2025-12-04T09:32:17.0999740Z Current runner version: '2.330.0' 2025-12-04T09:32:17.1006947Z Runner name: 'i-0144021b78afd54ea' 2025-12-04T09:32:17.1007853Z Runner group name: 'default' 2025-12-04T09:32:17.1008828Z Machine name: 'ip-10-0-6-82' 2025-12-04T09:32:17.1012158Z ##[group]GITHUB_TOKEN Permissions 2025-12-04T09:32:17.1014773Z Contents: read 2025-12-04T09:32:17.1015438Z Metadata: read 2025-12-04T09:32:17.1016124Z ##[endgroup] 2025-12-04T09:32:17.1018669Z Secret source: Actions 2025-12-04T09:32:17.1019657Z Prepare workflow directory 2025-12-04T09:32:17.1608128Z Prepare all required actions 2025-12-04T09:32:17.1654565Z Getting action download info 2025-12-04T09:32:17.5612698Z Download action repository 'pytorch/test-infra@main' (SHA:39aa74d619174326f4e2fb0e216151c2f29d9ffd) 2025-12-04T09:32:19.8612667Z Download action repository 'pytorch/pytorch@main' (SHA:7716da9fb23f27a65b41f9f016a2afadf281c18f) 2025-12-04T09:32:34.7615463Z Download action repository 'actions/setup-python@a26af69be951a213d495a4c3e4e4022e16d87065' (SHA:a26af69be951a213d495a4c3e4e4022e16d87065) 2025-12-04T09:32:35.1175213Z Download action repository 'aws-actions/configure-aws-credentials@ececac1a45f3b08a01d2dd070d28d111c5fe6722' (SHA:ececac1a45f3b08a01d2dd070d28d111c5fe6722) 2025-12-04T09:32:35.3575262Z Download action repository 'aws-actions/amazon-ecr-login@062b18b96a7aff071d4dc91bc00c4c1a7945b076' (SHA:062b18b96a7aff071d4dc91bc00c4c1a7945b076) 2025-12-04T09:32:35.5615303Z Download action repository 'seemethere/download-artifact-s3@1da556a7aa0a088e3153970611f6c432d58e80e6' (SHA:1da556a7aa0a088e3153970611f6c432d58e80e6) 2025-12-04T09:32:35.8034951Z Download action repository 'seemethere/upload-artifact-s3@baba72d0712b404f646cebe0730933554ebce96a' (SHA:baba72d0712b404f646cebe0730933554ebce96a) 2025-12-04T09:32:36.1441027Z Getting action download info 2025-12-04T09:32:36.3546094Z Download action repository 'actions/checkout@v4' (SHA:34e114876b0b11c390a56381ad16ebd13914f8d5) 2025-12-04T09:32:36.6530758Z Getting action download info 2025-12-04T09:32:36.7897878Z Download action repository 'nick-fields/retry@v3.0.0' (SHA:7152eba30c6575329ac0576536151aca5a72780e) 2025-12-04T09:32:37.0232968Z Getting action download info 2025-12-04T09:32:37.1517506Z Download action repository 'nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482' (SHA:3e91a01664abd3c5cd539100d10d33b9c5b68482) 2025-12-04T09:32:37.3581400Z Getting action download info 2025-12-04T09:32:37.5137463Z Uses: pytorch/pytorch/.github/workflows/_linux-test.yml@refs/heads/main (ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32) 2025-12-04T09:32:37.5142056Z ##[group] Inputs 2025-12-04T09:32:37.5142498Z build-environment: linux-jammy-cuda12.4-py3.10-gcc11 2025-12-04T09:32:37.5150004Z test-matrix: {"include": [{"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]} 2025-12-04T09:32:37.5158199Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:32:37.5159221Z sync-tag: 2025-12-04T09:32:37.5160119Z timeout-minutes: 240 2025-12-04T09:32:37.5160412Z use-gha: 2025-12-04T09:32:37.5160672Z dashboard-tag: 2025-12-04T09:32:37.5160963Z s3-bucket: gha-artifacts 2025-12-04T09:32:37.5161270Z aws-role-to-assume: 2025-12-04T09:32:37.5161913Z disable-monitor: false 2025-12-04T09:32:37.5162263Z monitor-log-interval: 5 2025-12-04T09:32:37.5162618Z monitor-data-collect-interval: 1 2025-12-04T09:32:37.5163001Z ##[endgroup] 2025-12-04T09:32:37.5163760Z Complete job name: linux-jammy-cuda12.4-py3.10-gcc11 / test (legacy_nvidia_driver, 2, 5, linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check, unstable) 2025-12-04T09:32:37.5738361Z A job started hook has been configured by the self-hosted runner administrator 2025-12-04T09:32:37.5851640Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-12-04T09:32:37.5862464Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:32:37.5863223Z ##[endgroup] 2025-12-04T09:32:38.9397561Z Runner Type: linux.g4dn.4xlarge.nvidia.gpu 2025-12-04T09:32:38.9398196Z Instance Type: g4dn.4xlarge 2025-12-04T09:32:38.9398506Z AMI Name: unknown 2025-12-04T09:32:38.9440190Z AMI ID: ami-08982f1c5bf93d976 2025-12-04T09:32:44.9209169Z ##[group]Run pytorch/test-infra/.github/actions/setup-ssh@main 2025-12-04T09:32:44.9209703Z with: 2025-12-04T09:32:44.9210319Z github-secret: *** 2025-12-04T09:32:44.9211177Z instructions: All testing is done inside the container, to start an interactive session run: docker exec -it $(docker container ps --format '{{.ID}}') bash 2025-12-04T09:32:44.9212119Z activate-with-label: false 2025-12-04T09:32:44.9212446Z label: with-ssh 2025-12-04T09:32:44.9212720Z remove-existing-keys: true 2025-12-04T09:32:44.9213046Z fail-silently: true 2025-12-04T09:32:44.9213325Z env: 2025-12-04T09:32:44.9213560Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:32:44.9213868Z ##[endgroup] 2025-12-04T09:32:45.0757403Z Please see https://github.com/pytorch/pytorch/wiki/Debugging-using-with-ssh-for-Github-Actions for more info. 2025-12-04T09:32:45.0759244Z Not on pull request and ciflow reference could not be extracted, skipping adding ssh keys 2025-12-04T09:32:45.1088060Z ##[group]Run pytorch/pytorch/.github/actions/checkout-pytorch@main 2025-12-04T09:32:45.1088597Z with: 2025-12-04T09:32:45.1088852Z no-sudo: true 2025-12-04T09:32:45.1089113Z submodules: recursive 2025-12-04T09:32:45.1089425Z fetch-depth: 0 2025-12-04T09:32:45.1089707Z env: 2025-12-04T09:32:45.1089940Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:32:45.1090244Z ##[endgroup] 2025-12-04T09:32:45.1179412Z ##[group]Run echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T09:32:45.1180560Z echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T09:32:45.1191788Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:32:45.1192250Z env: 2025-12-04T09:32:45.1192526Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:32:45.1192872Z ##[endgroup] 2025-12-04T09:32:45.1281588Z ##[group]Run # Use all available CPUs for fetching 2025-12-04T09:32:45.1282097Z # Use all available CPUs for fetching 2025-12-04T09:32:45.1282511Z cd "${GITHUB_WORKSPACE}" 2025-12-04T09:32:45.1282900Z git config --global fetch.parallel 0 2025-12-04T09:32:45.1283515Z git config --global submodule.fetchJobs 0 2025-12-04T09:32:45.1283921Z  2025-12-04T09:32:45.1284332Z # Clean workspace. The default checkout action should also do this, but 2025-12-04T09:32:45.1284896Z # do it here as well just in case 2025-12-04T09:32:45.1285276Z if [[ -d .git ]]; then 2025-12-04T09:32:45.1285607Z  if [ -z "${NO_SUDO}" ]; then 2025-12-04T09:32:45.1285976Z  sudo git clean -ffdx 2025-12-04T09:32:45.1286312Z  else 2025-12-04T09:32:45.1286584Z  git clean -ffdx 2025-12-04T09:32:45.1286877Z  fi 2025-12-04T09:32:45.1287128Z fi 2025-12-04T09:32:45.1293661Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:32:45.1294103Z env: 2025-12-04T09:32:45.1294439Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:32:45.1294775Z NO_SUDO: true 2025-12-04T09:32:45.1295023Z ##[endgroup] 2025-12-04T09:32:45.1428925Z ##[group]Run actions/checkout@v4 2025-12-04T09:32:45.1429451Z with: 2025-12-04T09:32:45.1429801Z ref: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:32:45.1430316Z fetch-depth: 0 2025-12-04T09:32:45.1430742Z submodules: recursive 2025-12-04T09:32:45.1431105Z show-progress: false 2025-12-04T09:32:45.1431539Z repository: pytorch/pytorch 2025-12-04T09:32:45.1432185Z token: *** 2025-12-04T09:32:45.1432562Z ssh-strict: true 2025-12-04T09:32:45.1432942Z ssh-user: git 2025-12-04T09:32:45.1433373Z persist-credentials: true 2025-12-04T09:32:45.1433809Z clean: true 2025-12-04T09:32:45.1434161Z sparse-checkout-cone-mode: true 2025-12-04T09:32:45.1434657Z fetch-tags: false 2025-12-04T09:32:45.1435046Z lfs: false 2025-12-04T09:32:45.1435367Z set-safe-directory: true 2025-12-04T09:32:45.1435828Z env: 2025-12-04T09:32:45.1436185Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:32:45.1436546Z ##[endgroup] 2025-12-04T09:32:45.2688471Z Syncing repository: pytorch/pytorch 2025-12-04T09:32:45.2689985Z ##[group]Getting Git version info 2025-12-04T09:32:45.2690585Z Working directory is '/home/ec2-user/actions-runner/_work/pytorch/pytorch' 2025-12-04T09:32:45.2691364Z [command]/usr/bin/git version 2025-12-04T09:32:45.2859847Z git version 2.50.1 2025-12-04T09:32:45.2886887Z ##[endgroup] 2025-12-04T09:32:45.2897918Z Copying '/home/ec2-user/.gitconfig' to '/home/ec2-user/actions-runner/_work/_temp/7726e826-277c-4b92-a51e-ef1fbdd479dd/.gitconfig' 2025-12-04T09:32:45.2916869Z Temporarily overriding HOME='/home/ec2-user/actions-runner/_work/_temp/7726e826-277c-4b92-a51e-ef1fbdd479dd' before making global git config changes 2025-12-04T09:32:45.2918057Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T09:32:45.2922888Z [command]/usr/bin/git config --global --add safe.directory /home/ec2-user/actions-runner/_work/pytorch/pytorch 2025-12-04T09:32:45.2968483Z Deleting the contents of '/home/ec2-user/actions-runner/_work/pytorch/pytorch' 2025-12-04T09:32:45.2971654Z ##[group]Initializing the repository 2025-12-04T09:32:45.2976117Z [command]/usr/bin/git init /home/ec2-user/actions-runner/_work/pytorch/pytorch 2025-12-04T09:32:45.3042800Z hint: Using 'master' as the name for the initial branch. This default branch name 2025-12-04T09:32:45.3043530Z hint: is subject to change. To configure the initial branch name to use in all 2025-12-04T09:32:45.3044356Z hint: of your new repositories, which will suppress this warning, call: 2025-12-04T09:32:45.3044855Z hint: 2025-12-04T09:32:45.3045199Z hint: git config --global init.defaultBranch 2025-12-04T09:32:45.3045595Z hint: 2025-12-04T09:32:45.3045987Z hint: Names commonly chosen instead of 'master' are 'main', 'trunk' and 2025-12-04T09:32:45.3046683Z hint: 'development'. The just-created branch can be renamed via this command: 2025-12-04T09:32:45.3047189Z hint: 2025-12-04T09:32:45.3047443Z hint: git branch -m 2025-12-04T09:32:45.3047791Z hint: 2025-12-04T09:32:45.3048209Z hint: Disable this message with "git config set advice.defaultBranchName false" 2025-12-04T09:32:45.3052456Z Initialized empty Git repository in /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/ 2025-12-04T09:32:45.3061808Z [command]/usr/bin/git remote add origin https://github.com/pytorch/pytorch 2025-12-04T09:32:45.3100432Z ##[endgroup] 2025-12-04T09:32:45.3100945Z ##[group]Disabling automatic garbage collection 2025-12-04T09:32:45.3104128Z [command]/usr/bin/git config --local gc.auto 0 2025-12-04T09:32:45.3131744Z ##[endgroup] 2025-12-04T09:32:45.3132217Z ##[group]Setting up auth 2025-12-04T09:32:45.3138418Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T09:32:45.3167065Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T09:32:45.3497990Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T09:32:45.3524822Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T09:32:45.3823967Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T09:32:45.3851823Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T09:32:45.4183205Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T09:32:45.4235341Z ##[endgroup] 2025-12-04T09:32:45.4235847Z ##[group]Fetching the repository 2025-12-04T09:32:45.4244090Z [command]/usr/bin/git -c protocol.version=2 fetch --prune --no-recurse-submodules origin +refs/heads/*:refs/remotes/origin/* +refs/tags/*:refs/tags/* 2025-12-04T09:33:39.7661763Z From https://github.com/pytorch/pytorch 2025-12-04T09:33:39.7662386Z * [new branch] 2.6.0.dev20241004+ -> origin/2.6.0.dev20241004+ 2025-12-04T09:33:39.7663610Z * [new branch] 2.9.1 -> origin/2.9.1 2025-12-04T09:33:39.7664343Z * [new branch] AaronWang04_addmmfusion_perftest -> origin/AaronWang04_addmmfusion_perftest 2025-12-04T09:33:39.7665212Z * [new branch] Flamefire-patch-1 -> origin/Flamefire-patch-1 2025-12-04T09:33:39.7665944Z * [new branch] HDCharles-2.6.0-release-notes -> origin/HDCharles-2.6.0-release-notes 2025-12-04T09:33:39.7666642Z * [new branch] HOPrintFunc -> origin/HOPrintFunc 2025-12-04T09:33:39.7669027Z * [new branch] IvanKobzarev/stack/1 -> origin/IvanKobzarev/stack/1 2025-12-04T09:33:39.7671171Z * [new branch] NicoshevSVE128 -> origin/NicoshevSVE128 2025-12-04T09:33:39.7672556Z * [new branch] PR-AOTInductorNoneBug -> origin/PR-AOTInductorNoneBug 2025-12-04T09:33:39.7673868Z * [new branch] PR-AOTInductorNoneBugFix -> origin/PR-AOTInductorNoneBugFix 2025-12-04T09:33:39.7675149Z * [new branch] PR-FixConfigsIssue -> origin/PR-FixConfigsIssue 2025-12-04T09:33:39.7676389Z * [new branch] PR-NoneBugFix-viable -> origin/PR-NoneBugFix-viable 2025-12-04T09:33:39.7677819Z * [new branch] PR-ResetToZero -> origin/PR-ResetToZero 2025-12-04T09:33:39.7679366Z * [new branch] Update-Flash-Packaging -> origin/Update-Flash-Packaging 2025-12-04T09:33:39.7680434Z * [new branch] VLA_exp -> origin/VLA_exp 2025-12-04T09:33:39.7681997Z * [new branch] activation_bench -> origin/activation_bench 2025-12-04T09:33:39.7683848Z * [new branch] addmm-heuristic -> origin/addmm-heuristic 2025-12-04T09:33:39.7685677Z * [new branch] adi/onednn_aarch64 -> origin/adi/onednn_aarch64 2025-12-04T09:33:39.7686882Z * [new branch] adi/test -> origin/adi/test 2025-12-04T09:33:39.7688342Z * [new branch] adi/test_bgemm -> origin/adi/test_bgemm 2025-12-04T09:33:39.7689883Z * [new branch] adi/test_m8g -> origin/adi/test_m8g 2025-12-04T09:33:39.7691094Z * [new branch] adi/test_onednn -> origin/adi/test_onednn 2025-12-04T09:33:39.7692495Z * [new branch] adi/test_onednn_v3.9 -> origin/adi/test_onednn_v3.9 2025-12-04T09:33:39.7693745Z * [new branch] adi/test_presve_change -> origin/adi/test_presve_change 2025-12-04T09:33:39.7694925Z * [new branch] adi/test_timm -> origin/adi/test_timm 2025-12-04T09:33:39.7696700Z * [new branch] adi/testpresve_change -> origin/adi/testpresve_change 2025-12-04T09:33:39.7699151Z * [new branch] aditew01/test/vec_bf16 -> origin/aditew01/test/vec_bf16 2025-12-04T09:33:39.7700451Z * [new branch] ah-globalfeedback-hook -> origin/ah-globalfeedback-hook 2025-12-04T09:33:39.7702429Z * [new branch] albanD-patch-1 -> origin/albanD-patch-1 2025-12-04T09:33:39.7703213Z * [new branch] also-surround-shimh -> origin/also-surround-shimh 2025-12-04T09:33:39.7705124Z * [new branch] angelayi/aot_compile -> origin/angelayi/aot_compile 2025-12-04T09:33:39.7706478Z * [new branch] angelayi/aoti_additional_files -> origin/angelayi/aoti_additional_files 2025-12-04T09:33:39.7707692Z * [new branch] angelayi/benchmark -> origin/angelayi/benchmark 2025-12-04T09:33:39.7709055Z * [new branch] angelayi/change_pytree_serialization -> origin/angelayi/change_pytree_serialization 2025-12-04T09:33:39.7710145Z * [new branch] angelayi/cpp_loader -> origin/angelayi/cpp_loader 2025-12-04T09:33:39.7711466Z * [new branch] angelayi/inductor_const -> origin/angelayi/inductor_const 2025-12-04T09:33:39.7713478Z * [new branch] angelayi/lstm -> origin/angelayi/lstm 2025-12-04T09:33:39.7714821Z * [new branch] angelayi/no_so_weight -> origin/angelayi/no_so_weight 2025-12-04T09:33:39.7716368Z * [new branch] angelayi/scan_layers -> origin/angelayi/scan_layers 2025-12-04T09:33:39.7717602Z * [new branch] angelayi/side_eff -> origin/angelayi/side_eff 2025-12-04T09:33:39.7718910Z * [new branch] angelayi/state_dict -> origin/angelayi/state_dict 2025-12-04T09:33:39.7720280Z * [new branch] angelayi/symint_input -> origin/angelayi/symint_input 2025-12-04T09:33:39.7722105Z * [new branch] angelayi/symm_mem -> origin/angelayi/symm_mem 2025-12-04T09:33:39.7723141Z * [new branch] angelayi/test_cpp -> origin/angelayi/test_cpp 2025-12-04T09:33:39.7724263Z * [new branch] angelayi/torch_size -> origin/angelayi/torch_size 2025-12-04T09:33:39.7725572Z * [new branch] annotate_assert -> origin/annotate_assert 2025-12-04T09:33:39.7726958Z * [new branch] annotate_fallback_kernel -> origin/annotate_fallback_kernel 2025-12-04T09:33:39.7728244Z * [new branch] annotation_deepcopy -> origin/annotation_deepcopy 2025-12-04T09:33:39.7729564Z * [new branch] annotation_dynamo -> origin/annotation_dynamo 2025-12-04T09:33:39.7730827Z * [new branch] aot_eager_stack_trace -> origin/aot_eager_stack_trace 2025-12-04T09:33:39.7732127Z * [new branch] aoti-cuda-alloc -> origin/aoti-cuda-alloc 2025-12-04T09:33:39.7733439Z * [new branch] aoti_const_device -> origin/aoti_const_device 2025-12-04T09:33:39.7734748Z * [new branch] aoti_fqn_name_interface -> origin/aoti_fqn_name_interface 2025-12-04T09:33:39.7736023Z * [new branch] aoti_package_weights_binary -> origin/aoti_package_weights_binary 2025-12-04T09:33:39.7737271Z * [new branch] aoti_target_windows -> origin/aoti_target_windows 2025-12-04T09:33:39.7739922Z * [new branch] arsh/feat/inductor_check_profiling -> origin/arsh/feat/inductor_check_profiling 2025-12-04T09:33:39.7740954Z * [new branch] async_tp -> origin/async_tp 2025-12-04T09:33:39.7742477Z * [new branch] atalman-inductor-perf-cu124 -> origin/atalman-inductor-perf-cu124 2025-12-04T09:33:39.7743802Z * [new branch] atalman-inductor-perf-cu124.1 -> origin/atalman-inductor-perf-cu124.1 2025-12-04T09:33:39.7745168Z * [new branch] atalman-patch-2 -> origin/atalman-patch-2 2025-12-04T09:33:39.7746562Z * [new branch] atalman-patch-3 -> origin/atalman-patch-3 2025-12-04T09:33:39.7747901Z * [new branch] atalman-patch-4 -> origin/atalman-patch-4 2025-12-04T09:33:39.7749324Z * [new branch] atalman-patch-5 -> origin/atalman-patch-5 2025-12-04T09:33:39.7750680Z * [new branch] atalman-patch-6 -> origin/atalman-patch-6 2025-12-04T09:33:39.7752064Z * [new branch] atalman-patch-7 -> origin/atalman-patch-7 2025-12-04T09:33:39.7753455Z * [new branch] atalman-patch-8 -> origin/atalman-patch-8 2025-12-04T09:33:39.7754792Z * [new branch] atalman_inductor_2.3.1 -> origin/atalman_inductor_2.3.1 2025-12-04T09:33:39.7756059Z * [new branch] atalman_inductor_2.4.0 -> origin/atalman_inductor_2.4.0 2025-12-04T09:33:39.7757427Z * [new branch] atalman_inductor_2.4.x -> origin/atalman_inductor_2.4.x 2025-12-04T09:33:39.7758919Z * [new branch] attention_benchmarking_clean -> origin/attention_benchmarking_clean 2025-12-04T09:33:39.7760725Z * [new branch] bahuang/dt_fix_scalar_add -> origin/bahuang/dt_fix_scalar_add 2025-12-04T09:33:39.7761799Z * [new branch] bahuang/fix_debug_mode -> origin/bahuang/fix_debug_mode 2025-12-04T09:33:39.7763068Z * [new branch] bahuang/fix_expand -> origin/bahuang/fix_expand 2025-12-04T09:33:39.7764335Z * [new branch] bahuang/test -> origin/bahuang/test 2025-12-04T09:33:39.7766391Z * [new branch] base/1.5 -> origin/base/1.5 2025-12-04T09:33:39.7768011Z * [new branch] batching_sdpa_efficient_attention -> origin/batching_sdpa_efficient_attention 2025-12-04T09:33:39.7769227Z * [new branch] bench_scaled_mm_ops -> origin/bench_scaled_mm_ops 2025-12-04T09:33:39.7770677Z * [new branch] benchmark-updates -> origin/benchmark-updates 2025-12-04T09:33:39.7771921Z * [new branch] benchmarking-script -> origin/benchmarking-script 2025-12-04T09:33:39.7773700Z * [new branch] bertmaher/pinbump26 -> origin/bertmaher/pinbump26 2025-12-04T09:33:39.7775444Z * [new branch] bertrand/cutlass -> origin/bertrand/cutlass 2025-12-04T09:33:39.7777198Z * [new branch] bf/bug-static-input -> origin/bf/bug-static-input 2025-12-04T09:33:39.7778358Z * [new branch] bf/cg-backend -> origin/bf/cg-backend 2025-12-04T09:33:39.7779757Z * [new branch] bf/cg-nccl-test -> origin/bf/cg-nccl-test 2025-12-04T09:33:39.7780908Z * [new branch] bf/cg-remove-check -> origin/bf/cg-remove-check 2025-12-04T09:33:39.7782282Z * [new branch] bf/clean-torchbench-hf -> origin/bf/clean-torchbench-hf 2025-12-04T09:33:39.7783484Z * [new branch] bf/combo-debug-log -> origin/bf/combo-debug-log 2025-12-04T09:33:39.7784742Z * [new branch] bf/cudagraph -> origin/bf/cudagraph 2025-12-04T09:33:39.7786707Z * [new branch] bf/cudagraph-disable-input-mutation -> origin/bf/cudagraph-disable-input-mutation 2025-12-04T09:33:39.7788472Z * [new branch] bf/cudagraph-enable-input-mutation-support-benchmark -> origin/bf/cudagraph-enable-input-mutation-support-benchmark 2025-12-04T09:33:39.7789840Z * [new branch] bf/cudagraph-partition -> origin/bf/cudagraph-partition 2025-12-04T09:33:39.7790815Z * [new branch] bf/donated-buffer-bench -> origin/bf/donated-buffer-bench 2025-12-04T09:33:39.7791979Z * [new branch] bf/dynamo-partition -> origin/bf/dynamo-partition 2025-12-04T09:33:39.7793219Z * [new branch] bf/lite -> origin/bf/lite 2025-12-04T09:33:39.7794540Z * [new branch] bf/pa-non-divisible -> origin/bf/pa-non-divisible 2025-12-04T09:33:39.7796020Z * [new branch] bf/partition-cache-free-symbols -> origin/bf/partition-cache-free-symbols 2025-12-04T09:33:39.7797278Z * [new branch] bf/partition-memory-plan -> origin/bf/partition-memory-plan 2025-12-04T09:33:39.7798613Z * [new branch] bf/partition-move-cpu -> origin/bf/partition-move-cpu 2025-12-04T09:33:39.7799947Z * [new branch] bf/partition-view-fallback -> origin/bf/partition-view-fallback 2025-12-04T09:33:39.7801135Z * [new branch] bf/remove-check-55b0c39d -> origin/bf/remove-check-55b0c39d 2025-12-04T09:33:39.7802474Z * [new branch] bf/timm-nov-26-2025 -> origin/bf/timm-nov-26-2025 2025-12-04T09:33:39.7803752Z * [new branch] bf/transformer-pin-4-57-3 -> origin/bf/transformer-pin-4-57-3 2025-12-04T09:33:39.7805108Z * [new branch] bisect_perf_hf_T5_3acc6eac492 -> origin/bisect_perf_hf_T5_3acc6eac492 2025-12-04T09:33:39.7806324Z * [new branch] bisect_perf_hf_T5_3fcf66f61fb -> origin/bisect_perf_hf_T5_3fcf66f61fb 2025-12-04T09:33:39.7807628Z * [new branch] bisect_perf_hf_T5_4009d154129 -> origin/bisect_perf_hf_T5_4009d154129 2025-12-04T09:33:39.7808827Z * [new branch] bisect_perf_hf_T5_40d0740e73d -> origin/bisect_perf_hf_T5_40d0740e73d 2025-12-04T09:33:39.7810019Z * [new branch] bisect_perf_hf_T5_5268754e -> origin/bisect_perf_hf_T5_5268754e 2025-12-04T09:33:39.7811320Z * [new branch] bisect_perf_hf_T5_7d89a8d385c -> origin/bisect_perf_hf_T5_7d89a8d385c 2025-12-04T09:33:39.7812469Z * [new branch] bisect_perf_hf_T5_b7a25c1ee7c -> origin/bisect_perf_hf_T5_b7a25c1ee7c 2025-12-04T09:33:39.7813735Z * [new branch] bisect_perf_hf_T5_c25b201583f -> origin/bisect_perf_hf_T5_c25b201583f 2025-12-04T09:33:39.7814988Z * [new branch] bisect_perf_hf_T5_c93e57efac0 -> origin/bisect_perf_hf_T5_c93e57efac0 2025-12-04T09:33:39.7816554Z * [new branch] bisect_perf_hf_T5_ca9813ea149 -> origin/bisect_perf_hf_T5_ca9813ea149 2025-12-04T09:33:39.7817577Z * [new branch] bisect_perf_hf_T5_d65f194a -> origin/bisect_perf_hf_T5_d65f194a 2025-12-04T09:33:39.7818904Z * [new branch] bisect_perf_hf_T5_da94ab0b -> origin/bisect_perf_hf_T5_da94ab0b 2025-12-04T09:33:39.7820328Z * [new branch] bisect_perf_hf_T5_da94ab0b_new -> origin/bisect_perf_hf_T5_da94ab0b_new 2025-12-04T09:33:39.7821422Z * [new branch] bisect_perf_hf_T5_db4e8a1d8a8 -> origin/bisect_perf_hf_T5_db4e8a1d8a8 2025-12-04T09:33:39.7822704Z * [new branch] bisect_perf_hf_T5_e0d97e936a2 -> origin/bisect_perf_hf_T5_e0d97e936a2 2025-12-04T09:33:39.7823933Z * [new branch] bisect_perf_hf_T5_f23621ec563 -> origin/bisect_perf_hf_T5_f23621ec563 2025-12-04T09:33:39.7825849Z * [new branch] brister/fx_device_type -> origin/brister/fx_device_type 2025-12-04T09:33:39.7827147Z * [new branch] brister/test_inductor_all_fx -> origin/brister/test_inductor_all_fx 2025-12-04T09:33:39.7828392Z * [new branch] brister/tiled_reduction_no_numel_check -> origin/brister/tiled_reduction_no_numel_check 2025-12-04T09:33:39.7829581Z * [new branch] bwd-backup -> origin/bwd-backup 2025-12-04T09:33:39.7831057Z * [new branch] c57382a49 -> origin/c57382a49 2025-12-04T09:33:39.7832291Z * [new branch] ca_0431d47eaa -> origin/ca_0431d47eaa 2025-12-04T09:33:39.7833510Z * [new branch] ca_fix_0431d47eaa -> origin/ca_fix_0431d47eaa 2025-12-04T09:33:39.7835380Z * [new branch] camyllh/test_setup_hooks_push -> origin/camyllh/test_setup_hooks_push 2025-12-04T09:33:39.7836764Z * [new branch] cccclai-patch-1 -> origin/cccclai-patch-1 2025-12-04T09:33:39.7838234Z * [new branch] cherry-pick-159969-by-pytorch_bot_bot_ -> origin/cherry-pick-159969-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7839563Z * [new branch] cherry-pick-160586-by-pytorch_bot_bot_ -> origin/cherry-pick-160586-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7840935Z * [new branch] cherry-pick-162208-by-pytorch_bot_bot_ -> origin/cherry-pick-162208-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7842296Z * [new branch] cherry-pick-163169-by-pytorch_bot_bot_ -> origin/cherry-pick-163169-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7843612Z * [new branch] cherry-pick-165086-by-pytorch_bot_bot_ -> origin/cherry-pick-165086-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7845135Z * [new branch] cherry-pick-165514-by-pytorch_bot_bot_ -> origin/cherry-pick-165514-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7846337Z * [new branch] cherry-pick-165601-by-pytorch_bot_bot_ -> origin/cherry-pick-165601-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7847694Z * [new branch] cherry-pick-165667-by-pytorch_bot_bot_ -> origin/cherry-pick-165667-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7849193Z * [new branch] cherry-pick-165815-by-pytorch_bot_bot_ -> origin/cherry-pick-165815-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7850515Z * [new branch] cherry-pick-165922-by-pytorch_bot_bot_ -> origin/cherry-pick-165922-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7851902Z * [new branch] cherry-pick-166148-by-pytorch_bot_bot_ -> origin/cherry-pick-166148-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7853110Z * [new branch] cherry-pick-166181-by-pytorch_bot_bot_ -> origin/cherry-pick-166181-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7854479Z * [new branch] cherry-pick-166404-by-pytorch_bot_bot_ -> origin/cherry-pick-166404-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7855803Z * [new branch] cherry-pick-166427-by-pytorch_bot_bot_ -> origin/cherry-pick-166427-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7857263Z * [new branch] cherry-pick-166480-by-pytorch_bot_bot_ -> origin/cherry-pick-166480-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7858577Z * [new branch] cherry-pick-166570-by-pytorch_bot_bot_ -> origin/cherry-pick-166570-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7859985Z * [new branch] cherry-pick-166993-by-pytorch_bot_bot_ -> origin/cherry-pick-166993-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7861346Z * [new branch] cherry-pick-167111-by-pytorch_bot_bot_ -> origin/cherry-pick-167111-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7862703Z * [new branch] cherry-pick-167478-by-pytorch_bot_bot_ -> origin/cherry-pick-167478-by-pytorch_bot_bot_ 2025-12-04T09:33:39.7863801Z * [new branch] cherry_pick_166036_166040 -> origin/cherry_pick_166036_166040 2025-12-04T09:33:39.7865154Z * [new branch] cherry_pick_166457 -> origin/cherry_pick_166457 2025-12-04T09:33:39.7866518Z * [new branch] cherrypick_166338 -> origin/cherrypick_166338 2025-12-04T09:33:39.7867872Z * [new branch] cherrypick_166458 -> origin/cherrypick_166458 2025-12-04T09:33:39.7869096Z * [new branch] cherrypick_166586 -> origin/cherrypick_166586 2025-12-04T09:33:39.7870406Z * [new branch] cherrypick_166956 -> origin/cherrypick_166956 2025-12-04T09:33:39.7871703Z * [new branch] ci_attn -> origin/ci_attn 2025-12-04T09:33:39.7872966Z * [new branch] codex-testing -> origin/codex-testing 2025-12-04T09:33:39.7875155Z * [new branch] codex/add-check_memory_overlap-helper-functions -> origin/codex/add-check_memory_overlap-helper-functions 2025-12-04T09:33:39.7876461Z * [new branch] codex/fix-issue-121219-in-pytorch -> origin/codex/fix-issue-121219-in-pytorch 2025-12-04T09:33:39.7878105Z * [new branch] codex/investigate-segfaults-in-get_tensor_storage_id -> origin/codex/investigate-segfaults-in-get_tensor_storage_id 2025-12-04T09:33:39.7879658Z * [new branch] codex/refactor-lintrunner-config-to-use-uv-run -> origin/codex/refactor-lintrunner-config-to-use-uv-run 2025-12-04T09:33:39.7880878Z * [new branch] compatiblpy39util -> origin/compatiblpy39util 2025-12-04T09:33:39.7881783Z * [new branch] cond_hop_device -> origin/cond_hop_device 2025-12-04T09:33:39.7883152Z * [new branch] context_test -> origin/context_test 2025-12-04T09:33:39.7885099Z * [new branch] copilot/code-style-cleanup-python-pip -> origin/copilot/code-style-cleanup-python-pip 2025-12-04T09:33:39.7886644Z * [new branch] cpio/fix_new_ami_tests -> origin/cpio/fix_new_ami_tests 2025-12-04T09:33:39.7888202Z * [new branch] cpp-docs-dependency-upgrade -> origin/cpp-docs-dependency-upgrade 2025-12-04T09:33:39.7890114Z * [new branch] crpa/typo-in-inductor_comm_lowering -> origin/crpa/typo-in-inductor_comm_lowering 2025-12-04T09:33:39.7891654Z * [new branch] csl/always_produce_xml -> origin/csl/always_produce_xml 2025-12-04T09:33:39.7892832Z * [new branch] csl/build_test_more_procs -> origin/csl/build_test_more_procs 2025-12-04T09:33:39.7894141Z * [new branch] csl/build_test_more_procs2 -> origin/csl/build_test_more_procs2 2025-12-04T09:33:39.7895400Z * [new branch] csl/clean_up -> origin/csl/clean_up 2025-12-04T09:33:39.7897060Z * [new branch] csl/fix_retry_segfault_exit -> origin/csl/fix_retry_segfault_exit 2025-12-04T09:33:39.7897874Z * [new branch] csl/katex -> origin/csl/katex 2025-12-04T09:33:39.7899611Z * [new branch] csl/larger_runner -> origin/csl/larger_runner 2025-12-04T09:33:39.7901259Z * [new branch] csl/lint_testing -> origin/csl/lint_testing 2025-12-04T09:33:39.7902783Z * [new branch] csl/lint_thing -> origin/csl/lint_thing 2025-12-04T09:33:39.7904201Z * [new branch] csl/lintrunner_stuff -> origin/csl/lintrunner_stuff 2025-12-04T09:33:39.7905477Z * [new branch] csl/manually_gen_json -> origin/csl/manually_gen_json 2025-12-04T09:33:39.7906760Z * [new branch] csl/mps_sharding -> origin/csl/mps_sharding 2025-12-04T09:33:39.7908067Z * [new branch] csl/multistage_docker -> origin/csl/multistage_docker 2025-12-04T09:33:39.7909367Z * [new branch] csl/print_timing -> origin/csl/print_timing 2025-12-04T09:33:39.7910647Z * [new branch] csl/remove_experiment -> origin/csl/remove_experiment 2025-12-04T09:33:39.7911999Z * [new branch] csl/remove_maybe_unused_var -> origin/csl/remove_maybe_unused_var 2025-12-04T09:33:39.7913457Z * [new branch] csl/remove_repo_specific_autolabel -> origin/csl/remove_repo_specific_autolabel 2025-12-04T09:33:39.7914712Z * [new branch] csl/remove_run_parallel -> origin/csl/remove_run_parallel 2025-12-04T09:33:39.7915862Z * [new branch] csl/remove_unused_vars -> origin/csl/remove_unused_vars 2025-12-04T09:33:39.7917141Z * [new branch] csl/revert_open -> origin/csl/revert_open 2025-12-04T09:33:39.7918580Z * [new branch] csl/skip_build -> origin/csl/skip_build 2025-12-04T09:33:39.7919727Z * [new branch] csl/smaller_avx_amx_runenrs -> origin/csl/smaller_avx_amx_runenrs 2025-12-04T09:33:39.7920889Z * [new branch] csl/td_job_level -> origin/csl/td_job_level 2025-12-04T09:33:39.7922238Z * [new branch] csl/test_cuda_build_large_runner -> origin/csl/test_cuda_build_large_runner 2025-12-04T09:33:39.7923725Z * [new branch] csl/test_owners_autograd_dispatch_nn -> origin/csl/test_owners_autograd_dispatch_nn 2025-12-04T09:33:39.7924903Z * [new branch] csl/test_owners_higher_confidence -> origin/csl/test_owners_higher_confidence 2025-12-04T09:33:39.7926127Z * [new branch] csl/upload_json_running -> origin/csl/upload_json_running 2025-12-04T09:33:39.7927408Z * [new branch] csl/win_sccache -> origin/csl/win_sccache 2025-12-04T09:33:39.7928606Z * [new branch] csl/xml_stuff -> origin/csl/xml_stuff 2025-12-04T09:33:39.7930051Z * [new branch] cublasrelax2 -> origin/cublasrelax2 2025-12-04T09:33:39.7931254Z * [new branch] cuda_mempool -> origin/cuda_mempool 2025-12-04T09:33:39.7932697Z * [new branch] custom_lowering_dict -> origin/custom_lowering_dict 2025-12-04T09:33:39.7934406Z * [new branch] d4l3k/debug_plane_frtrace -> origin/d4l3k/debug_plane_frtrace 2025-12-04T09:33:39.7936109Z * [new branch] daxia6/2.8o3 -> origin/daxia6/2.8o3 2025-12-04T09:33:39.7937272Z * [new branch] debug-guard -> origin/debug-guard 2025-12-04T09:33:39.7938883Z * [new branch] delete-quant-docs -> origin/delete-quant-docs 2025-12-04T09:33:39.7943105Z * [new branch] dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.0 -> origin/dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.0 2025-12-04T09:33:39.7944924Z * [new branch] dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.1 -> origin/dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.1 2025-12-04T09:33:39.7946428Z * [new branch] desertfire/test_cpp_wrapper -> origin/desertfire/test_cpp_wrapper 2025-12-04T09:33:39.7947519Z * [new branch] desertfire/triton-cpu-for-aarch64 -> origin/desertfire/triton-cpu-for-aarch64 2025-12-04T09:33:39.7949597Z * [new branch] dev/dhruva/flex_attn_opt -> origin/dev/dhruva/flex_attn_opt 2025-12-04T09:33:39.7951692Z * [new branch] dev/joona/MPSNDArrayAdd -> origin/dev/joona/MPSNDArrayAdd 2025-12-04T09:33:39.7953298Z * [new branch] dev/joona/Unranked -> origin/dev/joona/Unranked 2025-12-04T09:33:39.7954963Z * [new branch] dev/joona/cat -> origin/dev/joona/cat 2025-12-04T09:33:39.7956119Z * [new branch] dev/joona/embeddingbag -> origin/dev/joona/embeddingbag 2025-12-04T09:33:39.7957576Z * [new branch] dev/joona/fix_sdpa_memtest -> origin/dev/joona/fix_sdpa_memtest 2025-12-04T09:33:39.7959101Z * [new branch] dev/joona/getTensorsString -> origin/dev/joona/getTensorsString 2025-12-04T09:33:39.7960676Z * [new branch] dev/joona/mps_linear_macos14 -> origin/dev/joona/mps_linear_macos14 2025-12-04T09:33:39.7962520Z * [new branch] dev/joona/scalar_clamp -> origin/dev/joona/scalar_clamp 2025-12-04T09:33:39.7964379Z * [new branch] dev/joona/sdpa -> origin/dev/joona/sdpa 2025-12-04T09:33:39.7966414Z * [new branch] dev/joona/sdpa_api -> origin/dev/joona/sdpa_api 2025-12-04T09:33:39.7967910Z * [new branch] dev/joona/type_inf -> origin/dev/joona/type_inf 2025-12-04T09:33:39.7969583Z * [new branch] dev/joona/ulpAssertClose -> origin/dev/joona/ulpAssertClose 2025-12-04T09:33:39.7970745Z * [new branch] dev/joona/upsize3d -> origin/dev/joona/upsize3d 2025-12-04T09:33:39.7971991Z * [new branch] disp_counter -> origin/disp_counter 2025-12-04T09:33:39.7973496Z * [new branch] divyanshk-patch-1 -> origin/divyanshk-patch-1 2025-12-04T09:33:39.7974563Z * [new branch] docs -> origin/docs 2025-12-04T09:33:39.7976049Z * [new branch] documentation -> origin/documentation 2025-12-04T09:33:39.7977233Z * [new branch] eager_model_benchmarks -> origin/eager_model_benchmarks 2025-12-04T09:33:39.7979318Z * [new branch] embg/test_inductor_ci_control -> origin/embg/test_inductor_ci_control 2025-12-04T09:33:39.7980418Z * [new branch] embg/triton_l2_prefetch_128B -> origin/embg/triton_l2_prefetch_128B 2025-12-04T09:33:39.7981569Z * [new branch] embg/triton_l2_prefetch_256B -> origin/embg/triton_l2_prefetch_256B 2025-12-04T09:33:39.7982886Z * [new branch] eqy-patch-1 -> origin/eqy-patch-1 2025-12-04T09:33:39.7984415Z * [new branch] eqy-patch-2 -> origin/eqy-patch-2 2025-12-04T09:33:39.7985774Z * [new branch] eqy-patch-3 -> origin/eqy-patch-3 2025-12-04T09:33:39.7987017Z * [new branch] eqy-patch-4 -> origin/eqy-patch-4 2025-12-04T09:33:39.7988664Z * [new branch] eqy-patch-5 -> origin/eqy-patch-5 2025-12-04T09:33:39.7989789Z * [new branch] eqy-patch-6 -> origin/eqy-patch-6 2025-12-04T09:33:39.7991708Z * [new branch] exclamaforte/amd-ma -> origin/exclamaforte/amd-ma 2025-12-04T09:33:39.7993064Z * [new branch] exclamaforte/combo-kernels-perf-run -> origin/exclamaforte/combo-kernels-perf-run 2025-12-04T09:33:39.7994268Z * [new branch] exclamaforte/do_bench_refactor -> origin/exclamaforte/do_bench_refactor 2025-12-04T09:33:39.7995542Z * [new branch] exclamaforte/enable-mem-dep-fusion -> origin/exclamaforte/enable-mem-dep-fusion 2025-12-04T09:33:39.7996903Z * [new branch] exclamaforte/fix-exhaustive-autotuning -> origin/exclamaforte/fix-exhaustive-autotuning 2025-12-04T09:33:39.7998412Z * [new branch] exclamaforte/fix-trace-parsing-fx-svg -> origin/exclamaforte/fix-trace-parsing-fx-svg 2025-12-04T09:33:39.8000043Z * [new branch] exclamaforte/force-pointwise-cat-perf-run -> origin/exclamaforte/force-pointwise-cat-perf-run 2025-12-04T09:33:39.8001171Z * [new branch] exclamaforte/fusion-data -> origin/exclamaforte/fusion-data 2025-12-04T09:33:39.8002848Z * [new branch] exclamaforte/gemm-benchmark-run -> origin/exclamaforte/gemm-benchmark-run 2025-12-04T09:33:39.8003901Z * [new branch] exclamaforte/gemm-export-model -> origin/exclamaforte/gemm-export-model 2025-12-04T09:33:39.8005154Z * [new branch] exclamaforte/gemm-model -> origin/exclamaforte/gemm-model 2025-12-04T09:33:39.8006649Z * [new branch] exclamaforte/gemm-model-all-data-collection -> origin/exclamaforte/gemm-model-all-data-collection 2025-12-04T09:33:39.8007708Z * [new branch] exclamaforte/gemm-to-amd -> origin/exclamaforte/gemm-to-amd 2025-12-04T09:33:39.8009016Z * [new branch] exclamaforte/just-gemm-model -> origin/exclamaforte/just-gemm-model 2025-12-04T09:33:39.8010484Z * [new branch] exclamaforte/just-gemm-model-no-refactor -> origin/exclamaforte/just-gemm-model-no-refactor 2025-12-04T09:33:39.8011712Z * [new branch] exclamaforte/profile-diff-algo -> origin/exclamaforte/profile-diff-algo 2025-12-04T09:33:39.8013015Z * [new branch] exclamaforte/profiler-visualization -> origin/exclamaforte/profiler-visualization 2025-12-04T09:33:39.8014326Z * [new branch] exclamaforte/test_cpp_wrapper_mode -> origin/exclamaforte/test_cpp_wrapper_mode 2025-12-04T09:33:39.8015700Z * [new branch] exclamaforte/update-autotune-configs -> origin/exclamaforte/update-autotune-configs 2025-12-04T09:33:39.8017034Z * [new branch] exclamaforte/update-autotune-configs-2 -> origin/exclamaforte/update-autotune-configs-2 2025-12-04T09:33:39.8018228Z * [new branch] exec -> origin/exec 2025-12-04T09:33:39.8019921Z * [new branch] experimental-mosaic -> origin/experimental-mosaic 2025-12-04T09:33:39.8021197Z * [new branch] export-D61047529 -> origin/export-D61047529 2025-12-04T09:33:39.8022599Z * [new branch] export-D71412006 -> origin/export-D71412006 2025-12-04T09:33:39.8024069Z * [new branch] export-D73042989 -> origin/export-D73042989 2025-12-04T09:33:39.8025258Z * [new branch] export-D78957093 -> origin/export-D78957093 2025-12-04T09:33:39.8026535Z * [new branch] export-D78996107 -> origin/export-D78996107 2025-12-04T09:33:39.8027816Z * [new branch] export-D80823877 -> origin/export-D80823877 2025-12-04T09:33:39.8029326Z * [new branch] export-D80958642 -> origin/export-D80958642 2025-12-04T09:33:39.8030496Z * [new branch] export-D81054193 -> origin/export-D81054193 2025-12-04T09:33:39.8031726Z * [new branch] export-D81204584 -> origin/export-D81204584 2025-12-04T09:33:39.8032991Z * [new branch] export-D81429090 -> origin/export-D81429090 2025-12-04T09:33:39.8034539Z * [new branch] export-D82250826 -> origin/export-D82250826 2025-12-04T09:33:39.8035733Z * [new branch] export-D82253817 -> origin/export-D82253817 2025-12-04T09:33:39.8037085Z * [new branch] export-D83541846 -> origin/export-D83541846 2025-12-04T09:33:39.8038293Z * [new branch] export-D83627170 -> origin/export-D83627170 2025-12-04T09:33:39.8039740Z * [new branch] export-D83766701 -> origin/export-D83766701 2025-12-04T09:33:39.8040879Z * [new branch] export-D83768878 -> origin/export-D83768878 2025-12-04T09:33:39.8042528Z * [new branch] export-D83769447 -> origin/export-D83769447 2025-12-04T09:33:39.8043746Z * [new branch] export-D84089824 -> origin/export-D84089824 2025-12-04T09:33:39.8044950Z * [new branch] export-D84213020 -> origin/export-D84213020 2025-12-04T09:33:39.8046821Z * [new branch] export-D84373821 -> origin/export-D84373821 2025-12-04T09:33:39.8048275Z * [new branch] export-D84612194 -> origin/export-D84612194 2025-12-04T09:33:39.8049558Z * [new branch] export-D84890985 -> origin/export-D84890985 2025-12-04T09:33:39.8050797Z * [new branch] export-D85122326 -> origin/export-D85122326 2025-12-04T09:33:39.8052154Z * [new branch] export-D86256198 -> origin/export-D86256198 2025-12-04T09:33:39.8053423Z * [new branch] export-D86460608 -> origin/export-D86460608 2025-12-04T09:33:39.8054865Z * [new branch] export-D86474796 -> origin/export-D86474796 2025-12-04T09:33:39.8056510Z * [new branch] export-D86712396 -> origin/export-D86712396 2025-12-04T09:33:39.8057652Z * [new branch] export-D87022129 -> origin/export-D87022129 2025-12-04T09:33:39.8059276Z * [new branch] export-D87838959 -> origin/export-D87838959 2025-12-04T09:33:39.8060595Z * [new branch] export-D88319437 -> origin/export-D88319437 2025-12-04T09:33:39.8062155Z * [new branch] exported-model-train-idempotent -> origin/exported-model-train-idempotent 2025-12-04T09:33:39.8063397Z * [new branch] ezyang-titan-october -> origin/ezyang-titan-october 2025-12-04T09:33:39.8064705Z * [new branch] ezyang-titan-october2 -> origin/ezyang-titan-october2 2025-12-04T09:33:39.8065934Z * [new branch] ezyang-war -> origin/ezyang-war 2025-12-04T09:33:39.8067782Z * [new branch] ezyang/wip-aot-descriptors -> origin/ezyang/wip-aot-descriptors 2025-12-04T09:33:39.8068903Z * [new branch] fa_u8_brgemm -> origin/fa_u8_brgemm 2025-12-04T09:33:39.8070763Z * [new branch] fadeputr/sequence_fbgemm -> origin/fadeputr/sequence_fbgemm 2025-12-04T09:33:39.8072034Z * [new branch] fastmath_baseline -> origin/fastmath_baseline 2025-12-04T09:33:39.8073882Z * [new branch] fbcode/warm -> origin/fbcode/warm 2025-12-04T09:33:39.8075314Z * [new branch] fca -> origin/fca 2025-12-04T09:33:39.8076613Z * [new branch] fca2_ca5984c -> origin/fca2_ca5984c 2025-12-04T09:33:39.8077905Z * [new branch] fca5 -> origin/fca5 2025-12-04T09:33:39.8079708Z * [new branch] feature/justknobs-cpp -> origin/feature/justknobs-cpp 2025-12-04T09:33:39.8081020Z * [new branch] feature/numa-forkserver -> origin/feature/numa-forkserver 2025-12-04T09:33:39.8082824Z * [new branch] ffast_math_baseline -> origin/ffast_math_baseline 2025-12-04T09:33:39.8084044Z * [new branch] ffast_math_target -> origin/ffast_math_target 2025-12-04T09:33:39.8085790Z * [new branch] findhao/base_commit -> origin/findhao/base_commit 2025-12-04T09:33:39.8087142Z * [new branch] findhao/base_commit1 -> origin/findhao/base_commit1 2025-12-04T09:33:39.8088575Z * [new branch] findhao/multistream2 -> origin/findhao/multistream2 2025-12-04T09:33:39.8089806Z * [new branch] findhao/multistream5 -> origin/findhao/multistream5 2025-12-04T09:33:39.8090961Z * [new branch] findhao/multistream6 -> origin/findhao/multistream6 2025-12-04T09:33:39.8092209Z * [new branch] findhao/operatorbench3 -> origin/findhao/operatorbench3 2025-12-04T09:33:39.8093473Z * [new branch] findhao/operatorbench5 -> origin/findhao/operatorbench5 2025-12-04T09:33:39.8094608Z * [new branch] findhao/tritonparse -> origin/findhao/tritonparse 2025-12-04T09:33:39.8096055Z * [new branch] fix-ck-gemm-template-format -> origin/fix-ck-gemm-template-format 2025-12-04T09:33:39.8098507Z * [new branch] fix-config-ignore -> origin/fix-config-ignore 2025-12-04T09:33:39.8099848Z * [new branch] fix-dict-guard -> origin/fix-dict-guard 2025-12-04T09:33:39.8101095Z * [new branch] fix_addmm_issue -> origin/fix_addmm_issue 2025-12-04T09:33:39.8102440Z * [new branch] fix_amd_missing_cluster_dims -> origin/fix_amd_missing_cluster_dims 2025-12-04T09:33:39.8103688Z * [new branch] fix_bench_bwd_pass -> origin/fix_bench_bwd_pass 2025-12-04T09:33:39.8104984Z * [new branch] fix_mem_profiler_config -> origin/fix_mem_profiler_config 2025-12-04T09:33:39.8106242Z * [new branch] fix_nvrtc_discovery -> origin/fix_nvrtc_discovery 2025-12-04T09:33:39.8107432Z * [new branch] fix_op_runner -> origin/fix_op_runner 2025-12-04T09:33:39.8108756Z * [new branch] fix_ubn_159469 -> origin/fix_ubn_159469 2025-12-04T09:33:39.8110158Z * [new branch] fixes-triage -> origin/fixes-triage 2025-12-04T09:33:39.8111465Z * [new branch] fixflashinfer -> origin/fixflashinfer 2025-12-04T09:33:39.8112773Z * [new branch] flash_decoding_cpu -> origin/flash_decoding_cpu 2025-12-04T09:33:39.8114205Z * [new branch] flex-flash -> origin/flex-flash 2025-12-04T09:33:39.8115743Z * [new branch] flex_attention_functorch_grad -> origin/flex_attention_functorch_grad 2025-12-04T09:33:39.8116837Z * [new branch] flex_flash -> origin/flex_flash 2025-12-04T09:33:39.8141421Z * [new branch] fmassa/fix_memeff_sharding_rule -> origin/fmassa/fix_memeff_sharding_rule 2025-12-04T09:33:39.8142571Z * [new branch] fmassa/tests_comm_compute_scheduler -> origin/fmassa/tests_comm_compute_scheduler 2025-12-04T09:33:39.8143337Z * [new branch] forkserver_fix -> origin/forkserver_fix 2025-12-04T09:33:39.8144207Z * [new branch] fsdp2_trace_rules -> origin/fsdp2_trace_rules 2025-12-04T09:33:39.8144804Z * [new branch] fx_cpp -> origin/fx_cpp 2025-12-04T09:33:39.8145583Z * [new branch] fy/fix-win -> origin/fy/fix-win 2025-12-04T09:33:39.8146169Z * [new branch] galv-patch-1 -> origin/galv-patch-1 2025-12-04T09:33:39.8147183Z * [new branch] galv/cudagraphs-conditional-nodes-4 -> origin/galv/cudagraphs-conditional-nodes-4 2025-12-04T09:33:39.8148155Z * [new branch] georgehong/cmakelists-patch -> origin/georgehong/cmakelists-patch 2025-12-04T09:33:39.8149014Z * [new branch] gh/AlnisM/1/base -> origin/gh/AlnisM/1/base 2025-12-04T09:33:39.8149872Z * [new branch] gh/AlnisM/1/head -> origin/gh/AlnisM/1/head 2025-12-04T09:33:39.8150597Z * [new branch] gh/EikanWang/67/base -> origin/gh/EikanWang/67/base 2025-12-04T09:33:39.8151447Z * [new branch] gh/EikanWang/67/head -> origin/gh/EikanWang/67/head 2025-12-04T09:33:39.8152114Z * [new branch] gh/Gasoonjia/1/base -> origin/gh/Gasoonjia/1/base 2025-12-04T09:33:39.8153000Z * [new branch] gh/Gasoonjia/1/head -> origin/gh/Gasoonjia/1/head 2025-12-04T09:33:39.8153825Z * [new branch] gh/H-Huang/131/base -> origin/gh/H-Huang/131/base 2025-12-04T09:33:39.8154536Z * [new branch] gh/H-Huang/131/head -> origin/gh/H-Huang/131/head 2025-12-04T09:33:39.8155383Z * [new branch] gh/H-Huang/131/orig -> origin/gh/H-Huang/131/orig 2025-12-04T09:33:39.8156003Z * [new branch] gh/H-Huang/132/base -> origin/gh/H-Huang/132/base 2025-12-04T09:33:39.8156866Z * [new branch] gh/H-Huang/132/head -> origin/gh/H-Huang/132/head 2025-12-04T09:33:39.8157502Z * [new branch] gh/H-Huang/132/orig -> origin/gh/H-Huang/132/orig 2025-12-04T09:33:39.8158505Z * [new branch] gh/H-Huang/180/base -> origin/gh/H-Huang/180/base 2025-12-04T09:33:39.8159321Z * [new branch] gh/H-Huang/180/head -> origin/gh/H-Huang/180/head 2025-12-04T09:33:39.8159999Z * [new branch] gh/H-Huang/180/orig -> origin/gh/H-Huang/180/orig 2025-12-04T09:33:39.8160625Z * [new branch] gh/H-Huang/182/base -> origin/gh/H-Huang/182/base 2025-12-04T09:33:39.8161473Z * [new branch] gh/H-Huang/182/head -> origin/gh/H-Huang/182/head 2025-12-04T09:33:39.8162106Z * [new branch] gh/H-Huang/182/orig -> origin/gh/H-Huang/182/orig 2025-12-04T09:33:39.8162951Z * [new branch] gh/H-Huang/226/base -> origin/gh/H-Huang/226/base 2025-12-04T09:33:39.8163633Z * [new branch] gh/H-Huang/226/head -> origin/gh/H-Huang/226/head 2025-12-04T09:33:39.8164456Z * [new branch] gh/H-Huang/226/orig -> origin/gh/H-Huang/226/orig 2025-12-04T09:33:39.8165971Z * [new branch] gh/H-Huang/228/base -> origin/gh/H-Huang/228/base 2025-12-04T09:33:39.8167189Z * [new branch] gh/H-Huang/228/head -> origin/gh/H-Huang/228/head 2025-12-04T09:33:39.8168470Z * [new branch] gh/H-Huang/228/orig -> origin/gh/H-Huang/228/orig 2025-12-04T09:33:39.8170654Z * [new branch] gh/IvanKobzarev/150/base -> origin/gh/IvanKobzarev/150/base 2025-12-04T09:33:39.8171868Z * [new branch] gh/IvanKobzarev/150/head -> origin/gh/IvanKobzarev/150/head 2025-12-04T09:33:39.8173054Z * [new branch] gh/IvanKobzarev/150/orig -> origin/gh/IvanKobzarev/150/orig 2025-12-04T09:33:39.8174968Z * [new branch] gh/IvanKobzarev/157/base -> origin/gh/IvanKobzarev/157/base 2025-12-04T09:33:39.8176308Z * [new branch] gh/IvanKobzarev/157/head -> origin/gh/IvanKobzarev/157/head 2025-12-04T09:33:39.8177609Z * [new branch] gh/IvanKobzarev/157/orig -> origin/gh/IvanKobzarev/157/orig 2025-12-04T09:33:39.8179612Z * [new branch] gh/IvanKobzarev/159/base -> origin/gh/IvanKobzarev/159/base 2025-12-04T09:33:39.8180890Z * [new branch] gh/IvanKobzarev/159/head -> origin/gh/IvanKobzarev/159/head 2025-12-04T09:33:39.8182244Z * [new branch] gh/IvanKobzarev/159/orig -> origin/gh/IvanKobzarev/159/orig 2025-12-04T09:33:39.8184004Z * [new branch] gh/IvanKobzarev/162/base -> origin/gh/IvanKobzarev/162/base 2025-12-04T09:33:39.8185393Z * [new branch] gh/IvanKobzarev/162/head -> origin/gh/IvanKobzarev/162/head 2025-12-04T09:33:39.8186628Z * [new branch] gh/IvanKobzarev/162/orig -> origin/gh/IvanKobzarev/162/orig 2025-12-04T09:33:39.8188664Z * [new branch] gh/IvanKobzarev/163/base -> origin/gh/IvanKobzarev/163/base 2025-12-04T09:33:39.8189792Z * [new branch] gh/IvanKobzarev/163/head -> origin/gh/IvanKobzarev/163/head 2025-12-04T09:33:39.8191013Z * [new branch] gh/IvanKobzarev/163/orig -> origin/gh/IvanKobzarev/163/orig 2025-12-04T09:33:39.8192875Z * [new branch] gh/IvanKobzarev/166/base -> origin/gh/IvanKobzarev/166/base 2025-12-04T09:33:39.8194118Z * [new branch] gh/IvanKobzarev/166/head -> origin/gh/IvanKobzarev/166/head 2025-12-04T09:33:39.8195471Z * [new branch] gh/IvanKobzarev/166/orig -> origin/gh/IvanKobzarev/166/orig 2025-12-04T09:33:39.8197290Z * [new branch] gh/IvanKobzarev/167/base -> origin/gh/IvanKobzarev/167/base 2025-12-04T09:33:39.8198452Z * [new branch] gh/IvanKobzarev/167/head -> origin/gh/IvanKobzarev/167/head 2025-12-04T09:33:39.8199750Z * [new branch] gh/IvanKobzarev/167/orig -> origin/gh/IvanKobzarev/167/orig 2025-12-04T09:33:39.8201450Z * [new branch] gh/IvanKobzarev/168/base -> origin/gh/IvanKobzarev/168/base 2025-12-04T09:33:39.8202827Z * [new branch] gh/IvanKobzarev/168/head -> origin/gh/IvanKobzarev/168/head 2025-12-04T09:33:39.8203894Z * [new branch] gh/IvanKobzarev/168/orig -> origin/gh/IvanKobzarev/168/orig 2025-12-04T09:33:39.8205729Z * [new branch] gh/IvanKobzarev/169/base -> origin/gh/IvanKobzarev/169/base 2025-12-04T09:33:39.8206931Z * [new branch] gh/IvanKobzarev/169/head -> origin/gh/IvanKobzarev/169/head 2025-12-04T09:33:39.8208220Z * [new branch] gh/IvanKobzarev/169/orig -> origin/gh/IvanKobzarev/169/orig 2025-12-04T09:33:39.8209926Z * [new branch] gh/IvanKobzarev/170/base -> origin/gh/IvanKobzarev/170/base 2025-12-04T09:33:39.8211103Z * [new branch] gh/IvanKobzarev/170/head -> origin/gh/IvanKobzarev/170/head 2025-12-04T09:33:39.8212419Z * [new branch] gh/IvanKobzarev/170/orig -> origin/gh/IvanKobzarev/170/orig 2025-12-04T09:33:39.8214400Z * [new branch] gh/IvanKobzarev/171/base -> origin/gh/IvanKobzarev/171/base 2025-12-04T09:33:39.8215675Z * [new branch] gh/IvanKobzarev/171/head -> origin/gh/IvanKobzarev/171/head 2025-12-04T09:33:39.8216909Z * [new branch] gh/IvanKobzarev/171/orig -> origin/gh/IvanKobzarev/171/orig 2025-12-04T09:33:39.8218751Z * [new branch] gh/IvanKobzarev/172/base -> origin/gh/IvanKobzarev/172/base 2025-12-04T09:33:39.8220311Z * [new branch] gh/IvanKobzarev/172/head -> origin/gh/IvanKobzarev/172/head 2025-12-04T09:33:39.8221449Z * [new branch] gh/IvanKobzarev/172/orig -> origin/gh/IvanKobzarev/172/orig 2025-12-04T09:33:39.8223277Z * [new branch] gh/IvanKobzarev/173/base -> origin/gh/IvanKobzarev/173/base 2025-12-04T09:33:39.8224418Z * [new branch] gh/IvanKobzarev/173/head -> origin/gh/IvanKobzarev/173/head 2025-12-04T09:33:39.8225707Z * [new branch] gh/IvanKobzarev/173/orig -> origin/gh/IvanKobzarev/173/orig 2025-12-04T09:33:39.8227506Z * [new branch] gh/IvanKobzarev/174/base -> origin/gh/IvanKobzarev/174/base 2025-12-04T09:33:39.8228836Z * [new branch] gh/IvanKobzarev/174/head -> origin/gh/IvanKobzarev/174/head 2025-12-04T09:33:39.8230141Z * [new branch] gh/IvanKobzarev/174/orig -> origin/gh/IvanKobzarev/174/orig 2025-12-04T09:33:39.8231942Z * [new branch] gh/IvanKobzarev/175/base -> origin/gh/IvanKobzarev/175/base 2025-12-04T09:33:39.8233341Z * [new branch] gh/IvanKobzarev/175/head -> origin/gh/IvanKobzarev/175/head 2025-12-04T09:33:39.8234663Z * [new branch] gh/IvanKobzarev/175/orig -> origin/gh/IvanKobzarev/175/orig 2025-12-04T09:33:39.8236552Z * [new branch] gh/IvanKobzarev/176/base -> origin/gh/IvanKobzarev/176/base 2025-12-04T09:33:39.8237916Z * [new branch] gh/IvanKobzarev/176/head -> origin/gh/IvanKobzarev/176/head 2025-12-04T09:33:39.8239113Z * [new branch] gh/IvanKobzarev/176/orig -> origin/gh/IvanKobzarev/176/orig 2025-12-04T09:33:39.8241174Z * [new branch] gh/IvanKobzarev/177/base -> origin/gh/IvanKobzarev/177/base 2025-12-04T09:33:39.8242462Z * [new branch] gh/IvanKobzarev/177/head -> origin/gh/IvanKobzarev/177/head 2025-12-04T09:33:39.8243737Z * [new branch] gh/IvanKobzarev/177/orig -> origin/gh/IvanKobzarev/177/orig 2025-12-04T09:33:39.8245635Z * [new branch] gh/IvanKobzarev/178/base -> origin/gh/IvanKobzarev/178/base 2025-12-04T09:33:39.8246977Z * [new branch] gh/IvanKobzarev/178/head -> origin/gh/IvanKobzarev/178/head 2025-12-04T09:33:39.8248284Z * [new branch] gh/IvanKobzarev/178/orig -> origin/gh/IvanKobzarev/178/orig 2025-12-04T09:33:39.8250073Z * [new branch] gh/IvanKobzarev/179/base -> origin/gh/IvanKobzarev/179/base 2025-12-04T09:33:39.8251276Z * [new branch] gh/IvanKobzarev/179/head -> origin/gh/IvanKobzarev/179/head 2025-12-04T09:33:39.8252677Z * [new branch] gh/IvanKobzarev/179/orig -> origin/gh/IvanKobzarev/179/orig 2025-12-04T09:33:39.8254441Z * [new branch] gh/IvanKobzarev/180/base -> origin/gh/IvanKobzarev/180/base 2025-12-04T09:33:39.8255679Z * [new branch] gh/IvanKobzarev/180/head -> origin/gh/IvanKobzarev/180/head 2025-12-04T09:33:39.8256976Z * [new branch] gh/IvanKobzarev/180/orig -> origin/gh/IvanKobzarev/180/orig 2025-12-04T09:33:39.8259191Z * [new branch] gh/IvanKobzarev/181/base -> origin/gh/IvanKobzarev/181/base 2025-12-04T09:33:39.8260432Z * [new branch] gh/IvanKobzarev/181/head -> origin/gh/IvanKobzarev/181/head 2025-12-04T09:33:39.8261688Z * [new branch] gh/IvanKobzarev/181/orig -> origin/gh/IvanKobzarev/181/orig 2025-12-04T09:33:39.8263716Z * [new branch] gh/IvanKobzarev/182/base -> origin/gh/IvanKobzarev/182/base 2025-12-04T09:33:39.8265039Z * [new branch] gh/IvanKobzarev/182/head -> origin/gh/IvanKobzarev/182/head 2025-12-04T09:33:39.8266260Z * [new branch] gh/IvanKobzarev/182/orig -> origin/gh/IvanKobzarev/182/orig 2025-12-04T09:33:39.8268189Z * [new branch] gh/IvanKobzarev/183/base -> origin/gh/IvanKobzarev/183/base 2025-12-04T09:33:39.8269583Z * [new branch] gh/IvanKobzarev/183/head -> origin/gh/IvanKobzarev/183/head 2025-12-04T09:33:39.8270805Z * [new branch] gh/IvanKobzarev/183/orig -> origin/gh/IvanKobzarev/183/orig 2025-12-04T09:33:39.8272652Z * [new branch] gh/IvanKobzarev/184/base -> origin/gh/IvanKobzarev/184/base 2025-12-04T09:33:39.8273919Z * [new branch] gh/IvanKobzarev/184/head -> origin/gh/IvanKobzarev/184/head 2025-12-04T09:33:39.8275230Z * [new branch] gh/IvanKobzarev/184/orig -> origin/gh/IvanKobzarev/184/orig 2025-12-04T09:33:39.8277281Z * [new branch] gh/NikhilAPatel/1/base -> origin/gh/NikhilAPatel/1/base 2025-12-04T09:33:39.8278762Z * [new branch] gh/NikhilAPatel/1/head -> origin/gh/NikhilAPatel/1/head 2025-12-04T09:33:39.8280370Z * [new branch] gh/NikhilAPatel/2/base -> origin/gh/NikhilAPatel/2/base 2025-12-04T09:33:39.8281563Z * [new branch] gh/NikhilAPatel/2/head -> origin/gh/NikhilAPatel/2/head 2025-12-04T09:33:39.8283482Z * [new branch] gh/NikhilAPatel/4/base -> origin/gh/NikhilAPatel/4/base 2025-12-04T09:33:39.8284852Z * [new branch] gh/NikhilAPatel/4/head -> origin/gh/NikhilAPatel/4/head 2025-12-04T09:33:39.8286592Z * [new branch] gh/NikhilAPatel/5/base -> origin/gh/NikhilAPatel/5/base 2025-12-04T09:33:39.8288028Z * [new branch] gh/NikhilAPatel/5/head -> origin/gh/NikhilAPatel/5/head 2025-12-04T09:33:39.8289375Z * [new branch] gh/NikhilAPatel/5/orig -> origin/gh/NikhilAPatel/5/orig 2025-12-04T09:33:39.8291382Z * [new branch] gh/PaliC/17/base -> origin/gh/PaliC/17/base 2025-12-04T09:33:39.8292645Z * [new branch] gh/PaliC/17/head -> origin/gh/PaliC/17/head 2025-12-04T09:33:39.8293913Z * [new branch] gh/PaliC/17/orig -> origin/gh/PaliC/17/orig 2025-12-04T09:33:39.8295634Z * [new branch] gh/PaliC/18/base -> origin/gh/PaliC/18/base 2025-12-04T09:33:39.8296858Z * [new branch] gh/PaliC/18/head -> origin/gh/PaliC/18/head 2025-12-04T09:33:39.8298281Z * [new branch] gh/PaliC/18/orig -> origin/gh/PaliC/18/orig 2025-12-04T09:33:39.8300043Z * [new branch] gh/PaliC/20/base -> origin/gh/PaliC/20/base 2025-12-04T09:33:39.8301283Z * [new branch] gh/PaliC/20/head -> origin/gh/PaliC/20/head 2025-12-04T09:33:39.8302579Z * [new branch] gh/PaliC/20/orig -> origin/gh/PaliC/20/orig 2025-12-04T09:33:39.8304355Z * [new branch] gh/PaliC/21/base -> origin/gh/PaliC/21/base 2025-12-04T09:33:39.8305758Z * [new branch] gh/PaliC/21/head -> origin/gh/PaliC/21/head 2025-12-04T09:33:39.8306888Z * [new branch] gh/PaliC/21/orig -> origin/gh/PaliC/21/orig 2025-12-04T09:33:39.8308574Z * [new branch] gh/PaliC/23/base -> origin/gh/PaliC/23/base 2025-12-04T09:33:39.8309740Z * [new branch] gh/PaliC/23/head -> origin/gh/PaliC/23/head 2025-12-04T09:33:39.8311039Z * [new branch] gh/PaliC/23/orig -> origin/gh/PaliC/23/orig 2025-12-04T09:33:39.8312735Z * [new branch] gh/PaliC/24/base -> origin/gh/PaliC/24/base 2025-12-04T09:33:39.8313979Z * [new branch] gh/PaliC/24/head -> origin/gh/PaliC/24/head 2025-12-04T09:33:39.8315229Z * [new branch] gh/PaliC/24/orig -> origin/gh/PaliC/24/orig 2025-12-04T09:33:39.8316916Z * [new branch] gh/PaliC/25/head -> origin/gh/PaliC/25/head 2025-12-04T09:33:39.8318206Z * [new branch] gh/PaliC/25/next -> origin/gh/PaliC/25/next 2025-12-04T09:33:39.8319424Z * [new branch] gh/PaliC/25/orig -> origin/gh/PaliC/25/orig 2025-12-04T09:33:39.8321135Z * [new branch] gh/PaliC/26/head -> origin/gh/PaliC/26/head 2025-12-04T09:33:39.8322219Z * [new branch] gh/PaliC/26/next -> origin/gh/PaliC/26/next 2025-12-04T09:33:39.8323581Z * [new branch] gh/PaliC/26/orig -> origin/gh/PaliC/26/orig 2025-12-04T09:33:39.8325249Z * [new branch] gh/PaliC/27/next -> origin/gh/PaliC/27/next 2025-12-04T09:33:39.8326907Z * [new branch] gh/PaliC/28/head -> origin/gh/PaliC/28/head 2025-12-04T09:33:39.8327984Z * [new branch] gh/PaliC/28/next -> origin/gh/PaliC/28/next 2025-12-04T09:33:39.8329317Z * [new branch] gh/PaliC/28/orig -> origin/gh/PaliC/28/orig 2025-12-04T09:33:39.8331037Z * [new branch] gh/PaliC/29/head -> origin/gh/PaliC/29/head 2025-12-04T09:33:39.8332064Z * [new branch] gh/PaliC/29/next -> origin/gh/PaliC/29/next 2025-12-04T09:33:39.8333375Z * [new branch] gh/PaliC/29/orig -> origin/gh/PaliC/29/orig 2025-12-04T09:33:39.8335137Z * [new branch] gh/PaliC/30/head -> origin/gh/PaliC/30/head 2025-12-04T09:33:39.8336175Z * [new branch] gh/PaliC/30/next -> origin/gh/PaliC/30/next 2025-12-04T09:33:39.8337474Z * [new branch] gh/PaliC/30/orig -> origin/gh/PaliC/30/orig 2025-12-04T09:33:39.8339288Z * [new branch] gh/PaliC/31/head -> origin/gh/PaliC/31/head 2025-12-04T09:33:39.8340398Z * [new branch] gh/PaliC/31/next -> origin/gh/PaliC/31/next 2025-12-04T09:33:39.8342009Z * [new branch] gh/PaliC/31/orig -> origin/gh/PaliC/31/orig 2025-12-04T09:33:39.8343984Z * [new branch] gh/PaulZhang12/25/base -> origin/gh/PaulZhang12/25/base 2025-12-04T09:33:39.8345427Z * [new branch] gh/PaulZhang12/25/head -> origin/gh/PaulZhang12/25/head 2025-12-04T09:33:39.8346717Z * [new branch] gh/PaulZhang12/25/orig -> origin/gh/PaulZhang12/25/orig 2025-12-04T09:33:39.8348521Z * [new branch] gh/PaulZhang12/28/base -> origin/gh/PaulZhang12/28/base 2025-12-04T09:33:39.8349819Z * [new branch] gh/PaulZhang12/28/head -> origin/gh/PaulZhang12/28/head 2025-12-04T09:33:39.8351132Z * [new branch] gh/PaulZhang12/28/orig -> origin/gh/PaulZhang12/28/orig 2025-12-04T09:33:39.8353276Z * [new branch] gh/PaulZhang12/31/base -> origin/gh/PaulZhang12/31/base 2025-12-04T09:33:39.8357057Z * [new branch] gh/PaulZhang12/31/head -> origin/gh/PaulZhang12/31/head 2025-12-04T09:33:39.8357911Z * [new branch] gh/PaulZhang12/31/orig -> origin/gh/PaulZhang12/31/orig 2025-12-04T09:33:39.8358599Z * [new branch] gh/PaulZhang12/37/base -> origin/gh/PaulZhang12/37/base 2025-12-04T09:33:39.8359274Z * [new branch] gh/PaulZhang12/37/head -> origin/gh/PaulZhang12/37/head 2025-12-04T09:33:39.8359971Z * [new branch] gh/PaulZhang12/37/orig -> origin/gh/PaulZhang12/37/orig 2025-12-04T09:33:39.8361579Z * [new branch] gh/PaulZhang12/40/base -> origin/gh/PaulZhang12/40/base 2025-12-04T09:33:39.8362653Z * [new branch] gh/PaulZhang12/40/head -> origin/gh/PaulZhang12/40/head 2025-12-04T09:33:39.8363926Z * [new branch] gh/PaulZhang12/40/orig -> origin/gh/PaulZhang12/40/orig 2025-12-04T09:33:39.8365753Z * [new branch] gh/PaulZhang12/42/base -> origin/gh/PaulZhang12/42/base 2025-12-04T09:33:39.8366901Z * [new branch] gh/PaulZhang12/42/head -> origin/gh/PaulZhang12/42/head 2025-12-04T09:33:39.8368717Z * [new branch] gh/PaulZhang12/43/base -> origin/gh/PaulZhang12/43/base 2025-12-04T09:33:39.8369869Z * [new branch] gh/PaulZhang12/43/head -> origin/gh/PaulZhang12/43/head 2025-12-04T09:33:39.8371196Z * [new branch] gh/PaulZhang12/43/orig -> origin/gh/PaulZhang12/43/orig 2025-12-04T09:33:39.8372871Z * [new branch] gh/PaulZhang12/44/base -> origin/gh/PaulZhang12/44/base 2025-12-04T09:33:39.8374027Z * [new branch] gh/PaulZhang12/44/head -> origin/gh/PaulZhang12/44/head 2025-12-04T09:33:39.8376003Z * [new branch] gh/PaulZhang12/45/base -> origin/gh/PaulZhang12/45/base 2025-12-04T09:33:39.8377072Z * [new branch] gh/PaulZhang12/45/head -> origin/gh/PaulZhang12/45/head 2025-12-04T09:33:39.8378355Z * [new branch] gh/PaulZhang12/45/orig -> origin/gh/PaulZhang12/45/orig 2025-12-04T09:33:39.8380296Z * [new branch] gh/PaulZhang12/46/base -> origin/gh/PaulZhang12/46/base 2025-12-04T09:33:39.8381476Z * [new branch] gh/PaulZhang12/46/head -> origin/gh/PaulZhang12/46/head 2025-12-04T09:33:39.8382813Z * [new branch] gh/PaulZhang12/46/orig -> origin/gh/PaulZhang12/46/orig 2025-12-04T09:33:39.8384727Z * [new branch] gh/PaulZhang12/47/base -> origin/gh/PaulZhang12/47/base 2025-12-04T09:33:39.8385950Z * [new branch] gh/PaulZhang12/47/head -> origin/gh/PaulZhang12/47/head 2025-12-04T09:33:39.8387264Z * [new branch] gh/PaulZhang12/47/orig -> origin/gh/PaulZhang12/47/orig 2025-12-04T09:33:39.8389151Z * [new branch] gh/PaulZhang12/48/base -> origin/gh/PaulZhang12/48/base 2025-12-04T09:33:39.8390277Z * [new branch] gh/PaulZhang12/48/head -> origin/gh/PaulZhang12/48/head 2025-12-04T09:33:39.8391603Z * [new branch] gh/PaulZhang12/48/orig -> origin/gh/PaulZhang12/48/orig 2025-12-04T09:33:39.8393722Z * [new branch] gh/SamGinzburg/11/base -> origin/gh/SamGinzburg/11/base 2025-12-04T09:33:39.8394902Z * [new branch] gh/SamGinzburg/11/head -> origin/gh/SamGinzburg/11/head 2025-12-04T09:33:39.8397200Z * [new branch] gh/SherlockNoMad/1/base -> origin/gh/SherlockNoMad/1/base 2025-12-04T09:33:39.8398428Z * [new branch] gh/SherlockNoMad/1/head -> origin/gh/SherlockNoMad/1/head 2025-12-04T09:33:39.8400283Z * [new branch] gh/SherlockNoMad/10/base -> origin/gh/SherlockNoMad/10/base 2025-12-04T09:33:39.8401456Z * [new branch] gh/SherlockNoMad/10/head -> origin/gh/SherlockNoMad/10/head 2025-12-04T09:33:39.8402803Z * [new branch] gh/SherlockNoMad/10/orig -> origin/gh/SherlockNoMad/10/orig 2025-12-04T09:33:39.8404469Z * [new branch] gh/SherlockNoMad/11/base -> origin/gh/SherlockNoMad/11/base 2025-12-04T09:33:39.8405674Z * [new branch] gh/SherlockNoMad/11/head -> origin/gh/SherlockNoMad/11/head 2025-12-04T09:33:39.8407155Z * [new branch] gh/SherlockNoMad/11/orig -> origin/gh/SherlockNoMad/11/orig 2025-12-04T09:33:39.8408511Z * [new branch] gh/SherlockNoMad/12/base -> origin/gh/SherlockNoMad/12/base 2025-12-04T09:33:39.8409804Z * [new branch] gh/SherlockNoMad/12/head -> origin/gh/SherlockNoMad/12/head 2025-12-04T09:33:39.8411045Z * [new branch] gh/SherlockNoMad/12/orig -> origin/gh/SherlockNoMad/12/orig 2025-12-04T09:33:39.8412931Z * [new branch] gh/SherlockNoMad/15/base -> origin/gh/SherlockNoMad/15/base 2025-12-04T09:33:39.8414104Z * [new branch] gh/SherlockNoMad/15/head -> origin/gh/SherlockNoMad/15/head 2025-12-04T09:33:39.8415333Z * [new branch] gh/SherlockNoMad/15/orig -> origin/gh/SherlockNoMad/15/orig 2025-12-04T09:33:39.8417139Z * [new branch] gh/SherlockNoMad/17/base -> origin/gh/SherlockNoMad/17/base 2025-12-04T09:33:39.8418368Z * [new branch] gh/SherlockNoMad/17/head -> origin/gh/SherlockNoMad/17/head 2025-12-04T09:33:39.8419733Z * [new branch] gh/SherlockNoMad/17/orig -> origin/gh/SherlockNoMad/17/orig 2025-12-04T09:33:39.8421670Z * [new branch] gh/SherlockNoMad/18/base -> origin/gh/SherlockNoMad/18/base 2025-12-04T09:33:39.8422872Z * [new branch] gh/SherlockNoMad/18/head -> origin/gh/SherlockNoMad/18/head 2025-12-04T09:33:39.8424218Z * [new branch] gh/SherlockNoMad/18/orig -> origin/gh/SherlockNoMad/18/orig 2025-12-04T09:33:39.8425887Z * [new branch] gh/SherlockNoMad/19/base -> origin/gh/SherlockNoMad/19/base 2025-12-04T09:33:39.8427142Z * [new branch] gh/SherlockNoMad/19/head -> origin/gh/SherlockNoMad/19/head 2025-12-04T09:33:39.8428428Z * [new branch] gh/SherlockNoMad/19/orig -> origin/gh/SherlockNoMad/19/orig 2025-12-04T09:33:39.8430089Z * [new branch] gh/SherlockNoMad/2/base -> origin/gh/SherlockNoMad/2/base 2025-12-04T09:33:39.8431181Z * [new branch] gh/SherlockNoMad/2/head -> origin/gh/SherlockNoMad/2/head 2025-12-04T09:33:39.8432847Z * [new branch] gh/SherlockNoMad/20/base -> origin/gh/SherlockNoMad/20/base 2025-12-04T09:33:39.8434132Z * [new branch] gh/SherlockNoMad/20/head -> origin/gh/SherlockNoMad/20/head 2025-12-04T09:33:39.8435320Z * [new branch] gh/SherlockNoMad/20/orig -> origin/gh/SherlockNoMad/20/orig 2025-12-04T09:33:39.8437473Z * [new branch] gh/SherlockNoMad/21/base -> origin/gh/SherlockNoMad/21/base 2025-12-04T09:33:39.8438689Z * [new branch] gh/SherlockNoMad/21/head -> origin/gh/SherlockNoMad/21/head 2025-12-04T09:33:39.8439883Z * [new branch] gh/SherlockNoMad/21/orig -> origin/gh/SherlockNoMad/21/orig 2025-12-04T09:33:39.8441584Z * [new branch] gh/SherlockNoMad/3/base -> origin/gh/SherlockNoMad/3/base 2025-12-04T09:33:39.8442714Z * [new branch] gh/SherlockNoMad/3/head -> origin/gh/SherlockNoMad/3/head 2025-12-04T09:33:39.8444362Z * [new branch] gh/SherlockNoMad/4/base -> origin/gh/SherlockNoMad/4/base 2025-12-04T09:33:39.8445474Z * [new branch] gh/SherlockNoMad/4/head -> origin/gh/SherlockNoMad/4/head 2025-12-04T09:33:39.8447146Z * [new branch] gh/SherlockNoMad/5/base -> origin/gh/SherlockNoMad/5/base 2025-12-04T09:33:39.8448237Z * [new branch] gh/SherlockNoMad/5/head -> origin/gh/SherlockNoMad/5/head 2025-12-04T09:33:39.8450875Z * [new branch] gh/Sidharth123-cpu/24/base -> origin/gh/Sidharth123-cpu/24/base 2025-12-04T09:33:39.8452401Z * [new branch] gh/Sidharth123-cpu/25/base -> origin/gh/Sidharth123-cpu/25/base 2025-12-04T09:33:39.8453911Z * [new branch] gh/Sidharth123-cpu/26/base -> origin/gh/Sidharth123-cpu/26/base 2025-12-04T09:33:39.8455802Z * [new branch] gh/Sidharth123-cpu/27/base -> origin/gh/Sidharth123-cpu/27/base 2025-12-04T09:33:39.8457968Z * [new branch] gh/StrongerXi/1/base -> origin/gh/StrongerXi/1/base 2025-12-04T09:33:39.8459271Z * [new branch] gh/StrongerXi/1/head -> origin/gh/StrongerXi/1/head 2025-12-04T09:33:39.8461030Z * [new branch] gh/StrongerXi/71/base -> origin/gh/StrongerXi/71/base 2025-12-04T09:33:39.8462171Z * [new branch] gh/StrongerXi/71/head -> origin/gh/StrongerXi/71/head 2025-12-04T09:33:39.8463877Z * [new branch] gh/StrongerXi/72/base -> origin/gh/StrongerXi/72/base 2025-12-04T09:33:39.8465071Z * [new branch] gh/StrongerXi/72/head -> origin/gh/StrongerXi/72/head 2025-12-04T09:33:39.8466776Z * [new branch] gh/StrongerXi/73/base -> origin/gh/StrongerXi/73/base 2025-12-04T09:33:39.8467923Z * [new branch] gh/StrongerXi/73/head -> origin/gh/StrongerXi/73/head 2025-12-04T09:33:39.8469249Z * [new branch] gh/StrongerXi/73/orig -> origin/gh/StrongerXi/73/orig 2025-12-04T09:33:39.8471580Z * [new branch] gh/XilunWu/160/base -> origin/gh/XilunWu/160/base 2025-12-04T09:33:39.8472651Z * [new branch] gh/XilunWu/160/head -> origin/gh/XilunWu/160/head 2025-12-04T09:33:39.8473968Z * [new branch] gh/XilunWu/160/orig -> origin/gh/XilunWu/160/orig 2025-12-04T09:33:39.8475806Z * [new branch] gh/XilunWu/163/base -> origin/gh/XilunWu/163/base 2025-12-04T09:33:39.8477044Z * [new branch] gh/XilunWu/163/head -> origin/gh/XilunWu/163/head 2025-12-04T09:33:39.8478265Z * [new branch] gh/XilunWu/163/orig -> origin/gh/XilunWu/163/orig 2025-12-04T09:33:39.8480230Z * [new branch] gh/XilunWu/168/base -> origin/gh/XilunWu/168/base 2025-12-04T09:33:39.8481343Z * [new branch] gh/XilunWu/168/head -> origin/gh/XilunWu/168/head 2025-12-04T09:33:39.8482599Z * [new branch] gh/XilunWu/168/orig -> origin/gh/XilunWu/168/orig 2025-12-04T09:33:39.8484432Z * [new branch] gh/XilunWu/169/base -> origin/gh/XilunWu/169/base 2025-12-04T09:33:39.8485598Z * [new branch] gh/XilunWu/169/head -> origin/gh/XilunWu/169/head 2025-12-04T09:33:39.8487020Z * [new branch] gh/XilunWu/169/orig -> origin/gh/XilunWu/169/orig 2025-12-04T09:33:39.8488771Z * [new branch] gh/XilunWu/170/base -> origin/gh/XilunWu/170/base 2025-12-04T09:33:39.8489946Z * [new branch] gh/XilunWu/170/head -> origin/gh/XilunWu/170/head 2025-12-04T09:33:39.8491186Z * [new branch] gh/XilunWu/170/orig -> origin/gh/XilunWu/170/orig 2025-12-04T09:33:39.8493161Z * [new branch] gh/XilunWu/171/base -> origin/gh/XilunWu/171/base 2025-12-04T09:33:39.8494326Z * [new branch] gh/XilunWu/171/head -> origin/gh/XilunWu/171/head 2025-12-04T09:33:39.8495613Z * [new branch] gh/XilunWu/171/orig -> origin/gh/XilunWu/171/orig 2025-12-04T09:33:39.8497370Z * [new branch] gh/XilunWu/173/base -> origin/gh/XilunWu/173/base 2025-12-04T09:33:39.8498604Z * [new branch] gh/XilunWu/173/head -> origin/gh/XilunWu/173/head 2025-12-04T09:33:39.8500035Z * [new branch] gh/XilunWu/173/orig -> origin/gh/XilunWu/173/orig 2025-12-04T09:33:39.8501818Z * [new branch] gh/XilunWu/175/base -> origin/gh/XilunWu/175/base 2025-12-04T09:33:39.8503048Z * [new branch] gh/XilunWu/175/head -> origin/gh/XilunWu/175/head 2025-12-04T09:33:39.8504362Z * [new branch] gh/XilunWu/175/orig -> origin/gh/XilunWu/175/orig 2025-12-04T09:33:39.8506204Z * [new branch] gh/XilunWu/176/base -> origin/gh/XilunWu/176/base 2025-12-04T09:33:39.8507382Z * [new branch] gh/XilunWu/176/head -> origin/gh/XilunWu/176/head 2025-12-04T09:33:39.8508836Z * [new branch] gh/XilunWu/176/orig -> origin/gh/XilunWu/176/orig 2025-12-04T09:33:39.8510892Z * [new branch] gh/XuehaiPan/14/base -> origin/gh/XuehaiPan/14/base 2025-12-04T09:33:39.8512060Z * [new branch] gh/XuehaiPan/14/head -> origin/gh/XuehaiPan/14/head 2025-12-04T09:33:39.8513334Z * [new branch] gh/XuehaiPan/14/orig -> origin/gh/XuehaiPan/14/orig 2025-12-04T09:33:39.8515266Z * [new branch] gh/XuehaiPan/179/base -> origin/gh/XuehaiPan/179/base 2025-12-04T09:33:39.8516382Z * [new branch] gh/XuehaiPan/179/head -> origin/gh/XuehaiPan/179/head 2025-12-04T09:33:39.8517730Z * [new branch] gh/XuehaiPan/179/orig -> origin/gh/XuehaiPan/179/orig 2025-12-04T09:33:39.8519518Z * [new branch] gh/XuehaiPan/249/base -> origin/gh/XuehaiPan/249/base 2025-12-04T09:33:39.8520733Z * [new branch] gh/XuehaiPan/249/head -> origin/gh/XuehaiPan/249/head 2025-12-04T09:33:39.8522084Z * [new branch] gh/XuehaiPan/249/orig -> origin/gh/XuehaiPan/249/orig 2025-12-04T09:33:39.8523881Z * [new branch] gh/XuehaiPan/253/base -> origin/gh/XuehaiPan/253/base 2025-12-04T09:33:39.8525104Z * [new branch] gh/XuehaiPan/253/head -> origin/gh/XuehaiPan/253/head 2025-12-04T09:33:39.8526360Z * [new branch] gh/XuehaiPan/253/orig -> origin/gh/XuehaiPan/253/orig 2025-12-04T09:33:39.8528205Z * [new branch] gh/XuehaiPan/254/base -> origin/gh/XuehaiPan/254/base 2025-12-04T09:33:39.8529433Z * [new branch] gh/XuehaiPan/254/head -> origin/gh/XuehaiPan/254/head 2025-12-04T09:33:39.8530701Z * [new branch] gh/XuehaiPan/254/orig -> origin/gh/XuehaiPan/254/orig 2025-12-04T09:33:39.8532404Z * [new branch] gh/XuehaiPan/255/base -> origin/gh/XuehaiPan/255/base 2025-12-04T09:33:39.8533583Z * [new branch] gh/XuehaiPan/255/head -> origin/gh/XuehaiPan/255/head 2025-12-04T09:33:39.8534873Z * [new branch] gh/XuehaiPan/255/orig -> origin/gh/XuehaiPan/255/orig 2025-12-04T09:33:39.8536642Z * [new branch] gh/XuehaiPan/271/base -> origin/gh/XuehaiPan/271/base 2025-12-04T09:33:39.8537782Z * [new branch] gh/XuehaiPan/271/head -> origin/gh/XuehaiPan/271/head 2025-12-04T09:33:39.8539372Z * [new branch] gh/XuehaiPan/271/orig -> origin/gh/XuehaiPan/271/orig 2025-12-04T09:33:39.8541009Z * [new branch] gh/XuehaiPan/343/base -> origin/gh/XuehaiPan/343/base 2025-12-04T09:33:39.8542149Z * [new branch] gh/XuehaiPan/343/head -> origin/gh/XuehaiPan/343/head 2025-12-04T09:33:39.8543427Z * [new branch] gh/XuehaiPan/343/orig -> origin/gh/XuehaiPan/343/orig 2025-12-04T09:33:39.8545314Z * [new branch] gh/XuehaiPan/347/base -> origin/gh/XuehaiPan/347/base 2025-12-04T09:33:39.8546494Z * [new branch] gh/XuehaiPan/347/head -> origin/gh/XuehaiPan/347/head 2025-12-04T09:33:39.8547767Z * [new branch] gh/XuehaiPan/347/orig -> origin/gh/XuehaiPan/347/orig 2025-12-04T09:33:39.8549553Z * [new branch] gh/XuehaiPan/348/base -> origin/gh/XuehaiPan/348/base 2025-12-04T09:33:39.8550742Z * [new branch] gh/XuehaiPan/348/head -> origin/gh/XuehaiPan/348/head 2025-12-04T09:33:39.8552011Z * [new branch] gh/XuehaiPan/348/orig -> origin/gh/XuehaiPan/348/orig 2025-12-04T09:33:39.8553787Z * [new branch] gh/XuehaiPan/350/base -> origin/gh/XuehaiPan/350/base 2025-12-04T09:33:39.8554982Z * [new branch] gh/XuehaiPan/350/head -> origin/gh/XuehaiPan/350/head 2025-12-04T09:33:39.8556210Z * [new branch] gh/XuehaiPan/350/orig -> origin/gh/XuehaiPan/350/orig 2025-12-04T09:33:39.8558202Z * [new branch] gh/XuehaiPan/365/base -> origin/gh/XuehaiPan/365/base 2025-12-04T09:33:39.8559320Z * [new branch] gh/XuehaiPan/365/head -> origin/gh/XuehaiPan/365/head 2025-12-04T09:33:39.8560562Z * [new branch] gh/XuehaiPan/365/orig -> origin/gh/XuehaiPan/365/orig 2025-12-04T09:33:39.8562429Z * [new branch] gh/XuehaiPan/366/base -> origin/gh/XuehaiPan/366/base 2025-12-04T09:33:39.8563582Z * [new branch] gh/XuehaiPan/366/head -> origin/gh/XuehaiPan/366/head 2025-12-04T09:33:39.8565681Z * [new branch] gh/XuehaiPan/370/base -> origin/gh/XuehaiPan/370/base 2025-12-04T09:33:39.8566837Z * [new branch] gh/XuehaiPan/370/head -> origin/gh/XuehaiPan/370/head 2025-12-04T09:33:39.8568149Z * [new branch] gh/XuehaiPan/370/orig -> origin/gh/XuehaiPan/370/orig 2025-12-04T09:33:39.8570008Z * [new branch] gh/XuehaiPan/390/base -> origin/gh/XuehaiPan/390/base 2025-12-04T09:33:39.8571221Z * [new branch] gh/XuehaiPan/390/head -> origin/gh/XuehaiPan/390/head 2025-12-04T09:33:39.8572493Z * [new branch] gh/XuehaiPan/390/orig -> origin/gh/XuehaiPan/390/orig 2025-12-04T09:33:39.8574337Z * [new branch] gh/XuehaiPan/391/base -> origin/gh/XuehaiPan/391/base 2025-12-04T09:33:39.8575483Z * [new branch] gh/XuehaiPan/391/head -> origin/gh/XuehaiPan/391/head 2025-12-04T09:33:39.8576772Z * [new branch] gh/XuehaiPan/391/orig -> origin/gh/XuehaiPan/391/orig 2025-12-04T09:33:39.8578640Z * [new branch] gh/XuehaiPan/392/base -> origin/gh/XuehaiPan/392/base 2025-12-04T09:33:39.8579863Z * [new branch] gh/XuehaiPan/392/head -> origin/gh/XuehaiPan/392/head 2025-12-04T09:33:39.8581124Z * [new branch] gh/XuehaiPan/392/orig -> origin/gh/XuehaiPan/392/orig 2025-12-04T09:33:39.8583509Z * [new branch] gh/XuehaiPan/394/base -> origin/gh/XuehaiPan/394/base 2025-12-04T09:33:39.8584676Z * [new branch] gh/XuehaiPan/394/head -> origin/gh/XuehaiPan/394/head 2025-12-04T09:33:39.8585960Z * [new branch] gh/XuehaiPan/394/orig -> origin/gh/XuehaiPan/394/orig 2025-12-04T09:33:39.8587906Z * [new branch] gh/XuehaiPan/397/base -> origin/gh/XuehaiPan/397/base 2025-12-04T09:33:39.8591303Z * [new branch] gh/XuehaiPan/397/head -> origin/gh/XuehaiPan/397/head 2025-12-04T09:33:39.8592504Z * [new branch] gh/XuehaiPan/397/orig -> origin/gh/XuehaiPan/397/orig 2025-12-04T09:33:39.8594337Z * [new branch] gh/XuehaiPan/398/base -> origin/gh/XuehaiPan/398/base 2025-12-04T09:33:39.8595540Z * [new branch] gh/XuehaiPan/398/head -> origin/gh/XuehaiPan/398/head 2025-12-04T09:33:39.8596777Z * [new branch] gh/XuehaiPan/398/orig -> origin/gh/XuehaiPan/398/orig 2025-12-04T09:33:39.8598558Z * [new branch] gh/XuehaiPan/399/base -> origin/gh/XuehaiPan/399/base 2025-12-04T09:33:39.8599754Z * [new branch] gh/XuehaiPan/399/head -> origin/gh/XuehaiPan/399/head 2025-12-04T09:33:39.8601031Z * [new branch] gh/XuehaiPan/399/orig -> origin/gh/XuehaiPan/399/orig 2025-12-04T09:33:39.8602883Z * [new branch] gh/XuehaiPan/400/base -> origin/gh/XuehaiPan/400/base 2025-12-04T09:33:39.8604088Z * [new branch] gh/XuehaiPan/400/head -> origin/gh/XuehaiPan/400/head 2025-12-04T09:33:39.8605382Z * [new branch] gh/XuehaiPan/400/orig -> origin/gh/XuehaiPan/400/orig 2025-12-04T09:33:39.8607502Z * [new branch] gh/ZhiweiYan-96/39/base -> origin/gh/ZhiweiYan-96/39/base 2025-12-04T09:33:39.8608691Z * [new branch] gh/ZhiweiYan-96/39/head -> origin/gh/ZhiweiYan-96/39/head 2025-12-04T09:33:39.8610160Z * [new branch] gh/ZhiweiYan-96/39/orig -> origin/gh/ZhiweiYan-96/39/orig 2025-12-04T09:33:39.8612204Z * [new branch] gh/ZhiweiYan-96/44/base -> origin/gh/ZhiweiYan-96/44/base 2025-12-04T09:33:39.8613129Z * [new branch] gh/ZhiweiYan-96/44/head -> origin/gh/ZhiweiYan-96/44/head 2025-12-04T09:33:39.8614846Z * [new branch] gh/ZhiweiYan-96/45/base -> origin/gh/ZhiweiYan-96/45/base 2025-12-04T09:33:39.8616017Z * [new branch] gh/ZhiweiYan-96/45/head -> origin/gh/ZhiweiYan-96/45/head 2025-12-04T09:33:39.8617872Z * [new branch] gh/ZhiweiYan-96/49/base -> origin/gh/ZhiweiYan-96/49/base 2025-12-04T09:33:39.8619173Z * [new branch] gh/ZhiweiYan-96/49/head -> origin/gh/ZhiweiYan-96/49/head 2025-12-04T09:33:39.8620892Z * [new branch] gh/ZhiweiYan-96/62/base -> origin/gh/ZhiweiYan-96/62/base 2025-12-04T09:33:39.8622095Z * [new branch] gh/ZhiweiYan-96/62/head -> origin/gh/ZhiweiYan-96/62/head 2025-12-04T09:33:39.8623981Z * [new branch] gh/ZhiweiYan-96/66/base -> origin/gh/ZhiweiYan-96/66/base 2025-12-04T09:33:39.8625138Z * [new branch] gh/ZhiweiYan-96/66/head -> origin/gh/ZhiweiYan-96/66/head 2025-12-04T09:33:39.8626837Z * [new branch] gh/ZhiweiYan-96/67/base -> origin/gh/ZhiweiYan-96/67/base 2025-12-04T09:33:39.8627922Z * [new branch] gh/ZhiweiYan-96/67/head -> origin/gh/ZhiweiYan-96/67/head 2025-12-04T09:33:39.8629633Z * [new branch] gh/ZhiweiYan-96/68/base -> origin/gh/ZhiweiYan-96/68/base 2025-12-04T09:33:39.8630724Z * [new branch] gh/ZhiweiYan-96/68/head -> origin/gh/ZhiweiYan-96/68/head 2025-12-04T09:33:39.8631962Z * [new branch] gh/ZhiweiYan-96/68/orig -> origin/gh/ZhiweiYan-96/68/orig 2025-12-04T09:33:39.8634158Z * [new branch] gh/aakhundov/1/base -> origin/gh/aakhundov/1/base 2025-12-04T09:33:39.8635408Z * [new branch] gh/aakhundov/1/head -> origin/gh/aakhundov/1/head 2025-12-04T09:33:39.8637072Z * [new branch] gh/aakhundov/2/base -> origin/gh/aakhundov/2/base 2025-12-04T09:33:39.8638340Z * [new branch] gh/aakhundov/2/head -> origin/gh/aakhundov/2/head 2025-12-04T09:33:39.8640151Z * [new branch] gh/aditew01/openblas -> origin/gh/aditew01/openblas 2025-12-04T09:33:39.8641275Z * [new branch] gh/aditew01/sbgemm -> origin/gh/aditew01/sbgemm 2025-12-04T09:33:39.8642555Z * [new branch] gh/aditew01/vecbf16 -> origin/gh/aditew01/vecbf16 2025-12-04T09:33:39.8644651Z * [new branch] gh/albanD/4/base -> origin/gh/albanD/4/base 2025-12-04T09:33:39.8645769Z * [new branch] gh/albanD/4/head -> origin/gh/albanD/4/head 2025-12-04T09:33:39.8647189Z * [new branch] gh/albanD/4/orig -> origin/gh/albanD/4/orig 2025-12-04T09:33:39.8649247Z * [new branch] gh/alexbrauckmann/paddedtensor_faketensor_init -> origin/gh/alexbrauckmann/paddedtensor_faketensor_init 2025-12-04T09:33:39.8651138Z * [new branch] gh/alexsamardzic/12/base -> origin/gh/alexsamardzic/12/base 2025-12-04T09:33:39.8652216Z * [new branch] gh/alexsamardzic/12/head -> origin/gh/alexsamardzic/12/head 2025-12-04T09:33:39.8653462Z * [new branch] gh/alexsamardzic/12/orig -> origin/gh/alexsamardzic/12/orig 2025-12-04T09:33:39.8655277Z * [new branch] gh/alexsamardzic/14/base -> origin/gh/alexsamardzic/14/base 2025-12-04T09:33:39.8656442Z * [new branch] gh/alexsamardzic/14/head -> origin/gh/alexsamardzic/14/head 2025-12-04T09:33:39.8657752Z * [new branch] gh/alexsamardzic/14/orig -> origin/gh/alexsamardzic/14/orig 2025-12-04T09:33:39.8659731Z * [new branch] gh/alexsamardzic/15/base -> origin/gh/alexsamardzic/15/base 2025-12-04T09:33:39.8660908Z * [new branch] gh/alexsamardzic/15/head -> origin/gh/alexsamardzic/15/head 2025-12-04T09:33:39.8662322Z * [new branch] gh/alexsamardzic/15/orig -> origin/gh/alexsamardzic/15/orig 2025-12-04T09:33:39.8664388Z * [new branch] gh/amjames/18/base -> origin/gh/amjames/18/base 2025-12-04T09:33:39.8665538Z * [new branch] gh/amjames/18/head -> origin/gh/amjames/18/head 2025-12-04T09:33:39.8666820Z * [new branch] gh/amjames/18/orig -> origin/gh/amjames/18/orig 2025-12-04T09:33:39.8669217Z * [new branch] gh/andrewor14/35/base -> origin/gh/andrewor14/35/base 2025-12-04T09:33:39.8670576Z * [new branch] gh/andrewor14/35/head -> origin/gh/andrewor14/35/head 2025-12-04T09:33:39.8672065Z * [new branch] gh/andrewor14/35/orig -> origin/gh/andrewor14/35/orig 2025-12-04T09:33:39.8673869Z * [new branch] gh/andrewor14/50/base -> origin/gh/andrewor14/50/base 2025-12-04T09:33:39.8675112Z * [new branch] gh/andrewor14/50/head -> origin/gh/andrewor14/50/head 2025-12-04T09:33:39.8676570Z * [new branch] gh/andrewor14/50/orig -> origin/gh/andrewor14/50/orig 2025-12-04T09:33:39.8678703Z * [new branch] gh/andyanwang/30/base -> origin/gh/andyanwang/30/base 2025-12-04T09:33:39.8680210Z * [new branch] gh/andyanwang/30/orig -> origin/gh/andyanwang/30/orig 2025-12-04T09:33:39.8682010Z * [new branch] gh/andyanwang/31/base -> origin/gh/andyanwang/31/base 2025-12-04T09:33:39.8683468Z * [new branch] gh/andyanwang/31/orig -> origin/gh/andyanwang/31/orig 2025-12-04T09:33:39.8685268Z * [new branch] gh/andyanwang/39/base -> origin/gh/andyanwang/39/base 2025-12-04T09:33:39.8686536Z * [new branch] gh/andyanwang/39/head -> origin/gh/andyanwang/39/head 2025-12-04T09:33:39.8687914Z * [new branch] gh/andyanwang/39/orig -> origin/gh/andyanwang/39/orig 2025-12-04T09:33:39.8690141Z * [new branch] gh/andyanwang/42/base -> origin/gh/andyanwang/42/base 2025-12-04T09:33:39.8691246Z * [new branch] gh/andyanwang/42/head -> origin/gh/andyanwang/42/head 2025-12-04T09:33:39.8692557Z * [new branch] gh/andyanwang/42/orig -> origin/gh/andyanwang/42/orig 2025-12-04T09:33:39.8694493Z * [new branch] gh/andyanwang/45/base -> origin/gh/andyanwang/45/base 2025-12-04T09:33:39.8695796Z * [new branch] gh/andyanwang/45/head -> origin/gh/andyanwang/45/head 2025-12-04T09:33:39.8697086Z * [new branch] gh/andyanwang/45/orig -> origin/gh/andyanwang/45/orig 2025-12-04T09:33:39.8699353Z * [new branch] gh/angelayi/107/base -> origin/gh/angelayi/107/base 2025-12-04T09:33:39.8700485Z * [new branch] gh/angelayi/107/head -> origin/gh/angelayi/107/head 2025-12-04T09:33:39.8702349Z * [new branch] gh/angelayi/114/base -> origin/gh/angelayi/114/base 2025-12-04T09:33:39.8703665Z * [new branch] gh/angelayi/114/head -> origin/gh/angelayi/114/head 2025-12-04T09:33:39.8704945Z * [new branch] gh/angelayi/114/orig -> origin/gh/angelayi/114/orig 2025-12-04T09:33:39.8706718Z * [new branch] gh/angelayi/116/base -> origin/gh/angelayi/116/base 2025-12-04T09:33:39.8707866Z * [new branch] gh/angelayi/116/head -> origin/gh/angelayi/116/head 2025-12-04T09:33:39.8709135Z * [new branch] gh/angelayi/116/orig -> origin/gh/angelayi/116/orig 2025-12-04T09:33:39.8711041Z * [new branch] gh/angelayi/122/base -> origin/gh/angelayi/122/base 2025-12-04T09:33:39.8712169Z * [new branch] gh/angelayi/122/head -> origin/gh/angelayi/122/head 2025-12-04T09:33:39.8713502Z * [new branch] gh/angelayi/122/orig -> origin/gh/angelayi/122/orig 2025-12-04T09:33:39.8715716Z * [new branch] gh/angelayi/124/base -> origin/gh/angelayi/124/base 2025-12-04T09:33:39.8716672Z * [new branch] gh/angelayi/124/head -> origin/gh/angelayi/124/head 2025-12-04T09:33:39.8717876Z * [new branch] gh/angelayi/124/orig -> origin/gh/angelayi/124/orig 2025-12-04T09:33:39.8719760Z * [new branch] gh/angelayi/128/base -> origin/gh/angelayi/128/base 2025-12-04T09:33:39.8720955Z * [new branch] gh/angelayi/128/head -> origin/gh/angelayi/128/head 2025-12-04T09:33:39.8722241Z * [new branch] gh/angelayi/128/orig -> origin/gh/angelayi/128/orig 2025-12-04T09:33:39.8724078Z * [new branch] gh/angelayi/131/base -> origin/gh/angelayi/131/base 2025-12-04T09:33:39.8725253Z * [new branch] gh/angelayi/131/head -> origin/gh/angelayi/131/head 2025-12-04T09:33:39.8726587Z * [new branch] gh/angelayi/131/orig -> origin/gh/angelayi/131/orig 2025-12-04T09:33:39.8728738Z * [new branch] gh/angelayi/132/base -> origin/gh/angelayi/132/base 2025-12-04T09:33:39.8730248Z * [new branch] gh/angelayi/132/head -> origin/gh/angelayi/132/head 2025-12-04T09:33:39.8731676Z * [new branch] gh/angelayi/132/orig -> origin/gh/angelayi/132/orig 2025-12-04T09:33:39.8733372Z * [new branch] gh/angelayi/133/base -> origin/gh/angelayi/133/base 2025-12-04T09:33:39.8734585Z * [new branch] gh/angelayi/133/head -> origin/gh/angelayi/133/head 2025-12-04T09:33:39.8735876Z * [new branch] gh/angelayi/133/orig -> origin/gh/angelayi/133/orig 2025-12-04T09:33:39.8737981Z * [new branch] gh/angelayi/134/base -> origin/gh/angelayi/134/base 2025-12-04T09:33:39.8739598Z * [new branch] gh/angelayi/134/head -> origin/gh/angelayi/134/head 2025-12-04T09:33:39.8740763Z * [new branch] gh/angelayi/134/orig -> origin/gh/angelayi/134/orig 2025-12-04T09:33:39.8742841Z * [new branch] gh/angelayi/135/base -> origin/gh/angelayi/135/base 2025-12-04T09:33:39.8744081Z * [new branch] gh/angelayi/135/head -> origin/gh/angelayi/135/head 2025-12-04T09:33:39.8745338Z * [new branch] gh/angelayi/135/orig -> origin/gh/angelayi/135/orig 2025-12-04T09:33:39.8747173Z * [new branch] gh/angelayi/136/base -> origin/gh/angelayi/136/base 2025-12-04T09:33:39.8748378Z * [new branch] gh/angelayi/136/head -> origin/gh/angelayi/136/head 2025-12-04T09:33:39.8749670Z * [new branch] gh/angelayi/136/orig -> origin/gh/angelayi/136/orig 2025-12-04T09:33:39.8751523Z * [new branch] gh/angelayi/137/base -> origin/gh/angelayi/137/base 2025-12-04T09:33:39.8752670Z * [new branch] gh/angelayi/137/head -> origin/gh/angelayi/137/head 2025-12-04T09:33:39.8754232Z * [new branch] gh/angelayi/137/orig -> origin/gh/angelayi/137/orig 2025-12-04T09:33:39.8755897Z * [new branch] gh/angelayi/138/base -> origin/gh/angelayi/138/base 2025-12-04T09:33:39.8757033Z * [new branch] gh/angelayi/138/head -> origin/gh/angelayi/138/head 2025-12-04T09:33:39.8758268Z * [new branch] gh/angelayi/138/orig -> origin/gh/angelayi/138/orig 2025-12-04T09:33:39.8760061Z * [new branch] gh/angelayi/139/base -> origin/gh/angelayi/139/base 2025-12-04T09:33:39.8761288Z * [new branch] gh/angelayi/139/head -> origin/gh/angelayi/139/head 2025-12-04T09:33:39.8762562Z * [new branch] gh/angelayi/139/orig -> origin/gh/angelayi/139/orig 2025-12-04T09:33:39.8764509Z * [new branch] gh/angelayi/140/base -> origin/gh/angelayi/140/base 2025-12-04T09:33:39.8765765Z * [new branch] gh/angelayi/140/head -> origin/gh/angelayi/140/head 2025-12-04T09:33:39.8767068Z * [new branch] gh/angelayi/140/orig -> origin/gh/angelayi/140/orig 2025-12-04T09:33:39.8769432Z * [new branch] gh/angelayi/141/base -> origin/gh/angelayi/141/base 2025-12-04T09:33:39.8770580Z * [new branch] gh/angelayi/141/head -> origin/gh/angelayi/141/head 2025-12-04T09:33:39.8771841Z * [new branch] gh/angelayi/141/orig -> origin/gh/angelayi/141/orig 2025-12-04T09:33:39.8773800Z * [new branch] gh/angelayi/142/base -> origin/gh/angelayi/142/base 2025-12-04T09:33:39.8774971Z * [new branch] gh/angelayi/142/head -> origin/gh/angelayi/142/head 2025-12-04T09:33:39.8776216Z * [new branch] gh/angelayi/142/orig -> origin/gh/angelayi/142/orig 2025-12-04T09:33:39.8778061Z * [new branch] gh/angelayi/143/base -> origin/gh/angelayi/143/base 2025-12-04T09:33:39.8779361Z * [new branch] gh/angelayi/143/head -> origin/gh/angelayi/143/head 2025-12-04T09:33:39.8780665Z * [new branch] gh/angelayi/143/orig -> origin/gh/angelayi/143/orig 2025-12-04T09:33:39.8782536Z * [new branch] gh/angelayi/144/base -> origin/gh/angelayi/144/base 2025-12-04T09:33:39.8783953Z * [new branch] gh/angelayi/144/head -> origin/gh/angelayi/144/head 2025-12-04T09:33:39.8785112Z * [new branch] gh/angelayi/144/orig -> origin/gh/angelayi/144/orig 2025-12-04T09:33:39.8787519Z * [new branch] gh/anijain2305/753/base -> origin/gh/anijain2305/753/base 2025-12-04T09:33:39.8788846Z * [new branch] gh/anijain2305/753/head -> origin/gh/anijain2305/753/head 2025-12-04T09:33:39.8790116Z * [new branch] gh/anijain2305/753/orig -> origin/gh/anijain2305/753/orig 2025-12-04T09:33:39.8792104Z * [new branch] gh/anijain2305/810/base -> origin/gh/anijain2305/810/base 2025-12-04T09:33:39.8793308Z * [new branch] gh/anijain2305/810/head -> origin/gh/anijain2305/810/head 2025-12-04T09:33:39.8794576Z * [new branch] gh/anijain2305/810/orig -> origin/gh/anijain2305/810/orig 2025-12-04T09:33:39.8796401Z * [new branch] gh/anijain2305/854/base -> origin/gh/anijain2305/854/base 2025-12-04T09:33:39.8798072Z * [new branch] gh/anijain2305/854/head -> origin/gh/anijain2305/854/head 2025-12-04T09:33:39.8799272Z * [new branch] gh/anijain2305/854/orig -> origin/gh/anijain2305/854/orig 2025-12-04T09:33:39.8801242Z * [new branch] gh/anijain2305/864/base -> origin/gh/anijain2305/864/base 2025-12-04T09:33:39.8802434Z * [new branch] gh/anijain2305/864/head -> origin/gh/anijain2305/864/head 2025-12-04T09:33:39.8803698Z * [new branch] gh/anijain2305/864/orig -> origin/gh/anijain2305/864/orig 2025-12-04T09:33:39.8805593Z * [new branch] gh/anijain2305/870/base -> origin/gh/anijain2305/870/base 2025-12-04T09:33:39.8806759Z * [new branch] gh/anijain2305/870/head -> origin/gh/anijain2305/870/head 2025-12-04T09:33:39.8808079Z * [new branch] gh/anijain2305/870/orig -> origin/gh/anijain2305/870/orig 2025-12-04T09:33:39.8810036Z * [new branch] gh/anijain2305/873/base -> origin/gh/anijain2305/873/base 2025-12-04T09:33:39.8811183Z * [new branch] gh/anijain2305/873/head -> origin/gh/anijain2305/873/head 2025-12-04T09:33:39.8812442Z * [new branch] gh/anijain2305/873/orig -> origin/gh/anijain2305/873/orig 2025-12-04T09:33:39.8814285Z * [new branch] gh/anijain2305/894/base -> origin/gh/anijain2305/894/base 2025-12-04T09:33:39.8815485Z * [new branch] gh/anijain2305/894/head -> origin/gh/anijain2305/894/head 2025-12-04T09:33:39.8816798Z * [new branch] gh/anijain2305/894/orig -> origin/gh/anijain2305/894/orig 2025-12-04T09:33:39.8818758Z * [new branch] gh/anijain2305/895/base -> origin/gh/anijain2305/895/base 2025-12-04T09:33:39.8820073Z * [new branch] gh/anijain2305/895/head -> origin/gh/anijain2305/895/head 2025-12-04T09:33:39.8821481Z * [new branch] gh/anijain2305/895/orig -> origin/gh/anijain2305/895/orig 2025-12-04T09:33:39.8823252Z * [new branch] gh/anijain2305/910/base -> origin/gh/anijain2305/910/base 2025-12-04T09:33:39.8824427Z * [new branch] gh/anijain2305/910/head -> origin/gh/anijain2305/910/head 2025-12-04T09:33:39.8825751Z * [new branch] gh/anijain2305/910/orig -> origin/gh/anijain2305/910/orig 2025-12-04T09:33:39.8827702Z * [new branch] gh/anijain2305/919/base -> origin/gh/anijain2305/919/base 2025-12-04T09:33:39.8828940Z * [new branch] gh/anijain2305/919/head -> origin/gh/anijain2305/919/head 2025-12-04T09:33:39.8830242Z * [new branch] gh/anijain2305/919/orig -> origin/gh/anijain2305/919/orig 2025-12-04T09:33:39.8832093Z * [new branch] gh/anijain2305/922/base -> origin/gh/anijain2305/922/base 2025-12-04T09:33:39.8833357Z * [new branch] gh/anijain2305/922/head -> origin/gh/anijain2305/922/head 2025-12-04T09:33:39.8834636Z * [new branch] gh/anijain2305/922/orig -> origin/gh/anijain2305/922/orig 2025-12-04T09:33:39.8836497Z * [new branch] gh/anijain2305/932/base -> origin/gh/anijain2305/932/base 2025-12-04T09:33:39.8837874Z * [new branch] gh/anijain2305/932/head -> origin/gh/anijain2305/932/head 2025-12-04T09:33:39.8839224Z * [new branch] gh/anijain2305/932/orig -> origin/gh/anijain2305/932/orig 2025-12-04T09:33:39.8841027Z * [new branch] gh/anijain2305/940/base -> origin/gh/anijain2305/940/base 2025-12-04T09:33:39.8842169Z * [new branch] gh/anijain2305/940/head -> origin/gh/anijain2305/940/head 2025-12-04T09:33:39.8843450Z * [new branch] gh/anijain2305/940/orig -> origin/gh/anijain2305/940/orig 2025-12-04T09:33:39.8845308Z * [new branch] gh/anijain2305/941/base -> origin/gh/anijain2305/941/base 2025-12-04T09:33:39.8846532Z * [new branch] gh/anijain2305/941/head -> origin/gh/anijain2305/941/head 2025-12-04T09:33:39.8847802Z * [new branch] gh/anijain2305/941/orig -> origin/gh/anijain2305/941/orig 2025-12-04T09:33:39.8849675Z * [new branch] gh/anijain2305/942/base -> origin/gh/anijain2305/942/base 2025-12-04T09:33:39.8850959Z * [new branch] gh/anijain2305/942/head -> origin/gh/anijain2305/942/head 2025-12-04T09:33:39.8852323Z * [new branch] gh/anijain2305/942/orig -> origin/gh/anijain2305/942/orig 2025-12-04T09:33:39.8854154Z * [new branch] gh/anijain2305/943/base -> origin/gh/anijain2305/943/base 2025-12-04T09:33:39.8855321Z * [new branch] gh/anijain2305/943/head -> origin/gh/anijain2305/943/head 2025-12-04T09:33:39.8856642Z * [new branch] gh/anijain2305/943/orig -> origin/gh/anijain2305/943/orig 2025-12-04T09:33:39.8859247Z * [new branch] gh/anijain2305/944/base -> origin/gh/anijain2305/944/base 2025-12-04T09:33:39.8860391Z * [new branch] gh/anijain2305/944/head -> origin/gh/anijain2305/944/head 2025-12-04T09:33:39.8862504Z * [new branch] gh/anijain2305/944/orig -> origin/gh/anijain2305/944/orig 2025-12-04T09:33:39.8864276Z * [new branch] gh/anijain2305/945/base -> origin/gh/anijain2305/945/base 2025-12-04T09:33:39.8865552Z * [new branch] gh/anijain2305/945/head -> origin/gh/anijain2305/945/head 2025-12-04T09:33:39.8866846Z * [new branch] gh/anijain2305/945/orig -> origin/gh/anijain2305/945/orig 2025-12-04T09:33:39.8868752Z * [new branch] gh/anijain2305/946/base -> origin/gh/anijain2305/946/base 2025-12-04T09:33:39.8869915Z * [new branch] gh/anijain2305/946/head -> origin/gh/anijain2305/946/head 2025-12-04T09:33:39.8871192Z * [new branch] gh/anijain2305/946/orig -> origin/gh/anijain2305/946/orig 2025-12-04T09:33:39.8873177Z * [new branch] gh/anijain2305/947/base -> origin/gh/anijain2305/947/base 2025-12-04T09:33:39.8874460Z * [new branch] gh/anijain2305/947/head -> origin/gh/anijain2305/947/head 2025-12-04T09:33:39.8875523Z * [new branch] gh/anijain2305/947/orig -> origin/gh/anijain2305/947/orig 2025-12-04T09:33:39.8877373Z * [new branch] gh/anijain2305/948/base -> origin/gh/anijain2305/948/base 2025-12-04T09:33:39.8878675Z * [new branch] gh/anijain2305/948/head -> origin/gh/anijain2305/948/head 2025-12-04T09:33:39.8879946Z * [new branch] gh/anijain2305/948/orig -> origin/gh/anijain2305/948/orig 2025-12-04T09:33:39.8881808Z * [new branch] gh/anijain2305/949/base -> origin/gh/anijain2305/949/base 2025-12-04T09:33:39.8883034Z * [new branch] gh/anijain2305/949/head -> origin/gh/anijain2305/949/head 2025-12-04T09:33:39.8884252Z * [new branch] gh/anijain2305/949/orig -> origin/gh/anijain2305/949/orig 2025-12-04T09:33:39.8886146Z * [new branch] gh/anijain2305/950/base -> origin/gh/anijain2305/950/base 2025-12-04T09:33:39.8887319Z * [new branch] gh/anijain2305/950/head -> origin/gh/anijain2305/950/head 2025-12-04T09:33:39.8888788Z * [new branch] gh/anijain2305/950/orig -> origin/gh/anijain2305/950/orig 2025-12-04T09:33:39.8890701Z * [new branch] gh/anijain2305/951/base -> origin/gh/anijain2305/951/base 2025-12-04T09:33:39.8891873Z * [new branch] gh/anijain2305/951/head -> origin/gh/anijain2305/951/head 2025-12-04T09:33:39.8893326Z * [new branch] gh/anijain2305/951/orig -> origin/gh/anijain2305/951/orig 2025-12-04T09:33:39.8895021Z * [new branch] gh/anijain2305/952/base -> origin/gh/anijain2305/952/base 2025-12-04T09:33:39.8896551Z * [new branch] gh/anijain2305/952/head -> origin/gh/anijain2305/952/head 2025-12-04T09:33:39.8897621Z * [new branch] gh/anijain2305/952/orig -> origin/gh/anijain2305/952/orig 2025-12-04T09:33:39.8899614Z * [new branch] gh/anijain2305/953/base -> origin/gh/anijain2305/953/base 2025-12-04T09:33:39.8900761Z * [new branch] gh/anijain2305/953/head -> origin/gh/anijain2305/953/head 2025-12-04T09:33:39.8902342Z * [new branch] gh/anijain2305/953/orig -> origin/gh/anijain2305/953/orig 2025-12-04T09:33:39.8904221Z * [new branch] gh/anijain2305/954/base -> origin/gh/anijain2305/954/base 2025-12-04T09:33:39.8905694Z * [new branch] gh/anijain2305/954/head -> origin/gh/anijain2305/954/head 2025-12-04T09:33:39.8906989Z * [new branch] gh/anijain2305/954/orig -> origin/gh/anijain2305/954/orig 2025-12-04T09:33:39.8908931Z * [new branch] gh/anijain2305/955/base -> origin/gh/anijain2305/955/base 2025-12-04T09:33:39.8910147Z * [new branch] gh/anijain2305/955/head -> origin/gh/anijain2305/955/head 2025-12-04T09:33:39.8911405Z * [new branch] gh/anijain2305/955/orig -> origin/gh/anijain2305/955/orig 2025-12-04T09:33:39.8913654Z * [new branch] gh/anijain2305/956/base -> origin/gh/anijain2305/956/base 2025-12-04T09:33:39.8915277Z * [new branch] gh/anijain2305/956/head -> origin/gh/anijain2305/956/head 2025-12-04T09:33:39.8916141Z * [new branch] gh/anijain2305/956/orig -> origin/gh/anijain2305/956/orig 2025-12-04T09:33:39.8918120Z * [new branch] gh/anijain2305/957/base -> origin/gh/anijain2305/957/base 2025-12-04T09:33:39.8919317Z * [new branch] gh/anijain2305/957/head -> origin/gh/anijain2305/957/head 2025-12-04T09:33:39.8920614Z * [new branch] gh/anijain2305/957/orig -> origin/gh/anijain2305/957/orig 2025-12-04T09:33:39.8922464Z * [new branch] gh/anijain2305/958/base -> origin/gh/anijain2305/958/base 2025-12-04T09:33:39.8923925Z * [new branch] gh/anijain2305/958/head -> origin/gh/anijain2305/958/head 2025-12-04T09:33:39.8925125Z * [new branch] gh/anijain2305/958/orig -> origin/gh/anijain2305/958/orig 2025-12-04T09:33:39.8926959Z * [new branch] gh/anijain2305/959/base -> origin/gh/anijain2305/959/base 2025-12-04T09:33:39.8928146Z * [new branch] gh/anijain2305/959/head -> origin/gh/anijain2305/959/head 2025-12-04T09:33:39.8929409Z * [new branch] gh/anijain2305/959/orig -> origin/gh/anijain2305/959/orig 2025-12-04T09:33:39.8931336Z * [new branch] gh/anijain2305/960/base -> origin/gh/anijain2305/960/base 2025-12-04T09:33:39.8932686Z * [new branch] gh/anijain2305/960/head -> origin/gh/anijain2305/960/head 2025-12-04T09:33:39.8933954Z * [new branch] gh/anijain2305/960/orig -> origin/gh/anijain2305/960/orig 2025-12-04T09:33:39.8935929Z * [new branch] gh/anijain2305/961/base -> origin/gh/anijain2305/961/base 2025-12-04T09:33:39.8937146Z * [new branch] gh/anijain2305/961/head -> origin/gh/anijain2305/961/head 2025-12-04T09:33:39.8938529Z * [new branch] gh/anijain2305/961/orig -> origin/gh/anijain2305/961/orig 2025-12-04T09:33:39.8940468Z * [new branch] gh/anijain2305/962/base -> origin/gh/anijain2305/962/base 2025-12-04T09:33:39.8941633Z * [new branch] gh/anijain2305/962/head -> origin/gh/anijain2305/962/head 2025-12-04T09:33:39.8942933Z * [new branch] gh/anijain2305/962/orig -> origin/gh/anijain2305/962/orig 2025-12-04T09:33:39.8945190Z * [new branch] gh/anijain2305/963/base -> origin/gh/anijain2305/963/base 2025-12-04T09:33:39.8946546Z * [new branch] gh/anijain2305/963/head -> origin/gh/anijain2305/963/head 2025-12-04T09:33:39.8947889Z * [new branch] gh/anijain2305/963/orig -> origin/gh/anijain2305/963/orig 2025-12-04T09:33:39.8949776Z * [new branch] gh/anijain2305/964/base -> origin/gh/anijain2305/964/base 2025-12-04T09:33:39.8951127Z * [new branch] gh/anijain2305/964/head -> origin/gh/anijain2305/964/head 2025-12-04T09:33:39.8952982Z * [new branch] gh/anijain2305/964/orig -> origin/gh/anijain2305/964/orig 2025-12-04T09:33:39.8954787Z * [new branch] gh/anijain2305/965/base -> origin/gh/anijain2305/965/base 2025-12-04T09:33:39.8956112Z * [new branch] gh/anijain2305/965/head -> origin/gh/anijain2305/965/head 2025-12-04T09:33:39.8957482Z * [new branch] gh/anijain2305/965/orig -> origin/gh/anijain2305/965/orig 2025-12-04T09:33:39.8959297Z * [new branch] gh/anijain2305/966/base -> origin/gh/anijain2305/966/base 2025-12-04T09:33:39.8960453Z * [new branch] gh/anijain2305/966/head -> origin/gh/anijain2305/966/head 2025-12-04T09:33:39.8961713Z * [new branch] gh/anijain2305/966/orig -> origin/gh/anijain2305/966/orig 2025-12-04T09:33:39.8963594Z * [new branch] gh/anijain2305/967/base -> origin/gh/anijain2305/967/base 2025-12-04T09:33:39.8964779Z * [new branch] gh/anijain2305/967/head -> origin/gh/anijain2305/967/head 2025-12-04T09:33:39.8966303Z * [new branch] gh/anijain2305/967/orig -> origin/gh/anijain2305/967/orig 2025-12-04T09:33:39.8968012Z * [new branch] gh/anijain2305/968/base -> origin/gh/anijain2305/968/base 2025-12-04T09:33:39.8969257Z * [new branch] gh/anijain2305/968/head -> origin/gh/anijain2305/968/head 2025-12-04T09:33:39.8970584Z * [new branch] gh/anijain2305/968/orig -> origin/gh/anijain2305/968/orig 2025-12-04T09:33:39.8972394Z * [new branch] gh/anijain2305/969/base -> origin/gh/anijain2305/969/base 2025-12-04T09:33:39.8973619Z * [new branch] gh/anijain2305/969/head -> origin/gh/anijain2305/969/head 2025-12-04T09:33:39.8975013Z * [new branch] gh/anijain2305/969/orig -> origin/gh/anijain2305/969/orig 2025-12-04T09:33:39.8976842Z * [new branch] gh/anijain2305/970/base -> origin/gh/anijain2305/970/base 2025-12-04T09:33:39.8978114Z * [new branch] gh/anijain2305/970/head -> origin/gh/anijain2305/970/head 2025-12-04T09:33:39.8979644Z * [new branch] gh/anijain2305/970/orig -> origin/gh/anijain2305/970/orig 2025-12-04T09:33:39.8981846Z * [new branch] gh/anjali411/216/base -> origin/gh/anjali411/216/base 2025-12-04T09:33:39.8983067Z * [new branch] gh/anjali411/216/head -> origin/gh/anjali411/216/head 2025-12-04T09:33:39.8984295Z * [new branch] gh/anjali411/216/orig -> origin/gh/anjali411/216/orig 2025-12-04T09:33:39.8986785Z * [new branch] gh/anshul-si/1/base -> origin/gh/anshul-si/1/base 2025-12-04T09:33:39.8988049Z * [new branch] gh/anshul-si/1/head -> origin/gh/anshul-si/1/head 2025-12-04T09:33:39.8989777Z * [new branch] gh/anshul-si/2/base -> origin/gh/anshul-si/2/base 2025-12-04T09:33:39.8990890Z * [new branch] gh/anshul-si/2/head -> origin/gh/anshul-si/2/head 2025-12-04T09:33:39.8992523Z * [new branch] gh/anshul-si/3/base -> origin/gh/anshul-si/3/base 2025-12-04T09:33:39.8993647Z * [new branch] gh/anshul-si/3/head -> origin/gh/anshul-si/3/head 2025-12-04T09:33:39.8995228Z * [new branch] gh/anshul-si/4/base -> origin/gh/anshul-si/4/base 2025-12-04T09:33:39.8996339Z * [new branch] gh/anshul-si/4/head -> origin/gh/anshul-si/4/head 2025-12-04T09:33:39.8998397Z * [new branch] gh/anshul-si/5/base -> origin/gh/anshul-si/5/base 2025-12-04T09:33:39.8999190Z * [new branch] gh/anshul-si/5/head -> origin/gh/anshul-si/5/head 2025-12-04T09:33:39.9001278Z * [new branch] gh/anshul-si/53/base -> origin/gh/anshul-si/53/base 2025-12-04T09:33:39.9002544Z * [new branch] gh/anshul-si/53/head -> origin/gh/anshul-si/53/head 2025-12-04T09:33:39.9004420Z * [new branch] gh/anshul-si/58/base -> origin/gh/anshul-si/58/base 2025-12-04T09:33:39.9005606Z * [new branch] gh/anshul-si/58/head -> origin/gh/anshul-si/58/head 2025-12-04T09:33:39.9007288Z * [new branch] gh/anshul-si/66/base -> origin/gh/anshul-si/66/base 2025-12-04T09:33:39.9008466Z * [new branch] gh/anshul-si/66/head -> origin/gh/anshul-si/66/head 2025-12-04T09:33:39.9009770Z * [new branch] gh/anshul-si/66/orig -> origin/gh/anshul-si/66/orig 2025-12-04T09:33:39.9011431Z * [new branch] gh/anshul-si/67/base -> origin/gh/anshul-si/67/base 2025-12-04T09:33:39.9012564Z * [new branch] gh/anshul-si/67/head -> origin/gh/anshul-si/67/head 2025-12-04T09:33:39.9013821Z * [new branch] gh/anshul-si/67/orig -> origin/gh/anshul-si/67/orig 2025-12-04T09:33:39.9015848Z * [new branch] gh/anshul-si/68/base -> origin/gh/anshul-si/68/base 2025-12-04T09:33:39.9016971Z * [new branch] gh/anshul-si/68/head -> origin/gh/anshul-si/68/head 2025-12-04T09:33:39.9018575Z * [new branch] gh/anshul-si/68/orig -> origin/gh/anshul-si/68/orig 2025-12-04T09:33:39.9020534Z * [new branch] gh/anshul-si/69/base -> origin/gh/anshul-si/69/base 2025-12-04T09:33:39.9021665Z * [new branch] gh/anshul-si/69/head -> origin/gh/anshul-si/69/head 2025-12-04T09:33:39.9022964Z * [new branch] gh/anshul-si/69/orig -> origin/gh/anshul-si/69/orig 2025-12-04T09:33:39.9025232Z * [new branch] gh/anshul-si/70/base -> origin/gh/anshul-si/70/base 2025-12-04T09:33:39.9026532Z * [new branch] gh/anshul-si/70/head -> origin/gh/anshul-si/70/head 2025-12-04T09:33:39.9028371Z * [new branch] gh/anshul-si/70/orig -> origin/gh/anshul-si/70/orig 2025-12-04T09:33:39.9029986Z * [new branch] gh/anshul-si/71/base -> origin/gh/anshul-si/71/base 2025-12-04T09:33:39.9031220Z * [new branch] gh/anshul-si/71/head -> origin/gh/anshul-si/71/head 2025-12-04T09:33:39.9032503Z * [new branch] gh/anshul-si/71/orig -> origin/gh/anshul-si/71/orig 2025-12-04T09:33:39.9034377Z * [new branch] gh/anshul-si/72/base -> origin/gh/anshul-si/72/base 2025-12-04T09:33:39.9035655Z * [new branch] gh/anshul-si/72/head -> origin/gh/anshul-si/72/head 2025-12-04T09:33:39.9037097Z * [new branch] gh/anshul-si/72/orig -> origin/gh/anshul-si/72/orig 2025-12-04T09:33:39.9038839Z * [new branch] gh/anshul-si/73/base -> origin/gh/anshul-si/73/base 2025-12-04T09:33:39.9040119Z * [new branch] gh/anshul-si/73/head -> origin/gh/anshul-si/73/head 2025-12-04T09:33:39.9041402Z * [new branch] gh/anshul-si/73/orig -> origin/gh/anshul-si/73/orig 2025-12-04T09:33:39.9043734Z * [new branch] gh/aorenste/132/base -> origin/gh/aorenste/132/base 2025-12-04T09:33:39.9045032Z * [new branch] gh/aorenste/132/head -> origin/gh/aorenste/132/head 2025-12-04T09:33:39.9047013Z * [new branch] gh/aorenste/134/base -> origin/gh/aorenste/134/base 2025-12-04T09:33:39.9048563Z * [new branch] gh/aorenste/134/head -> origin/gh/aorenste/134/head 2025-12-04T09:33:39.9049648Z * [new branch] gh/aorenste/134/orig -> origin/gh/aorenste/134/orig 2025-12-04T09:33:39.9051510Z * [new branch] gh/aorenste/139/base -> origin/gh/aorenste/139/base 2025-12-04T09:33:39.9052710Z * [new branch] gh/aorenste/139/head -> origin/gh/aorenste/139/head 2025-12-04T09:33:39.9054208Z * [new branch] gh/aorenste/139/orig -> origin/gh/aorenste/139/orig 2025-12-04T09:33:39.9055910Z * [new branch] gh/aorenste/141/base -> origin/gh/aorenste/141/base 2025-12-04T09:33:39.9057080Z * [new branch] gh/aorenste/141/head -> origin/gh/aorenste/141/head 2025-12-04T09:33:39.9059393Z * [new branch] gh/aorenste/145/base -> origin/gh/aorenste/145/base 2025-12-04T09:33:39.9060546Z * [new branch] gh/aorenste/145/head -> origin/gh/aorenste/145/head 2025-12-04T09:33:39.9062124Z * [new branch] gh/aorenste/145/orig -> origin/gh/aorenste/145/orig 2025-12-04T09:33:39.9063929Z * [new branch] gh/aorenste/146/base -> origin/gh/aorenste/146/base 2025-12-04T09:33:39.9065234Z * [new branch] gh/aorenste/146/head -> origin/gh/aorenste/146/head 2025-12-04T09:33:39.9066544Z * [new branch] gh/aorenste/146/orig -> origin/gh/aorenste/146/orig 2025-12-04T09:33:39.9068470Z * [new branch] gh/aorenste/147/base -> origin/gh/aorenste/147/base 2025-12-04T09:33:39.9069864Z * [new branch] gh/aorenste/147/head -> origin/gh/aorenste/147/head 2025-12-04T09:33:39.9071160Z * [new branch] gh/aorenste/147/orig -> origin/gh/aorenste/147/orig 2025-12-04T09:33:39.9073083Z * [new branch] gh/aorenste/148/base -> origin/gh/aorenste/148/base 2025-12-04T09:33:39.9074327Z * [new branch] gh/aorenste/148/head -> origin/gh/aorenste/148/head 2025-12-04T09:33:39.9075752Z * [new branch] gh/aorenste/148/orig -> origin/gh/aorenste/148/orig 2025-12-04T09:33:39.9077492Z * [new branch] gh/aorenste/149/base -> origin/gh/aorenste/149/base 2025-12-04T09:33:39.9078666Z * [new branch] gh/aorenste/149/head -> origin/gh/aorenste/149/head 2025-12-04T09:33:39.9079900Z * [new branch] gh/aorenste/149/orig -> origin/gh/aorenste/149/orig 2025-12-04T09:33:39.9081784Z * [new branch] gh/aorenste/150/base -> origin/gh/aorenste/150/base 2025-12-04T09:33:39.9082909Z * [new branch] gh/aorenste/150/head -> origin/gh/aorenste/150/head 2025-12-04T09:33:39.9084219Z * [new branch] gh/aorenste/150/orig -> origin/gh/aorenste/150/orig 2025-12-04T09:33:39.9085952Z * [new branch] gh/aorenste/151/base -> origin/gh/aorenste/151/base 2025-12-04T09:33:39.9087149Z * [new branch] gh/aorenste/151/head -> origin/gh/aorenste/151/head 2025-12-04T09:33:39.9092339Z * [new branch] gh/aorenste/151/orig -> origin/gh/aorenste/151/orig 2025-12-04T09:33:39.9094376Z * [new branch] gh/aorenste/152/base -> origin/gh/aorenste/152/base 2025-12-04T09:33:39.9095587Z * [new branch] gh/aorenste/152/head -> origin/gh/aorenste/152/head 2025-12-04T09:33:39.9096929Z * [new branch] gh/aorenste/152/orig -> origin/gh/aorenste/152/orig 2025-12-04T09:33:39.9098645Z * [new branch] gh/aorenste/153/base -> origin/gh/aorenste/153/base 2025-12-04T09:33:39.9099864Z * [new branch] gh/aorenste/153/head -> origin/gh/aorenste/153/head 2025-12-04T09:33:39.9101131Z * [new branch] gh/aorenste/153/orig -> origin/gh/aorenste/153/orig 2025-12-04T09:33:39.9103605Z * [new branch] gh/aorenste/154/base -> origin/gh/aorenste/154/base 2025-12-04T09:33:39.9104636Z * [new branch] gh/aorenste/154/head -> origin/gh/aorenste/154/head 2025-12-04T09:33:39.9105818Z * [new branch] gh/aorenste/154/orig -> origin/gh/aorenste/154/orig 2025-12-04T09:33:39.9107428Z * [new branch] gh/aorenste/155/base -> origin/gh/aorenste/155/base 2025-12-04T09:33:39.9108655Z * [new branch] gh/aorenste/155/head -> origin/gh/aorenste/155/head 2025-12-04T09:33:39.9110133Z * [new branch] gh/aorenste/155/orig -> origin/gh/aorenste/155/orig 2025-12-04T09:33:39.9111580Z * [new branch] gh/aorenste/156/base -> origin/gh/aorenste/156/base 2025-12-04T09:33:39.9112772Z * [new branch] gh/aorenste/156/head -> origin/gh/aorenste/156/head 2025-12-04T09:33:39.9113927Z * [new branch] gh/aorenste/156/orig -> origin/gh/aorenste/156/orig 2025-12-04T09:33:39.9116034Z * [new branch] gh/aorenste/157/base -> origin/gh/aorenste/157/base 2025-12-04T09:33:39.9117291Z * [new branch] gh/aorenste/157/head -> origin/gh/aorenste/157/head 2025-12-04T09:33:39.9118540Z * [new branch] gh/aorenste/157/orig -> origin/gh/aorenste/157/orig 2025-12-04T09:33:39.9120217Z * [new branch] gh/aorenste/158/base -> origin/gh/aorenste/158/base 2025-12-04T09:33:39.9121456Z * [new branch] gh/aorenste/158/head -> origin/gh/aorenste/158/head 2025-12-04T09:33:39.9122602Z * [new branch] gh/aorenste/158/orig -> origin/gh/aorenste/158/orig 2025-12-04T09:33:39.9124288Z * [new branch] gh/aorenste/159/base -> origin/gh/aorenste/159/base 2025-12-04T09:33:39.9125519Z * [new branch] gh/aorenste/159/head -> origin/gh/aorenste/159/head 2025-12-04T09:33:39.9126686Z * [new branch] gh/aorenste/159/orig -> origin/gh/aorenste/159/orig 2025-12-04T09:33:39.9128959Z * [new branch] gh/avikchaudhuri/1/base -> origin/gh/avikchaudhuri/1/base 2025-12-04T09:33:39.9130223Z * [new branch] gh/avikchaudhuri/1/head -> origin/gh/avikchaudhuri/1/head 2025-12-04T09:33:39.9131869Z * [new branch] gh/avikchaudhuri/2/base -> origin/gh/avikchaudhuri/2/base 2025-12-04T09:33:39.9133052Z * [new branch] gh/avikchaudhuri/2/head -> origin/gh/avikchaudhuri/2/head 2025-12-04T09:33:39.9134232Z * [new branch] gh/avikchaudhuri/2/orig -> origin/gh/avikchaudhuri/2/orig 2025-12-04T09:33:39.9136844Z * [new branch] gh/bdhirsh/666/base -> origin/gh/bdhirsh/666/base 2025-12-04T09:33:39.9137970Z * [new branch] gh/bdhirsh/666/head -> origin/gh/bdhirsh/666/head 2025-12-04T09:33:39.9139757Z * [new branch] gh/bdhirsh/666/orig -> origin/gh/bdhirsh/666/orig 2025-12-04T09:33:39.9141153Z * [new branch] gh/bdhirsh/668/base -> origin/gh/bdhirsh/668/base 2025-12-04T09:33:39.9142414Z * [new branch] gh/bdhirsh/668/head -> origin/gh/bdhirsh/668/head 2025-12-04T09:33:39.9143835Z * [new branch] gh/bdhirsh/668/orig -> origin/gh/bdhirsh/668/orig 2025-12-04T09:33:39.9145786Z * [new branch] gh/bdhirsh/669/base -> origin/gh/bdhirsh/669/base 2025-12-04T09:33:39.9146931Z * [new branch] gh/bdhirsh/669/head -> origin/gh/bdhirsh/669/head 2025-12-04T09:33:39.9148140Z * [new branch] gh/bdhirsh/669/orig -> origin/gh/bdhirsh/669/orig 2025-12-04T09:33:39.9150195Z * [new branch] gh/bdhirsh/670/base -> origin/gh/bdhirsh/670/base 2025-12-04T09:33:39.9151492Z * [new branch] gh/bdhirsh/670/head -> origin/gh/bdhirsh/670/head 2025-12-04T09:33:39.9152910Z * [new branch] gh/bdhirsh/670/orig -> origin/gh/bdhirsh/670/orig 2025-12-04T09:33:39.9154762Z * [new branch] gh/bdhirsh/672/base -> origin/gh/bdhirsh/672/base 2025-12-04T09:33:39.9155868Z * [new branch] gh/bdhirsh/672/head -> origin/gh/bdhirsh/672/head 2025-12-04T09:33:39.9157121Z * [new branch] gh/bdhirsh/672/orig -> origin/gh/bdhirsh/672/orig 2025-12-04T09:33:39.9159235Z * [new branch] gh/bdhirsh/675/base -> origin/gh/bdhirsh/675/base 2025-12-04T09:33:39.9160690Z * [new branch] gh/bdhirsh/675/head -> origin/gh/bdhirsh/675/head 2025-12-04T09:33:39.9161998Z * [new branch] gh/bdhirsh/675/orig -> origin/gh/bdhirsh/675/orig 2025-12-04T09:33:39.9163827Z * [new branch] gh/bdhirsh/676/base -> origin/gh/bdhirsh/676/base 2025-12-04T09:33:39.9165104Z * [new branch] gh/bdhirsh/676/head -> origin/gh/bdhirsh/676/head 2025-12-04T09:33:39.9166564Z * [new branch] gh/bdhirsh/676/orig -> origin/gh/bdhirsh/676/orig 2025-12-04T09:33:39.9168168Z * [new branch] gh/bdhirsh/677/base -> origin/gh/bdhirsh/677/base 2025-12-04T09:33:39.9169892Z * [new branch] gh/bdhirsh/677/head -> origin/gh/bdhirsh/677/head 2025-12-04T09:33:39.9171201Z * [new branch] gh/bdhirsh/677/orig -> origin/gh/bdhirsh/677/orig 2025-12-04T09:33:39.9173047Z * [new branch] gh/bdhirsh/678/base -> origin/gh/bdhirsh/678/base 2025-12-04T09:33:39.9174466Z * [new branch] gh/bdhirsh/678/head -> origin/gh/bdhirsh/678/head 2025-12-04T09:33:39.9175747Z * [new branch] gh/bdhirsh/678/orig -> origin/gh/bdhirsh/678/orig 2025-12-04T09:33:39.9177648Z * [new branch] gh/bdhirsh/679/base -> origin/gh/bdhirsh/679/base 2025-12-04T09:33:39.9179123Z * [new branch] gh/bdhirsh/679/head -> origin/gh/bdhirsh/679/head 2025-12-04T09:33:39.9180514Z * [new branch] gh/bdhirsh/679/orig -> origin/gh/bdhirsh/679/orig 2025-12-04T09:33:39.9182263Z * [new branch] gh/bdhirsh/680/base -> origin/gh/bdhirsh/680/base 2025-12-04T09:33:39.9183664Z * [new branch] gh/bdhirsh/680/head -> origin/gh/bdhirsh/680/head 2025-12-04T09:33:39.9184917Z * [new branch] gh/bdhirsh/680/orig -> origin/gh/bdhirsh/680/orig 2025-12-04T09:33:39.9186481Z * [new branch] gh/bdhirsh/681/base -> origin/gh/bdhirsh/681/base 2025-12-04T09:33:39.9188066Z * [new branch] gh/bdhirsh/681/head -> origin/gh/bdhirsh/681/head 2025-12-04T09:33:39.9189562Z * [new branch] gh/bdhirsh/681/orig -> origin/gh/bdhirsh/681/orig 2025-12-04T09:33:39.9191588Z * [new branch] gh/benjaminglass1/101/base -> origin/gh/benjaminglass1/101/base 2025-12-04T09:33:39.9192857Z * [new branch] gh/benjaminglass1/101/head -> origin/gh/benjaminglass1/101/head 2025-12-04T09:33:39.9194211Z * [new branch] gh/benjaminglass1/101/orig -> origin/gh/benjaminglass1/101/orig 2025-12-04T09:33:39.9195920Z * [new branch] gh/benjaminglass1/102/base -> origin/gh/benjaminglass1/102/base 2025-12-04T09:33:39.9197307Z * [new branch] gh/benjaminglass1/102/head -> origin/gh/benjaminglass1/102/head 2025-12-04T09:33:39.9198625Z * [new branch] gh/benjaminglass1/102/orig -> origin/gh/benjaminglass1/102/orig 2025-12-04T09:33:39.9200373Z * [new branch] gh/benjaminglass1/106/base -> origin/gh/benjaminglass1/106/base 2025-12-04T09:33:39.9201630Z * [new branch] gh/benjaminglass1/106/head -> origin/gh/benjaminglass1/106/head 2025-12-04T09:33:39.9202897Z * [new branch] gh/benjaminglass1/106/orig -> origin/gh/benjaminglass1/106/orig 2025-12-04T09:33:39.9204564Z * [new branch] gh/benjaminglass1/107/base -> origin/gh/benjaminglass1/107/base 2025-12-04T09:33:39.9205832Z * [new branch] gh/benjaminglass1/107/head -> origin/gh/benjaminglass1/107/head 2025-12-04T09:33:39.9207120Z * [new branch] gh/benjaminglass1/107/orig -> origin/gh/benjaminglass1/107/orig 2025-12-04T09:33:39.9208816Z * [new branch] gh/benjaminglass1/108/base -> origin/gh/benjaminglass1/108/base 2025-12-04T09:33:39.9210066Z * [new branch] gh/benjaminglass1/108/head -> origin/gh/benjaminglass1/108/head 2025-12-04T09:33:39.9211329Z * [new branch] gh/benjaminglass1/108/orig -> origin/gh/benjaminglass1/108/orig 2025-12-04T09:33:39.9212984Z * [new branch] gh/benjaminglass1/109/base -> origin/gh/benjaminglass1/109/base 2025-12-04T09:33:39.9214347Z * [new branch] gh/benjaminglass1/109/head -> origin/gh/benjaminglass1/109/head 2025-12-04T09:33:39.9215692Z * [new branch] gh/benjaminglass1/109/orig -> origin/gh/benjaminglass1/109/orig 2025-12-04T09:33:39.9217307Z * [new branch] gh/benjaminglass1/97/base -> origin/gh/benjaminglass1/97/base 2025-12-04T09:33:39.9218646Z * [new branch] gh/benjaminglass1/97/head -> origin/gh/benjaminglass1/97/head 2025-12-04T09:33:39.9219981Z * [new branch] gh/benjaminglass1/97/orig -> origin/gh/benjaminglass1/97/orig 2025-12-04T09:33:39.9222506Z * [new branch] gh/bobrenjc93/570/base -> origin/gh/bobrenjc93/570/base 2025-12-04T09:33:39.9223816Z * [new branch] gh/bobrenjc93/570/head -> origin/gh/bobrenjc93/570/head 2025-12-04T09:33:39.9225078Z * [new branch] gh/bobrenjc93/570/orig -> origin/gh/bobrenjc93/570/orig 2025-12-04T09:33:39.9226690Z * [new branch] gh/bobrenjc93/604/base -> origin/gh/bobrenjc93/604/base 2025-12-04T09:33:39.9228039Z * [new branch] gh/bobrenjc93/604/head -> origin/gh/bobrenjc93/604/head 2025-12-04T09:33:39.9229335Z * [new branch] gh/bobrenjc93/604/orig -> origin/gh/bobrenjc93/604/orig 2025-12-04T09:33:39.9231094Z * [new branch] gh/bobrenjc93/638/base -> origin/gh/bobrenjc93/638/base 2025-12-04T09:33:39.9232364Z * [new branch] gh/bobrenjc93/638/head -> origin/gh/bobrenjc93/638/head 2025-12-04T09:33:39.9233674Z * [new branch] gh/bobrenjc93/638/orig -> origin/gh/bobrenjc93/638/orig 2025-12-04T09:33:39.9235352Z * [new branch] gh/bobrenjc93/653/base -> origin/gh/bobrenjc93/653/base 2025-12-04T09:33:39.9236643Z * [new branch] gh/bobrenjc93/653/head -> origin/gh/bobrenjc93/653/head 2025-12-04T09:33:39.9237869Z * [new branch] gh/bobrenjc93/653/orig -> origin/gh/bobrenjc93/653/orig 2025-12-04T09:33:39.9239716Z * [new branch] gh/bobrenjc93/654/base -> origin/gh/bobrenjc93/654/base 2025-12-04T09:33:39.9240939Z * [new branch] gh/bobrenjc93/654/head -> origin/gh/bobrenjc93/654/head 2025-12-04T09:33:39.9242234Z * [new branch] gh/bobrenjc93/654/orig -> origin/gh/bobrenjc93/654/orig 2025-12-04T09:33:39.9243905Z * [new branch] gh/bobrenjc93/657/base -> origin/gh/bobrenjc93/657/base 2025-12-04T09:33:39.9245138Z * [new branch] gh/bobrenjc93/657/head -> origin/gh/bobrenjc93/657/head 2025-12-04T09:33:39.9246380Z * [new branch] gh/bobrenjc93/657/orig -> origin/gh/bobrenjc93/657/orig 2025-12-04T09:33:39.9248214Z * [new branch] gh/bobrenjc93/672/base -> origin/gh/bobrenjc93/672/base 2025-12-04T09:33:39.9249401Z * [new branch] gh/bobrenjc93/672/head -> origin/gh/bobrenjc93/672/head 2025-12-04T09:33:39.9250709Z * [new branch] gh/bobrenjc93/672/orig -> origin/gh/bobrenjc93/672/orig 2025-12-04T09:33:39.9254390Z * [new branch] gh/bobrenjc93/679/base -> origin/gh/bobrenjc93/679/base 2025-12-04T09:33:39.9254669Z * [new branch] gh/bobrenjc93/679/head -> origin/gh/bobrenjc93/679/head 2025-12-04T09:33:39.9255039Z * [new branch] gh/bobrenjc93/679/orig -> origin/gh/bobrenjc93/679/orig 2025-12-04T09:33:39.9256906Z * [new branch] gh/bobrenjc93/680/base -> origin/gh/bobrenjc93/680/base 2025-12-04T09:33:39.9258131Z * [new branch] gh/bobrenjc93/680/head -> origin/gh/bobrenjc93/680/head 2025-12-04T09:33:39.9259877Z * [new branch] gh/bobrenjc93/680/orig -> origin/gh/bobrenjc93/680/orig 2025-12-04T09:33:39.9261376Z * [new branch] gh/bobrenjc93/681/base -> origin/gh/bobrenjc93/681/base 2025-12-04T09:33:39.9262659Z * [new branch] gh/bobrenjc93/681/head -> origin/gh/bobrenjc93/681/head 2025-12-04T09:33:39.9264029Z * [new branch] gh/bobrenjc93/681/orig -> origin/gh/bobrenjc93/681/orig 2025-12-04T09:33:39.9265621Z * [new branch] gh/bobrenjc93/682/base -> origin/gh/bobrenjc93/682/base 2025-12-04T09:33:39.9266896Z * [new branch] gh/bobrenjc93/682/head -> origin/gh/bobrenjc93/682/head 2025-12-04T09:33:39.9268148Z * [new branch] gh/bobrenjc93/682/orig -> origin/gh/bobrenjc93/682/orig 2025-12-04T09:33:39.9269849Z * [new branch] gh/bobrenjc93/683/base -> origin/gh/bobrenjc93/683/base 2025-12-04T09:33:39.9271107Z * [new branch] gh/bobrenjc93/683/head -> origin/gh/bobrenjc93/683/head 2025-12-04T09:33:39.9272405Z * [new branch] gh/bobrenjc93/683/orig -> origin/gh/bobrenjc93/683/orig 2025-12-04T09:33:39.9274580Z * [new branch] gh/bobrenjc93/684/base -> origin/gh/bobrenjc93/684/base 2025-12-04T09:33:39.9276092Z * [new branch] gh/bobrenjc93/684/head -> origin/gh/bobrenjc93/684/head 2025-12-04T09:33:39.9277559Z * [new branch] gh/bobrenjc93/684/orig -> origin/gh/bobrenjc93/684/orig 2025-12-04T09:33:39.9279101Z * [new branch] gh/bobrenjc93/685/base -> origin/gh/bobrenjc93/685/base 2025-12-04T09:33:39.9280773Z * [new branch] gh/bobrenjc93/685/head -> origin/gh/bobrenjc93/685/head 2025-12-04T09:33:39.9282387Z * [new branch] gh/bobrenjc93/685/orig -> origin/gh/bobrenjc93/685/orig 2025-12-04T09:33:39.9284404Z * [new branch] gh/bobrenjc93/686/base -> origin/gh/bobrenjc93/686/base 2025-12-04T09:33:39.9285625Z * [new branch] gh/bobrenjc93/686/head -> origin/gh/bobrenjc93/686/head 2025-12-04T09:33:39.9286977Z * [new branch] gh/bobrenjc93/686/orig -> origin/gh/bobrenjc93/686/orig 2025-12-04T09:33:39.9289177Z * [new branch] gh/bobrenjc93/687/base -> origin/gh/bobrenjc93/687/base 2025-12-04T09:33:39.9290437Z * [new branch] gh/bobrenjc93/687/head -> origin/gh/bobrenjc93/687/head 2025-12-04T09:33:39.9291666Z * [new branch] gh/bobrenjc93/687/orig -> origin/gh/bobrenjc93/687/orig 2025-12-04T09:33:39.9293808Z * [new branch] gh/bobrenjc93/688/base -> origin/gh/bobrenjc93/688/base 2025-12-04T09:33:39.9295121Z * [new branch] gh/bobrenjc93/688/head -> origin/gh/bobrenjc93/688/head 2025-12-04T09:33:39.9296430Z * [new branch] gh/bobrenjc93/688/orig -> origin/gh/bobrenjc93/688/orig 2025-12-04T09:33:39.9298028Z * [new branch] gh/bobrenjc93/689/base -> origin/gh/bobrenjc93/689/base 2025-12-04T09:33:39.9299534Z * [new branch] gh/bobrenjc93/689/head -> origin/gh/bobrenjc93/689/head 2025-12-04T09:33:39.9300845Z * [new branch] gh/bobrenjc93/689/orig -> origin/gh/bobrenjc93/689/orig 2025-12-04T09:33:39.9302543Z * [new branch] gh/bobrenjc93/690/base -> origin/gh/bobrenjc93/690/base 2025-12-04T09:33:39.9303831Z * [new branch] gh/bobrenjc93/690/head -> origin/gh/bobrenjc93/690/head 2025-12-04T09:33:39.9305111Z * [new branch] gh/bobrenjc93/690/orig -> origin/gh/bobrenjc93/690/orig 2025-12-04T09:33:39.9307588Z * [new branch] gh/bobrenjc93/691/base -> origin/gh/bobrenjc93/691/base 2025-12-04T09:33:39.9309224Z * [new branch] gh/bobrenjc93/691/head -> origin/gh/bobrenjc93/691/head 2025-12-04T09:33:39.9311012Z * [new branch] gh/bobrenjc93/691/orig -> origin/gh/bobrenjc93/691/orig 2025-12-04T09:33:39.9313514Z * [new branch] gh/bobrenjc93/692/base -> origin/gh/bobrenjc93/692/base 2025-12-04T09:33:39.9314819Z * [new branch] gh/bobrenjc93/692/head -> origin/gh/bobrenjc93/692/head 2025-12-04T09:33:39.9316094Z * [new branch] gh/bobrenjc93/692/orig -> origin/gh/bobrenjc93/692/orig 2025-12-04T09:33:39.9317732Z * [new branch] gh/bobrenjc93/693/base -> origin/gh/bobrenjc93/693/base 2025-12-04T09:33:39.9318978Z * [new branch] gh/bobrenjc93/693/head -> origin/gh/bobrenjc93/693/head 2025-12-04T09:33:39.9320357Z * [new branch] gh/bobrenjc93/693/orig -> origin/gh/bobrenjc93/693/orig 2025-12-04T09:33:39.9322200Z * [new branch] gh/bobrenjc93/694/base -> origin/gh/bobrenjc93/694/base 2025-12-04T09:33:39.9323538Z * [new branch] gh/bobrenjc93/694/head -> origin/gh/bobrenjc93/694/head 2025-12-04T09:33:39.9324853Z * [new branch] gh/bobrenjc93/694/orig -> origin/gh/bobrenjc93/694/orig 2025-12-04T09:33:39.9326428Z * [new branch] gh/bobrenjc93/695/base -> origin/gh/bobrenjc93/695/base 2025-12-04T09:33:39.9327733Z * [new branch] gh/bobrenjc93/695/head -> origin/gh/bobrenjc93/695/head 2025-12-04T09:33:39.9329093Z * [new branch] gh/bobrenjc93/695/orig -> origin/gh/bobrenjc93/695/orig 2025-12-04T09:33:39.9331131Z * [new branch] gh/c00w/23/base -> origin/gh/c00w/23/base 2025-12-04T09:33:39.9332509Z * [new branch] gh/c00w/23/head -> origin/gh/c00w/23/head 2025-12-04T09:33:39.9334269Z * [new branch] gh/c00w/53/base -> origin/gh/c00w/53/base 2025-12-04T09:33:39.9335469Z * [new branch] gh/c00w/53/head -> origin/gh/c00w/53/head 2025-12-04T09:33:39.9336703Z * [new branch] gh/c00w/53/orig -> origin/gh/c00w/53/orig 2025-12-04T09:33:39.9338408Z * [new branch] gh/c00w/54/base -> origin/gh/c00w/54/base 2025-12-04T09:33:39.9339851Z * [new branch] gh/c00w/54/head -> origin/gh/c00w/54/head 2025-12-04T09:33:39.9341118Z * [new branch] gh/c00w/54/orig -> origin/gh/c00w/54/orig 2025-12-04T09:33:39.9342792Z * [new branch] gh/c00w/56/base -> origin/gh/c00w/56/base 2025-12-04T09:33:39.9344124Z * [new branch] gh/c00w/56/head -> origin/gh/c00w/56/head 2025-12-04T09:33:39.9345341Z * [new branch] gh/c00w/56/orig -> origin/gh/c00w/56/orig 2025-12-04T09:33:39.9347000Z * [new branch] gh/c00w/57/base -> origin/gh/c00w/57/base 2025-12-04T09:33:39.9348259Z * [new branch] gh/c00w/57/head -> origin/gh/c00w/57/head 2025-12-04T09:33:39.9349580Z * [new branch] gh/c00w/57/orig -> origin/gh/c00w/57/orig 2025-12-04T09:33:39.9351157Z * [new branch] gh/c00w/58/base -> origin/gh/c00w/58/base 2025-12-04T09:33:39.9352466Z * [new branch] gh/c00w/58/head -> origin/gh/c00w/58/head 2025-12-04T09:33:39.9353709Z * [new branch] gh/c00w/58/orig -> origin/gh/c00w/58/orig 2025-12-04T09:33:39.9355862Z * [new branch] gh/clee2000/1/base -> origin/gh/clee2000/1/base 2025-12-04T09:33:39.9357324Z * [new branch] gh/clee2000/1/head -> origin/gh/clee2000/1/head 2025-12-04T09:33:39.9358554Z * [new branch] gh/clee2000/1/orig -> origin/gh/clee2000/1/orig 2025-12-04T09:33:39.9360709Z * [new branch] gh/coconutruben/1/base -> origin/gh/coconutruben/1/base 2025-12-04T09:33:39.9362080Z * [new branch] gh/coconutruben/1/head -> origin/gh/coconutruben/1/head 2025-12-04T09:33:39.9364132Z * [new branch] gh/coconutruben/55/base -> origin/gh/coconutruben/55/base 2025-12-04T09:33:39.9365399Z * [new branch] gh/coconutruben/55/head -> origin/gh/coconutruben/55/head 2025-12-04T09:33:39.9366729Z * [new branch] gh/coconutruben/55/orig -> origin/gh/coconutruben/55/orig 2025-12-04T09:33:39.9368618Z * [new branch] gh/coconutruben/57/base -> origin/gh/coconutruben/57/base 2025-12-04T09:33:39.9370062Z * [new branch] gh/coconutruben/57/head -> origin/gh/coconutruben/57/head 2025-12-04T09:33:39.9371482Z * [new branch] gh/coconutruben/57/orig -> origin/gh/coconutruben/57/orig 2025-12-04T09:33:39.9373352Z * [new branch] gh/coconutruben/70/base -> origin/gh/coconutruben/70/base 2025-12-04T09:33:39.9374718Z * [new branch] gh/coconutruben/70/head -> origin/gh/coconutruben/70/head 2025-12-04T09:33:39.9376112Z * [new branch] gh/coconutruben/70/orig -> origin/gh/coconutruben/70/orig 2025-12-04T09:33:39.9377632Z * [new branch] gh/coconutruben/71/base -> origin/gh/coconutruben/71/base 2025-12-04T09:33:39.9379131Z * [new branch] gh/coconutruben/71/head -> origin/gh/coconutruben/71/head 2025-12-04T09:33:39.9380420Z * [new branch] gh/coconutruben/71/orig -> origin/gh/coconutruben/71/orig 2025-12-04T09:33:39.9381950Z * [new branch] gh/coconutruben/72/base -> origin/gh/coconutruben/72/base 2025-12-04T09:33:39.9383481Z * [new branch] gh/coconutruben/72/head -> origin/gh/coconutruben/72/head 2025-12-04T09:33:39.9384710Z * [new branch] gh/coconutruben/72/orig -> origin/gh/coconutruben/72/orig 2025-12-04T09:33:39.9386261Z * [new branch] gh/coconutruben/73/base -> origin/gh/coconutruben/73/base 2025-12-04T09:33:39.9387471Z * [new branch] gh/coconutruben/73/head -> origin/gh/coconutruben/73/head 2025-12-04T09:33:39.9389732Z * [new branch] gh/coconutruben/73/orig -> origin/gh/coconutruben/73/orig 2025-12-04T09:33:39.9391710Z * [new branch] gh/coconutruben/74/base -> origin/gh/coconutruben/74/base 2025-12-04T09:33:39.9393152Z * [new branch] gh/coconutruben/74/head -> origin/gh/coconutruben/74/head 2025-12-04T09:33:39.9394429Z * [new branch] gh/coconutruben/74/orig -> origin/gh/coconutruben/74/orig 2025-12-04T09:33:39.9396265Z * [new branch] gh/coconutruben/79/base -> origin/gh/coconutruben/79/base 2025-12-04T09:33:39.9397924Z * [new branch] gh/coconutruben/79/head -> origin/gh/coconutruben/79/head 2025-12-04T09:33:39.9399108Z * [new branch] gh/coconutruben/79/orig -> origin/gh/coconutruben/79/orig 2025-12-04T09:33:39.9400884Z * [new branch] gh/coconutruben/80/base -> origin/gh/coconutruben/80/base 2025-12-04T09:33:39.9402133Z * [new branch] gh/coconutruben/80/head -> origin/gh/coconutruben/80/head 2025-12-04T09:33:39.9403508Z * [new branch] gh/coconutruben/80/orig -> origin/gh/coconutruben/80/orig 2025-12-04T09:33:39.9405330Z * [new branch] gh/coconutruben/82/base -> origin/gh/coconutruben/82/base 2025-12-04T09:33:39.9406516Z * [new branch] gh/coconutruben/82/head -> origin/gh/coconutruben/82/head 2025-12-04T09:33:39.9407763Z * [new branch] gh/coconutruben/82/orig -> origin/gh/coconutruben/82/orig 2025-12-04T09:33:39.9409780Z * [new branch] gh/coconutruben/83/base -> origin/gh/coconutruben/83/base 2025-12-04T09:33:39.9410998Z * [new branch] gh/coconutruben/83/head -> origin/gh/coconutruben/83/head 2025-12-04T09:33:39.9412216Z * [new branch] gh/coconutruben/83/orig -> origin/gh/coconutruben/83/orig 2025-12-04T09:33:39.9414051Z * [new branch] gh/coconutruben/84/base -> origin/gh/coconutruben/84/base 2025-12-04T09:33:39.9415413Z * [new branch] gh/coconutruben/84/head -> origin/gh/coconutruben/84/head 2025-12-04T09:33:39.9416769Z * [new branch] gh/coconutruben/84/orig -> origin/gh/coconutruben/84/orig 2025-12-04T09:33:39.9418500Z * [new branch] gh/coconutruben/85/base -> origin/gh/coconutruben/85/base 2025-12-04T09:33:39.9419946Z * [new branch] gh/coconutruben/85/head -> origin/gh/coconutruben/85/head 2025-12-04T09:33:39.9421634Z * [new branch] gh/coconutruben/85/orig -> origin/gh/coconutruben/85/orig 2025-12-04T09:33:39.9423016Z * [new branch] gh/coconutruben/86/base -> origin/gh/coconutruben/86/base 2025-12-04T09:33:39.9424284Z * [new branch] gh/coconutruben/86/head -> origin/gh/coconutruben/86/head 2025-12-04T09:33:39.9425564Z * [new branch] gh/coconutruben/86/orig -> origin/gh/coconutruben/86/orig 2025-12-04T09:33:39.9427771Z * [new branch] gh/colinchan15/1/base -> origin/gh/colinchan15/1/base 2025-12-04T09:33:39.9429063Z * [new branch] gh/colinchan15/1/head -> origin/gh/colinchan15/1/head 2025-12-04T09:33:39.9430573Z * [new branch] gh/colinchan15/2/base -> origin/gh/colinchan15/2/base 2025-12-04T09:33:39.9431820Z * [new branch] gh/colinchan15/2/head -> origin/gh/colinchan15/2/head 2025-12-04T09:33:39.9433347Z * [new branch] gh/colinchan15/3/base -> origin/gh/colinchan15/3/base 2025-12-04T09:33:39.9434606Z * [new branch] gh/colinchan15/3/head -> origin/gh/colinchan15/3/head 2025-12-04T09:33:39.9436105Z * [new branch] gh/colinchan15/6/base -> origin/gh/colinchan15/6/base 2025-12-04T09:33:39.9437371Z * [new branch] gh/colinchan15/6/head -> origin/gh/colinchan15/6/head 2025-12-04T09:33:39.9439345Z * [new branch] gh/d4l3k/1/base -> origin/gh/d4l3k/1/base 2025-12-04T09:33:39.9440734Z * [new branch] gh/d4l3k/1/head -> origin/gh/d4l3k/1/head 2025-12-04T09:33:39.9442403Z * [new branch] gh/d4l3k/2/base -> origin/gh/d4l3k/2/base 2025-12-04T09:33:39.9443652Z * [new branch] gh/d4l3k/2/head -> origin/gh/d4l3k/2/head 2025-12-04T09:33:39.9444893Z * [new branch] gh/d4l3k/2/orig -> origin/gh/d4l3k/2/orig 2025-12-04T09:33:39.9446540Z * [new branch] gh/d4l3k/3/base -> origin/gh/d4l3k/3/base 2025-12-04T09:33:39.9447793Z * [new branch] gh/d4l3k/3/head -> origin/gh/d4l3k/3/head 2025-12-04T09:33:39.9449152Z * [new branch] gh/d4l3k/3/orig -> origin/gh/d4l3k/3/orig 2025-12-04T09:33:39.9450729Z * [new branch] gh/d4l3k/4/base -> origin/gh/d4l3k/4/base 2025-12-04T09:33:39.9452027Z * [new branch] gh/d4l3k/4/head -> origin/gh/d4l3k/4/head 2025-12-04T09:33:39.9453311Z * [new branch] gh/d4l3k/4/orig -> origin/gh/d4l3k/4/orig 2025-12-04T09:33:39.9454938Z * [new branch] gh/d4l3k/5/base -> origin/gh/d4l3k/5/base 2025-12-04T09:33:39.9456234Z * [new branch] gh/d4l3k/5/orig -> origin/gh/d4l3k/5/orig 2025-12-04T09:33:39.9458511Z * [new branch] gh/davidberard98/392/base -> origin/gh/davidberard98/392/base 2025-12-04T09:33:39.9459889Z * [new branch] gh/davidberard98/392/head -> origin/gh/davidberard98/392/head 2025-12-04T09:33:39.9461154Z * [new branch] gh/davidberard98/392/orig -> origin/gh/davidberard98/392/orig 2025-12-04T09:33:39.9462975Z * [new branch] gh/davidberard98/399/base -> origin/gh/davidberard98/399/base 2025-12-04T09:33:39.9464353Z * [new branch] gh/davidberard98/399/head -> origin/gh/davidberard98/399/head 2025-12-04T09:33:39.9465667Z * [new branch] gh/davidberard98/399/orig -> origin/gh/davidberard98/399/orig 2025-12-04T09:33:39.9467644Z * [new branch] gh/desertfire/605/base -> origin/gh/desertfire/605/base 2025-12-04T09:33:39.9468903Z * [new branch] gh/desertfire/605/head -> origin/gh/desertfire/605/head 2025-12-04T09:33:39.9470256Z * [new branch] gh/desertfire/605/orig -> origin/gh/desertfire/605/orig 2025-12-04T09:33:39.9471898Z * [new branch] gh/desertfire/606/base -> origin/gh/desertfire/606/base 2025-12-04T09:33:39.9473279Z * [new branch] gh/desertfire/606/head -> origin/gh/desertfire/606/head 2025-12-04T09:33:39.9474725Z * [new branch] gh/desertfire/606/orig -> origin/gh/desertfire/606/orig 2025-12-04T09:33:39.9476381Z * [new branch] gh/desertfire/607/base -> origin/gh/desertfire/607/base 2025-12-04T09:33:39.9477595Z * [new branch] gh/desertfire/607/head -> origin/gh/desertfire/607/head 2025-12-04T09:33:39.9478928Z * [new branch] gh/desertfire/607/orig -> origin/gh/desertfire/607/orig 2025-12-04T09:33:39.9480636Z * [new branch] gh/desertfire/608/base -> origin/gh/desertfire/608/base 2025-12-04T09:33:39.9481886Z * [new branch] gh/desertfire/608/head -> origin/gh/desertfire/608/head 2025-12-04T09:33:39.9483202Z * [new branch] gh/desertfire/608/orig -> origin/gh/desertfire/608/orig 2025-12-04T09:33:39.9484855Z * [new branch] gh/desertfire/609/base -> origin/gh/desertfire/609/base 2025-12-04T09:33:39.9486116Z * [new branch] gh/desertfire/609/head -> origin/gh/desertfire/609/head 2025-12-04T09:33:39.9487415Z * [new branch] gh/desertfire/609/orig -> origin/gh/desertfire/609/orig 2025-12-04T09:33:39.9489515Z * [new branch] gh/desertfire/610/base -> origin/gh/desertfire/610/base 2025-12-04T09:33:39.9491272Z * [new branch] gh/desertfire/610/head -> origin/gh/desertfire/610/head 2025-12-04T09:33:39.9492564Z * [new branch] gh/desertfire/610/orig -> origin/gh/desertfire/610/orig 2025-12-04T09:33:39.9507124Z * [new branch] gh/desertfire/611/base -> origin/gh/desertfire/611/base 2025-12-04T09:33:39.9507813Z * [new branch] gh/desertfire/611/head -> origin/gh/desertfire/611/head 2025-12-04T09:33:39.9508292Z * [new branch] gh/desertfire/611/orig -> origin/gh/desertfire/611/orig 2025-12-04T09:33:39.9508573Z * [new branch] gh/desertfire/612/base -> origin/gh/desertfire/612/base 2025-12-04T09:33:39.9509064Z * [new branch] gh/desertfire/612/head -> origin/gh/desertfire/612/head 2025-12-04T09:33:39.9509612Z * [new branch] gh/desertfire/612/orig -> origin/gh/desertfire/612/orig 2025-12-04T09:33:39.9510033Z * [new branch] gh/desertfire/613/base -> origin/gh/desertfire/613/base 2025-12-04T09:33:39.9510435Z * [new branch] gh/desertfire/613/head -> origin/gh/desertfire/613/head 2025-12-04T09:33:39.9510969Z * [new branch] gh/desertfire/613/orig -> origin/gh/desertfire/613/orig 2025-12-04T09:33:39.9511342Z * [new branch] gh/desertfire/614/base -> origin/gh/desertfire/614/base 2025-12-04T09:33:39.9511619Z * [new branch] gh/desertfire/614/head -> origin/gh/desertfire/614/head 2025-12-04T09:33:39.9511877Z * [new branch] gh/desertfire/614/orig -> origin/gh/desertfire/614/orig 2025-12-04T09:33:39.9512446Z * [new branch] gh/desertfire/615/base -> origin/gh/desertfire/615/base 2025-12-04T09:33:39.9514069Z * [new branch] gh/desertfire/615/head -> origin/gh/desertfire/615/head 2025-12-04T09:33:39.9515267Z * [new branch] gh/desertfire/615/orig -> origin/gh/desertfire/615/orig 2025-12-04T09:33:39.9516912Z * [new branch] gh/desertfire/616/base -> origin/gh/desertfire/616/base 2025-12-04T09:33:39.9518161Z * [new branch] gh/desertfire/616/head -> origin/gh/desertfire/616/head 2025-12-04T09:33:39.9519370Z * [new branch] gh/desertfire/616/orig -> origin/gh/desertfire/616/orig 2025-12-04T09:33:39.9520954Z * [new branch] gh/desertfire/617/base -> origin/gh/desertfire/617/base 2025-12-04T09:33:39.9522256Z * [new branch] gh/desertfire/617/head -> origin/gh/desertfire/617/head 2025-12-04T09:33:39.9523457Z * [new branch] gh/desertfire/617/orig -> origin/gh/desertfire/617/orig 2025-12-04T09:33:39.9525621Z * [new branch] gh/dharakk/1/base -> origin/gh/dharakk/1/base 2025-12-04T09:33:39.9526943Z * [new branch] gh/dharakk/1/head -> origin/gh/dharakk/1/head 2025-12-04T09:33:39.9529009Z * [new branch] gh/drisspg/170/base -> origin/gh/drisspg/170/base 2025-12-04T09:33:39.9530241Z * [new branch] gh/drisspg/170/head -> origin/gh/drisspg/170/head 2025-12-04T09:33:39.9531519Z * [new branch] gh/drisspg/170/orig -> origin/gh/drisspg/170/orig 2025-12-04T09:33:39.9533301Z * [new branch] gh/drisspg/182/base -> origin/gh/drisspg/182/base 2025-12-04T09:33:39.9534536Z * [new branch] gh/drisspg/182/head -> origin/gh/drisspg/182/head 2025-12-04T09:33:39.9536119Z * [new branch] gh/drisspg/183/base -> origin/gh/drisspg/183/base 2025-12-04T09:33:39.9537244Z * [new branch] gh/drisspg/183/head -> origin/gh/drisspg/183/head 2025-12-04T09:33:39.9538867Z * [new branch] gh/drisspg/184/base -> origin/gh/drisspg/184/base 2025-12-04T09:33:39.9540023Z * [new branch] gh/drisspg/184/head -> origin/gh/drisspg/184/head 2025-12-04T09:33:39.9541906Z * [new branch] gh/drisspg/185/base -> origin/gh/drisspg/185/base 2025-12-04T09:33:39.9543086Z * [new branch] gh/drisspg/185/head -> origin/gh/drisspg/185/head 2025-12-04T09:33:39.9544837Z * [new branch] gh/drisspg/194/base -> origin/gh/drisspg/194/base 2025-12-04T09:33:39.9546052Z * [new branch] gh/drisspg/194/head -> origin/gh/drisspg/194/head 2025-12-04T09:33:39.9547309Z * [new branch] gh/drisspg/194/orig -> origin/gh/drisspg/194/orig 2025-12-04T09:33:39.9549008Z * [new branch] gh/drisspg/200/base -> origin/gh/drisspg/200/base 2025-12-04T09:33:39.9550172Z * [new branch] gh/drisspg/200/head -> origin/gh/drisspg/200/head 2025-12-04T09:33:39.9551563Z * [new branch] gh/drisspg/200/orig -> origin/gh/drisspg/200/orig 2025-12-04T09:33:39.9553168Z * [new branch] gh/drisspg/218/base -> origin/gh/drisspg/218/base 2025-12-04T09:33:39.9554361Z * [new branch] gh/drisspg/218/head -> origin/gh/drisspg/218/head 2025-12-04T09:33:39.9555622Z * [new branch] gh/drisspg/218/orig -> origin/gh/drisspg/218/orig 2025-12-04T09:33:39.9557477Z * [new branch] gh/drisspg/219/base -> origin/gh/drisspg/219/base 2025-12-04T09:33:39.9558657Z * [new branch] gh/drisspg/219/head -> origin/gh/drisspg/219/head 2025-12-04T09:33:39.9560115Z * [new branch] gh/drisspg/219/orig -> origin/gh/drisspg/219/orig 2025-12-04T09:33:39.9561765Z * [new branch] gh/drisspg/220/base -> origin/gh/drisspg/220/base 2025-12-04T09:33:39.9562932Z * [new branch] gh/drisspg/220/head -> origin/gh/drisspg/220/head 2025-12-04T09:33:39.9564227Z * [new branch] gh/drisspg/220/orig -> origin/gh/drisspg/220/orig 2025-12-04T09:33:39.9565938Z * [new branch] gh/drisspg/221/base -> origin/gh/drisspg/221/base 2025-12-04T09:33:39.9567132Z * [new branch] gh/drisspg/221/head -> origin/gh/drisspg/221/head 2025-12-04T09:33:39.9568368Z * [new branch] gh/drisspg/221/orig -> origin/gh/drisspg/221/orig 2025-12-04T09:33:39.9570108Z * [new branch] gh/drisspg/222/base -> origin/gh/drisspg/222/base 2025-12-04T09:33:39.9571301Z * [new branch] gh/drisspg/222/head -> origin/gh/drisspg/222/head 2025-12-04T09:33:39.9572551Z * [new branch] gh/drisspg/222/orig -> origin/gh/drisspg/222/orig 2025-12-04T09:33:39.9574398Z * [new branch] gh/drisspg/223/base -> origin/gh/drisspg/223/base 2025-12-04T09:33:39.9575568Z * [new branch] gh/drisspg/223/head -> origin/gh/drisspg/223/head 2025-12-04T09:33:39.9576847Z * [new branch] gh/drisspg/223/orig -> origin/gh/drisspg/223/orig 2025-12-04T09:33:39.9578628Z * [new branch] gh/drisspg/224/base -> origin/gh/drisspg/224/base 2025-12-04T09:33:39.9579893Z * [new branch] gh/drisspg/224/head -> origin/gh/drisspg/224/head 2025-12-04T09:33:39.9581191Z * [new branch] gh/drisspg/224/orig -> origin/gh/drisspg/224/orig 2025-12-04T09:33:39.9582954Z * [new branch] gh/drisspg/225/base -> origin/gh/drisspg/225/base 2025-12-04T09:33:39.9584118Z * [new branch] gh/drisspg/225/head -> origin/gh/drisspg/225/head 2025-12-04T09:33:39.9585417Z * [new branch] gh/drisspg/225/orig -> origin/gh/drisspg/225/orig 2025-12-04T09:33:39.9587169Z * [new branch] gh/drisspg/226/base -> origin/gh/drisspg/226/base 2025-12-04T09:33:39.9588387Z * [new branch] gh/drisspg/226/head -> origin/gh/drisspg/226/head 2025-12-04T09:33:39.9589835Z * [new branch] gh/drisspg/226/orig -> origin/gh/drisspg/226/orig 2025-12-04T09:33:39.9592069Z * [new branch] gh/drisspg/227/base -> origin/gh/drisspg/227/base 2025-12-04T09:33:39.9593225Z * [new branch] gh/drisspg/227/head -> origin/gh/drisspg/227/head 2025-12-04T09:33:39.9594553Z * [new branch] gh/drisspg/227/orig -> origin/gh/drisspg/227/orig 2025-12-04T09:33:39.9596330Z * [new branch] gh/drisspg/228/base -> origin/gh/drisspg/228/base 2025-12-04T09:33:39.9597519Z * [new branch] gh/drisspg/228/head -> origin/gh/drisspg/228/head 2025-12-04T09:33:39.9598787Z * [new branch] gh/drisspg/228/orig -> origin/gh/drisspg/228/orig 2025-12-04T09:33:39.9600583Z * [new branch] gh/drisspg/229/base -> origin/gh/drisspg/229/base 2025-12-04T09:33:39.9601759Z * [new branch] gh/drisspg/229/head -> origin/gh/drisspg/229/head 2025-12-04T09:33:39.9603174Z * [new branch] gh/drisspg/229/orig -> origin/gh/drisspg/229/orig 2025-12-04T09:33:39.9604948Z * [new branch] gh/drisspg/230/base -> origin/gh/drisspg/230/base 2025-12-04T09:33:39.9606121Z * [new branch] gh/drisspg/230/head -> origin/gh/drisspg/230/head 2025-12-04T09:33:39.9607396Z * [new branch] gh/drisspg/230/orig -> origin/gh/drisspg/230/orig 2025-12-04T09:33:39.9609674Z * [new branch] gh/dsjohns2/1/base -> origin/gh/dsjohns2/1/base 2025-12-04T09:33:39.9610960Z * [new branch] gh/dsjohns2/1/head -> origin/gh/dsjohns2/1/head 2025-12-04T09:33:39.9613146Z * [new branch] gh/dzmitry-huba/1/base -> origin/gh/dzmitry-huba/1/base 2025-12-04T09:33:39.9614364Z * [new branch] gh/dzmitry-huba/1/head -> origin/gh/dzmitry-huba/1/head 2025-12-04T09:33:39.9616417Z * [new branch] gh/dzmitry-huba/12/base -> origin/gh/dzmitry-huba/12/base 2025-12-04T09:33:39.9617711Z * [new branch] gh/dzmitry-huba/12/head -> origin/gh/dzmitry-huba/12/head 2025-12-04T09:33:39.9619100Z * [new branch] gh/dzmitry-huba/12/orig -> origin/gh/dzmitry-huba/12/orig 2025-12-04T09:33:39.9621056Z * [new branch] gh/dzmitry-huba/13/base -> origin/gh/dzmitry-huba/13/base 2025-12-04T09:33:39.9622312Z * [new branch] gh/dzmitry-huba/13/head -> origin/gh/dzmitry-huba/13/head 2025-12-04T09:33:39.9623578Z * [new branch] gh/dzmitry-huba/13/orig -> origin/gh/dzmitry-huba/13/orig 2025-12-04T09:33:39.9625361Z * [new branch] gh/dzmitry-huba/14/base -> origin/gh/dzmitry-huba/14/base 2025-12-04T09:33:39.9626628Z * [new branch] gh/dzmitry-huba/14/head -> origin/gh/dzmitry-huba/14/head 2025-12-04T09:33:39.9627893Z * [new branch] gh/dzmitry-huba/14/orig -> origin/gh/dzmitry-huba/14/orig 2025-12-04T09:33:39.9629779Z * [new branch] gh/dzmitry-huba/15/base -> origin/gh/dzmitry-huba/15/base 2025-12-04T09:33:39.9630966Z * [new branch] gh/dzmitry-huba/15/head -> origin/gh/dzmitry-huba/15/head 2025-12-04T09:33:39.9632170Z * [new branch] gh/dzmitry-huba/15/orig -> origin/gh/dzmitry-huba/15/orig 2025-12-04T09:33:39.9634129Z * [new branch] gh/dzmitry-huba/16/base -> origin/gh/dzmitry-huba/16/base 2025-12-04T09:33:39.9635393Z * [new branch] gh/dzmitry-huba/16/head -> origin/gh/dzmitry-huba/16/head 2025-12-04T09:33:39.9636836Z * [new branch] gh/dzmitry-huba/16/orig -> origin/gh/dzmitry-huba/16/orig 2025-12-04T09:33:39.9638508Z * [new branch] gh/dzmitry-huba/17/base -> origin/gh/dzmitry-huba/17/base 2025-12-04T09:33:39.9639726Z * [new branch] gh/dzmitry-huba/17/head -> origin/gh/dzmitry-huba/17/head 2025-12-04T09:33:39.9640997Z * [new branch] gh/dzmitry-huba/17/orig -> origin/gh/dzmitry-huba/17/orig 2025-12-04T09:33:39.9642708Z * [new branch] gh/dzmitry-huba/2/base -> origin/gh/dzmitry-huba/2/base 2025-12-04T09:33:39.9643869Z * [new branch] gh/dzmitry-huba/2/head -> origin/gh/dzmitry-huba/2/head 2025-12-04T09:33:39.9645464Z * [new branch] gh/dzmitry-huba/3/base -> origin/gh/dzmitry-huba/3/base 2025-12-04T09:33:39.9646578Z * [new branch] gh/dzmitry-huba/3/head -> origin/gh/dzmitry-huba/3/head 2025-12-04T09:33:39.9648745Z * [new branch] gh/eellison/808/base -> origin/gh/eellison/808/base 2025-12-04T09:33:39.9650115Z * [new branch] gh/eellison/808/head -> origin/gh/eellison/808/head 2025-12-04T09:33:39.9651394Z * [new branch] gh/eellison/808/orig -> origin/gh/eellison/808/orig 2025-12-04T09:33:39.9653503Z * [new branch] gh/eellison/822/base -> origin/gh/eellison/822/base 2025-12-04T09:33:39.9654810Z * [new branch] gh/eellison/822/head -> origin/gh/eellison/822/head 2025-12-04T09:33:39.9655991Z * [new branch] gh/eellison/822/orig -> origin/gh/eellison/822/orig 2025-12-04T09:33:39.9657798Z * [new branch] gh/eellison/823/base -> origin/gh/eellison/823/base 2025-12-04T09:33:39.9659164Z * [new branch] gh/eellison/823/head -> origin/gh/eellison/823/head 2025-12-04T09:33:39.9660478Z * [new branch] gh/eellison/823/orig -> origin/gh/eellison/823/orig 2025-12-04T09:33:39.9662214Z * [new branch] gh/eellison/862/base -> origin/gh/eellison/862/base 2025-12-04T09:33:39.9663386Z * [new branch] gh/eellison/862/head -> origin/gh/eellison/862/head 2025-12-04T09:33:39.9664656Z * [new branch] gh/eellison/862/orig -> origin/gh/eellison/862/orig 2025-12-04T09:33:39.9666378Z * [new branch] gh/eellison/863/base -> origin/gh/eellison/863/base 2025-12-04T09:33:39.9667583Z * [new branch] gh/eellison/863/head -> origin/gh/eellison/863/head 2025-12-04T09:33:39.9668891Z * [new branch] gh/eellison/863/orig -> origin/gh/eellison/863/orig 2025-12-04T09:33:39.9670576Z * [new branch] gh/eellison/864/base -> origin/gh/eellison/864/base 2025-12-04T09:33:39.9671799Z * [new branch] gh/eellison/864/head -> origin/gh/eellison/864/head 2025-12-04T09:33:39.9673191Z * [new branch] gh/eellison/864/orig -> origin/gh/eellison/864/orig 2025-12-04T09:33:39.9675074Z * [new branch] gh/eellison/865/base -> origin/gh/eellison/865/base 2025-12-04T09:33:39.9676321Z * [new branch] gh/eellison/865/head -> origin/gh/eellison/865/head 2025-12-04T09:33:39.9677624Z * [new branch] gh/eellison/865/orig -> origin/gh/eellison/865/orig 2025-12-04T09:33:39.9679340Z * [new branch] gh/eellison/866/base -> origin/gh/eellison/866/base 2025-12-04T09:33:39.9680603Z * [new branch] gh/eellison/866/head -> origin/gh/eellison/866/head 2025-12-04T09:33:39.9681719Z * [new branch] gh/eellison/866/orig -> origin/gh/eellison/866/orig 2025-12-04T09:33:39.9683617Z * [new branch] gh/eellison/867/base -> origin/gh/eellison/867/base 2025-12-04T09:33:39.9684753Z * [new branch] gh/eellison/867/head -> origin/gh/eellison/867/head 2025-12-04T09:33:39.9686037Z * [new branch] gh/eellison/867/orig -> origin/gh/eellison/867/orig 2025-12-04T09:33:39.9688202Z * [new branch] gh/eellison/868/base -> origin/gh/eellison/868/base 2025-12-04T09:33:39.9689804Z * [new branch] gh/eellison/868/head -> origin/gh/eellison/868/head 2025-12-04T09:33:39.9690960Z * [new branch] gh/eellison/868/orig -> origin/gh/eellison/868/orig 2025-12-04T09:33:39.9692796Z * [new branch] gh/eellison/869/base -> origin/gh/eellison/869/base 2025-12-04T09:33:39.9694014Z * [new branch] gh/eellison/869/head -> origin/gh/eellison/869/head 2025-12-04T09:33:39.9695262Z * [new branch] gh/eellison/869/orig -> origin/gh/eellison/869/orig 2025-12-04T09:33:39.9697085Z * [new branch] gh/eellison/870/base -> origin/gh/eellison/870/base 2025-12-04T09:33:39.9698327Z * [new branch] gh/eellison/870/head -> origin/gh/eellison/870/head 2025-12-04T09:33:39.9699587Z * [new branch] gh/eellison/870/orig -> origin/gh/eellison/870/orig 2025-12-04T09:33:39.9701454Z * [new branch] gh/eellison/871/base -> origin/gh/eellison/871/base 2025-12-04T09:33:39.9702618Z * [new branch] gh/eellison/871/head -> origin/gh/eellison/871/head 2025-12-04T09:33:39.9703862Z * [new branch] gh/eellison/871/orig -> origin/gh/eellison/871/orig 2025-12-04T09:33:39.9705821Z * [new branch] gh/eellison/872/base -> origin/gh/eellison/872/base 2025-12-04T09:33:39.9706807Z * [new branch] gh/eellison/872/head -> origin/gh/eellison/872/head 2025-12-04T09:33:39.9708058Z * [new branch] gh/eellison/872/orig -> origin/gh/eellison/872/orig 2025-12-04T09:33:39.9710064Z * [new branch] gh/eellison/873/base -> origin/gh/eellison/873/base 2025-12-04T09:33:39.9711312Z * [new branch] gh/eellison/873/head -> origin/gh/eellison/873/head 2025-12-04T09:33:39.9712609Z * [new branch] gh/eellison/873/orig -> origin/gh/eellison/873/orig 2025-12-04T09:33:39.9714409Z * [new branch] gh/eellison/874/base -> origin/gh/eellison/874/base 2025-12-04T09:33:39.9715925Z * [new branch] gh/eellison/874/head -> origin/gh/eellison/874/head 2025-12-04T09:33:39.9717142Z * [new branch] gh/eellison/874/orig -> origin/gh/eellison/874/orig 2025-12-04T09:33:39.9719650Z * [new branch] gh/eellison/875/base -> origin/gh/eellison/875/base 2025-12-04T09:33:39.9721302Z * [new branch] gh/eellison/875/head -> origin/gh/eellison/875/head 2025-12-04T09:33:39.9722515Z * [new branch] gh/eellison/875/orig -> origin/gh/eellison/875/orig 2025-12-04T09:33:39.9724859Z * [new branch] gh/eellison/876/base -> origin/gh/eellison/876/base 2025-12-04T09:33:39.9725610Z * [new branch] gh/eellison/876/head -> origin/gh/eellison/876/head 2025-12-04T09:33:39.9727014Z * [new branch] gh/eellison/876/orig -> origin/gh/eellison/876/orig 2025-12-04T09:33:39.9728695Z * [new branch] gh/eellison/877/base -> origin/gh/eellison/877/base 2025-12-04T09:33:39.9729979Z * [new branch] gh/eellison/877/head -> origin/gh/eellison/877/head 2025-12-04T09:33:39.9731313Z * [new branch] gh/eellison/877/orig -> origin/gh/eellison/877/orig 2025-12-04T09:33:39.9733160Z * [new branch] gh/eellison/878/base -> origin/gh/eellison/878/base 2025-12-04T09:33:39.9734334Z * [new branch] gh/eellison/878/head -> origin/gh/eellison/878/head 2025-12-04T09:33:39.9735570Z * [new branch] gh/eellison/878/orig -> origin/gh/eellison/878/orig 2025-12-04T09:33:39.9737477Z * [new branch] gh/eellison/879/base -> origin/gh/eellison/879/base 2025-12-04T09:33:39.9738684Z * [new branch] gh/eellison/879/head -> origin/gh/eellison/879/head 2025-12-04T09:33:39.9740111Z * [new branch] gh/eellison/879/orig -> origin/gh/eellison/879/orig 2025-12-04T09:33:39.9741679Z * [new branch] gh/eellison/880/base -> origin/gh/eellison/880/base 2025-12-04T09:33:39.9743042Z * [new branch] gh/eellison/880/head -> origin/gh/eellison/880/head 2025-12-04T09:33:39.9744329Z * [new branch] gh/eellison/880/orig -> origin/gh/eellison/880/orig 2025-12-04T09:33:39.9746169Z * [new branch] gh/eellison/881/base -> origin/gh/eellison/881/base 2025-12-04T09:33:39.9747582Z * [new branch] gh/eellison/881/head -> origin/gh/eellison/881/head 2025-12-04T09:33:39.9748752Z * [new branch] gh/eellison/881/orig -> origin/gh/eellison/881/orig 2025-12-04T09:33:39.9750503Z * [new branch] gh/eellison/882/base -> origin/gh/eellison/882/base 2025-12-04T09:33:39.9751703Z * [new branch] gh/eellison/882/head -> origin/gh/eellison/882/head 2025-12-04T09:33:39.9753205Z * [new branch] gh/eellison/882/orig -> origin/gh/eellison/882/orig 2025-12-04T09:33:39.9754836Z * [new branch] gh/eellison/883/base -> origin/gh/eellison/883/base 2025-12-04T09:33:39.9756023Z * [new branch] gh/eellison/883/head -> origin/gh/eellison/883/head 2025-12-04T09:33:39.9757392Z * [new branch] gh/eellison/883/orig -> origin/gh/eellison/883/orig 2025-12-04T09:33:39.9758957Z * [new branch] gh/eellison/884/base -> origin/gh/eellison/884/base 2025-12-04T09:33:39.9760199Z * [new branch] gh/eellison/884/head -> origin/gh/eellison/884/head 2025-12-04T09:33:39.9761390Z * [new branch] gh/eellison/884/orig -> origin/gh/eellison/884/orig 2025-12-04T09:33:39.9763545Z * [new branch] gh/etaf/147/base -> origin/gh/etaf/147/base 2025-12-04T09:33:39.9764809Z * [new branch] gh/etaf/147/head -> origin/gh/etaf/147/head 2025-12-04T09:33:39.9766790Z * [new branch] gh/etaf/154/base -> origin/gh/etaf/154/base 2025-12-04T09:33:39.9768145Z * [new branch] gh/etaf/154/head -> origin/gh/etaf/154/head 2025-12-04T09:33:39.9769242Z * [new branch] gh/etaf/154/orig -> origin/gh/etaf/154/orig 2025-12-04T09:33:39.9770946Z * [new branch] gh/etaf/156/base -> origin/gh/etaf/156/base 2025-12-04T09:33:39.9772206Z * [new branch] gh/etaf/156/head -> origin/gh/etaf/156/head 2025-12-04T09:33:39.9773451Z * [new branch] gh/etaf/156/orig -> origin/gh/etaf/156/orig 2025-12-04T09:33:39.9775916Z * [new branch] gh/etaf/157/base -> origin/gh/etaf/157/base 2025-12-04T09:33:39.9777180Z * [new branch] gh/etaf/157/head -> origin/gh/etaf/157/head 2025-12-04T09:33:39.9778630Z * [new branch] gh/etaf/157/orig -> origin/gh/etaf/157/orig 2025-12-04T09:33:39.9780392Z * [new branch] gh/etaf/158/base -> origin/gh/etaf/158/base 2025-12-04T09:33:39.9781816Z * [new branch] gh/etaf/158/head -> origin/gh/etaf/158/head 2025-12-04T09:33:39.9783026Z * [new branch] gh/etaf/158/orig -> origin/gh/etaf/158/orig 2025-12-04T09:33:39.9784859Z * [new branch] gh/etaf/159/base -> origin/gh/etaf/159/base 2025-12-04T09:33:39.9786116Z * [new branch] gh/etaf/159/head -> origin/gh/etaf/159/head 2025-12-04T09:33:39.9787377Z * [new branch] gh/etaf/159/orig -> origin/gh/etaf/159/orig 2025-12-04T09:33:39.9789606Z * [new branch] gh/etaf/160/base -> origin/gh/etaf/160/base 2025-12-04T09:33:39.9790842Z * [new branch] gh/etaf/160/head -> origin/gh/etaf/160/head 2025-12-04T09:33:39.9792223Z * [new branch] gh/etaf/160/orig -> origin/gh/etaf/160/orig 2025-12-04T09:33:39.9793966Z * [new branch] gh/etaf/161/base -> origin/gh/etaf/161/base 2025-12-04T09:33:39.9795345Z * [new branch] gh/etaf/161/head -> origin/gh/etaf/161/head 2025-12-04T09:33:39.9796547Z * [new branch] gh/etaf/161/orig -> origin/gh/etaf/161/orig 2025-12-04T09:33:39.9798320Z * [new branch] gh/etaf/166/base -> origin/gh/etaf/166/base 2025-12-04T09:33:39.9799910Z * [new branch] gh/etaf/166/head -> origin/gh/etaf/166/head 2025-12-04T09:33:39.9801091Z * [new branch] gh/etaf/166/orig -> origin/gh/etaf/166/orig 2025-12-04T09:33:39.9802787Z * [new branch] gh/etaf/167/base -> origin/gh/etaf/167/base 2025-12-04T09:33:39.9804035Z * [new branch] gh/etaf/167/head -> origin/gh/etaf/167/head 2025-12-04T09:33:39.9805252Z * [new branch] gh/etaf/167/orig -> origin/gh/etaf/167/orig 2025-12-04T09:33:39.9807167Z * [new branch] gh/etaf/168/base -> origin/gh/etaf/168/base 2025-12-04T09:33:39.9808446Z * [new branch] gh/etaf/168/head -> origin/gh/etaf/168/head 2025-12-04T09:33:39.9809812Z * [new branch] gh/etaf/168/orig -> origin/gh/etaf/168/orig 2025-12-04T09:33:39.9811857Z * [new branch] gh/etaf/172/base -> origin/gh/etaf/172/base 2025-12-04T09:33:39.9812818Z * [new branch] gh/etaf/172/head -> origin/gh/etaf/172/head 2025-12-04T09:33:39.9814257Z * [new branch] gh/etaf/172/orig -> origin/gh/etaf/172/orig 2025-12-04T09:33:39.9816154Z * [new branch] gh/etaf/173/base -> origin/gh/etaf/173/base 2025-12-04T09:33:39.9817641Z * [new branch] gh/etaf/173/head -> origin/gh/etaf/173/head 2025-12-04T09:33:39.9818928Z * [new branch] gh/etaf/173/orig -> origin/gh/etaf/173/orig 2025-12-04T09:33:39.9820823Z * [new branch] gh/etaf/174/base -> origin/gh/etaf/174/base 2025-12-04T09:33:39.9821985Z * [new branch] gh/etaf/174/head -> origin/gh/etaf/174/head 2025-12-04T09:33:39.9823794Z * [new branch] gh/etaf/175/base -> origin/gh/etaf/175/base 2025-12-04T09:33:39.9825204Z * [new branch] gh/etaf/175/head -> origin/gh/etaf/175/head 2025-12-04T09:33:39.9826193Z * [new branch] gh/etaf/175/orig -> origin/gh/etaf/175/orig 2025-12-04T09:33:39.9828064Z * [new branch] gh/etaf/176/base -> origin/gh/etaf/176/base 2025-12-04T09:33:39.9829367Z * [new branch] gh/etaf/176/head -> origin/gh/etaf/176/head 2025-12-04T09:33:39.9830742Z * [new branch] gh/etaf/176/orig -> origin/gh/etaf/176/orig 2025-12-04T09:33:39.9832908Z * [new branch] gh/etaf/177/base -> origin/gh/etaf/177/base 2025-12-04T09:33:39.9834415Z * [new branch] gh/etaf/177/head -> origin/gh/etaf/177/head 2025-12-04T09:33:39.9835884Z * [new branch] gh/etaf/177/orig -> origin/gh/etaf/177/orig 2025-12-04T09:33:39.9837835Z * [new branch] gh/etaf/178/base -> origin/gh/etaf/178/base 2025-12-04T09:33:39.9839254Z * [new branch] gh/etaf/178/head -> origin/gh/etaf/178/head 2025-12-04T09:33:39.9840499Z * [new branch] gh/etaf/178/orig -> origin/gh/etaf/178/orig 2025-12-04T09:33:39.9842310Z * [new branch] gh/etaf/179/base -> origin/gh/etaf/179/base 2025-12-04T09:33:39.9843535Z * [new branch] gh/etaf/179/head -> origin/gh/etaf/179/head 2025-12-04T09:33:39.9844988Z * [new branch] gh/etaf/179/orig -> origin/gh/etaf/179/orig 2025-12-04T09:33:39.9846514Z * [new branch] gh/etaf/180/base -> origin/gh/etaf/180/base 2025-12-04T09:33:39.9847689Z * [new branch] gh/etaf/180/head -> origin/gh/etaf/180/head 2025-12-04T09:33:39.9848948Z * [new branch] gh/etaf/180/orig -> origin/gh/etaf/180/orig 2025-12-04T09:33:39.9851656Z * [new branch] gh/exclamaforte/1/base -> origin/gh/exclamaforte/1/base 2025-12-04T09:33:39.9853154Z * [new branch] gh/exclamaforte/1/head -> origin/gh/exclamaforte/1/head 2025-12-04T09:33:39.9854636Z * [new branch] gh/exclamaforte/2/base -> origin/gh/exclamaforte/2/base 2025-12-04T09:33:39.9855707Z * [new branch] gh/exclamaforte/2/head -> origin/gh/exclamaforte/2/head 2025-12-04T09:33:39.9857442Z * [new branch] gh/exclamaforte/3/base -> origin/gh/exclamaforte/3/base 2025-12-04T09:33:39.9858717Z * [new branch] gh/exclamaforte/3/head -> origin/gh/exclamaforte/3/head 2025-12-04T09:33:39.9860588Z * [new branch] gh/exclamaforte/4/base -> origin/gh/exclamaforte/4/base 2025-12-04T09:33:39.9861800Z * [new branch] gh/exclamaforte/4/head -> origin/gh/exclamaforte/4/head 2025-12-04T09:33:39.9863888Z * [new branch] gh/ezyang/2374/base -> origin/gh/ezyang/2374/base 2025-12-04T09:33:39.9865147Z * [new branch] gh/ezyang/2374/head -> origin/gh/ezyang/2374/head 2025-12-04T09:33:39.9866548Z * [new branch] gh/ezyang/2374/orig -> origin/gh/ezyang/2374/orig 2025-12-04T09:33:39.9868294Z * [new branch] gh/ezyang/2973/base -> origin/gh/ezyang/2973/base 2025-12-04T09:33:39.9869470Z * [new branch] gh/ezyang/2973/head -> origin/gh/ezyang/2973/head 2025-12-04T09:33:39.9870868Z * [new branch] gh/ezyang/2973/orig -> origin/gh/ezyang/2973/orig 2025-12-04T09:33:39.9872552Z * [new branch] gh/ezyang/2974/base -> origin/gh/ezyang/2974/base 2025-12-04T09:33:39.9873692Z * [new branch] gh/ezyang/2974/head -> origin/gh/ezyang/2974/head 2025-12-04T09:33:39.9875108Z * [new branch] gh/ezyang/2974/orig -> origin/gh/ezyang/2974/orig 2025-12-04T09:33:39.9876753Z * [new branch] gh/ezyang/3131/base -> origin/gh/ezyang/3131/base 2025-12-04T09:33:39.9877954Z * [new branch] gh/ezyang/3131/head -> origin/gh/ezyang/3131/head 2025-12-04T09:33:39.9879231Z * [new branch] gh/ezyang/3131/orig -> origin/gh/ezyang/3131/orig 2025-12-04T09:33:39.9880952Z * [new branch] gh/ezyang/3139/base -> origin/gh/ezyang/3139/base 2025-12-04T09:33:39.9882170Z * [new branch] gh/ezyang/3139/head -> origin/gh/ezyang/3139/head 2025-12-04T09:33:39.9883443Z * [new branch] gh/ezyang/3139/orig -> origin/gh/ezyang/3139/orig 2025-12-04T09:33:39.9885283Z * [new branch] gh/ezyang/3140/base -> origin/gh/ezyang/3140/base 2025-12-04T09:33:39.9886430Z * [new branch] gh/ezyang/3140/head -> origin/gh/ezyang/3140/head 2025-12-04T09:33:39.9887981Z * [new branch] gh/ezyang/3140/orig -> origin/gh/ezyang/3140/orig 2025-12-04T09:33:39.9892751Z * [new branch] gh/ezyang/3143/base -> origin/gh/ezyang/3143/base 2025-12-04T09:33:39.9893934Z * [new branch] gh/ezyang/3143/head -> origin/gh/ezyang/3143/head 2025-12-04T09:33:39.9895388Z * [new branch] gh/ezyang/3143/orig -> origin/gh/ezyang/3143/orig 2025-12-04T09:33:39.9897082Z * [new branch] gh/ezyang/3144/base -> origin/gh/ezyang/3144/base 2025-12-04T09:33:39.9898446Z * [new branch] gh/ezyang/3144/head -> origin/gh/ezyang/3144/head 2025-12-04T09:33:39.9899845Z * [new branch] gh/ezyang/3144/orig -> origin/gh/ezyang/3144/orig 2025-12-04T09:33:39.9901522Z * [new branch] gh/ezyang/3167/base -> origin/gh/ezyang/3167/base 2025-12-04T09:33:39.9902719Z * [new branch] gh/ezyang/3167/head -> origin/gh/ezyang/3167/head 2025-12-04T09:33:39.9903997Z * [new branch] gh/ezyang/3167/orig -> origin/gh/ezyang/3167/orig 2025-12-04T09:33:39.9906416Z * [new branch] gh/ezyang/3173/base -> origin/gh/ezyang/3173/base 2025-12-04T09:33:39.9907556Z * [new branch] gh/ezyang/3173/head -> origin/gh/ezyang/3173/head 2025-12-04T09:33:39.9909071Z * [new branch] gh/ezyang/3173/orig -> origin/gh/ezyang/3173/orig 2025-12-04T09:33:39.9910736Z * [new branch] gh/ezyang/3175/base -> origin/gh/ezyang/3175/base 2025-12-04T09:33:39.9912500Z * [new branch] gh/ezyang/3175/head -> origin/gh/ezyang/3175/head 2025-12-04T09:33:39.9913751Z * [new branch] gh/ezyang/3175/orig -> origin/gh/ezyang/3175/orig 2025-12-04T09:33:39.9915456Z * [new branch] gh/ezyang/3182/base -> origin/gh/ezyang/3182/base 2025-12-04T09:33:39.9916681Z * [new branch] gh/ezyang/3182/head -> origin/gh/ezyang/3182/head 2025-12-04T09:33:39.9917891Z * [new branch] gh/ezyang/3182/orig -> origin/gh/ezyang/3182/orig 2025-12-04T09:33:39.9919653Z * [new branch] gh/ezyang/3185/base -> origin/gh/ezyang/3185/base 2025-12-04T09:33:39.9920986Z * [new branch] gh/ezyang/3185/head -> origin/gh/ezyang/3185/head 2025-12-04T09:33:39.9922091Z * [new branch] gh/ezyang/3185/orig -> origin/gh/ezyang/3185/orig 2025-12-04T09:33:39.9923932Z * [new branch] gh/ezyang/3189/base -> origin/gh/ezyang/3189/base 2025-12-04T09:33:39.9925101Z * [new branch] gh/ezyang/3189/head -> origin/gh/ezyang/3189/head 2025-12-04T09:33:39.9926414Z * [new branch] gh/ezyang/3189/orig -> origin/gh/ezyang/3189/orig 2025-12-04T09:33:39.9928123Z * [new branch] gh/ezyang/3191/base -> origin/gh/ezyang/3191/base 2025-12-04T09:33:39.9929872Z * [new branch] gh/ezyang/3191/head -> origin/gh/ezyang/3191/head 2025-12-04T09:33:39.9931066Z * [new branch] gh/ezyang/3191/orig -> origin/gh/ezyang/3191/orig 2025-12-04T09:33:39.9933364Z * [new branch] gh/ezyang/3192/base -> origin/gh/ezyang/3192/base 2025-12-04T09:33:39.9934581Z * [new branch] gh/ezyang/3192/head -> origin/gh/ezyang/3192/head 2025-12-04T09:33:39.9936008Z * [new branch] gh/ezyang/3192/orig -> origin/gh/ezyang/3192/orig 2025-12-04T09:33:39.9937749Z * [new branch] gh/ezyang/3193/base -> origin/gh/ezyang/3193/base 2025-12-04T09:33:39.9939043Z * [new branch] gh/ezyang/3193/head -> origin/gh/ezyang/3193/head 2025-12-04T09:33:39.9940470Z * [new branch] gh/ezyang/3193/orig -> origin/gh/ezyang/3193/orig 2025-12-04T09:33:39.9942342Z * [new branch] gh/ezyang/3194/base -> origin/gh/ezyang/3194/base 2025-12-04T09:33:39.9943504Z * [new branch] gh/ezyang/3194/head -> origin/gh/ezyang/3194/head 2025-12-04T09:33:39.9944775Z * [new branch] gh/ezyang/3194/orig -> origin/gh/ezyang/3194/orig 2025-12-04T09:33:39.9946507Z * [new branch] gh/ezyang/3195/base -> origin/gh/ezyang/3195/base 2025-12-04T09:33:39.9948018Z * [new branch] gh/ezyang/3195/head -> origin/gh/ezyang/3195/head 2025-12-04T09:33:39.9949777Z * [new branch] gh/ezyang/3195/orig -> origin/gh/ezyang/3195/orig 2025-12-04T09:33:39.9951971Z * [new branch] gh/ezyang/3196/base -> origin/gh/ezyang/3196/base 2025-12-04T09:33:39.9953219Z * [new branch] gh/ezyang/3196/head -> origin/gh/ezyang/3196/head 2025-12-04T09:33:39.9954494Z * [new branch] gh/ezyang/3196/orig -> origin/gh/ezyang/3196/orig 2025-12-04T09:33:39.9956304Z * [new branch] gh/ezyang/3197/base -> origin/gh/ezyang/3197/base 2025-12-04T09:33:39.9957456Z * [new branch] gh/ezyang/3197/head -> origin/gh/ezyang/3197/head 2025-12-04T09:33:39.9958790Z * [new branch] gh/ezyang/3197/orig -> origin/gh/ezyang/3197/orig 2025-12-04T09:33:39.9960627Z * [new branch] gh/ezyang/3198/base -> origin/gh/ezyang/3198/base 2025-12-04T09:33:39.9961852Z * [new branch] gh/ezyang/3198/head -> origin/gh/ezyang/3198/head 2025-12-04T09:33:39.9963283Z * [new branch] gh/ezyang/3198/orig -> origin/gh/ezyang/3198/orig 2025-12-04T09:33:39.9964901Z * [new branch] gh/ezyang/3199/base -> origin/gh/ezyang/3199/base 2025-12-04T09:33:39.9966101Z * [new branch] gh/ezyang/3199/head -> origin/gh/ezyang/3199/head 2025-12-04T09:33:39.9967483Z * [new branch] gh/ezyang/3199/orig -> origin/gh/ezyang/3199/orig 2025-12-04T09:33:39.9969190Z * [new branch] gh/ezyang/3200/base -> origin/gh/ezyang/3200/base 2025-12-04T09:33:39.9970319Z * [new branch] gh/ezyang/3200/head -> origin/gh/ezyang/3200/head 2025-12-04T09:33:39.9971597Z * [new branch] gh/ezyang/3200/orig -> origin/gh/ezyang/3200/orig 2025-12-04T09:33:39.9973571Z * [new branch] gh/ezyang/3201/base -> origin/gh/ezyang/3201/base 2025-12-04T09:33:39.9974769Z * [new branch] gh/ezyang/3201/head -> origin/gh/ezyang/3201/head 2025-12-04T09:33:39.9975895Z * [new branch] gh/ezyang/3201/orig -> origin/gh/ezyang/3201/orig 2025-12-04T09:33:39.9978092Z * [new branch] gh/ezyang/3202/base -> origin/gh/ezyang/3202/base 2025-12-04T09:33:39.9979163Z * [new branch] gh/ezyang/3202/head -> origin/gh/ezyang/3202/head 2025-12-04T09:33:39.9980370Z * [new branch] gh/ezyang/3202/orig -> origin/gh/ezyang/3202/orig 2025-12-04T09:33:39.9982220Z * [new branch] gh/ezyang/3203/base -> origin/gh/ezyang/3203/base 2025-12-04T09:33:39.9983384Z * [new branch] gh/ezyang/3203/head -> origin/gh/ezyang/3203/head 2025-12-04T09:33:39.9984876Z * [new branch] gh/ezyang/3203/orig -> origin/gh/ezyang/3203/orig 2025-12-04T09:33:39.9986578Z * [new branch] gh/ezyang/3204/base -> origin/gh/ezyang/3204/base 2025-12-04T09:33:39.9987912Z * [new branch] gh/ezyang/3204/head -> origin/gh/ezyang/3204/head 2025-12-04T09:33:39.9989231Z * [new branch] gh/ezyang/3204/orig -> origin/gh/ezyang/3204/orig 2025-12-04T09:33:39.9991511Z * [new branch] gh/ezyang/3205/base -> origin/gh/ezyang/3205/base 2025-12-04T09:33:39.9992727Z * [new branch] gh/ezyang/3205/head -> origin/gh/ezyang/3205/head 2025-12-04T09:33:39.9993954Z * [new branch] gh/ezyang/3205/orig -> origin/gh/ezyang/3205/orig 2025-12-04T09:33:39.9995831Z * [new branch] gh/ezyang/3206/base -> origin/gh/ezyang/3206/base 2025-12-04T09:33:39.9996960Z * [new branch] gh/ezyang/3206/head -> origin/gh/ezyang/3206/head 2025-12-04T09:33:39.9998361Z * [new branch] gh/ezyang/3206/orig -> origin/gh/ezyang/3206/orig 2025-12-04T09:33:40.0000033Z * [new branch] gh/ezyang/3207/base -> origin/gh/ezyang/3207/base 2025-12-04T09:33:40.0001189Z * [new branch] gh/ezyang/3207/head -> origin/gh/ezyang/3207/head 2025-12-04T09:33:40.0002606Z * [new branch] gh/ezyang/3207/orig -> origin/gh/ezyang/3207/orig 2025-12-04T09:33:40.0004264Z * [new branch] gh/ezyang/3208/base -> origin/gh/ezyang/3208/base 2025-12-04T09:33:40.0005478Z * [new branch] gh/ezyang/3208/head -> origin/gh/ezyang/3208/head 2025-12-04T09:33:40.0006858Z * [new branch] gh/ezyang/3208/orig -> origin/gh/ezyang/3208/orig 2025-12-04T09:33:40.0008607Z * [new branch] gh/ezyang/3209/base -> origin/gh/ezyang/3209/base 2025-12-04T09:33:40.0009786Z * [new branch] gh/ezyang/3209/head -> origin/gh/ezyang/3209/head 2025-12-04T09:33:40.0011055Z * [new branch] gh/ezyang/3209/orig -> origin/gh/ezyang/3209/orig 2025-12-04T09:33:40.0013240Z * [new branch] gh/fadara01/3/base -> origin/gh/fadara01/3/base 2025-12-04T09:33:40.0014375Z * [new branch] gh/fadara01/3/head -> origin/gh/fadara01/3/head 2025-12-04T09:33:40.0015803Z * [new branch] gh/fadara01/3/orig -> origin/gh/fadara01/3/orig 2025-12-04T09:33:40.0017544Z * [new branch] gh/fadara01/5/base -> origin/gh/fadara01/5/base 2025-12-04T09:33:40.0018753Z * [new branch] gh/fadara01/5/head -> origin/gh/fadara01/5/head 2025-12-04T09:33:40.0020228Z * [new branch] gh/fadara01/5/orig -> origin/gh/fadara01/5/orig 2025-12-04T09:33:40.0021887Z * [new branch] gh/fadara01/6/base -> origin/gh/fadara01/6/base 2025-12-04T09:33:40.0023135Z * [new branch] gh/fadara01/6/head -> origin/gh/fadara01/6/head 2025-12-04T09:33:40.0024475Z * [new branch] gh/fadara01/6/orig -> origin/gh/fadara01/6/orig 2025-12-04T09:33:40.0026387Z * [new branch] gh/fadara01/7/base -> origin/gh/fadara01/7/base 2025-12-04T09:33:40.0027360Z * [new branch] gh/fadara01/7/head -> origin/gh/fadara01/7/head 2025-12-04T09:33:40.0028776Z * [new branch] gh/fadara01/7/orig -> origin/gh/fadara01/7/orig 2025-12-04T09:33:40.0030558Z * [new branch] gh/fadara01/8/base -> origin/gh/fadara01/8/base 2025-12-04T09:33:40.0031725Z * [new branch] gh/fadara01/8/head -> origin/gh/fadara01/8/head 2025-12-04T09:33:40.0033113Z * [new branch] gh/fadara01/8/orig -> origin/gh/fadara01/8/orig 2025-12-04T09:33:40.0034732Z * [new branch] gh/fadara01/9/base -> origin/gh/fadara01/9/base 2025-12-04T09:33:40.0035935Z * [new branch] gh/fadara01/9/head -> origin/gh/fadara01/9/head 2025-12-04T09:33:40.0037366Z * [new branch] gh/fadara01/9/orig -> origin/gh/fadara01/9/orig 2025-12-04T09:33:40.0039348Z * [new branch] gh/fduwjj/182/base -> origin/gh/fduwjj/182/base 2025-12-04T09:33:40.0040572Z * [new branch] gh/fduwjj/182/head -> origin/gh/fduwjj/182/head 2025-12-04T09:33:40.0041790Z * [new branch] gh/fduwjj/182/orig -> origin/gh/fduwjj/182/orig 2025-12-04T09:33:40.0043635Z * [new branch] gh/fduwjj/211/base -> origin/gh/fduwjj/211/base 2025-12-04T09:33:40.0044802Z * [new branch] gh/fduwjj/211/head -> origin/gh/fduwjj/211/head 2025-12-04T09:33:40.0046332Z * [new branch] gh/fduwjj/211/orig -> origin/gh/fduwjj/211/orig 2025-12-04T09:33:40.0047990Z * [new branch] gh/fduwjj/212/base -> origin/gh/fduwjj/212/base 2025-12-04T09:33:40.0049133Z * [new branch] gh/fduwjj/212/head -> origin/gh/fduwjj/212/head 2025-12-04T09:33:40.0050404Z * [new branch] gh/fduwjj/212/orig -> origin/gh/fduwjj/212/orig 2025-12-04T09:33:40.0052166Z * [new branch] gh/fduwjj/213/base -> origin/gh/fduwjj/213/base 2025-12-04T09:33:40.0053402Z * [new branch] gh/fduwjj/213/head -> origin/gh/fduwjj/213/head 2025-12-04T09:33:40.0054609Z * [new branch] gh/fduwjj/213/orig -> origin/gh/fduwjj/213/orig 2025-12-04T09:33:40.0056522Z * [new branch] gh/fduwjj/226/base -> origin/gh/fduwjj/226/base 2025-12-04T09:33:40.0057598Z * [new branch] gh/fduwjj/226/head -> origin/gh/fduwjj/226/head 2025-12-04T09:33:40.0058920Z * [new branch] gh/fduwjj/226/orig -> origin/gh/fduwjj/226/orig 2025-12-04T09:33:40.0060940Z * [new branch] gh/fduwjj/229/base -> origin/gh/fduwjj/229/base 2025-12-04T09:33:40.0062035Z * [new branch] gh/fduwjj/229/head -> origin/gh/fduwjj/229/head 2025-12-04T09:33:40.0063542Z * [new branch] gh/fduwjj/229/orig -> origin/gh/fduwjj/229/orig 2025-12-04T09:33:40.0065259Z * [new branch] gh/fduwjj/233/base -> origin/gh/fduwjj/233/base 2025-12-04T09:33:40.0066438Z * [new branch] gh/fduwjj/233/head -> origin/gh/fduwjj/233/head 2025-12-04T09:33:40.0067677Z * [new branch] gh/fduwjj/233/orig -> origin/gh/fduwjj/233/orig 2025-12-04T09:33:40.0069552Z * [new branch] gh/fduwjj/234/base -> origin/gh/fduwjj/234/base 2025-12-04T09:33:40.0070769Z * [new branch] gh/fduwjj/234/head -> origin/gh/fduwjj/234/head 2025-12-04T09:33:40.0071982Z * [new branch] gh/fduwjj/234/orig -> origin/gh/fduwjj/234/orig 2025-12-04T09:33:40.0073782Z * [new branch] gh/fduwjj/235/base -> origin/gh/fduwjj/235/base 2025-12-04T09:33:40.0074939Z * [new branch] gh/fduwjj/235/head -> origin/gh/fduwjj/235/head 2025-12-04T09:33:40.0076385Z * [new branch] gh/fduwjj/235/orig -> origin/gh/fduwjj/235/orig 2025-12-04T09:33:40.0078019Z * [new branch] gh/fduwjj/236/base -> origin/gh/fduwjj/236/base 2025-12-04T09:33:40.0079039Z * [new branch] gh/fduwjj/236/head -> origin/gh/fduwjj/236/head 2025-12-04T09:33:40.0080524Z * [new branch] gh/fduwjj/236/orig -> origin/gh/fduwjj/236/orig 2025-12-04T09:33:40.0082011Z * [new branch] gh/fduwjj/237/base -> origin/gh/fduwjj/237/base 2025-12-04T09:33:40.0083182Z * [new branch] gh/fduwjj/237/head -> origin/gh/fduwjj/237/head 2025-12-04T09:33:40.0084397Z * [new branch] gh/fduwjj/237/orig -> origin/gh/fduwjj/237/orig 2025-12-04T09:33:40.0086198Z * [new branch] gh/fduwjj/238/base -> origin/gh/fduwjj/238/base 2025-12-04T09:33:40.0087433Z * [new branch] gh/fduwjj/238/head -> origin/gh/fduwjj/238/head 2025-12-04T09:33:40.0088785Z * [new branch] gh/fduwjj/238/orig -> origin/gh/fduwjj/238/orig 2025-12-04T09:33:40.0090605Z * [new branch] gh/fduwjj/239/base -> origin/gh/fduwjj/239/base 2025-12-04T09:33:40.0091853Z * [new branch] gh/fduwjj/239/head -> origin/gh/fduwjj/239/head 2025-12-04T09:33:40.0093111Z * [new branch] gh/fduwjj/239/orig -> origin/gh/fduwjj/239/orig 2025-12-04T09:33:40.0095244Z * [new branch] gh/fegin/332/base -> origin/gh/fegin/332/base 2025-12-04T09:33:40.0096636Z * [new branch] gh/fegin/332/head -> origin/gh/fegin/332/head 2025-12-04T09:33:40.0097895Z * [new branch] gh/fegin/332/orig -> origin/gh/fegin/332/orig 2025-12-04T09:33:40.0099852Z * [new branch] gh/fegin/333/base -> origin/gh/fegin/333/base 2025-12-04T09:33:40.0101072Z * [new branch] gh/fegin/333/head -> origin/gh/fegin/333/head 2025-12-04T09:33:40.0102488Z * [new branch] gh/fegin/333/orig -> origin/gh/fegin/333/orig 2025-12-04T09:33:40.0104158Z * [new branch] gh/fegin/334/base -> origin/gh/fegin/334/base 2025-12-04T09:33:40.0105274Z * [new branch] gh/fegin/334/head -> origin/gh/fegin/334/head 2025-12-04T09:33:40.0106786Z * [new branch] gh/fegin/334/orig -> origin/gh/fegin/334/orig 2025-12-04T09:33:40.0108458Z * [new branch] gh/fegin/335/base -> origin/gh/fegin/335/base 2025-12-04T09:33:40.0109602Z * [new branch] gh/fegin/335/head -> origin/gh/fegin/335/head 2025-12-04T09:33:40.0110886Z * [new branch] gh/fegin/335/orig -> origin/gh/fegin/335/orig 2025-12-04T09:33:40.0113076Z * [new branch] gh/fffrog/160/base -> origin/gh/fffrog/160/base 2025-12-04T09:33:40.0114307Z * [new branch] gh/fffrog/160/head -> origin/gh/fffrog/160/head 2025-12-04T09:33:40.0116080Z * [new branch] gh/fffrog/177/base -> origin/gh/fffrog/177/base 2025-12-04T09:33:40.0117246Z * [new branch] gh/fffrog/177/head -> origin/gh/fffrog/177/head 2025-12-04T09:33:40.0118666Z * [new branch] gh/fffrog/177/orig -> origin/gh/fffrog/177/orig 2025-12-04T09:33:40.0120285Z * [new branch] gh/fffrog/178/base -> origin/gh/fffrog/178/base 2025-12-04T09:33:40.0122106Z * [new branch] gh/fffrog/178/head -> origin/gh/fffrog/178/head 2025-12-04T09:33:40.0123363Z * [new branch] gh/fffrog/178/orig -> origin/gh/fffrog/178/orig 2025-12-04T09:33:40.0125069Z * [new branch] gh/fffrog/181/base -> origin/gh/fffrog/181/base 2025-12-04T09:33:40.0126217Z * [new branch] gh/fffrog/181/head -> origin/gh/fffrog/181/head 2025-12-04T09:33:40.0127703Z * [new branch] gh/fffrog/181/orig -> origin/gh/fffrog/181/orig 2025-12-04T09:33:40.0129610Z * [new branch] gh/fffrog/183/base -> origin/gh/fffrog/183/base 2025-12-04T09:33:40.0130643Z * [new branch] gh/fffrog/183/head -> origin/gh/fffrog/183/head 2025-12-04T09:33:40.0131891Z * [new branch] gh/fffrog/183/orig -> origin/gh/fffrog/183/orig 2025-12-04T09:33:40.0134074Z * [new branch] gh/fxdawnn/10/base -> origin/gh/fxdawnn/10/base 2025-12-04T09:33:40.0135142Z * [new branch] gh/fxdawnn/10/head -> origin/gh/fxdawnn/10/head 2025-12-04T09:33:40.0136723Z * [new branch] gh/fxdawnn/10/orig -> origin/gh/fxdawnn/10/orig 2025-12-04T09:33:40.0138522Z * [new branch] gh/fxdawnn/11/base -> origin/gh/fxdawnn/11/base 2025-12-04T09:33:40.0139745Z * [new branch] gh/fxdawnn/11/head -> origin/gh/fxdawnn/11/head 2025-12-04T09:33:40.0140997Z * [new branch] gh/fxdawnn/11/orig -> origin/gh/fxdawnn/11/orig 2025-12-04T09:33:40.0142856Z * [new branch] gh/fxdawnn/12/base -> origin/gh/fxdawnn/12/base 2025-12-04T09:33:40.0144442Z * [new branch] gh/fxdawnn/12/head -> origin/gh/fxdawnn/12/head 2025-12-04T09:33:40.0145309Z * [new branch] gh/fxdawnn/12/orig -> origin/gh/fxdawnn/12/orig 2025-12-04T09:33:40.0147217Z * [new branch] gh/fxdawnn/13/base -> origin/gh/fxdawnn/13/base 2025-12-04T09:33:40.0148347Z * [new branch] gh/fxdawnn/13/head -> origin/gh/fxdawnn/13/head 2025-12-04T09:33:40.0149756Z * [new branch] gh/fxdawnn/13/orig -> origin/gh/fxdawnn/13/orig 2025-12-04T09:33:40.0151613Z * [new branch] gh/fxdawnn/14/base -> origin/gh/fxdawnn/14/base 2025-12-04T09:33:40.0152705Z * [new branch] gh/fxdawnn/14/head -> origin/gh/fxdawnn/14/head 2025-12-04T09:33:40.0153892Z * [new branch] gh/fxdawnn/14/orig -> origin/gh/fxdawnn/14/orig 2025-12-04T09:33:40.0155707Z * [new branch] gh/fxdawnn/15/base -> origin/gh/fxdawnn/15/base 2025-12-04T09:33:40.0156806Z * [new branch] gh/fxdawnn/15/head -> origin/gh/fxdawnn/15/head 2025-12-04T09:33:40.0158043Z * [new branch] gh/fxdawnn/15/orig -> origin/gh/fxdawnn/15/orig 2025-12-04T09:33:40.0159909Z * [new branch] gh/fxdawnn/6/base -> origin/gh/fxdawnn/6/base 2025-12-04T09:33:40.0161068Z * [new branch] gh/fxdawnn/6/head -> origin/gh/fxdawnn/6/head 2025-12-04T09:33:40.0162473Z * [new branch] gh/fxdawnn/6/orig -> origin/gh/fxdawnn/6/orig 2025-12-04T09:33:40.0164258Z * [new branch] gh/fxdawnn/7/base -> origin/gh/fxdawnn/7/base 2025-12-04T09:33:40.0165466Z * [new branch] gh/fxdawnn/7/head -> origin/gh/fxdawnn/7/head 2025-12-04T09:33:40.0166690Z * [new branch] gh/fxdawnn/7/orig -> origin/gh/fxdawnn/7/orig 2025-12-04T09:33:40.0168633Z * [new branch] gh/fxdawnn/9/base -> origin/gh/fxdawnn/9/base 2025-12-04T09:33:40.0169631Z * [new branch] gh/fxdawnn/9/head -> origin/gh/fxdawnn/9/head 2025-12-04T09:33:40.0171166Z * [new branch] gh/fxdawnn/9/orig -> origin/gh/fxdawnn/9/orig 2025-12-04T09:33:40.0173193Z * [new branch] gh/galv/1/base -> origin/gh/galv/1/base 2025-12-04T09:33:40.0174325Z * [new branch] gh/galv/1/head -> origin/gh/galv/1/head 2025-12-04T09:33:40.0175753Z * [new branch] gh/galv/1/orig -> origin/gh/galv/1/orig 2025-12-04T09:33:40.0177463Z * [new branch] gh/galv/2/base -> origin/gh/galv/2/base 2025-12-04T09:33:40.0178694Z * [new branch] gh/galv/2/head -> origin/gh/galv/2/head 2025-12-04T09:33:40.0180224Z * [new branch] gh/galv/2/orig -> origin/gh/galv/2/orig 2025-12-04T09:33:40.0182093Z * [new branch] gh/galv/3/base -> origin/gh/galv/3/base 2025-12-04T09:33:40.0183170Z * [new branch] gh/galv/3/head -> origin/gh/galv/3/head 2025-12-04T09:33:40.0184707Z * [new branch] gh/galv/3/orig -> origin/gh/galv/3/orig 2025-12-04T09:33:40.0186750Z * [new branch] gh/guangyey/134/base -> origin/gh/guangyey/134/base 2025-12-04T09:33:40.0188087Z * [new branch] gh/guangyey/134/head -> origin/gh/guangyey/134/head 2025-12-04T09:33:40.0189571Z * [new branch] gh/guangyey/134/orig -> origin/gh/guangyey/134/orig 2025-12-04T09:33:40.0191222Z * [new branch] gh/guangyey/163/base -> origin/gh/guangyey/163/base 2025-12-04T09:33:40.0192379Z * [new branch] gh/guangyey/163/head -> origin/gh/guangyey/163/head 2025-12-04T09:33:40.0193646Z * [new branch] gh/guangyey/163/orig -> origin/gh/guangyey/163/orig 2025-12-04T09:33:40.0195908Z * [new branch] gh/guangyey/168/base -> origin/gh/guangyey/168/base 2025-12-04T09:33:40.0196590Z * [new branch] gh/guangyey/168/head -> origin/gh/guangyey/168/head 2025-12-04T09:33:40.0198056Z * [new branch] gh/guangyey/168/orig -> origin/gh/guangyey/168/orig 2025-12-04T09:33:40.0199709Z * [new branch] gh/guangyey/169/base -> origin/gh/guangyey/169/base 2025-12-04T09:33:40.0200881Z * [new branch] gh/guangyey/169/head -> origin/gh/guangyey/169/head 2025-12-04T09:33:40.0202123Z * [new branch] gh/guangyey/169/orig -> origin/gh/guangyey/169/orig 2025-12-04T09:33:40.0203900Z * [new branch] gh/guangyey/170/base -> origin/gh/guangyey/170/base 2025-12-04T09:33:40.0205120Z * [new branch] gh/guangyey/170/head -> origin/gh/guangyey/170/head 2025-12-04T09:33:40.0206397Z * [new branch] gh/guangyey/170/orig -> origin/gh/guangyey/170/orig 2025-12-04T09:33:40.0208194Z * [new branch] gh/guangyey/171/base -> origin/gh/guangyey/171/base 2025-12-04T09:33:40.0209346Z * [new branch] gh/guangyey/171/head -> origin/gh/guangyey/171/head 2025-12-04T09:33:40.0210598Z * [new branch] gh/guangyey/171/orig -> origin/gh/guangyey/171/orig 2025-12-04T09:33:40.0212348Z * [new branch] gh/guangyey/178/base -> origin/gh/guangyey/178/base 2025-12-04T09:33:40.0213611Z * [new branch] gh/guangyey/178/head -> origin/gh/guangyey/178/head 2025-12-04T09:33:40.0214888Z * [new branch] gh/guangyey/178/orig -> origin/gh/guangyey/178/orig 2025-12-04T09:33:40.0216628Z * [new branch] gh/guangyey/182/base -> origin/gh/guangyey/182/base 2025-12-04T09:33:40.0217872Z * [new branch] gh/guangyey/182/head -> origin/gh/guangyey/182/head 2025-12-04T09:33:40.0219823Z * [new branch] gh/guangyey/182/orig -> origin/gh/guangyey/182/orig 2025-12-04T09:33:40.0221435Z * [new branch] gh/guangyey/183/base -> origin/gh/guangyey/183/base 2025-12-04T09:33:40.0222610Z * [new branch] gh/guangyey/183/head -> origin/gh/guangyey/183/head 2025-12-04T09:33:40.0224018Z * [new branch] gh/guangyey/183/orig -> origin/gh/guangyey/183/orig 2025-12-04T09:33:40.0225699Z * [new branch] gh/guangyey/185/base -> origin/gh/guangyey/185/base 2025-12-04T09:33:40.0226883Z * [new branch] gh/guangyey/185/head -> origin/gh/guangyey/185/head 2025-12-04T09:33:40.0228158Z * [new branch] gh/guangyey/185/orig -> origin/gh/guangyey/185/orig 2025-12-04T09:33:40.0229960Z * [new branch] gh/guangyey/186/base -> origin/gh/guangyey/186/base 2025-12-04T09:33:40.0231168Z * [new branch] gh/guangyey/186/head -> origin/gh/guangyey/186/head 2025-12-04T09:33:40.0232639Z * [new branch] gh/guangyey/186/orig -> origin/gh/guangyey/186/orig 2025-12-04T09:33:40.0234250Z * [new branch] gh/guangyey/187/base -> origin/gh/guangyey/187/base 2025-12-04T09:33:40.0235380Z * [new branch] gh/guangyey/187/head -> origin/gh/guangyey/187/head 2025-12-04T09:33:40.0236659Z * [new branch] gh/guangyey/187/orig -> origin/gh/guangyey/187/orig 2025-12-04T09:33:40.0238445Z * [new branch] gh/guangyey/188/base -> origin/gh/guangyey/188/base 2025-12-04T09:33:40.0239572Z * [new branch] gh/guangyey/188/head -> origin/gh/guangyey/188/head 2025-12-04T09:33:40.0240884Z * [new branch] gh/guangyey/188/orig -> origin/gh/guangyey/188/orig 2025-12-04T09:33:40.0242642Z * [new branch] gh/guangyey/190/base -> origin/gh/guangyey/190/base 2025-12-04T09:33:40.0243841Z * [new branch] gh/guangyey/190/head -> origin/gh/guangyey/190/head 2025-12-04T09:33:40.0245080Z * [new branch] gh/guangyey/190/orig -> origin/gh/guangyey/190/orig 2025-12-04T09:33:40.0246830Z * [new branch] gh/guangyey/208/base -> origin/gh/guangyey/208/base 2025-12-04T09:33:40.0248045Z * [new branch] gh/guangyey/208/head -> origin/gh/guangyey/208/head 2025-12-04T09:33:40.0249539Z * [new branch] gh/guangyey/208/orig -> origin/gh/guangyey/208/orig 2025-12-04T09:33:40.0251190Z * [new branch] gh/guangyey/228/base -> origin/gh/guangyey/228/base 2025-12-04T09:33:40.0252418Z * [new branch] gh/guangyey/228/head -> origin/gh/guangyey/228/head 2025-12-04T09:33:40.0253650Z * [new branch] gh/guangyey/228/orig -> origin/gh/guangyey/228/orig 2025-12-04T09:33:40.0255902Z * [new branch] gh/guangyey/230/base -> origin/gh/guangyey/230/base 2025-12-04T09:33:40.0257068Z * [new branch] gh/guangyey/230/head -> origin/gh/guangyey/230/head 2025-12-04T09:33:40.0258412Z * [new branch] gh/guangyey/230/orig -> origin/gh/guangyey/230/orig 2025-12-04T09:33:40.0260273Z * [new branch] gh/guangyey/231/base -> origin/gh/guangyey/231/base 2025-12-04T09:33:40.0261466Z * [new branch] gh/guangyey/231/head -> origin/gh/guangyey/231/head 2025-12-04T09:33:40.0262776Z * [new branch] gh/guangyey/231/orig -> origin/gh/guangyey/231/orig 2025-12-04T09:33:40.0264582Z * [new branch] gh/guangyey/232/base -> origin/gh/guangyey/232/base 2025-12-04T09:33:40.0266329Z * [new branch] gh/guangyey/232/head -> origin/gh/guangyey/232/head 2025-12-04T09:33:40.0267628Z * [new branch] gh/guangyey/232/orig -> origin/gh/guangyey/232/orig 2025-12-04T09:33:40.0269433Z * [new branch] gh/guangyey/233/base -> origin/gh/guangyey/233/base 2025-12-04T09:33:40.0270581Z * [new branch] gh/guangyey/233/head -> origin/gh/guangyey/233/head 2025-12-04T09:33:40.0271859Z * [new branch] gh/guangyey/233/orig -> origin/gh/guangyey/233/orig 2025-12-04T09:33:40.0273611Z * [new branch] gh/guangyey/234/base -> origin/gh/guangyey/234/base 2025-12-04T09:33:40.0274816Z * [new branch] gh/guangyey/234/head -> origin/gh/guangyey/234/head 2025-12-04T09:33:40.0276081Z * [new branch] gh/guangyey/234/orig -> origin/gh/guangyey/234/orig 2025-12-04T09:33:40.0277870Z * [new branch] gh/guangyey/235/base -> origin/gh/guangyey/235/base 2025-12-04T09:33:40.0279081Z * [new branch] gh/guangyey/235/head -> origin/gh/guangyey/235/head 2025-12-04T09:33:40.0280341Z * [new branch] gh/guangyey/235/orig -> origin/gh/guangyey/235/orig 2025-12-04T09:33:40.0282137Z * [new branch] gh/guangyey/236/base -> origin/gh/guangyey/236/base 2025-12-04T09:33:40.0283475Z * [new branch] gh/guangyey/236/head -> origin/gh/guangyey/236/head 2025-12-04T09:33:40.0284716Z * [new branch] gh/guangyey/236/orig -> origin/gh/guangyey/236/orig 2025-12-04T09:33:40.0286514Z * [new branch] gh/guangyey/237/base -> origin/gh/guangyey/237/base 2025-12-04T09:33:40.0287697Z * [new branch] gh/guangyey/237/head -> origin/gh/guangyey/237/head 2025-12-04T09:33:40.0291111Z * [new branch] gh/guangyey/237/orig -> origin/gh/guangyey/237/orig 2025-12-04T09:33:40.0292705Z * [new branch] gh/guangyey/238/base -> origin/gh/guangyey/238/base 2025-12-04T09:33:40.0294044Z * [new branch] gh/guangyey/238/head -> origin/gh/guangyey/238/head 2025-12-04T09:33:40.0295704Z * [new branch] gh/guangyey/239/base -> origin/gh/guangyey/239/base 2025-12-04T09:33:40.0296994Z * [new branch] gh/guangyey/239/head -> origin/gh/guangyey/239/head 2025-12-04T09:33:40.0298319Z * [new branch] gh/guangyey/239/orig -> origin/gh/guangyey/239/orig 2025-12-04T09:33:40.0300075Z * [new branch] gh/guangyey/240/base -> origin/gh/guangyey/240/base 2025-12-04T09:33:40.0301379Z * [new branch] gh/guangyey/240/head -> origin/gh/guangyey/240/head 2025-12-04T09:33:40.0302620Z * [new branch] gh/guangyey/240/orig -> origin/gh/guangyey/240/orig 2025-12-04T09:33:40.0304441Z * [new branch] gh/guangyey/241/base -> origin/gh/guangyey/241/base 2025-12-04T09:33:40.0305702Z * [new branch] gh/guangyey/241/head -> origin/gh/guangyey/241/head 2025-12-04T09:33:40.0306961Z * [new branch] gh/guangyey/241/orig -> origin/gh/guangyey/241/orig 2025-12-04T09:33:40.0309153Z * [new branch] gh/guangyey/242/base -> origin/gh/guangyey/242/base 2025-12-04T09:33:40.0310409Z * [new branch] gh/guangyey/242/head -> origin/gh/guangyey/242/head 2025-12-04T09:33:40.0311707Z * [new branch] gh/guangyey/242/orig -> origin/gh/guangyey/242/orig 2025-12-04T09:33:40.0313391Z * [new branch] gh/guangyey/243/base -> origin/gh/guangyey/243/base 2025-12-04T09:33:40.0314658Z * [new branch] gh/guangyey/243/head -> origin/gh/guangyey/243/head 2025-12-04T09:33:40.0315945Z * [new branch] gh/guangyey/243/orig -> origin/gh/guangyey/243/orig 2025-12-04T09:33:40.0317722Z * [new branch] gh/guangyey/244/base -> origin/gh/guangyey/244/base 2025-12-04T09:33:40.0319002Z * [new branch] gh/guangyey/244/head -> origin/gh/guangyey/244/head 2025-12-04T09:33:40.0320273Z * [new branch] gh/guangyey/244/orig -> origin/gh/guangyey/244/orig 2025-12-04T09:33:40.0322079Z * [new branch] gh/guangyey/245/base -> origin/gh/guangyey/245/base 2025-12-04T09:33:40.0323362Z * [new branch] gh/guangyey/245/head -> origin/gh/guangyey/245/head 2025-12-04T09:33:40.0324645Z * [new branch] gh/guangyey/245/orig -> origin/gh/guangyey/245/orig 2025-12-04T09:33:40.0326357Z * [new branch] gh/guangyey/246/base -> origin/gh/guangyey/246/base 2025-12-04T09:33:40.0327650Z * [new branch] gh/guangyey/246/head -> origin/gh/guangyey/246/head 2025-12-04T09:33:40.0328939Z * [new branch] gh/guangyey/246/orig -> origin/gh/guangyey/246/orig 2025-12-04T09:33:40.0330707Z * [new branch] gh/guangyey/247/base -> origin/gh/guangyey/247/base 2025-12-04T09:33:40.0331998Z * [new branch] gh/guangyey/247/head -> origin/gh/guangyey/247/head 2025-12-04T09:33:40.0333249Z * [new branch] gh/guangyey/247/orig -> origin/gh/guangyey/247/orig 2025-12-04T09:33:40.0334988Z * [new branch] gh/guangyey/248/base -> origin/gh/guangyey/248/base 2025-12-04T09:33:40.0336305Z * [new branch] gh/guangyey/248/head -> origin/gh/guangyey/248/head 2025-12-04T09:33:40.0337468Z * [new branch] gh/guangyey/248/orig -> origin/gh/guangyey/248/orig 2025-12-04T09:33:40.0339425Z * [new branch] gh/guangyey/249/base -> origin/gh/guangyey/249/base 2025-12-04T09:33:40.0340772Z * [new branch] gh/guangyey/249/head -> origin/gh/guangyey/249/head 2025-12-04T09:33:40.0342110Z * [new branch] gh/guangyey/249/orig -> origin/gh/guangyey/249/orig 2025-12-04T09:33:40.0343806Z * [new branch] gh/guangyey/250/base -> origin/gh/guangyey/250/base 2025-12-04T09:33:40.0345173Z * [new branch] gh/guangyey/250/head -> origin/gh/guangyey/250/head 2025-12-04T09:33:40.0346387Z * [new branch] gh/guangyey/250/orig -> origin/gh/guangyey/250/orig 2025-12-04T09:33:40.0348020Z * [new branch] gh/guangyey/251/base -> origin/gh/guangyey/251/base 2025-12-04T09:33:40.0349317Z * [new branch] gh/guangyey/251/head -> origin/gh/guangyey/251/head 2025-12-04T09:33:40.0350612Z * [new branch] gh/guangyey/251/orig -> origin/gh/guangyey/251/orig 2025-12-04T09:33:40.0352310Z * [new branch] gh/guangyey/252/base -> origin/gh/guangyey/252/base 2025-12-04T09:33:40.0354020Z * [new branch] gh/guangyey/252/head -> origin/gh/guangyey/252/head 2025-12-04T09:33:40.0355311Z * [new branch] gh/guangyey/252/orig -> origin/gh/guangyey/252/orig 2025-12-04T09:33:40.0357141Z * [new branch] gh/guangyey/253/base -> origin/gh/guangyey/253/base 2025-12-04T09:33:40.0358409Z * [new branch] gh/guangyey/253/head -> origin/gh/guangyey/253/head 2025-12-04T09:33:40.0359685Z * [new branch] gh/guangyey/253/orig -> origin/gh/guangyey/253/orig 2025-12-04T09:33:40.0361441Z * [new branch] gh/guangyey/254/base -> origin/gh/guangyey/254/base 2025-12-04T09:33:40.0362764Z * [new branch] gh/guangyey/254/head -> origin/gh/guangyey/254/head 2025-12-04T09:33:40.0364016Z * [new branch] gh/guangyey/254/orig -> origin/gh/guangyey/254/orig 2025-12-04T09:33:40.0365707Z * [new branch] gh/guangyey/255/base -> origin/gh/guangyey/255/base 2025-12-04T09:33:40.0366970Z * [new branch] gh/guangyey/255/head -> origin/gh/guangyey/255/head 2025-12-04T09:33:40.0368211Z * [new branch] gh/guangyey/255/orig -> origin/gh/guangyey/255/orig 2025-12-04T09:33:40.0370404Z * [new branch] gh/guilhermeleobas/107/base -> origin/gh/guilhermeleobas/107/base 2025-12-04T09:33:40.0371707Z * [new branch] gh/guilhermeleobas/107/head -> origin/gh/guilhermeleobas/107/head 2025-12-04T09:33:40.0373318Z * [new branch] gh/guilhermeleobas/107/orig -> origin/gh/guilhermeleobas/107/orig 2025-12-04T09:33:40.0375402Z * [new branch] gh/guilhermeleobas/108/base -> origin/gh/guilhermeleobas/108/base 2025-12-04T09:33:40.0376628Z * [new branch] gh/guilhermeleobas/108/head -> origin/gh/guilhermeleobas/108/head 2025-12-04T09:33:40.0377837Z * [new branch] gh/guilhermeleobas/108/orig -> origin/gh/guilhermeleobas/108/orig 2025-12-04T09:33:40.0379776Z * [new branch] gh/guilhermeleobas/150/base -> origin/gh/guilhermeleobas/150/base 2025-12-04T09:33:40.0382370Z * [new branch] gh/guilhermeleobas/150/head -> origin/gh/guilhermeleobas/150/head 2025-12-04T09:33:40.0383645Z * [new branch] gh/guilhermeleobas/150/orig -> origin/gh/guilhermeleobas/150/orig 2025-12-04T09:33:40.0385790Z * [new branch] gh/guilhermeleobas/168/base -> origin/gh/guilhermeleobas/168/base 2025-12-04T09:33:40.0387054Z * [new branch] gh/guilhermeleobas/168/head -> origin/gh/guilhermeleobas/168/head 2025-12-04T09:33:40.0389476Z * [new branch] gh/guilhermeleobas/168/orig -> origin/gh/guilhermeleobas/168/orig 2025-12-04T09:33:40.0390367Z * [new branch] gh/guilhermeleobas/169/base -> origin/gh/guilhermeleobas/169/base 2025-12-04T09:33:40.0391588Z * [new branch] gh/guilhermeleobas/169/head -> origin/gh/guilhermeleobas/169/head 2025-12-04T09:33:40.0395200Z * [new branch] gh/guilhermeleobas/169/orig -> origin/gh/guilhermeleobas/169/orig 2025-12-04T09:33:40.0396736Z * [new branch] gh/guilhermeleobas/170/base -> origin/gh/guilhermeleobas/170/base 2025-12-04T09:33:40.0397054Z * [new branch] gh/guilhermeleobas/170/head -> origin/gh/guilhermeleobas/170/head 2025-12-04T09:33:40.0397357Z * [new branch] gh/guilhermeleobas/170/orig -> origin/gh/guilhermeleobas/170/orig 2025-12-04T09:33:40.0399030Z * [new branch] gh/guilhermeleobas/171/base -> origin/gh/guilhermeleobas/171/base 2025-12-04T09:33:40.0400262Z * [new branch] gh/guilhermeleobas/171/head -> origin/gh/guilhermeleobas/171/head 2025-12-04T09:33:40.0401560Z * [new branch] gh/guilhermeleobas/171/orig -> origin/gh/guilhermeleobas/171/orig 2025-12-04T09:33:40.0403190Z * [new branch] gh/guilhermeleobas/173/base -> origin/gh/guilhermeleobas/173/base 2025-12-04T09:33:40.0404481Z * [new branch] gh/guilhermeleobas/173/head -> origin/gh/guilhermeleobas/173/head 2025-12-04T09:33:40.0405746Z * [new branch] gh/guilhermeleobas/173/orig -> origin/gh/guilhermeleobas/173/orig 2025-12-04T09:33:40.0407481Z * [new branch] gh/guilhermeleobas/193/base -> origin/gh/guilhermeleobas/193/base 2025-12-04T09:33:40.0408741Z * [new branch] gh/guilhermeleobas/193/head -> origin/gh/guilhermeleobas/193/head 2025-12-04T09:33:40.0410268Z * [new branch] gh/guilhermeleobas/193/orig -> origin/gh/guilhermeleobas/193/orig 2025-12-04T09:33:40.0411989Z * [new branch] gh/guilhermeleobas/204/base -> origin/gh/guilhermeleobas/204/base 2025-12-04T09:33:40.0413238Z * [new branch] gh/guilhermeleobas/204/head -> origin/gh/guilhermeleobas/204/head 2025-12-04T09:33:40.0414512Z * [new branch] gh/guilhermeleobas/204/orig -> origin/gh/guilhermeleobas/204/orig 2025-12-04T09:33:40.0416189Z * [new branch] gh/guilhermeleobas/211/base -> origin/gh/guilhermeleobas/211/base 2025-12-04T09:33:40.0417449Z * [new branch] gh/guilhermeleobas/211/head -> origin/gh/guilhermeleobas/211/head 2025-12-04T09:33:40.0418847Z * [new branch] gh/guilhermeleobas/211/orig -> origin/gh/guilhermeleobas/211/orig 2025-12-04T09:33:40.0420555Z * [new branch] gh/guilhermeleobas/226/base -> origin/gh/guilhermeleobas/226/base 2025-12-04T09:33:40.0421834Z * [new branch] gh/guilhermeleobas/226/head -> origin/gh/guilhermeleobas/226/head 2025-12-04T09:33:40.0423080Z * [new branch] gh/guilhermeleobas/226/orig -> origin/gh/guilhermeleobas/226/orig 2025-12-04T09:33:40.0424754Z * [new branch] gh/guilhermeleobas/236/base -> origin/gh/guilhermeleobas/236/base 2025-12-04T09:33:40.0425996Z * [new branch] gh/guilhermeleobas/236/head -> origin/gh/guilhermeleobas/236/head 2025-12-04T09:33:40.0427351Z * [new branch] gh/guilhermeleobas/236/orig -> origin/gh/guilhermeleobas/236/orig 2025-12-04T09:33:40.0429038Z * [new branch] gh/guilhermeleobas/247/base -> origin/gh/guilhermeleobas/247/base 2025-12-04T09:33:40.0430761Z * [new branch] gh/guilhermeleobas/247/head -> origin/gh/guilhermeleobas/247/head 2025-12-04T09:33:40.0432196Z * [new branch] gh/guilhermeleobas/247/orig -> origin/gh/guilhermeleobas/247/orig 2025-12-04T09:33:40.0434242Z * [new branch] gh/guilhermeleobas/248/base -> origin/gh/guilhermeleobas/248/base 2025-12-04T09:33:40.0435540Z * [new branch] gh/guilhermeleobas/248/head -> origin/gh/guilhermeleobas/248/head 2025-12-04T09:33:40.0436887Z * [new branch] gh/guilhermeleobas/248/orig -> origin/gh/guilhermeleobas/248/orig 2025-12-04T09:33:40.0438633Z * [new branch] gh/guilhermeleobas/250/base -> origin/gh/guilhermeleobas/250/base 2025-12-04T09:33:40.0439792Z * [new branch] gh/guilhermeleobas/250/head -> origin/gh/guilhermeleobas/250/head 2025-12-04T09:33:40.0441170Z * [new branch] gh/guilhermeleobas/250/orig -> origin/gh/guilhermeleobas/250/orig 2025-12-04T09:33:40.0443279Z * [new branch] gh/guilhermeleobas/253/base -> origin/gh/guilhermeleobas/253/base 2025-12-04T09:33:40.0444523Z * [new branch] gh/guilhermeleobas/253/head -> origin/gh/guilhermeleobas/253/head 2025-12-04T09:33:40.0445964Z * [new branch] gh/guilhermeleobas/253/orig -> origin/gh/guilhermeleobas/253/orig 2025-12-04T09:33:40.0447730Z * [new branch] gh/guilhermeleobas/254/base -> origin/gh/guilhermeleobas/254/base 2025-12-04T09:33:40.0448981Z * [new branch] gh/guilhermeleobas/254/head -> origin/gh/guilhermeleobas/254/head 2025-12-04T09:33:40.0450237Z * [new branch] gh/guilhermeleobas/254/orig -> origin/gh/guilhermeleobas/254/orig 2025-12-04T09:33:40.0451957Z * [new branch] gh/guilhermeleobas/255/base -> origin/gh/guilhermeleobas/255/base 2025-12-04T09:33:40.0453239Z * [new branch] gh/guilhermeleobas/255/head -> origin/gh/guilhermeleobas/255/head 2025-12-04T09:33:40.0454660Z * [new branch] gh/guilhermeleobas/255/orig -> origin/gh/guilhermeleobas/255/orig 2025-12-04T09:33:40.0456370Z * [new branch] gh/guilhermeleobas/256/base -> origin/gh/guilhermeleobas/256/base 2025-12-04T09:33:40.0457909Z * [new branch] gh/guilhermeleobas/256/head -> origin/gh/guilhermeleobas/256/head 2025-12-04T09:33:40.0459058Z * [new branch] gh/guilhermeleobas/256/orig -> origin/gh/guilhermeleobas/256/orig 2025-12-04T09:33:40.0460801Z * [new branch] gh/guilhermeleobas/257/base -> origin/gh/guilhermeleobas/257/base 2025-12-04T09:33:40.0462023Z * [new branch] gh/guilhermeleobas/257/head -> origin/gh/guilhermeleobas/257/head 2025-12-04T09:33:40.0463581Z * [new branch] gh/guilhermeleobas/257/orig -> origin/gh/guilhermeleobas/257/orig 2025-12-04T09:33:40.0465580Z * [new branch] gh/guilhermeleobas/258/base -> origin/gh/guilhermeleobas/258/base 2025-12-04T09:33:40.0466422Z * [new branch] gh/guilhermeleobas/258/head -> origin/gh/guilhermeleobas/258/head 2025-12-04T09:33:40.0467801Z * [new branch] gh/guilhermeleobas/258/orig -> origin/gh/guilhermeleobas/258/orig 2025-12-04T09:33:40.0469553Z * [new branch] gh/guilhermeleobas/259/base -> origin/gh/guilhermeleobas/259/base 2025-12-04T09:33:40.0470797Z * [new branch] gh/guilhermeleobas/259/head -> origin/gh/guilhermeleobas/259/head 2025-12-04T09:33:40.0472107Z * [new branch] gh/guilhermeleobas/259/orig -> origin/gh/guilhermeleobas/259/orig 2025-12-04T09:33:40.0473863Z * [new branch] gh/guilhermeleobas/260/base -> origin/gh/guilhermeleobas/260/base 2025-12-04T09:33:40.0475108Z * [new branch] gh/guilhermeleobas/260/head -> origin/gh/guilhermeleobas/260/head 2025-12-04T09:33:40.0476379Z * [new branch] gh/guilhermeleobas/260/orig -> origin/gh/guilhermeleobas/260/orig 2025-12-04T09:33:40.0478132Z * [new branch] gh/guilhermeleobas/261/base -> origin/gh/guilhermeleobas/261/base 2025-12-04T09:33:40.0479400Z * [new branch] gh/guilhermeleobas/261/head -> origin/gh/guilhermeleobas/261/head 2025-12-04T09:33:40.0480744Z * [new branch] gh/guilhermeleobas/261/orig -> origin/gh/guilhermeleobas/261/orig 2025-12-04T09:33:40.0482518Z * [new branch] gh/guilhermeleobas/262/base -> origin/gh/guilhermeleobas/262/base 2025-12-04T09:33:40.0483850Z * [new branch] gh/guilhermeleobas/262/head -> origin/gh/guilhermeleobas/262/head 2025-12-04T09:33:40.0485091Z * [new branch] gh/guilhermeleobas/262/orig -> origin/gh/guilhermeleobas/262/orig 2025-12-04T09:33:40.0486882Z * [new branch] gh/guilhermeleobas/263/base -> origin/gh/guilhermeleobas/263/base 2025-12-04T09:33:40.0488231Z * [new branch] gh/guilhermeleobas/263/head -> origin/gh/guilhermeleobas/263/head 2025-12-04T09:33:40.0489635Z * [new branch] gh/guilhermeleobas/263/orig -> origin/gh/guilhermeleobas/263/orig 2025-12-04T09:33:40.0491421Z * [new branch] gh/guilhermeleobas/264/base -> origin/gh/guilhermeleobas/264/base 2025-12-04T09:33:40.0492707Z * [new branch] gh/guilhermeleobas/264/head -> origin/gh/guilhermeleobas/264/head 2025-12-04T09:33:40.0493941Z * [new branch] gh/guilhermeleobas/264/orig -> origin/gh/guilhermeleobas/264/orig 2025-12-04T09:33:40.0495692Z * [new branch] gh/guilhermeleobas/265/base -> origin/gh/guilhermeleobas/265/base 2025-12-04T09:33:40.0496992Z * [new branch] gh/guilhermeleobas/265/head -> origin/gh/guilhermeleobas/265/head 2025-12-04T09:33:40.0498555Z * [new branch] gh/guilhermeleobas/265/orig -> origin/gh/guilhermeleobas/265/orig 2025-12-04T09:33:40.0500364Z * [new branch] gh/guilhermeleobas/266/base -> origin/gh/guilhermeleobas/266/base 2025-12-04T09:33:40.0501632Z * [new branch] gh/guilhermeleobas/266/head -> origin/gh/guilhermeleobas/266/head 2025-12-04T09:33:40.0502856Z * [new branch] gh/guilhermeleobas/266/orig -> origin/gh/guilhermeleobas/266/orig 2025-12-04T09:33:40.0504656Z * [new branch] gh/guilhermeleobas/267/base -> origin/gh/guilhermeleobas/267/base 2025-12-04T09:33:40.0505903Z * [new branch] gh/guilhermeleobas/267/head -> origin/gh/guilhermeleobas/267/head 2025-12-04T09:33:40.0507209Z * [new branch] gh/guilhermeleobas/267/orig -> origin/gh/guilhermeleobas/267/orig 2025-12-04T09:33:40.0509266Z * [new branch] gh/hameerabbasi/1/base -> origin/gh/hameerabbasi/1/base 2025-12-04T09:33:40.0511038Z * [new branch] gh/hameerabbasi/1/head -> origin/gh/hameerabbasi/1/head 2025-12-04T09:33:40.0512134Z * [new branch] gh/hameerabbasi/2/base -> origin/gh/hameerabbasi/2/base 2025-12-04T09:33:40.0513406Z * [new branch] gh/hameerabbasi/2/head -> origin/gh/hameerabbasi/2/head 2025-12-04T09:33:40.0514824Z * [new branch] gh/hameerabbasi/2/orig -> origin/gh/hameerabbasi/2/orig 2025-12-04T09:33:40.0516451Z * [new branch] gh/hameerabbasi/3/base -> origin/gh/hameerabbasi/3/base 2025-12-04T09:33:40.0517702Z * [new branch] gh/hameerabbasi/3/head -> origin/gh/hameerabbasi/3/head 2025-12-04T09:33:40.0519160Z * [new branch] gh/hameerabbasi/3/orig -> origin/gh/hameerabbasi/3/orig 2025-12-04T09:33:40.0520759Z * [new branch] gh/hameerabbasi/4/base -> origin/gh/hameerabbasi/4/base 2025-12-04T09:33:40.0522049Z * [new branch] gh/hameerabbasi/4/head -> origin/gh/hameerabbasi/4/head 2025-12-04T09:33:40.0523269Z * [new branch] gh/hameerabbasi/4/orig -> origin/gh/hameerabbasi/4/orig 2025-12-04T09:33:40.0525239Z * [new branch] gh/huydhn/1/next -> origin/gh/huydhn/1/next 2025-12-04T09:33:40.0526843Z * [new branch] gh/huydhn/2/next -> origin/gh/huydhn/2/next 2025-12-04T09:33:40.0528493Z * [new branch] gh/huydhn/3/next -> origin/gh/huydhn/3/next 2025-12-04T09:33:40.0530263Z * [new branch] gh/huydhn/4/next -> origin/gh/huydhn/4/next 2025-12-04T09:33:40.0531869Z * [new branch] gh/huydhn/5/next -> origin/gh/huydhn/5/next 2025-12-04T09:33:40.0533528Z * [new branch] gh/huydhn/6/next -> origin/gh/huydhn/6/next 2025-12-04T09:33:40.0535448Z * [new branch] gh/int3/97/base -> origin/gh/int3/97/base 2025-12-04T09:33:40.0536758Z * [new branch] gh/int3/97/head -> origin/gh/int3/97/head 2025-12-04T09:33:40.0539056Z * [new branch] gh/isuruf/101/base -> origin/gh/isuruf/101/base 2025-12-04T09:33:40.0540238Z * [new branch] gh/isuruf/101/head -> origin/gh/isuruf/101/head 2025-12-04T09:33:40.0541903Z * [new branch] gh/isuruf/146/base -> origin/gh/isuruf/146/base 2025-12-04T09:33:40.0543251Z * [new branch] gh/isuruf/146/head -> origin/gh/isuruf/146/head 2025-12-04T09:33:40.0544518Z * [new branch] gh/isuruf/146/orig -> origin/gh/isuruf/146/orig 2025-12-04T09:33:40.0546201Z * [new branch] gh/isuruf/158/base -> origin/gh/isuruf/158/base 2025-12-04T09:33:40.0547437Z * [new branch] gh/isuruf/158/head -> origin/gh/isuruf/158/head 2025-12-04T09:33:40.0548999Z * [new branch] gh/isuruf/159/base -> origin/gh/isuruf/159/base 2025-12-04T09:33:40.0550279Z * [new branch] gh/isuruf/159/head -> origin/gh/isuruf/159/head 2025-12-04T09:33:40.0551957Z * [new branch] gh/isuruf/160/base -> origin/gh/isuruf/160/base 2025-12-04T09:33:40.0553693Z * [new branch] gh/isuruf/160/head -> origin/gh/isuruf/160/head 2025-12-04T09:33:40.0555021Z * [new branch] gh/isuruf/160/orig -> origin/gh/isuruf/160/orig 2025-12-04T09:33:40.0556705Z * [new branch] gh/isuruf/81/base -> origin/gh/isuruf/81/base 2025-12-04T09:33:40.0557973Z * [new branch] gh/isuruf/81/head -> origin/gh/isuruf/81/head 2025-12-04T09:33:40.0559248Z * [new branch] gh/isuruf/81/orig -> origin/gh/isuruf/81/orig 2025-12-04T09:33:40.0561321Z * [new branch] gh/jamesjwu/176/base -> origin/gh/jamesjwu/176/base 2025-12-04T09:33:40.0563135Z * [new branch] gh/jamesjwu/176/head -> origin/gh/jamesjwu/176/head 2025-12-04T09:33:40.0564381Z * [new branch] gh/jamesjwu/176/orig -> origin/gh/jamesjwu/176/orig 2025-12-04T09:33:40.0566052Z * [new branch] gh/jamesjwu/187/base -> origin/gh/jamesjwu/187/base 2025-12-04T09:33:40.0567305Z * [new branch] gh/jamesjwu/187/head -> origin/gh/jamesjwu/187/head 2025-12-04T09:33:40.0568553Z * [new branch] gh/jamesjwu/187/orig -> origin/gh/jamesjwu/187/orig 2025-12-04T09:33:40.0570266Z * [new branch] gh/jamesjwu/196/base -> origin/gh/jamesjwu/196/base 2025-12-04T09:33:40.0571523Z * [new branch] gh/jamesjwu/196/head -> origin/gh/jamesjwu/196/head 2025-12-04T09:33:40.0572836Z * [new branch] gh/jamesjwu/196/orig -> origin/gh/jamesjwu/196/orig 2025-12-04T09:33:40.0574452Z * [new branch] gh/jamesjwu/198/base -> origin/gh/jamesjwu/198/base 2025-12-04T09:33:40.0575752Z * [new branch] gh/jamesjwu/198/head -> origin/gh/jamesjwu/198/head 2025-12-04T09:33:40.0577122Z * [new branch] gh/jamesjwu/198/orig -> origin/gh/jamesjwu/198/orig 2025-12-04T09:33:40.0578894Z * [new branch] gh/jamesjwu/207/base -> origin/gh/jamesjwu/207/base 2025-12-04T09:33:40.0580390Z * [new branch] gh/jamesjwu/207/head -> origin/gh/jamesjwu/207/head 2025-12-04T09:33:40.0581668Z * [new branch] gh/jamesjwu/207/orig -> origin/gh/jamesjwu/207/orig 2025-12-04T09:33:40.0583487Z * [new branch] gh/jamesjwu/208/base -> origin/gh/jamesjwu/208/base 2025-12-04T09:33:40.0584768Z * [new branch] gh/jamesjwu/208/head -> origin/gh/jamesjwu/208/head 2025-12-04T09:33:40.0586042Z * [new branch] gh/jamesjwu/208/orig -> origin/gh/jamesjwu/208/orig 2025-12-04T09:33:40.0587935Z * [new branch] gh/jamesjwu/52/base -> origin/gh/jamesjwu/52/base 2025-12-04T09:33:40.0589580Z * [new branch] gh/jamesjwu/52/head -> origin/gh/jamesjwu/52/head 2025-12-04T09:33:40.0591103Z * [new branch] gh/jamesjwu/53/base -> origin/gh/jamesjwu/53/base 2025-12-04T09:33:40.0592175Z * [new branch] gh/jamesjwu/53/head -> origin/gh/jamesjwu/53/head 2025-12-04T09:33:40.0593785Z * [new branch] gh/jamesjwu/54/base -> origin/gh/jamesjwu/54/base 2025-12-04T09:33:40.0594919Z * [new branch] gh/jamesjwu/54/head -> origin/gh/jamesjwu/54/head 2025-12-04T09:33:40.0596513Z * [new branch] gh/jamesjwu/55/base -> origin/gh/jamesjwu/55/base 2025-12-04T09:33:40.0597705Z * [new branch] gh/jamesjwu/55/head -> origin/gh/jamesjwu/55/head 2025-12-04T09:33:40.0599215Z * [new branch] gh/jamesjwu/56/base -> origin/gh/jamesjwu/56/base 2025-12-04T09:33:40.0600458Z * [new branch] gh/jamesjwu/56/head -> origin/gh/jamesjwu/56/head 2025-12-04T09:33:40.0601965Z * [new branch] gh/jamesjwu/57/base -> origin/gh/jamesjwu/57/base 2025-12-04T09:33:40.0603180Z * [new branch] gh/jamesjwu/57/head -> origin/gh/jamesjwu/57/head 2025-12-04T09:33:40.0604694Z * [new branch] gh/jamesjwu/58/base -> origin/gh/jamesjwu/58/base 2025-12-04T09:33:40.0605942Z * [new branch] gh/jamesjwu/58/head -> origin/gh/jamesjwu/58/head 2025-12-04T09:33:40.0607474Z * [new branch] gh/jamesjwu/59/base -> origin/gh/jamesjwu/59/base 2025-12-04T09:33:40.0608875Z * [new branch] gh/jamesjwu/59/head -> origin/gh/jamesjwu/59/head 2025-12-04T09:33:40.0610418Z * [new branch] gh/jamesjwu/60/base -> origin/gh/jamesjwu/60/base 2025-12-04T09:33:40.0611601Z * [new branch] gh/jamesjwu/60/head -> origin/gh/jamesjwu/60/head 2025-12-04T09:33:40.0613116Z * [new branch] gh/jamesjwu/61/base -> origin/gh/jamesjwu/61/base 2025-12-04T09:33:40.0614328Z * [new branch] gh/jamesjwu/61/head -> origin/gh/jamesjwu/61/head 2025-12-04T09:33:40.0615887Z * [new branch] gh/jamesjwu/62/base -> origin/gh/jamesjwu/62/base 2025-12-04T09:33:40.0617042Z * [new branch] gh/jamesjwu/62/head -> origin/gh/jamesjwu/62/head 2025-12-04T09:33:40.0618613Z * [new branch] gh/jamesjwu/63/base -> origin/gh/jamesjwu/63/base 2025-12-04T09:33:40.0620061Z * [new branch] gh/jamesjwu/63/head -> origin/gh/jamesjwu/63/head 2025-12-04T09:33:40.0622225Z * [new branch] gh/jamesjwu/64/base -> origin/gh/jamesjwu/64/base 2025-12-04T09:33:40.0623510Z * [new branch] gh/jamesjwu/64/head -> origin/gh/jamesjwu/64/head 2025-12-04T09:33:40.0625449Z * [new branch] gh/jamesjwu/65/base -> origin/gh/jamesjwu/65/base 2025-12-04T09:33:40.0626698Z * [new branch] gh/jamesjwu/65/head -> origin/gh/jamesjwu/65/head 2025-12-04T09:33:40.0629331Z * [new branch] gh/janeyx99/165/base -> origin/gh/janeyx99/165/base 2025-12-04T09:33:40.0630665Z * [new branch] gh/janeyx99/165/head -> origin/gh/janeyx99/165/head 2025-12-04T09:33:40.0631986Z * [new branch] gh/janeyx99/165/orig -> origin/gh/janeyx99/165/orig 2025-12-04T09:33:40.0633550Z * [new branch] gh/janeyx99/201/base -> origin/gh/janeyx99/201/base 2025-12-04T09:33:40.0634804Z * [new branch] gh/janeyx99/201/head -> origin/gh/janeyx99/201/head 2025-12-04T09:33:40.0636101Z * [new branch] gh/janeyx99/201/orig -> origin/gh/janeyx99/201/orig 2025-12-04T09:33:40.0638091Z * [new branch] gh/janeyx99/225/base -> origin/gh/janeyx99/225/base 2025-12-04T09:33:40.0639481Z * [new branch] gh/janeyx99/225/head -> origin/gh/janeyx99/225/head 2025-12-04T09:33:40.0640722Z * [new branch] gh/janeyx99/225/orig -> origin/gh/janeyx99/225/orig 2025-12-04T09:33:40.0642532Z * [new branch] gh/janeyx99/299/base -> origin/gh/janeyx99/299/base 2025-12-04T09:33:40.0643857Z * [new branch] gh/janeyx99/299/head -> origin/gh/janeyx99/299/head 2025-12-04T09:33:40.0645032Z * [new branch] gh/janeyx99/299/orig -> origin/gh/janeyx99/299/orig 2025-12-04T09:33:40.0647151Z * [new branch] gh/janeyx99/302/base -> origin/gh/janeyx99/302/base 2025-12-04T09:33:40.0648460Z * [new branch] gh/janeyx99/302/head -> origin/gh/janeyx99/302/head 2025-12-04T09:33:40.0650048Z * [new branch] gh/janeyx99/303/base -> origin/gh/janeyx99/303/base 2025-12-04T09:33:40.0651266Z * [new branch] gh/janeyx99/303/head -> origin/gh/janeyx99/303/head 2025-12-04T09:33:40.0652939Z * [new branch] gh/janeyx99/305/base -> origin/gh/janeyx99/305/base 2025-12-04T09:33:40.0654218Z * [new branch] gh/janeyx99/305/head -> origin/gh/janeyx99/305/head 2025-12-04T09:33:40.0655788Z * [new branch] gh/janeyx99/306/base -> origin/gh/janeyx99/306/base 2025-12-04T09:33:40.0656945Z * [new branch] gh/janeyx99/306/head -> origin/gh/janeyx99/306/head 2025-12-04T09:33:40.0658804Z * [new branch] gh/janeyx99/314/base -> origin/gh/janeyx99/314/base 2025-12-04T09:33:40.0660222Z * [new branch] gh/janeyx99/314/head -> origin/gh/janeyx99/314/head 2025-12-04T09:33:40.0661519Z * [new branch] gh/janeyx99/314/orig -> origin/gh/janeyx99/314/orig 2025-12-04T09:33:40.0663195Z * [new branch] gh/janeyx99/315/base -> origin/gh/janeyx99/315/base 2025-12-04T09:33:40.0664500Z * [new branch] gh/janeyx99/315/head -> origin/gh/janeyx99/315/head 2025-12-04T09:33:40.0665817Z * [new branch] gh/janeyx99/315/orig -> origin/gh/janeyx99/315/orig 2025-12-04T09:33:40.0667594Z * [new branch] gh/janeyx99/316/base -> origin/gh/janeyx99/316/base 2025-12-04T09:33:40.0669335Z * [new branch] gh/janeyx99/316/head -> origin/gh/janeyx99/316/head 2025-12-04T09:33:40.0670630Z * [new branch] gh/janeyx99/316/orig -> origin/gh/janeyx99/316/orig 2025-12-04T09:33:40.0672496Z * [new branch] gh/janeyx99/317/base -> origin/gh/janeyx99/317/base 2025-12-04T09:33:40.0673717Z * [new branch] gh/janeyx99/317/head -> origin/gh/janeyx99/317/head 2025-12-04T09:33:40.0674924Z * [new branch] gh/janeyx99/317/orig -> origin/gh/janeyx99/317/orig 2025-12-04T09:33:40.0676762Z * [new branch] gh/janeyx99/325/base -> origin/gh/janeyx99/325/base 2025-12-04T09:33:40.0678061Z * [new branch] gh/janeyx99/325/head -> origin/gh/janeyx99/325/head 2025-12-04T09:33:40.0679296Z * [new branch] gh/janeyx99/325/orig -> origin/gh/janeyx99/325/orig 2025-12-04T09:33:40.0681000Z * [new branch] gh/janeyx99/327/base -> origin/gh/janeyx99/327/base 2025-12-04T09:33:40.0682302Z * [new branch] gh/janeyx99/327/head -> origin/gh/janeyx99/327/head 2025-12-04T09:33:40.0683578Z * [new branch] gh/janeyx99/327/orig -> origin/gh/janeyx99/327/orig 2025-12-04T09:33:40.0685295Z * [new branch] gh/janeyx99/328/base -> origin/gh/janeyx99/328/base 2025-12-04T09:33:40.0686628Z * [new branch] gh/janeyx99/328/head -> origin/gh/janeyx99/328/head 2025-12-04T09:33:40.0688111Z * [new branch] gh/janeyx99/328/orig -> origin/gh/janeyx99/328/orig 2025-12-04T09:33:40.0692600Z * [new branch] gh/janeyx99/329/base -> origin/gh/janeyx99/329/base 2025-12-04T09:33:40.0693971Z * [new branch] gh/janeyx99/329/head -> origin/gh/janeyx99/329/head 2025-12-04T09:33:40.0695232Z * [new branch] gh/janeyx99/329/orig -> origin/gh/janeyx99/329/orig 2025-12-04T09:33:40.0697849Z * [new branch] gh/janeyx99/330/base -> origin/gh/janeyx99/330/base 2025-12-04T09:33:40.0699189Z * [new branch] gh/janeyx99/330/head -> origin/gh/janeyx99/330/head 2025-12-04T09:33:40.0700295Z * [new branch] gh/janeyx99/330/orig -> origin/gh/janeyx99/330/orig 2025-12-04T09:33:40.0702375Z * [new branch] gh/janeyx99/331/base -> origin/gh/janeyx99/331/base 2025-12-04T09:33:40.0703317Z * [new branch] gh/janeyx99/331/head -> origin/gh/janeyx99/331/head 2025-12-04T09:33:40.0704581Z * [new branch] gh/janeyx99/331/orig -> origin/gh/janeyx99/331/orig 2025-12-04T09:33:40.0706505Z * [new branch] gh/janeyx99/332/base -> origin/gh/janeyx99/332/base 2025-12-04T09:33:40.0707627Z * [new branch] gh/janeyx99/332/head -> origin/gh/janeyx99/332/head 2025-12-04T09:33:40.0708866Z * [new branch] gh/janeyx99/332/orig -> origin/gh/janeyx99/332/orig 2025-12-04T09:33:40.0710567Z * [new branch] gh/janeyx99/333/base -> origin/gh/janeyx99/333/base 2025-12-04T09:33:40.0711716Z * [new branch] gh/janeyx99/333/head -> origin/gh/janeyx99/333/head 2025-12-04T09:33:40.0712980Z * [new branch] gh/janeyx99/333/orig -> origin/gh/janeyx99/333/orig 2025-12-04T09:33:40.0715243Z * [new branch] gh/janeyx99/88/base -> origin/gh/janeyx99/88/base 2025-12-04T09:33:40.0716246Z * [new branch] gh/janeyx99/88/head -> origin/gh/janeyx99/88/head 2025-12-04T09:33:40.0717530Z * [new branch] gh/janeyx99/88/orig -> origin/gh/janeyx99/88/orig 2025-12-04T09:33:40.0719604Z * [new branch] gh/jansel/360/base -> origin/gh/jansel/360/base 2025-12-04T09:33:40.0720784Z * [new branch] gh/jansel/360/head -> origin/gh/jansel/360/head 2025-12-04T09:33:40.0722517Z * [new branch] gh/jansel/451/base -> origin/gh/jansel/451/base 2025-12-04T09:33:40.0723748Z * [new branch] gh/jansel/451/head -> origin/gh/jansel/451/head 2025-12-04T09:33:40.0724965Z * [new branch] gh/jansel/451/orig -> origin/gh/jansel/451/orig 2025-12-04T09:33:40.0726734Z * [new branch] gh/jansel/462/base -> origin/gh/jansel/462/base 2025-12-04T09:33:40.0727898Z * [new branch] gh/jansel/462/head -> origin/gh/jansel/462/head 2025-12-04T09:33:40.0729222Z * [new branch] gh/jansel/462/orig -> origin/gh/jansel/462/orig 2025-12-04T09:33:40.0731051Z * [new branch] gh/jansel/533/base -> origin/gh/jansel/533/base 2025-12-04T09:33:40.0732157Z * [new branch] gh/jansel/533/head -> origin/gh/jansel/533/head 2025-12-04T09:33:40.0733405Z * [new branch] gh/jansel/533/orig -> origin/gh/jansel/533/orig 2025-12-04T09:33:40.0735168Z * [new branch] gh/jansel/552/base -> origin/gh/jansel/552/base 2025-12-04T09:33:40.0736304Z * [new branch] gh/jansel/552/head -> origin/gh/jansel/552/head 2025-12-04T09:33:40.0737529Z * [new branch] gh/jansel/552/orig -> origin/gh/jansel/552/orig 2025-12-04T09:33:40.0739414Z * [new branch] gh/jansel/553/base -> origin/gh/jansel/553/base 2025-12-04T09:33:40.0740597Z * [new branch] gh/jansel/553/head -> origin/gh/jansel/553/head 2025-12-04T09:33:40.0742465Z * [new branch] gh/jansel/553/orig -> origin/gh/jansel/553/orig 2025-12-04T09:33:40.0744075Z * [new branch] gh/jansel/554/base -> origin/gh/jansel/554/base 2025-12-04T09:33:40.0745222Z * [new branch] gh/jansel/554/head -> origin/gh/jansel/554/head 2025-12-04T09:33:40.0746460Z * [new branch] gh/jansel/554/orig -> origin/gh/jansel/554/orig 2025-12-04T09:33:40.0748370Z * [new branch] gh/jansel/555/base -> origin/gh/jansel/555/base 2025-12-04T09:33:40.0749653Z * [new branch] gh/jansel/555/head -> origin/gh/jansel/555/head 2025-12-04T09:33:40.0750838Z * [new branch] gh/jansel/555/orig -> origin/gh/jansel/555/orig 2025-12-04T09:33:40.0752618Z * [new branch] gh/jansel/556/base -> origin/gh/jansel/556/base 2025-12-04T09:33:40.0753781Z * [new branch] gh/jansel/556/head -> origin/gh/jansel/556/head 2025-12-04T09:33:40.0755025Z * [new branch] gh/jansel/556/orig -> origin/gh/jansel/556/orig 2025-12-04T09:33:40.0756956Z * [new branch] gh/jansel/557/base -> origin/gh/jansel/557/base 2025-12-04T09:33:40.0758048Z * [new branch] gh/jansel/557/head -> origin/gh/jansel/557/head 2025-12-04T09:33:40.0759315Z * [new branch] gh/jansel/557/orig -> origin/gh/jansel/557/orig 2025-12-04T09:33:40.0761676Z * [new branch] gh/jansel/558/base -> origin/gh/jansel/558/base 2025-12-04T09:33:40.0762821Z * [new branch] gh/jansel/558/head -> origin/gh/jansel/558/head 2025-12-04T09:33:40.0764039Z * [new branch] gh/jansel/558/orig -> origin/gh/jansel/558/orig 2025-12-04T09:33:40.0765901Z * [new branch] gh/jansel/559/base -> origin/gh/jansel/559/base 2025-12-04T09:33:40.0767048Z * [new branch] gh/jansel/559/head -> origin/gh/jansel/559/head 2025-12-04T09:33:40.0768315Z * [new branch] gh/jansel/559/orig -> origin/gh/jansel/559/orig 2025-12-04T09:33:40.0770033Z * [new branch] gh/jansel/560/base -> origin/gh/jansel/560/base 2025-12-04T09:33:40.0771204Z * [new branch] gh/jansel/560/head -> origin/gh/jansel/560/head 2025-12-04T09:33:40.0772445Z * [new branch] gh/jansel/560/orig -> origin/gh/jansel/560/orig 2025-12-04T09:33:40.0774211Z * [new branch] gh/jansel/561/base -> origin/gh/jansel/561/base 2025-12-04T09:33:40.0775321Z * [new branch] gh/jansel/561/head -> origin/gh/jansel/561/head 2025-12-04T09:33:40.0776536Z * [new branch] gh/jansel/561/orig -> origin/gh/jansel/561/orig 2025-12-04T09:33:40.0778416Z * [new branch] gh/jansel/562/base -> origin/gh/jansel/562/base 2025-12-04T09:33:40.0779682Z * [new branch] gh/jansel/562/head -> origin/gh/jansel/562/head 2025-12-04T09:33:40.0780881Z * [new branch] gh/jansel/562/orig -> origin/gh/jansel/562/orig 2025-12-04T09:33:40.0782794Z * [new branch] gh/jansel/563/base -> origin/gh/jansel/563/base 2025-12-04T09:33:40.0783944Z * [new branch] gh/jansel/563/head -> origin/gh/jansel/563/head 2025-12-04T09:33:40.0785197Z * [new branch] gh/jansel/563/orig -> origin/gh/jansel/563/orig 2025-12-04T09:33:40.0787869Z * [new branch] gh/jansel/564/base -> origin/gh/jansel/564/base 2025-12-04T09:33:40.0789328Z * [new branch] gh/jansel/564/head -> origin/gh/jansel/564/head 2025-12-04T09:33:40.0790502Z * [new branch] gh/jansel/564/orig -> origin/gh/jansel/564/orig 2025-12-04T09:33:40.0792332Z * [new branch] gh/jansel/565/base -> origin/gh/jansel/565/base 2025-12-04T09:33:40.0793532Z * [new branch] gh/jansel/565/head -> origin/gh/jansel/565/head 2025-12-04T09:33:40.0794796Z * [new branch] gh/jansel/565/orig -> origin/gh/jansel/565/orig 2025-12-04T09:33:40.0796761Z * [new branch] gh/jansel/566/base -> origin/gh/jansel/566/base 2025-12-04T09:33:40.0797859Z * [new branch] gh/jansel/566/head -> origin/gh/jansel/566/head 2025-12-04T09:33:40.0799099Z * [new branch] gh/jansel/566/orig -> origin/gh/jansel/566/orig 2025-12-04T09:33:40.0801036Z * [new branch] gh/jansel/567/base -> origin/gh/jansel/567/base 2025-12-04T09:33:40.0802278Z * [new branch] gh/jansel/567/head -> origin/gh/jansel/567/head 2025-12-04T09:33:40.0803399Z * [new branch] gh/jansel/567/orig -> origin/gh/jansel/567/orig 2025-12-04T09:33:40.0805293Z * [new branch] gh/jansel/568/base -> origin/gh/jansel/568/base 2025-12-04T09:33:40.0806534Z * [new branch] gh/jansel/568/head -> origin/gh/jansel/568/head 2025-12-04T09:33:40.0807792Z * [new branch] gh/jansel/568/orig -> origin/gh/jansel/568/orig 2025-12-04T09:33:40.0809562Z * [new branch] gh/jansel/569/base -> origin/gh/jansel/569/base 2025-12-04T09:33:40.0810731Z * [new branch] gh/jansel/569/head -> origin/gh/jansel/569/head 2025-12-04T09:33:40.0811989Z * [new branch] gh/jansel/569/orig -> origin/gh/jansel/569/orig 2025-12-04T09:33:40.0813749Z * [new branch] gh/jansel/570/base -> origin/gh/jansel/570/base 2025-12-04T09:33:40.0814943Z * [new branch] gh/jansel/570/head -> origin/gh/jansel/570/head 2025-12-04T09:33:40.0816204Z * [new branch] gh/jansel/570/orig -> origin/gh/jansel/570/orig 2025-12-04T09:33:40.0818085Z * [new branch] gh/jansel/571/base -> origin/gh/jansel/571/base 2025-12-04T09:33:40.0819454Z * [new branch] gh/jansel/571/head -> origin/gh/jansel/571/head 2025-12-04T09:33:40.0820698Z * [new branch] gh/jansel/571/orig -> origin/gh/jansel/571/orig 2025-12-04T09:33:40.0822417Z * [new branch] gh/jansel/572/base -> origin/gh/jansel/572/base 2025-12-04T09:33:40.0823605Z * [new branch] gh/jansel/572/head -> origin/gh/jansel/572/head 2025-12-04T09:33:40.0824868Z * [new branch] gh/jansel/572/orig -> origin/gh/jansel/572/orig 2025-12-04T09:33:40.0826694Z * [new branch] gh/jansel/573/base -> origin/gh/jansel/573/base 2025-12-04T09:33:40.0827857Z * [new branch] gh/jansel/573/head -> origin/gh/jansel/573/head 2025-12-04T09:33:40.0829144Z * [new branch] gh/jansel/573/orig -> origin/gh/jansel/573/orig 2025-12-04T09:33:40.0831005Z * [new branch] gh/jansel/574/base -> origin/gh/jansel/574/base 2025-12-04T09:33:40.0832162Z * [new branch] gh/jansel/574/head -> origin/gh/jansel/574/head 2025-12-04T09:33:40.0833452Z * [new branch] gh/jansel/574/orig -> origin/gh/jansel/574/orig 2025-12-04T09:33:40.0835416Z * [new branch] gh/jansel/575/base -> origin/gh/jansel/575/base 2025-12-04T09:33:40.0836681Z * [new branch] gh/jansel/575/head -> origin/gh/jansel/575/head 2025-12-04T09:33:40.0837920Z * [new branch] gh/jansel/575/orig -> origin/gh/jansel/575/orig 2025-12-04T09:33:40.0839804Z * [new branch] gh/jansel/576/base -> origin/gh/jansel/576/base 2025-12-04T09:33:40.0840951Z * [new branch] gh/jansel/576/head -> origin/gh/jansel/576/head 2025-12-04T09:33:40.0842201Z * [new branch] gh/jansel/576/orig -> origin/gh/jansel/576/orig 2025-12-04T09:33:40.0844403Z * [new branch] gh/jbschlosser/247/base -> origin/gh/jbschlosser/247/base 2025-12-04T09:33:40.0845607Z * [new branch] gh/jbschlosser/247/head -> origin/gh/jbschlosser/247/head 2025-12-04T09:33:40.0846891Z * [new branch] gh/jbschlosser/247/orig -> origin/gh/jbschlosser/247/orig 2025-12-04T09:33:40.0848728Z * [new branch] gh/jbschlosser/250/base -> origin/gh/jbschlosser/250/base 2025-12-04T09:33:40.0849830Z * [new branch] gh/jbschlosser/250/head -> origin/gh/jbschlosser/250/head 2025-12-04T09:33:40.0851385Z * [new branch] gh/jbschlosser/250/orig -> origin/gh/jbschlosser/250/orig 2025-12-04T09:33:40.0853896Z * [new branch] gh/jerryzh168/1/base -> origin/gh/jerryzh168/1/base 2025-12-04T09:33:40.0854970Z * [new branch] gh/jerryzh168/1/head -> origin/gh/jerryzh168/1/head 2025-12-04T09:33:40.0856175Z * [new branch] gh/jerryzh168/1/orig -> origin/gh/jerryzh168/1/orig 2025-12-04T09:33:40.0858348Z * [new branch] gh/jiayisunx/59/base -> origin/gh/jiayisunx/59/base 2025-12-04T09:33:40.0859616Z * [new branch] gh/jiayisunx/59/head -> origin/gh/jiayisunx/59/head 2025-12-04T09:33:40.0860886Z * [new branch] gh/jiayisunx/59/orig -> origin/gh/jiayisunx/59/orig 2025-12-04T09:33:40.0862587Z * [new branch] gh/jiayisunx/61/base -> origin/gh/jiayisunx/61/base 2025-12-04T09:33:40.0863742Z * [new branch] gh/jiayisunx/61/head -> origin/gh/jiayisunx/61/head 2025-12-04T09:33:40.0865035Z * [new branch] gh/jiayisunx/61/orig -> origin/gh/jiayisunx/61/orig 2025-12-04T09:33:40.0866935Z * [new branch] gh/jiayisunx/68/base -> origin/gh/jiayisunx/68/base 2025-12-04T09:33:40.0868029Z * [new branch] gh/jiayisunx/68/head -> origin/gh/jiayisunx/68/head 2025-12-04T09:33:40.0869286Z * [new branch] gh/jiayisunx/68/orig -> origin/gh/jiayisunx/68/orig 2025-12-04T09:33:40.0871033Z * [new branch] gh/jiayisunx/77/base -> origin/gh/jiayisunx/77/base 2025-12-04T09:33:40.0872194Z * [new branch] gh/jiayisunx/77/head -> origin/gh/jiayisunx/77/head 2025-12-04T09:33:40.0873433Z * [new branch] gh/jiayisunx/77/orig -> origin/gh/jiayisunx/77/orig 2025-12-04T09:33:40.0875193Z * [new branch] gh/jiayisunx/78/base -> origin/gh/jiayisunx/78/base 2025-12-04T09:33:40.0876375Z * [new branch] gh/jiayisunx/78/head -> origin/gh/jiayisunx/78/head 2025-12-04T09:33:40.0877646Z * [new branch] gh/jiayisunx/78/orig -> origin/gh/jiayisunx/78/orig 2025-12-04T09:33:40.0879441Z * [new branch] gh/jiayisunx/79/base -> origin/gh/jiayisunx/79/base 2025-12-04T09:33:40.0880617Z * [new branch] gh/jiayisunx/79/head -> origin/gh/jiayisunx/79/head 2025-12-04T09:33:40.0881911Z * [new branch] gh/jiayisunx/79/orig -> origin/gh/jiayisunx/79/orig 2025-12-04T09:33:40.0883810Z * [new branch] gh/jiayisunx/82/base -> origin/gh/jiayisunx/82/base 2025-12-04T09:33:40.0884967Z * [new branch] gh/jiayisunx/82/head -> origin/gh/jiayisunx/82/head 2025-12-04T09:33:40.0886221Z * [new branch] gh/jiayisunx/82/orig -> origin/gh/jiayisunx/82/orig 2025-12-04T09:33:40.0888051Z * [new branch] gh/jiayisunx/83/base -> origin/gh/jiayisunx/83/base 2025-12-04T09:33:40.0889291Z * [new branch] gh/jiayisunx/83/head -> origin/gh/jiayisunx/83/head 2025-12-04T09:33:40.0890594Z * [new branch] gh/jiayisunx/83/orig -> origin/gh/jiayisunx/83/orig 2025-12-04T09:33:40.0892260Z * [new branch] gh/jiayisunx/84/base -> origin/gh/jiayisunx/84/base 2025-12-04T09:33:40.0893378Z * [new branch] gh/jiayisunx/84/head -> origin/gh/jiayisunx/84/head 2025-12-04T09:33:40.0894651Z * [new branch] gh/jiayisunx/84/orig -> origin/gh/jiayisunx/84/orig 2025-12-04T09:33:40.0896453Z * [new branch] gh/jiayisunx/85/base -> origin/gh/jiayisunx/85/base 2025-12-04T09:33:40.0897639Z * [new branch] gh/jiayisunx/85/head -> origin/gh/jiayisunx/85/head 2025-12-04T09:33:40.0899195Z * [new branch] gh/jiayisunx/85/orig -> origin/gh/jiayisunx/85/orig 2025-12-04T09:33:40.0900864Z * [new branch] gh/jiayisunx/86/base -> origin/gh/jiayisunx/86/base 2025-12-04T09:33:40.0902672Z * [new branch] gh/jiayisunx/86/head -> origin/gh/jiayisunx/86/head 2025-12-04T09:33:40.0904076Z * [new branch] gh/jiayisunx/86/orig -> origin/gh/jiayisunx/86/orig 2025-12-04T09:33:40.0905743Z * [new branch] gh/jiayisunx/87/base -> origin/gh/jiayisunx/87/base 2025-12-04T09:33:40.0906877Z * [new branch] gh/jiayisunx/87/head -> origin/gh/jiayisunx/87/head 2025-12-04T09:33:40.0908144Z * [new branch] gh/jiayisunx/87/orig -> origin/gh/jiayisunx/87/orig 2025-12-04T09:33:40.0909929Z * [new branch] gh/jiayisunx/88/base -> origin/gh/jiayisunx/88/base 2025-12-04T09:33:40.0911100Z * [new branch] gh/jiayisunx/88/head -> origin/gh/jiayisunx/88/head 2025-12-04T09:33:40.0912339Z * [new branch] gh/jiayisunx/88/orig -> origin/gh/jiayisunx/88/orig 2025-12-04T09:33:40.0914077Z * [new branch] gh/jiayisunx/89/base -> origin/gh/jiayisunx/89/base 2025-12-04T09:33:40.0915231Z * [new branch] gh/jiayisunx/89/head -> origin/gh/jiayisunx/89/head 2025-12-04T09:33:40.0916514Z * [new branch] gh/jiayisunx/89/orig -> origin/gh/jiayisunx/89/orig 2025-12-04T09:33:40.0918300Z * [new branch] gh/jiayisunx/90/base -> origin/gh/jiayisunx/90/base 2025-12-04T09:33:40.0919452Z * [new branch] gh/jiayisunx/90/head -> origin/gh/jiayisunx/90/head 2025-12-04T09:33:40.0920724Z * [new branch] gh/jiayisunx/90/orig -> origin/gh/jiayisunx/90/orig 2025-12-04T09:33:40.0922741Z * [new branch] gh/jjwu@meta.com/1/base -> origin/gh/jjwu@meta.com/1/base 2025-12-04T09:33:40.0923824Z * [new branch] gh/jjwu@meta.com/1/head -> origin/gh/jjwu@meta.com/1/head 2025-12-04T09:33:40.0925837Z * [new branch] gh/jturney/1/base -> origin/gh/jturney/1/base 2025-12-04T09:33:40.0926994Z * [new branch] gh/jturney/1/head -> origin/gh/jturney/1/head 2025-12-04T09:33:40.0928287Z * [new branch] gh/jturney/1/orig -> origin/gh/jturney/1/orig 2025-12-04T09:33:40.0930038Z * [new branch] gh/jturney/2/base -> origin/gh/jturney/2/base 2025-12-04T09:33:40.0931168Z * [new branch] gh/jturney/2/head -> origin/gh/jturney/2/head 2025-12-04T09:33:40.0932447Z * [new branch] gh/jturney/2/orig -> origin/gh/jturney/2/orig 2025-12-04T09:33:40.0934902Z * [new branch] gh/karthickai/10/base -> origin/gh/karthickai/10/base 2025-12-04T09:33:40.0936130Z * [new branch] gh/karthickai/10/head -> origin/gh/karthickai/10/head 2025-12-04T09:33:40.0937403Z * [new branch] gh/karthickai/10/orig -> origin/gh/karthickai/10/orig 2025-12-04T09:33:40.0939363Z * [new branch] gh/karthickai/11/base -> origin/gh/karthickai/11/base 2025-12-04T09:33:40.0940594Z * [new branch] gh/karthickai/11/head -> origin/gh/karthickai/11/head 2025-12-04T09:33:40.0941953Z * [new branch] gh/karthickai/11/orig -> origin/gh/karthickai/11/orig 2025-12-04T09:33:40.0944182Z * [new branch] gh/karthickai/12/base -> origin/gh/karthickai/12/base 2025-12-04T09:33:40.0945421Z * [new branch] gh/karthickai/12/head -> origin/gh/karthickai/12/head 2025-12-04T09:33:40.0946754Z * [new branch] gh/karthickai/12/orig -> origin/gh/karthickai/12/orig 2025-12-04T09:33:40.0948568Z * [new branch] gh/karthickai/13/base -> origin/gh/karthickai/13/base 2025-12-04T09:33:40.0949837Z * [new branch] gh/karthickai/13/head -> origin/gh/karthickai/13/head 2025-12-04T09:33:40.0951179Z * [new branch] gh/karthickai/13/orig -> origin/gh/karthickai/13/orig 2025-12-04T09:33:40.0953265Z * [new branch] gh/karthickai/14/base -> origin/gh/karthickai/14/base 2025-12-04T09:33:40.0954575Z * [new branch] gh/karthickai/14/head -> origin/gh/karthickai/14/head 2025-12-04T09:33:40.0955935Z * [new branch] gh/karthickai/14/orig -> origin/gh/karthickai/14/orig 2025-12-04T09:33:40.0957909Z * [new branch] gh/karthickai/15/base -> origin/gh/karthickai/15/base 2025-12-04T09:33:40.0959113Z * [new branch] gh/karthickai/15/head -> origin/gh/karthickai/15/head 2025-12-04T09:33:40.0960373Z * [new branch] gh/karthickai/15/orig -> origin/gh/karthickai/15/orig 2025-12-04T09:33:40.0962098Z * [new branch] gh/karthickai/16/base -> origin/gh/karthickai/16/base 2025-12-04T09:33:40.0963432Z * [new branch] gh/karthickai/16/head -> origin/gh/karthickai/16/head 2025-12-04T09:33:40.0965345Z * [new branch] gh/karthickai/16/orig -> origin/gh/karthickai/16/orig 2025-12-04T09:33:40.0966947Z * [new branch] gh/karthickai/17/base -> origin/gh/karthickai/17/base 2025-12-04T09:33:40.0968092Z * [new branch] gh/karthickai/17/head -> origin/gh/karthickai/17/head 2025-12-04T09:33:40.0969418Z * [new branch] gh/karthickai/17/orig -> origin/gh/karthickai/17/orig 2025-12-04T09:33:40.0971335Z * [new branch] gh/karthickai/18/base -> origin/gh/karthickai/18/base 2025-12-04T09:33:40.0972851Z * [new branch] gh/karthickai/18/head -> origin/gh/karthickai/18/head 2025-12-04T09:33:40.0974184Z * [new branch] gh/karthickai/18/orig -> origin/gh/karthickai/18/orig 2025-12-04T09:33:40.0976112Z * [new branch] gh/karthickai/19/base -> origin/gh/karthickai/19/base 2025-12-04T09:33:40.0977270Z * [new branch] gh/karthickai/19/head -> origin/gh/karthickai/19/head 2025-12-04T09:33:40.0978641Z * [new branch] gh/karthickai/19/orig -> origin/gh/karthickai/19/orig 2025-12-04T09:33:40.0981322Z * [new branch] gh/karthickai/20/base -> origin/gh/karthickai/20/base 2025-12-04T09:33:40.0983272Z * [new branch] gh/karthickai/20/head -> origin/gh/karthickai/20/head 2025-12-04T09:33:40.0984463Z * [new branch] gh/karthickai/20/orig -> origin/gh/karthickai/20/orig 2025-12-04T09:33:40.0986343Z * [new branch] gh/karthickai/21/base -> origin/gh/karthickai/21/base 2025-12-04T09:33:40.0987903Z * [new branch] gh/karthickai/21/head -> origin/gh/karthickai/21/head 2025-12-04T09:33:40.0992100Z * [new branch] gh/karthickai/21/orig -> origin/gh/karthickai/21/orig 2025-12-04T09:33:40.0993941Z * [new branch] gh/karthickai/22/base -> origin/gh/karthickai/22/base 2025-12-04T09:33:40.0995049Z * [new branch] gh/karthickai/22/head -> origin/gh/karthickai/22/head 2025-12-04T09:33:40.0996268Z * [new branch] gh/karthickai/22/orig -> origin/gh/karthickai/22/orig 2025-12-04T09:33:40.0998312Z * [new branch] gh/karthickai/23/base -> origin/gh/karthickai/23/base 2025-12-04T09:33:40.0999634Z * [new branch] gh/karthickai/23/head -> origin/gh/karthickai/23/head 2025-12-04T09:33:40.1000898Z * [new branch] gh/karthickai/23/orig -> origin/gh/karthickai/23/orig 2025-12-04T09:33:40.1002727Z * [new branch] gh/karthickai/24/base -> origin/gh/karthickai/24/base 2025-12-04T09:33:40.1003966Z * [new branch] gh/karthickai/24/head -> origin/gh/karthickai/24/head 2025-12-04T09:33:40.1005209Z * [new branch] gh/karthickai/24/orig -> origin/gh/karthickai/24/orig 2025-12-04T09:33:40.1007544Z * [new branch] gh/karthickai/25/base -> origin/gh/karthickai/25/base 2025-12-04T09:33:40.1008817Z * [new branch] gh/karthickai/25/head -> origin/gh/karthickai/25/head 2025-12-04T09:33:40.1010331Z * [new branch] gh/karthickai/25/orig -> origin/gh/karthickai/25/orig 2025-12-04T09:33:40.1011946Z * [new branch] gh/karthickai/26/base -> origin/gh/karthickai/26/base 2025-12-04T09:33:40.1013363Z * [new branch] gh/karthickai/26/head -> origin/gh/karthickai/26/head 2025-12-04T09:33:40.1014538Z * [new branch] gh/karthickai/26/orig -> origin/gh/karthickai/26/orig 2025-12-04T09:33:40.1018018Z * [new branch] gh/karthickai/6/base -> origin/gh/karthickai/6/base 2025-12-04T09:33:40.1020599Z * [new branch] gh/karthickai/6/head -> origin/gh/karthickai/6/head 2025-12-04T09:33:40.1021815Z * [new branch] gh/karthickai/6/orig -> origin/gh/karthickai/6/orig 2025-12-04T09:33:40.1023919Z * [new branch] gh/krocki/1/base -> origin/gh/krocki/1/base 2025-12-04T09:33:40.1025099Z * [new branch] gh/krocki/1/head -> origin/gh/krocki/1/head 2025-12-04T09:33:40.1026367Z * [new branch] gh/krocki/1/orig -> origin/gh/krocki/1/orig 2025-12-04T09:33:40.1028164Z * [new branch] gh/krocki/2/base -> origin/gh/krocki/2/base 2025-12-04T09:33:40.1029553Z * [new branch] gh/krocki/2/head -> origin/gh/krocki/2/head 2025-12-04T09:33:40.1030735Z * [new branch] gh/krocki/2/orig -> origin/gh/krocki/2/orig 2025-12-04T09:33:40.1032878Z * [new branch] gh/kurtamohler/60/base -> origin/gh/kurtamohler/60/base 2025-12-04T09:33:40.1034059Z * [new branch] gh/kurtamohler/60/head -> origin/gh/kurtamohler/60/head 2025-12-04T09:33:40.1035362Z * [new branch] gh/kurtamohler/60/orig -> origin/gh/kurtamohler/60/orig 2025-12-04T09:33:40.1037171Z * [new branch] gh/kurtamohler/61/base -> origin/gh/kurtamohler/61/base 2025-12-04T09:33:40.1038328Z * [new branch] gh/kurtamohler/61/head -> origin/gh/kurtamohler/61/head 2025-12-04T09:33:40.1039637Z * [new branch] gh/kurtamohler/61/orig -> origin/gh/kurtamohler/61/orig 2025-12-04T09:33:40.1041404Z * [new branch] gh/kurtamohler/62/base -> origin/gh/kurtamohler/62/base 2025-12-04T09:33:40.1042547Z * [new branch] gh/kurtamohler/62/head -> origin/gh/kurtamohler/62/head 2025-12-04T09:33:40.1043811Z * [new branch] gh/kurtamohler/62/orig -> origin/gh/kurtamohler/62/orig 2025-12-04T09:33:40.1045716Z * [new branch] gh/kurtamohler/63/base -> origin/gh/kurtamohler/63/base 2025-12-04T09:33:40.1046889Z * [new branch] gh/kurtamohler/63/head -> origin/gh/kurtamohler/63/head 2025-12-04T09:33:40.1048134Z * [new branch] gh/kurtamohler/63/orig -> origin/gh/kurtamohler/63/orig 2025-12-04T09:33:40.1049966Z * [new branch] gh/kurtamohler/64/base -> origin/gh/kurtamohler/64/base 2025-12-04T09:33:40.1051127Z * [new branch] gh/kurtamohler/64/head -> origin/gh/kurtamohler/64/head 2025-12-04T09:33:40.1052440Z * [new branch] gh/kurtamohler/64/orig -> origin/gh/kurtamohler/64/orig 2025-12-04T09:33:40.1054309Z * [new branch] gh/kurtamohler/65/base -> origin/gh/kurtamohler/65/base 2025-12-04T09:33:40.1055491Z * [new branch] gh/kurtamohler/65/head -> origin/gh/kurtamohler/65/head 2025-12-04T09:33:40.1056772Z * [new branch] gh/kurtamohler/65/orig -> origin/gh/kurtamohler/65/orig 2025-12-04T09:33:40.1058615Z * [new branch] gh/kurtamohler/66/base -> origin/gh/kurtamohler/66/base 2025-12-04T09:33:40.1059962Z * [new branch] gh/kurtamohler/66/head -> origin/gh/kurtamohler/66/head 2025-12-04T09:33:40.1061157Z * [new branch] gh/kurtamohler/66/orig -> origin/gh/kurtamohler/66/orig 2025-12-04T09:33:40.1063017Z * [new branch] gh/kurtamohler/67/base -> origin/gh/kurtamohler/67/base 2025-12-04T09:33:40.1064168Z * [new branch] gh/kurtamohler/67/head -> origin/gh/kurtamohler/67/head 2025-12-04T09:33:40.1065493Z * [new branch] gh/kurtamohler/67/orig -> origin/gh/kurtamohler/67/orig 2025-12-04T09:33:40.1067583Z * [new branch] gh/kwen2501/130/base -> origin/gh/kwen2501/130/base 2025-12-04T09:33:40.1068906Z * [new branch] gh/kwen2501/130/head -> origin/gh/kwen2501/130/head 2025-12-04T09:33:40.1070375Z * [new branch] gh/kwen2501/130/orig -> origin/gh/kwen2501/130/orig 2025-12-04T09:33:40.1072092Z * [new branch] gh/kwen2501/170/base -> origin/gh/kwen2501/170/base 2025-12-04T09:33:40.1073228Z * [new branch] gh/kwen2501/170/head -> origin/gh/kwen2501/170/head 2025-12-04T09:33:40.1075113Z * [new branch] gh/kwen2501/187/base -> origin/gh/kwen2501/187/base 2025-12-04T09:33:40.1076313Z * [new branch] gh/kwen2501/187/head -> origin/gh/kwen2501/187/head 2025-12-04T09:33:40.1077625Z * [new branch] gh/kwen2501/187/orig -> origin/gh/kwen2501/187/orig 2025-12-04T09:33:40.1079488Z * [new branch] gh/kwen2501/188/base -> origin/gh/kwen2501/188/base 2025-12-04T09:33:40.1080642Z * [new branch] gh/kwen2501/188/head -> origin/gh/kwen2501/188/head 2025-12-04T09:33:40.1081882Z * [new branch] gh/kwen2501/188/orig -> origin/gh/kwen2501/188/orig 2025-12-04T09:33:40.1083662Z * [new branch] gh/kwen2501/211/base -> origin/gh/kwen2501/211/base 2025-12-04T09:33:40.1084813Z * [new branch] gh/kwen2501/211/head -> origin/gh/kwen2501/211/head 2025-12-04T09:33:40.1086728Z * [new branch] gh/kwen2501/224/base -> origin/gh/kwen2501/224/base 2025-12-04T09:33:40.1087992Z * [new branch] gh/kwen2501/224/head -> origin/gh/kwen2501/224/head 2025-12-04T09:33:40.1089430Z * [new branch] gh/kwen2501/224/orig -> origin/gh/kwen2501/224/orig 2025-12-04T09:33:40.1091117Z * [new branch] gh/kwen2501/228/base -> origin/gh/kwen2501/228/base 2025-12-04T09:33:40.1092269Z * [new branch] gh/kwen2501/228/head -> origin/gh/kwen2501/228/head 2025-12-04T09:33:40.1093538Z * [new branch] gh/kwen2501/228/orig -> origin/gh/kwen2501/228/orig 2025-12-04T09:33:40.1095442Z * [new branch] gh/kwen2501/234/base -> origin/gh/kwen2501/234/base 2025-12-04T09:33:40.1096746Z * [new branch] gh/kwen2501/234/head -> origin/gh/kwen2501/234/head 2025-12-04T09:33:40.1098001Z * [new branch] gh/kwen2501/234/orig -> origin/gh/kwen2501/234/orig 2025-12-04T09:33:40.1099927Z * [new branch] gh/kwen2501/235/base -> origin/gh/kwen2501/235/base 2025-12-04T09:33:40.1101080Z * [new branch] gh/kwen2501/235/head -> origin/gh/kwen2501/235/head 2025-12-04T09:33:40.1102350Z * [new branch] gh/kwen2501/235/orig -> origin/gh/kwen2501/235/orig 2025-12-04T09:33:40.1104068Z * [new branch] gh/kwen2501/236/base -> origin/gh/kwen2501/236/base 2025-12-04T09:33:40.1105261Z * [new branch] gh/kwen2501/236/head -> origin/gh/kwen2501/236/head 2025-12-04T09:33:40.1106478Z * [new branch] gh/kwen2501/236/orig -> origin/gh/kwen2501/236/orig 2025-12-04T09:33:40.1108230Z * [new branch] gh/kwen2501/237/base -> origin/gh/kwen2501/237/base 2025-12-04T09:33:40.1109556Z * [new branch] gh/kwen2501/237/head -> origin/gh/kwen2501/237/head 2025-12-04T09:33:40.1110754Z * [new branch] gh/kwen2501/237/orig -> origin/gh/kwen2501/237/orig 2025-12-04T09:33:40.1112534Z * [new branch] gh/kwen2501/238/base -> origin/gh/kwen2501/238/base 2025-12-04T09:33:40.1113745Z * [new branch] gh/kwen2501/238/head -> origin/gh/kwen2501/238/head 2025-12-04T09:33:40.1115215Z * [new branch] gh/kwen2501/238/orig -> origin/gh/kwen2501/238/orig 2025-12-04T09:33:40.1117367Z * [new branch] gh/kwen2501/240/base -> origin/gh/kwen2501/240/base 2025-12-04T09:33:40.1118393Z * [new branch] gh/kwen2501/240/head -> origin/gh/kwen2501/240/head 2025-12-04T09:33:40.1119705Z * [new branch] gh/kwen2501/240/orig -> origin/gh/kwen2501/240/orig 2025-12-04T09:33:40.1121546Z * [new branch] gh/kwen2501/241/base -> origin/gh/kwen2501/241/base 2025-12-04T09:33:40.1122638Z * [new branch] gh/kwen2501/241/head -> origin/gh/kwen2501/241/head 2025-12-04T09:33:40.1123885Z * [new branch] gh/kwen2501/241/orig -> origin/gh/kwen2501/241/orig 2025-12-04T09:33:40.1125643Z * [new branch] gh/kwen2501/247/base -> origin/gh/kwen2501/247/base 2025-12-04T09:33:40.1127251Z * [new branch] gh/kwen2501/247/head -> origin/gh/kwen2501/247/head 2025-12-04T09:33:40.1128017Z * [new branch] gh/kwen2501/247/orig -> origin/gh/kwen2501/247/orig 2025-12-04T09:33:40.1129742Z * [new branch] gh/kwen2501/252/base -> origin/gh/kwen2501/252/base 2025-12-04T09:33:40.1131007Z * [new branch] gh/kwen2501/252/head -> origin/gh/kwen2501/252/head 2025-12-04T09:33:40.1132274Z * [new branch] gh/kwen2501/252/orig -> origin/gh/kwen2501/252/orig 2025-12-04T09:33:40.1134558Z * [new branch] gh/kwen2501/259/base -> origin/gh/kwen2501/259/base 2025-12-04T09:33:40.1135871Z * [new branch] gh/kwen2501/259/head -> origin/gh/kwen2501/259/head 2025-12-04T09:33:40.1137157Z * [new branch] gh/kwen2501/259/orig -> origin/gh/kwen2501/259/orig 2025-12-04T09:33:40.1139232Z * [new branch] gh/kwen2501/260/base -> origin/gh/kwen2501/260/base 2025-12-04T09:33:40.1140520Z * [new branch] gh/kwen2501/260/head -> origin/gh/kwen2501/260/head 2025-12-04T09:33:40.1141832Z * [new branch] gh/kwen2501/260/orig -> origin/gh/kwen2501/260/orig 2025-12-04T09:33:40.1143646Z * [new branch] gh/kwen2501/268/base -> origin/gh/kwen2501/268/base 2025-12-04T09:33:40.1144827Z * [new branch] gh/kwen2501/268/head -> origin/gh/kwen2501/268/head 2025-12-04T09:33:40.1146102Z * [new branch] gh/kwen2501/268/orig -> origin/gh/kwen2501/268/orig 2025-12-04T09:33:40.1147919Z * [new branch] gh/kwen2501/269/base -> origin/gh/kwen2501/269/base 2025-12-04T09:33:40.1149232Z * [new branch] gh/kwen2501/269/head -> origin/gh/kwen2501/269/head 2025-12-04T09:33:40.1150521Z * [new branch] gh/kwen2501/269/orig -> origin/gh/kwen2501/269/orig 2025-12-04T09:33:40.1152402Z * [new branch] gh/kwen2501/270/base -> origin/gh/kwen2501/270/base 2025-12-04T09:33:40.1153691Z * [new branch] gh/kwen2501/270/head -> origin/gh/kwen2501/270/head 2025-12-04T09:33:40.1154924Z * [new branch] gh/kwen2501/270/orig -> origin/gh/kwen2501/270/orig 2025-12-04T09:33:40.1156796Z * [new branch] gh/kwen2501/271/base -> origin/gh/kwen2501/271/base 2025-12-04T09:33:40.1158017Z * [new branch] gh/kwen2501/271/head -> origin/gh/kwen2501/271/head 2025-12-04T09:33:40.1159323Z * [new branch] gh/kwen2501/271/orig -> origin/gh/kwen2501/271/orig 2025-12-04T09:33:40.1161216Z * [new branch] gh/kwen2501/274/base -> origin/gh/kwen2501/274/base 2025-12-04T09:33:40.1162659Z * [new branch] gh/kwen2501/274/head -> origin/gh/kwen2501/274/head 2025-12-04T09:33:40.1163803Z * [new branch] gh/kwen2501/274/orig -> origin/gh/kwen2501/274/orig 2025-12-04T09:33:40.1165745Z * [new branch] gh/kwen2501/275/base -> origin/gh/kwen2501/275/base 2025-12-04T09:33:40.1167339Z * [new branch] gh/kwen2501/275/head -> origin/gh/kwen2501/275/head 2025-12-04T09:33:40.1168574Z * [new branch] gh/kwen2501/275/orig -> origin/gh/kwen2501/275/orig 2025-12-04T09:33:40.1170283Z * [new branch] gh/kwen2501/276/base -> origin/gh/kwen2501/276/base 2025-12-04T09:33:40.1171458Z * [new branch] gh/kwen2501/276/head -> origin/gh/kwen2501/276/head 2025-12-04T09:33:40.1172733Z * [new branch] gh/kwen2501/276/orig -> origin/gh/kwen2501/276/orig 2025-12-04T09:33:40.1174499Z * [new branch] gh/kwen2501/277/base -> origin/gh/kwen2501/277/base 2025-12-04T09:33:40.1176252Z * [new branch] gh/kwen2501/277/head -> origin/gh/kwen2501/277/head 2025-12-04T09:33:40.1177431Z * [new branch] gh/kwen2501/277/orig -> origin/gh/kwen2501/277/orig 2025-12-04T09:33:40.1179535Z * [new branch] gh/kwen2501/278/base -> origin/gh/kwen2501/278/base 2025-12-04T09:33:40.1180592Z * [new branch] gh/kwen2501/278/head -> origin/gh/kwen2501/278/head 2025-12-04T09:33:40.1182055Z * [new branch] gh/kwen2501/278/orig -> origin/gh/kwen2501/278/orig 2025-12-04T09:33:40.1183841Z * [new branch] gh/kwen2501/279/base -> origin/gh/kwen2501/279/base 2025-12-04T09:33:40.1185351Z * [new branch] gh/kwen2501/279/head -> origin/gh/kwen2501/279/head 2025-12-04T09:33:40.1187257Z * [new branch] gh/kwen2501/279/orig -> origin/gh/kwen2501/279/orig 2025-12-04T09:33:40.1189250Z * [new branch] gh/kwen2501/280/base -> origin/gh/kwen2501/280/base 2025-12-04T09:33:40.1190439Z * [new branch] gh/kwen2501/280/head -> origin/gh/kwen2501/280/head 2025-12-04T09:33:40.1191765Z * [new branch] gh/kwen2501/280/orig -> origin/gh/kwen2501/280/orig 2025-12-04T09:33:40.1193630Z * [new branch] gh/kwen2501/281/base -> origin/gh/kwen2501/281/base 2025-12-04T09:33:40.1194746Z * [new branch] gh/kwen2501/281/head -> origin/gh/kwen2501/281/head 2025-12-04T09:33:40.1196611Z * [new branch] gh/kwen2501/281/orig -> origin/gh/kwen2501/281/orig 2025-12-04T09:33:40.1198409Z * [new branch] gh/kwen2501/282/base -> origin/gh/kwen2501/282/base 2025-12-04T09:33:40.1199660Z * [new branch] gh/kwen2501/282/head -> origin/gh/kwen2501/282/head 2025-12-04T09:33:40.1200922Z * [new branch] gh/kwen2501/282/orig -> origin/gh/kwen2501/282/orig 2025-12-04T09:33:40.1202732Z * [new branch] gh/kwen2501/283/base -> origin/gh/kwen2501/283/base 2025-12-04T09:33:40.1204219Z * [new branch] gh/kwen2501/283/head -> origin/gh/kwen2501/283/head 2025-12-04T09:33:40.1205395Z * [new branch] gh/kwen2501/283/orig -> origin/gh/kwen2501/283/orig 2025-12-04T09:33:40.1207199Z * [new branch] gh/kwen2501/284/base -> origin/gh/kwen2501/284/base 2025-12-04T09:33:40.1208434Z * [new branch] gh/kwen2501/284/head -> origin/gh/kwen2501/284/head 2025-12-04T09:33:40.1209752Z * [new branch] gh/kwen2501/284/orig -> origin/gh/kwen2501/284/orig 2025-12-04T09:33:40.1211613Z * [new branch] gh/kwen2501/285/base -> origin/gh/kwen2501/285/base 2025-12-04T09:33:40.1212798Z * [new branch] gh/kwen2501/285/head -> origin/gh/kwen2501/285/head 2025-12-04T09:33:40.1214064Z * [new branch] gh/kwen2501/285/orig -> origin/gh/kwen2501/285/orig 2025-12-04T09:33:40.1215916Z * [new branch] gh/kwen2501/286/base -> origin/gh/kwen2501/286/base 2025-12-04T09:33:40.1217136Z * [new branch] gh/kwen2501/286/head -> origin/gh/kwen2501/286/head 2025-12-04T09:33:40.1218486Z * [new branch] gh/kwen2501/286/orig -> origin/gh/kwen2501/286/orig 2025-12-04T09:33:40.1220799Z * [new branch] gh/kwen2501/287/base -> origin/gh/kwen2501/287/base 2025-12-04T09:33:40.1222206Z * [new branch] gh/kwen2501/287/head -> origin/gh/kwen2501/287/head 2025-12-04T09:33:40.1223366Z * [new branch] gh/kwen2501/287/orig -> origin/gh/kwen2501/287/orig 2025-12-04T09:33:40.1226143Z * [new branch] gh/kwen2501/288/base -> origin/gh/kwen2501/288/base 2025-12-04T09:33:40.1227438Z * [new branch] gh/kwen2501/288/head -> origin/gh/kwen2501/288/head 2025-12-04T09:33:40.1228763Z * [new branch] gh/kwen2501/288/orig -> origin/gh/kwen2501/288/orig 2025-12-04T09:33:40.1230885Z * [new branch] gh/laithsakka/251/base -> origin/gh/laithsakka/251/base 2025-12-04T09:33:40.1232066Z * [new branch] gh/laithsakka/251/head -> origin/gh/laithsakka/251/head 2025-12-04T09:33:40.1233347Z * [new branch] gh/laithsakka/251/orig -> origin/gh/laithsakka/251/orig 2025-12-04T09:33:40.1235072Z * [new branch] gh/laithsakka/276/base -> origin/gh/laithsakka/276/base 2025-12-04T09:33:40.1236194Z * [new branch] gh/laithsakka/276/head -> origin/gh/laithsakka/276/head 2025-12-04T09:33:40.1237432Z * [new branch] gh/laithsakka/276/orig -> origin/gh/laithsakka/276/orig 2025-12-04T09:33:40.1239414Z * [new branch] gh/laithsakka/28/base -> origin/gh/laithsakka/28/base 2025-12-04T09:33:40.1240795Z * [new branch] gh/laithsakka/29/base -> origin/gh/laithsakka/29/base 2025-12-04T09:33:40.1242387Z * [new branch] gh/laithsakka/30/base -> origin/gh/laithsakka/30/base 2025-12-04T09:33:40.1243559Z * [new branch] gh/laithsakka/30/head -> origin/gh/laithsakka/30/head 2025-12-04T09:33:40.1245164Z * [new branch] gh/laithsakka/31/base -> origin/gh/laithsakka/31/base 2025-12-04T09:33:40.1246252Z * [new branch] gh/laithsakka/31/head -> origin/gh/laithsakka/31/head 2025-12-04T09:33:40.1248219Z * [new branch] gh/laithsakka/313/base -> origin/gh/laithsakka/313/base 2025-12-04T09:33:40.1249948Z * [new branch] gh/laithsakka/313/head -> origin/gh/laithsakka/313/head 2025-12-04T09:33:40.1251165Z * [new branch] gh/laithsakka/313/orig -> origin/gh/laithsakka/313/orig 2025-12-04T09:33:40.1253661Z * [new branch] gh/laithsakka/316/base -> origin/gh/laithsakka/316/base 2025-12-04T09:33:40.1254767Z * [new branch] gh/laithsakka/316/head -> origin/gh/laithsakka/316/head 2025-12-04T09:33:40.1256348Z * [new branch] gh/laithsakka/316/orig -> origin/gh/laithsakka/316/orig 2025-12-04T09:33:40.1258097Z * [new branch] gh/laithsakka/317/base -> origin/gh/laithsakka/317/base 2025-12-04T09:33:40.1297736Z * [new branch] gh/laithsakka/317/head -> origin/gh/laithsakka/317/head 2025-12-04T09:33:40.1298607Z * [new branch] gh/laithsakka/317/orig -> origin/gh/laithsakka/317/orig 2025-12-04T09:33:40.1299313Z * [new branch] gh/laithsakka/319/base -> origin/gh/laithsakka/319/base 2025-12-04T09:33:40.1300000Z * [new branch] gh/laithsakka/319/head -> origin/gh/laithsakka/319/head 2025-12-04T09:33:40.1300677Z * [new branch] gh/laithsakka/319/orig -> origin/gh/laithsakka/319/orig 2025-12-04T09:33:40.1301363Z * [new branch] gh/laithsakka/32/base -> origin/gh/laithsakka/32/base 2025-12-04T09:33:40.1302023Z * [new branch] gh/laithsakka/32/head -> origin/gh/laithsakka/32/head 2025-12-04T09:33:40.1302714Z * [new branch] gh/laithsakka/320/base -> origin/gh/laithsakka/320/base 2025-12-04T09:33:40.1303386Z * [new branch] gh/laithsakka/320/head -> origin/gh/laithsakka/320/head 2025-12-04T09:33:40.1304067Z * [new branch] gh/laithsakka/320/orig -> origin/gh/laithsakka/320/orig 2025-12-04T09:33:40.1304729Z * [new branch] gh/laithsakka/321/base -> origin/gh/laithsakka/321/base 2025-12-04T09:33:40.1305573Z * [new branch] gh/laithsakka/321/head -> origin/gh/laithsakka/321/head 2025-12-04T09:33:40.1306255Z * [new branch] gh/laithsakka/321/orig -> origin/gh/laithsakka/321/orig 2025-12-04T09:33:40.1306934Z * [new branch] gh/laithsakka/322/base -> origin/gh/laithsakka/322/base 2025-12-04T09:33:40.1307606Z * [new branch] gh/laithsakka/322/head -> origin/gh/laithsakka/322/head 2025-12-04T09:33:40.1308287Z * [new branch] gh/laithsakka/322/orig -> origin/gh/laithsakka/322/orig 2025-12-04T09:33:40.1308967Z * [new branch] gh/laithsakka/323/base -> origin/gh/laithsakka/323/base 2025-12-04T09:33:40.1309633Z * [new branch] gh/laithsakka/323/head -> origin/gh/laithsakka/323/head 2025-12-04T09:33:40.1310315Z * [new branch] gh/laithsakka/323/orig -> origin/gh/laithsakka/323/orig 2025-12-04T09:33:40.1310994Z * [new branch] gh/laithsakka/324/base -> origin/gh/laithsakka/324/base 2025-12-04T09:33:40.1311677Z * [new branch] gh/laithsakka/324/head -> origin/gh/laithsakka/324/head 2025-12-04T09:33:40.1312342Z * [new branch] gh/laithsakka/324/orig -> origin/gh/laithsakka/324/orig 2025-12-04T09:33:40.1313016Z * [new branch] gh/laithsakka/325/base -> origin/gh/laithsakka/325/base 2025-12-04T09:33:40.1313773Z * [new branch] gh/laithsakka/325/head -> origin/gh/laithsakka/325/head 2025-12-04T09:33:40.1314447Z * [new branch] gh/laithsakka/325/orig -> origin/gh/laithsakka/325/orig 2025-12-04T09:33:40.1315108Z * [new branch] gh/laithsakka/326/base -> origin/gh/laithsakka/326/base 2025-12-04T09:33:40.1315784Z * [new branch] gh/laithsakka/326/head -> origin/gh/laithsakka/326/head 2025-12-04T09:33:40.1316458Z * [new branch] gh/laithsakka/326/orig -> origin/gh/laithsakka/326/orig 2025-12-04T09:33:40.1317138Z * [new branch] gh/laithsakka/327/base -> origin/gh/laithsakka/327/base 2025-12-04T09:33:40.1317800Z * [new branch] gh/laithsakka/327/head -> origin/gh/laithsakka/327/head 2025-12-04T09:33:40.1318481Z * [new branch] gh/laithsakka/327/orig -> origin/gh/laithsakka/327/orig 2025-12-04T09:33:40.1319162Z * [new branch] gh/laithsakka/328/base -> origin/gh/laithsakka/328/base 2025-12-04T09:33:40.1319835Z * [new branch] gh/laithsakka/328/head -> origin/gh/laithsakka/328/head 2025-12-04T09:33:40.1320499Z * [new branch] gh/laithsakka/328/orig -> origin/gh/laithsakka/328/orig 2025-12-04T09:33:40.1321162Z * [new branch] gh/liangel/4/base -> origin/gh/liangel/4/base 2025-12-04T09:33:40.1321789Z * [new branch] gh/liangel/4/head -> origin/gh/liangel/4/head 2025-12-04T09:33:40.1322398Z * [new branch] gh/liangel/4/orig -> origin/gh/liangel/4/orig 2025-12-04T09:33:40.1323055Z * [new branch] gh/lucaskabela/1/base -> origin/gh/lucaskabela/1/base 2025-12-04T09:33:40.1323732Z * [new branch] gh/lucaskabela/1/head -> origin/gh/lucaskabela/1/head 2025-12-04T09:33:40.1324364Z * [new branch] gh/lw/4/base -> origin/gh/lw/4/base 2025-12-04T09:33:40.1324929Z * [new branch] gh/lw/4/head -> origin/gh/lw/4/head 2025-12-04T09:33:40.1325498Z * [new branch] gh/lw/4/orig -> origin/gh/lw/4/orig 2025-12-04T09:33:40.1326065Z * [new branch] gh/lw/5/base -> origin/gh/lw/5/base 2025-12-04T09:33:40.1326625Z * [new branch] gh/lw/5/head -> origin/gh/lw/5/head 2025-12-04T09:33:40.1327230Z * [new branch] gh/lw/5/orig -> origin/gh/lw/5/orig 2025-12-04T09:33:40.1328990Z * [new branch] gh/lw/6/base -> origin/gh/lw/6/base 2025-12-04T09:33:40.1330245Z * [new branch] gh/lw/6/head -> origin/gh/lw/6/head 2025-12-04T09:33:40.1331383Z * [new branch] gh/lw/6/orig -> origin/gh/lw/6/orig 2025-12-04T09:33:40.1333450Z * [new branch] gh/malfet/14/base -> origin/gh/malfet/14/base 2025-12-04T09:33:40.1335101Z * [new branch] gh/malfet/417/base -> origin/gh/malfet/417/base 2025-12-04T09:33:40.1336319Z * [new branch] gh/malfet/417/head -> origin/gh/malfet/417/head 2025-12-04T09:33:40.1337575Z * [new branch] gh/malfet/417/orig -> origin/gh/malfet/417/orig 2025-12-04T09:33:40.1339442Z * [new branch] gh/malfet/506/base -> origin/gh/malfet/506/base 2025-12-04T09:33:40.1340662Z * [new branch] gh/malfet/506/head -> origin/gh/malfet/506/head 2025-12-04T09:33:40.1342131Z * [new branch] gh/malfet/506/orig -> origin/gh/malfet/506/orig 2025-12-04T09:33:40.1343839Z * [new branch] gh/malfet/517/base -> origin/gh/malfet/517/base 2025-12-04T09:33:40.1345022Z * [new branch] gh/malfet/517/head -> origin/gh/malfet/517/head 2025-12-04T09:33:40.1346758Z * [new branch] gh/malfet/528/base -> origin/gh/malfet/528/base 2025-12-04T09:33:40.1347985Z * [new branch] gh/malfet/528/head -> origin/gh/malfet/528/head 2025-12-04T09:33:40.1349214Z * [new branch] gh/malfet/528/orig -> origin/gh/malfet/528/orig 2025-12-04T09:33:40.1350962Z * [new branch] gh/malfet/537/base -> origin/gh/malfet/537/base 2025-12-04T09:33:40.1352122Z * [new branch] gh/malfet/537/head -> origin/gh/malfet/537/head 2025-12-04T09:33:40.1353428Z * [new branch] gh/malfet/537/orig -> origin/gh/malfet/537/orig 2025-12-04T09:33:40.1355557Z * [new branch] gh/malfet/546/base -> origin/gh/malfet/546/base 2025-12-04T09:33:40.1356715Z * [new branch] gh/malfet/546/head -> origin/gh/malfet/546/head 2025-12-04T09:33:40.1357954Z * [new branch] gh/malfet/546/orig -> origin/gh/malfet/546/orig 2025-12-04T09:33:40.1359796Z * [new branch] gh/malfet/565/base -> origin/gh/malfet/565/base 2025-12-04T09:33:40.1360976Z * [new branch] gh/malfet/565/head -> origin/gh/malfet/565/head 2025-12-04T09:33:40.1362248Z * [new branch] gh/malfet/565/orig -> origin/gh/malfet/565/orig 2025-12-04T09:33:40.1364091Z * [new branch] gh/malfet/575/base -> origin/gh/malfet/575/base 2025-12-04T09:33:40.1365247Z * [new branch] gh/malfet/575/head -> origin/gh/malfet/575/head 2025-12-04T09:33:40.1366861Z * [new branch] gh/malfet/575/orig -> origin/gh/malfet/575/orig 2025-12-04T09:33:40.1368563Z * [new branch] gh/malfet/580/base -> origin/gh/malfet/580/base 2025-12-04T09:33:40.1369766Z * [new branch] gh/malfet/580/head -> origin/gh/malfet/580/head 2025-12-04T09:33:40.1371063Z * [new branch] gh/malfet/580/orig -> origin/gh/malfet/580/orig 2025-12-04T09:33:40.1372785Z * [new branch] gh/malfet/581/base -> origin/gh/malfet/581/base 2025-12-04T09:33:40.1373945Z * [new branch] gh/malfet/581/head -> origin/gh/malfet/581/head 2025-12-04T09:33:40.1375199Z * [new branch] gh/malfet/581/orig -> origin/gh/malfet/581/orig 2025-12-04T09:33:40.1376941Z * [new branch] gh/malfet/583/base -> origin/gh/malfet/583/base 2025-12-04T09:33:40.1378108Z * [new branch] gh/malfet/583/head -> origin/gh/malfet/583/head 2025-12-04T09:33:40.1379594Z * [new branch] gh/malfet/583/orig -> origin/gh/malfet/583/orig 2025-12-04T09:33:40.1381236Z * [new branch] gh/malfet/586/base -> origin/gh/malfet/586/base 2025-12-04T09:33:40.1382546Z * [new branch] gh/malfet/586/head -> origin/gh/malfet/586/head 2025-12-04T09:33:40.1383675Z * [new branch] gh/malfet/586/orig -> origin/gh/malfet/586/orig 2025-12-04T09:33:40.1385450Z * [new branch] gh/malfet/587/base -> origin/gh/malfet/587/base 2025-12-04T09:33:40.1386663Z * [new branch] gh/malfet/587/head -> origin/gh/malfet/587/head 2025-12-04T09:33:40.1388199Z * [new branch] gh/malfet/587/orig -> origin/gh/malfet/587/orig 2025-12-04T09:33:40.1389941Z * [new branch] gh/malfet/588/base -> origin/gh/malfet/588/base 2025-12-04T09:33:40.1391035Z * [new branch] gh/malfet/588/head -> origin/gh/malfet/588/head 2025-12-04T09:33:40.1392494Z * [new branch] gh/malfet/588/orig -> origin/gh/malfet/588/orig 2025-12-04T09:33:40.1394342Z * [new branch] gh/malfet/589/base -> origin/gh/malfet/589/base 2025-12-04T09:33:40.1395497Z * [new branch] gh/malfet/589/head -> origin/gh/malfet/589/head 2025-12-04T09:33:40.1396935Z * [new branch] gh/malfet/589/orig -> origin/gh/malfet/589/orig 2025-12-04T09:33:40.1398539Z * [new branch] gh/malfet/590/base -> origin/gh/malfet/590/base 2025-12-04T09:33:40.1399739Z * [new branch] gh/malfet/590/head -> origin/gh/malfet/590/head 2025-12-04T09:33:40.1401596Z * [new branch] gh/malfet/590/orig -> origin/gh/malfet/590/orig 2025-12-04T09:33:40.1403724Z * [new branch] gh/malfet/591/base -> origin/gh/malfet/591/base 2025-12-04T09:33:40.1404959Z * [new branch] gh/malfet/591/head -> origin/gh/malfet/591/head 2025-12-04T09:33:40.1406261Z * [new branch] gh/malfet/591/orig -> origin/gh/malfet/591/orig 2025-12-04T09:33:40.1408076Z * [new branch] gh/malfet/592/base -> origin/gh/malfet/592/base 2025-12-04T09:33:40.1409263Z * [new branch] gh/malfet/592/head -> origin/gh/malfet/592/head 2025-12-04T09:33:40.1410525Z * [new branch] gh/malfet/592/orig -> origin/gh/malfet/592/orig 2025-12-04T09:33:40.1412408Z * [new branch] gh/malfet/593/base -> origin/gh/malfet/593/base 2025-12-04T09:33:40.1413600Z * [new branch] gh/malfet/593/head -> origin/gh/malfet/593/head 2025-12-04T09:33:40.1414982Z * [new branch] gh/malfet/593/orig -> origin/gh/malfet/593/orig 2025-12-04T09:33:40.1416714Z * [new branch] gh/malfet/594/base -> origin/gh/malfet/594/base 2025-12-04T09:33:40.1417906Z * [new branch] gh/malfet/594/head -> origin/gh/malfet/594/head 2025-12-04T09:33:40.1419400Z * [new branch] gh/malfet/594/orig -> origin/gh/malfet/594/orig 2025-12-04T09:33:40.1420968Z * [new branch] gh/malfet/595/base -> origin/gh/malfet/595/base 2025-12-04T09:33:40.1422166Z * [new branch] gh/malfet/595/head -> origin/gh/malfet/595/head 2025-12-04T09:33:40.1423404Z * [new branch] gh/malfet/595/orig -> origin/gh/malfet/595/orig 2025-12-04T09:33:40.1425178Z * [new branch] gh/malfet/596/base -> origin/gh/malfet/596/base 2025-12-04T09:33:40.1426345Z * [new branch] gh/malfet/596/head -> origin/gh/malfet/596/head 2025-12-04T09:33:40.1427597Z * [new branch] gh/malfet/596/orig -> origin/gh/malfet/596/orig 2025-12-04T09:33:40.1429506Z * [new branch] gh/malfet/597/base -> origin/gh/malfet/597/base 2025-12-04T09:33:40.1430702Z * [new branch] gh/malfet/597/head -> origin/gh/malfet/597/head 2025-12-04T09:33:40.1432019Z * [new branch] gh/malfet/597/orig -> origin/gh/malfet/597/orig 2025-12-04T09:33:40.1433780Z * [new branch] gh/malfet/598/base -> origin/gh/malfet/598/base 2025-12-04T09:33:40.1435052Z * [new branch] gh/malfet/598/head -> origin/gh/malfet/598/head 2025-12-04T09:33:40.1436243Z * [new branch] gh/malfet/598/orig -> origin/gh/malfet/598/orig 2025-12-04T09:33:40.1438036Z * [new branch] gh/malfet/599/base -> origin/gh/malfet/599/base 2025-12-04T09:33:40.1439248Z * [new branch] gh/malfet/599/head -> origin/gh/malfet/599/head 2025-12-04T09:33:40.1440564Z * [new branch] gh/malfet/599/orig -> origin/gh/malfet/599/orig 2025-12-04T09:33:40.1442324Z * [new branch] gh/malfet/600/base -> origin/gh/malfet/600/base 2025-12-04T09:33:40.1443489Z * [new branch] gh/malfet/600/head -> origin/gh/malfet/600/head 2025-12-04T09:33:40.1444704Z * [new branch] gh/malfet/600/orig -> origin/gh/malfet/600/orig 2025-12-04T09:33:40.1446729Z * [new branch] gh/malfet/601/base -> origin/gh/malfet/601/base 2025-12-04T09:33:40.1448064Z * [new branch] gh/malfet/601/head -> origin/gh/malfet/601/head 2025-12-04T09:33:40.1449312Z * [new branch] gh/malfet/601/orig -> origin/gh/malfet/601/orig 2025-12-04T09:33:40.1451093Z * [new branch] gh/malfet/602/base -> origin/gh/malfet/602/base 2025-12-04T09:33:40.1452290Z * [new branch] gh/malfet/602/head -> origin/gh/malfet/602/head 2025-12-04T09:33:40.1453540Z * [new branch] gh/malfet/602/orig -> origin/gh/malfet/602/orig 2025-12-04T09:33:40.1455156Z * [new branch] gh/malfet/603/base -> origin/gh/malfet/603/base 2025-12-04T09:33:40.1456327Z * [new branch] gh/malfet/603/head -> origin/gh/malfet/603/head 2025-12-04T09:33:40.1457595Z * [new branch] gh/malfet/603/orig -> origin/gh/malfet/603/orig 2025-12-04T09:33:40.1459387Z * [new branch] gh/malfet/604/base -> origin/gh/malfet/604/base 2025-12-04T09:33:40.1460635Z * [new branch] gh/malfet/604/head -> origin/gh/malfet/604/head 2025-12-04T09:33:40.1461876Z * [new branch] gh/malfet/604/orig -> origin/gh/malfet/604/orig 2025-12-04T09:33:40.1463732Z * [new branch] gh/malfet/605/base -> origin/gh/malfet/605/base 2025-12-04T09:33:40.1464990Z * [new branch] gh/malfet/605/head -> origin/gh/malfet/605/head 2025-12-04T09:33:40.1466381Z * [new branch] gh/malfet/605/orig -> origin/gh/malfet/605/orig 2025-12-04T09:33:40.1468121Z * [new branch] gh/malfet/606/base -> origin/gh/malfet/606/base 2025-12-04T09:33:40.1469491Z * [new branch] gh/malfet/606/head -> origin/gh/malfet/606/head 2025-12-04T09:33:40.1470805Z * [new branch] gh/malfet/606/orig -> origin/gh/malfet/606/orig 2025-12-04T09:33:40.1472543Z * [new branch] gh/malfet/607/base -> origin/gh/malfet/607/base 2025-12-04T09:33:40.1473755Z * [new branch] gh/malfet/607/head -> origin/gh/malfet/607/head 2025-12-04T09:33:40.1475122Z * [new branch] gh/malfet/607/orig -> origin/gh/malfet/607/orig 2025-12-04T09:33:40.1476812Z * [new branch] gh/malfet/608/base -> origin/gh/malfet/608/base 2025-12-04T09:33:40.1478098Z * [new branch] gh/malfet/608/head -> origin/gh/malfet/608/head 2025-12-04T09:33:40.1479314Z * [new branch] gh/malfet/608/orig -> origin/gh/malfet/608/orig 2025-12-04T09:33:40.1481219Z * [new branch] gh/malfet/609/base -> origin/gh/malfet/609/base 2025-12-04T09:33:40.1482462Z * [new branch] gh/malfet/609/head -> origin/gh/malfet/609/head 2025-12-04T09:33:40.1483802Z * [new branch] gh/malfet/609/orig -> origin/gh/malfet/609/orig 2025-12-04T09:33:40.1485598Z * [new branch] gh/malfet/610/base -> origin/gh/malfet/610/base 2025-12-04T09:33:40.1486819Z * [new branch] gh/malfet/610/head -> origin/gh/malfet/610/head 2025-12-04T09:33:40.1488193Z * [new branch] gh/malfet/610/orig -> origin/gh/malfet/610/orig 2025-12-04T09:33:40.1494264Z * [new branch] gh/malfet/611/base -> origin/gh/malfet/611/base 2025-12-04T09:33:40.1495567Z * [new branch] gh/malfet/611/head -> origin/gh/malfet/611/head 2025-12-04T09:33:40.1496906Z * [new branch] gh/malfet/611/orig -> origin/gh/malfet/611/orig 2025-12-04T09:33:40.1498528Z * [new branch] gh/malfet/612/base -> origin/gh/malfet/612/base 2025-12-04T09:33:40.1499950Z * [new branch] gh/malfet/612/head -> origin/gh/malfet/612/head 2025-12-04T09:33:40.1501253Z * [new branch] gh/malfet/612/orig -> origin/gh/malfet/612/orig 2025-12-04T09:33:40.1503142Z * [new branch] gh/malfet/64/base -> origin/gh/malfet/64/base 2025-12-04T09:33:40.1504408Z * [new branch] gh/malfet/64/head -> origin/gh/malfet/64/head 2025-12-04T09:33:40.1506535Z * [new branch] gh/manuelcandales/11/base -> origin/gh/manuelcandales/11/base 2025-12-04T09:33:40.1507752Z * [new branch] gh/manuelcandales/11/head -> origin/gh/manuelcandales/11/head 2025-12-04T09:33:40.1508997Z * [new branch] gh/manuelcandales/11/orig -> origin/gh/manuelcandales/11/orig 2025-12-04T09:33:40.1511773Z * [new branch] gh/markkm/1/base -> origin/gh/markkm/1/base 2025-12-04T09:33:40.1513863Z * [new branch] gh/masnesral/1/base -> origin/gh/masnesral/1/base 2025-12-04T09:33:40.1515129Z * [new branch] gh/masnesral/1/head -> origin/gh/masnesral/1/head 2025-12-04T09:33:40.1516374Z * [new branch] gh/masnesral/1/orig -> origin/gh/masnesral/1/orig 2025-12-04T09:33:40.1518453Z * [new branch] gh/mhorowitz/0/base -> origin/gh/mhorowitz/0/base 2025-12-04T09:33:40.1519777Z * [new branch] gh/mhorowitz/0/head -> origin/gh/mhorowitz/0/head 2025-12-04T09:33:40.1521281Z * [new branch] gh/mhorowitz/1/base -> origin/gh/mhorowitz/1/base 2025-12-04T09:33:40.1523039Z * [new branch] gh/mhorowitz/1/head -> origin/gh/mhorowitz/1/head 2025-12-04T09:33:40.1524580Z * [new branch] gh/mhorowitz/2/base -> origin/gh/mhorowitz/2/base 2025-12-04T09:33:40.1525909Z * [new branch] gh/mhorowitz/2/head -> origin/gh/mhorowitz/2/head 2025-12-04T09:33:40.1527416Z * [new branch] gh/mhorowitz/3/base -> origin/gh/mhorowitz/3/base 2025-12-04T09:33:40.1528650Z * [new branch] gh/mhorowitz/3/head -> origin/gh/mhorowitz/3/head 2025-12-04T09:33:40.1530251Z * [new branch] gh/mhorowitz/4/base -> origin/gh/mhorowitz/4/base 2025-12-04T09:33:40.1531475Z * [new branch] gh/mhorowitz/4/head -> origin/gh/mhorowitz/4/head 2025-12-04T09:33:40.1533123Z * [new branch] gh/mhorowitz/5/base -> origin/gh/mhorowitz/5/base 2025-12-04T09:33:40.1534340Z * [new branch] gh/mhorowitz/5/head -> origin/gh/mhorowitz/5/head 2025-12-04T09:33:40.1535890Z * [new branch] gh/mhorowitz/6/base -> origin/gh/mhorowitz/6/base 2025-12-04T09:33:40.1537021Z * [new branch] gh/mhorowitz/6/head -> origin/gh/mhorowitz/6/head 2025-12-04T09:33:40.1539335Z * [new branch] gh/mikaylagawarecki/234/base -> origin/gh/mikaylagawarecki/234/base 2025-12-04T09:33:40.1540583Z * [new branch] gh/mikaylagawarecki/234/head -> origin/gh/mikaylagawarecki/234/head 2025-12-04T09:33:40.1542615Z * [new branch] gh/mikaylagawarecki/235/base -> origin/gh/mikaylagawarecki/235/base 2025-12-04T09:33:40.1543954Z * [new branch] gh/mikaylagawarecki/235/head -> origin/gh/mikaylagawarecki/235/head 2025-12-04T09:33:40.1545511Z * [new branch] gh/mikaylagawarecki/236/base -> origin/gh/mikaylagawarecki/236/base 2025-12-04T09:33:40.1546715Z * [new branch] gh/mikaylagawarecki/236/head -> origin/gh/mikaylagawarecki/236/head 2025-12-04T09:33:40.1549582Z * [new branch] gh/mikaylagawarecki/237/base -> origin/gh/mikaylagawarecki/237/base 2025-12-04T09:33:40.1550814Z * [new branch] gh/mikaylagawarecki/237/head -> origin/gh/mikaylagawarecki/237/head 2025-12-04T09:33:40.1552507Z * [new branch] gh/mikaylagawarecki/238/base -> origin/gh/mikaylagawarecki/238/base 2025-12-04T09:33:40.1553779Z * [new branch] gh/mikaylagawarecki/238/head -> origin/gh/mikaylagawarecki/238/head 2025-12-04T09:33:40.1555514Z * [new branch] gh/mikaylagawarecki/336/base -> origin/gh/mikaylagawarecki/336/base 2025-12-04T09:33:40.1556796Z * [new branch] gh/mikaylagawarecki/336/head -> origin/gh/mikaylagawarecki/336/head 2025-12-04T09:33:40.1558098Z * [new branch] gh/mikaylagawarecki/336/orig -> origin/gh/mikaylagawarecki/336/orig 2025-12-04T09:33:40.1560481Z * [new branch] gh/mikaylagawarecki/341/base -> origin/gh/mikaylagawarecki/341/base 2025-12-04T09:33:40.1561726Z * [new branch] gh/mikaylagawarecki/341/head -> origin/gh/mikaylagawarecki/341/head 2025-12-04T09:33:40.1563037Z * [new branch] gh/mikaylagawarecki/341/orig -> origin/gh/mikaylagawarecki/341/orig 2025-12-04T09:33:40.1564932Z * [new branch] gh/mikaylagawarecki/342/base -> origin/gh/mikaylagawarecki/342/base 2025-12-04T09:33:40.1566340Z * [new branch] gh/mikaylagawarecki/342/head -> origin/gh/mikaylagawarecki/342/head 2025-12-04T09:33:40.1567678Z * [new branch] gh/mikaylagawarecki/342/orig -> origin/gh/mikaylagawarecki/342/orig 2025-12-04T09:33:40.1569454Z * [new branch] gh/mikaylagawarecki/345/base -> origin/gh/mikaylagawarecki/345/base 2025-12-04T09:33:40.1570688Z * [new branch] gh/mikaylagawarecki/345/head -> origin/gh/mikaylagawarecki/345/head 2025-12-04T09:33:40.1572009Z * [new branch] gh/mikaylagawarecki/345/orig -> origin/gh/mikaylagawarecki/345/orig 2025-12-04T09:33:40.1573922Z * [new branch] gh/mikaylagawarecki/346/base -> origin/gh/mikaylagawarecki/346/base 2025-12-04T09:33:40.1575137Z * [new branch] gh/mikaylagawarecki/346/head -> origin/gh/mikaylagawarecki/346/head 2025-12-04T09:33:40.1576490Z * [new branch] gh/mikaylagawarecki/346/orig -> origin/gh/mikaylagawarecki/346/orig 2025-12-04T09:33:40.1578333Z * [new branch] gh/mikaylagawarecki/347/base -> origin/gh/mikaylagawarecki/347/base 2025-12-04T09:33:40.1579671Z * [new branch] gh/mikaylagawarecki/347/head -> origin/gh/mikaylagawarecki/347/head 2025-12-04T09:33:40.1580861Z * [new branch] gh/mikaylagawarecki/347/orig -> origin/gh/mikaylagawarecki/347/orig 2025-12-04T09:33:40.1583180Z * [new branch] gh/mikaylagawarecki/350/base -> origin/gh/mikaylagawarecki/350/base 2025-12-04T09:33:40.1584573Z * [new branch] gh/mikaylagawarecki/350/head -> origin/gh/mikaylagawarecki/350/head 2025-12-04T09:33:40.1585871Z * [new branch] gh/mikaylagawarecki/350/orig -> origin/gh/mikaylagawarecki/350/orig 2025-12-04T09:33:40.1588173Z * [new branch] gh/mikaylagawarecki/351/base -> origin/gh/mikaylagawarecki/351/base 2025-12-04T09:33:40.1589577Z * [new branch] gh/mikaylagawarecki/351/head -> origin/gh/mikaylagawarecki/351/head 2025-12-04T09:33:40.1590862Z * [new branch] gh/mikaylagawarecki/351/orig -> origin/gh/mikaylagawarecki/351/orig 2025-12-04T09:33:40.1592740Z * [new branch] gh/mikaylagawarecki/352/base -> origin/gh/mikaylagawarecki/352/base 2025-12-04T09:33:40.1594178Z * [new branch] gh/mikaylagawarecki/352/head -> origin/gh/mikaylagawarecki/352/head 2025-12-04T09:33:40.1595585Z * [new branch] gh/mikaylagawarecki/352/orig -> origin/gh/mikaylagawarecki/352/orig 2025-12-04T09:33:40.1597394Z * [new branch] gh/mikaylagawarecki/353/base -> origin/gh/mikaylagawarecki/353/base 2025-12-04T09:33:40.1599434Z * [new branch] gh/mikaylagawarecki/353/head -> origin/gh/mikaylagawarecki/353/head 2025-12-04T09:33:40.1600755Z * [new branch] gh/mikaylagawarecki/353/orig -> origin/gh/mikaylagawarecki/353/orig 2025-12-04T09:33:40.1602366Z * [new branch] gh/mikaylagawarecki/354/base -> origin/gh/mikaylagawarecki/354/base 2025-12-04T09:33:40.1603770Z * [new branch] gh/mikaylagawarecki/354/head -> origin/gh/mikaylagawarecki/354/head 2025-12-04T09:33:40.1605048Z * [new branch] gh/mikaylagawarecki/354/orig -> origin/gh/mikaylagawarecki/354/orig 2025-12-04T09:33:40.1607208Z * [new branch] gh/mikaylagawarecki/356/base -> origin/gh/mikaylagawarecki/356/base 2025-12-04T09:33:40.1608542Z * [new branch] gh/mikaylagawarecki/356/head -> origin/gh/mikaylagawarecki/356/head 2025-12-04T09:33:40.1609921Z * [new branch] gh/mikaylagawarecki/356/orig -> origin/gh/mikaylagawarecki/356/orig 2025-12-04T09:33:40.1611485Z * [new branch] gh/mikaylagawarecki/357/base -> origin/gh/mikaylagawarecki/357/base 2025-12-04T09:33:40.1612876Z * [new branch] gh/mikaylagawarecki/357/head -> origin/gh/mikaylagawarecki/357/head 2025-12-04T09:33:40.1614141Z * [new branch] gh/mikaylagawarecki/357/orig -> origin/gh/mikaylagawarecki/357/orig 2025-12-04T09:33:40.1616044Z * [new branch] gh/mikaylagawarecki/359/base -> origin/gh/mikaylagawarecki/359/base 2025-12-04T09:33:40.1617360Z * [new branch] gh/mikaylagawarecki/359/head -> origin/gh/mikaylagawarecki/359/head 2025-12-04T09:33:40.1618778Z * [new branch] gh/mikaylagawarecki/359/orig -> origin/gh/mikaylagawarecki/359/orig 2025-12-04T09:33:40.1621005Z * [new branch] gh/mikaylagawarecki/360/base -> origin/gh/mikaylagawarecki/360/base 2025-12-04T09:33:40.1622470Z * [new branch] gh/mikaylagawarecki/360/head -> origin/gh/mikaylagawarecki/360/head 2025-12-04T09:33:40.1623774Z * [new branch] gh/mikaylagawarecki/360/orig -> origin/gh/mikaylagawarecki/360/orig 2025-12-04T09:33:40.1625528Z * [new branch] gh/mikaylagawarecki/361/base -> origin/gh/mikaylagawarecki/361/base 2025-12-04T09:33:40.1626915Z * [new branch] gh/mikaylagawarecki/361/head -> origin/gh/mikaylagawarecki/361/head 2025-12-04T09:33:40.1628176Z * [new branch] gh/mikaylagawarecki/361/orig -> origin/gh/mikaylagawarecki/361/orig 2025-12-04T09:33:40.1629957Z * [new branch] gh/mikaylagawarecki/362/base -> origin/gh/mikaylagawarecki/362/base 2025-12-04T09:33:40.1631491Z * [new branch] gh/mikaylagawarecki/362/head -> origin/gh/mikaylagawarecki/362/head 2025-12-04T09:33:40.1632767Z * [new branch] gh/mikaylagawarecki/362/orig -> origin/gh/mikaylagawarecki/362/orig 2025-12-04T09:33:40.1634870Z * [new branch] gh/mikaylagawarecki/363/base -> origin/gh/mikaylagawarecki/363/base 2025-12-04T09:33:40.1636256Z * [new branch] gh/mikaylagawarecki/363/head -> origin/gh/mikaylagawarecki/363/head 2025-12-04T09:33:40.1637543Z * [new branch] gh/mikaylagawarecki/363/orig -> origin/gh/mikaylagawarecki/363/orig 2025-12-04T09:33:40.1639820Z * [new branch] gh/mikaylagawarecki/364/base -> origin/gh/mikaylagawarecki/364/base 2025-12-04T09:33:40.1641218Z * [new branch] gh/mikaylagawarecki/364/head -> origin/gh/mikaylagawarecki/364/head 2025-12-04T09:33:40.1642573Z * [new branch] gh/mikaylagawarecki/364/orig -> origin/gh/mikaylagawarecki/364/orig 2025-12-04T09:33:40.1644506Z * [new branch] gh/mikaylagawarecki/365/base -> origin/gh/mikaylagawarecki/365/base 2025-12-04T09:33:40.1645871Z * [new branch] gh/mikaylagawarecki/365/head -> origin/gh/mikaylagawarecki/365/head 2025-12-04T09:33:40.1647252Z * [new branch] gh/mikaylagawarecki/365/orig -> origin/gh/mikaylagawarecki/365/orig 2025-12-04T09:33:40.1649082Z * [new branch] gh/mikaylagawarecki/366/base -> origin/gh/mikaylagawarecki/366/base 2025-12-04T09:33:40.1650299Z * [new branch] gh/mikaylagawarecki/366/head -> origin/gh/mikaylagawarecki/366/head 2025-12-04T09:33:40.1651655Z * [new branch] gh/mikaylagawarecki/366/orig -> origin/gh/mikaylagawarecki/366/orig 2025-12-04T09:33:40.1653428Z * [new branch] gh/mikaylagawarecki/367/base -> origin/gh/mikaylagawarecki/367/base 2025-12-04T09:33:40.1654730Z * [new branch] gh/mikaylagawarecki/367/head -> origin/gh/mikaylagawarecki/367/head 2025-12-04T09:33:40.1656013Z * [new branch] gh/mikaylagawarecki/367/orig -> origin/gh/mikaylagawarecki/367/orig 2025-12-04T09:33:40.1658420Z * [new branch] gh/mikaylagawarecki/368/base -> origin/gh/mikaylagawarecki/368/base 2025-12-04T09:33:40.1659900Z * [new branch] gh/mikaylagawarecki/368/head -> origin/gh/mikaylagawarecki/368/head 2025-12-04T09:33:40.1661251Z * [new branch] gh/mikaylagawarecki/368/orig -> origin/gh/mikaylagawarecki/368/orig 2025-12-04T09:33:40.1662996Z * [new branch] gh/mikaylagawarecki/369/base -> origin/gh/mikaylagawarecki/369/base 2025-12-04T09:33:40.1664399Z * [new branch] gh/mikaylagawarecki/369/head -> origin/gh/mikaylagawarecki/369/head 2025-12-04T09:33:40.1665580Z * [new branch] gh/mikaylagawarecki/369/orig -> origin/gh/mikaylagawarecki/369/orig 2025-12-04T09:33:40.1667448Z * [new branch] gh/mikaylagawarecki/370/base -> origin/gh/mikaylagawarecki/370/base 2025-12-04T09:33:40.1668728Z * [new branch] gh/mikaylagawarecki/370/head -> origin/gh/mikaylagawarecki/370/head 2025-12-04T09:33:40.1670022Z * [new branch] gh/mikaylagawarecki/370/orig -> origin/gh/mikaylagawarecki/370/orig 2025-12-04T09:33:40.1671842Z * [new branch] gh/mikaylagawarecki/371/base -> origin/gh/mikaylagawarecki/371/base 2025-12-04T09:33:40.1673071Z * [new branch] gh/mikaylagawarecki/371/head -> origin/gh/mikaylagawarecki/371/head 2025-12-04T09:33:40.1674349Z * [new branch] gh/mikaylagawarecki/371/orig -> origin/gh/mikaylagawarecki/371/orig 2025-12-04T09:33:40.1676107Z * [new branch] gh/mikaylagawarecki/372/base -> origin/gh/mikaylagawarecki/372/base 2025-12-04T09:33:40.1677510Z * [new branch] gh/mikaylagawarecki/372/head -> origin/gh/mikaylagawarecki/372/head 2025-12-04T09:33:40.1678747Z * [new branch] gh/mikaylagawarecki/372/orig -> origin/gh/mikaylagawarecki/372/orig 2025-12-04T09:33:40.1680848Z * [new branch] gh/mikaylagawarecki/373/base -> origin/gh/mikaylagawarecki/373/base 2025-12-04T09:33:40.1682139Z * [new branch] gh/mikaylagawarecki/373/head -> origin/gh/mikaylagawarecki/373/head 2025-12-04T09:33:40.1683435Z * [new branch] gh/mikaylagawarecki/373/orig -> origin/gh/mikaylagawarecki/373/orig 2025-12-04T09:33:40.1685210Z * [new branch] gh/mikaylagawarecki/374/base -> origin/gh/mikaylagawarecki/374/base 2025-12-04T09:33:40.1686475Z * [new branch] gh/mikaylagawarecki/374/head -> origin/gh/mikaylagawarecki/374/head 2025-12-04T09:33:40.1687898Z * [new branch] gh/mikaylagawarecki/374/orig -> origin/gh/mikaylagawarecki/374/orig 2025-12-04T09:33:40.1689738Z * [new branch] gh/mikaylagawarecki/375/base -> origin/gh/mikaylagawarecki/375/base 2025-12-04T09:33:40.1691120Z * [new branch] gh/mikaylagawarecki/375/head -> origin/gh/mikaylagawarecki/375/head 2025-12-04T09:33:40.1692471Z * [new branch] gh/mikaylagawarecki/375/orig -> origin/gh/mikaylagawarecki/375/orig 2025-12-04T09:33:40.1694314Z * [new branch] gh/mikaylagawarecki/376/base -> origin/gh/mikaylagawarecki/376/base 2025-12-04T09:33:40.1695874Z * [new branch] gh/mikaylagawarecki/376/head -> origin/gh/mikaylagawarecki/376/head 2025-12-04T09:33:40.1697047Z * [new branch] gh/mikaylagawarecki/376/orig -> origin/gh/mikaylagawarecki/376/orig 2025-12-04T09:33:40.1698900Z * [new branch] gh/mikaylagawarecki/377/base -> origin/gh/mikaylagawarecki/377/base 2025-12-04T09:33:40.1700293Z * [new branch] gh/mikaylagawarecki/377/head -> origin/gh/mikaylagawarecki/377/head 2025-12-04T09:33:40.1701592Z * [new branch] gh/mikaylagawarecki/377/orig -> origin/gh/mikaylagawarecki/377/orig 2025-12-04T09:33:40.1703303Z * [new branch] gh/mikaylagawarecki/378/base -> origin/gh/mikaylagawarecki/378/base 2025-12-04T09:33:40.1704646Z * [new branch] gh/mikaylagawarecki/378/head -> origin/gh/mikaylagawarecki/378/head 2025-12-04T09:33:40.1705936Z * [new branch] gh/mikaylagawarecki/378/orig -> origin/gh/mikaylagawarecki/378/orig 2025-12-04T09:33:40.1707629Z * [new branch] gh/mikaylagawarecki/379/base -> origin/gh/mikaylagawarecki/379/base 2025-12-04T09:33:40.1708969Z * [new branch] gh/mikaylagawarecki/379/head -> origin/gh/mikaylagawarecki/379/head 2025-12-04T09:33:40.1710236Z * [new branch] gh/mikaylagawarecki/379/orig -> origin/gh/mikaylagawarecki/379/orig 2025-12-04T09:33:40.1711813Z * [new branch] gh/mikaylagawarecki/380/base -> origin/gh/mikaylagawarecki/380/base 2025-12-04T09:33:40.1713139Z * [new branch] gh/mikaylagawarecki/380/head -> origin/gh/mikaylagawarecki/380/head 2025-12-04T09:33:40.1714382Z * [new branch] gh/mikaylagawarecki/380/orig -> origin/gh/mikaylagawarecki/380/orig 2025-12-04T09:33:40.1716507Z * [new branch] gh/mikaylagawarecki/381/base -> origin/gh/mikaylagawarecki/381/base 2025-12-04T09:33:40.1717757Z * [new branch] gh/mikaylagawarecki/381/head -> origin/gh/mikaylagawarecki/381/head 2025-12-04T09:33:40.1719026Z * [new branch] gh/mikaylagawarecki/381/orig -> origin/gh/mikaylagawarecki/381/orig 2025-12-04T09:33:40.1720565Z * [new branch] gh/mikaylagawarecki/382/base -> origin/gh/mikaylagawarecki/382/base 2025-12-04T09:33:40.1721874Z * [new branch] gh/mikaylagawarecki/382/head -> origin/gh/mikaylagawarecki/382/head 2025-12-04T09:33:40.1723103Z * [new branch] gh/mikaylagawarecki/382/orig -> origin/gh/mikaylagawarecki/382/orig 2025-12-04T09:33:40.1724923Z * [new branch] gh/mikaylagawarecki/383/base -> origin/gh/mikaylagawarecki/383/base 2025-12-04T09:33:40.1726291Z * [new branch] gh/mikaylagawarecki/383/head -> origin/gh/mikaylagawarecki/383/head 2025-12-04T09:33:40.1727565Z * [new branch] gh/mikaylagawarecki/383/orig -> origin/gh/mikaylagawarecki/383/orig 2025-12-04T09:33:40.1729294Z * [new branch] gh/mikaylagawarecki/384/base -> origin/gh/mikaylagawarecki/384/base 2025-12-04T09:33:40.1730678Z * [new branch] gh/mikaylagawarecki/384/head -> origin/gh/mikaylagawarecki/384/head 2025-12-04T09:33:40.1731970Z * [new branch] gh/mikaylagawarecki/384/orig -> origin/gh/mikaylagawarecki/384/orig 2025-12-04T09:33:40.1733681Z * [new branch] gh/mikaylagawarecki/385/base -> origin/gh/mikaylagawarecki/385/base 2025-12-04T09:33:40.1734995Z * [new branch] gh/mikaylagawarecki/385/head -> origin/gh/mikaylagawarecki/385/head 2025-12-04T09:33:40.1736350Z * [new branch] gh/mikaylagawarecki/385/orig -> origin/gh/mikaylagawarecki/385/orig 2025-12-04T09:33:40.1738338Z * [new branch] gh/mikaylagawarecki/386/base -> origin/gh/mikaylagawarecki/386/base 2025-12-04T09:33:40.1739693Z * [new branch] gh/mikaylagawarecki/386/head -> origin/gh/mikaylagawarecki/386/head 2025-12-04T09:33:40.1740983Z * [new branch] gh/mikaylagawarecki/386/orig -> origin/gh/mikaylagawarecki/386/orig 2025-12-04T09:33:40.1742900Z * [new branch] gh/mikaylagawarecki/387/base -> origin/gh/mikaylagawarecki/387/base 2025-12-04T09:33:40.1744082Z * [new branch] gh/mikaylagawarecki/387/head -> origin/gh/mikaylagawarecki/387/head 2025-12-04T09:33:40.1745335Z * [new branch] gh/mikaylagawarecki/387/orig -> origin/gh/mikaylagawarecki/387/orig 2025-12-04T09:33:40.1746899Z * [new branch] gh/mikaylagawarecki/388/base -> origin/gh/mikaylagawarecki/388/base 2025-12-04T09:33:40.1748308Z * [new branch] gh/mikaylagawarecki/388/head -> origin/gh/mikaylagawarecki/388/head 2025-12-04T09:33:40.1749641Z * [new branch] gh/mikaylagawarecki/388/orig -> origin/gh/mikaylagawarecki/388/orig 2025-12-04T09:33:40.1751362Z * [new branch] gh/mikaylagawarecki/389/base -> origin/gh/mikaylagawarecki/389/base 2025-12-04T09:33:40.1752657Z * [new branch] gh/mikaylagawarecki/389/head -> origin/gh/mikaylagawarecki/389/head 2025-12-04T09:33:40.1753892Z * [new branch] gh/mikaylagawarecki/389/orig -> origin/gh/mikaylagawarecki/389/orig 2025-12-04T09:33:40.1755745Z * [new branch] gh/mikaylagawarecki/390/base -> origin/gh/mikaylagawarecki/390/base 2025-12-04T09:33:40.1756902Z * [new branch] gh/mikaylagawarecki/390/head -> origin/gh/mikaylagawarecki/390/head 2025-12-04T09:33:40.1758166Z * [new branch] gh/mikaylagawarecki/390/orig -> origin/gh/mikaylagawarecki/390/orig 2025-12-04T09:33:40.1760142Z * [new branch] gh/mikaylagawarecki/391/base -> origin/gh/mikaylagawarecki/391/base 2025-12-04T09:33:40.1761492Z * [new branch] gh/mikaylagawarecki/391/head -> origin/gh/mikaylagawarecki/391/head 2025-12-04T09:33:40.1762735Z * [new branch] gh/mikaylagawarecki/391/orig -> origin/gh/mikaylagawarecki/391/orig 2025-12-04T09:33:40.1764599Z * [new branch] gh/mikaylagawarecki/392/base -> origin/gh/mikaylagawarecki/392/base 2025-12-04T09:33:40.1765972Z * [new branch] gh/mikaylagawarecki/392/head -> origin/gh/mikaylagawarecki/392/head 2025-12-04T09:33:40.1767280Z * [new branch] gh/mikaylagawarecki/392/orig -> origin/gh/mikaylagawarecki/392/orig 2025-12-04T09:33:40.1769200Z * [new branch] gh/mlazos/41/base -> origin/gh/mlazos/41/base 2025-12-04T09:33:40.1770489Z * [new branch] gh/mlazos/41/head -> origin/gh/mlazos/41/head 2025-12-04T09:33:40.1771731Z * [new branch] gh/mlazos/41/orig -> origin/gh/mlazos/41/orig 2025-12-04T09:33:40.1773492Z * [new branch] gh/mlazos/42/base -> origin/gh/mlazos/42/base 2025-12-04T09:33:40.1774740Z * [new branch] gh/mlazos/42/head -> origin/gh/mlazos/42/head 2025-12-04T09:33:40.1776004Z * [new branch] gh/mlazos/42/orig -> origin/gh/mlazos/42/orig 2025-12-04T09:33:40.1777587Z * [new branch] gh/mlazos/43/base -> origin/gh/mlazos/43/base 2025-12-04T09:33:40.1779307Z * [new branch] gh/mlazos/43/head -> origin/gh/mlazos/43/head 2025-12-04T09:33:40.1780553Z * [new branch] gh/mlazos/43/orig -> origin/gh/mlazos/43/orig 2025-12-04T09:33:40.1782157Z * [new branch] gh/mlazos/44/base -> origin/gh/mlazos/44/base 2025-12-04T09:33:40.1783425Z * [new branch] gh/mlazos/44/head -> origin/gh/mlazos/44/head 2025-12-04T09:33:40.1784650Z * [new branch] gh/mlazos/44/orig -> origin/gh/mlazos/44/orig 2025-12-04T09:33:40.1786296Z * [new branch] gh/mlazos/47/base -> origin/gh/mlazos/47/base 2025-12-04T09:33:40.1787557Z * [new branch] gh/mlazos/47/head -> origin/gh/mlazos/47/head 2025-12-04T09:33:40.1788997Z * [new branch] gh/mlazos/47/orig -> origin/gh/mlazos/47/orig 2025-12-04T09:33:40.1790563Z * [new branch] gh/mlazos/48/base -> origin/gh/mlazos/48/base 2025-12-04T09:33:40.1791967Z * [new branch] gh/mlazos/48/head -> origin/gh/mlazos/48/head 2025-12-04T09:33:40.1793089Z * [new branch] gh/mlazos/48/orig -> origin/gh/mlazos/48/orig 2025-12-04T09:33:40.1794715Z * [new branch] gh/mlazos/49/base -> origin/gh/mlazos/49/base 2025-12-04T09:33:40.1798161Z * [new branch] gh/mlazos/49/head -> origin/gh/mlazos/49/head 2025-12-04T09:33:40.1798415Z * [new branch] gh/mlazos/49/orig -> origin/gh/mlazos/49/orig 2025-12-04T09:33:40.1799125Z * [new branch] gh/mlazos/50/base -> origin/gh/mlazos/50/base 2025-12-04T09:33:40.1800066Z * [new branch] gh/mlazos/50/head -> origin/gh/mlazos/50/head 2025-12-04T09:33:40.1801412Z * [new branch] gh/mlazos/50/orig -> origin/gh/mlazos/50/orig 2025-12-04T09:33:40.1803178Z * [new branch] gh/mlazos/51/base -> origin/gh/mlazos/51/base 2025-12-04T09:33:40.1804252Z * [new branch] gh/mlazos/51/head -> origin/gh/mlazos/51/head 2025-12-04T09:33:40.1805503Z * [new branch] gh/mlazos/51/orig -> origin/gh/mlazos/51/orig 2025-12-04T09:33:40.1807208Z * [new branch] gh/mlazos/52/base -> origin/gh/mlazos/52/base 2025-12-04T09:33:40.1808478Z * [new branch] gh/mlazos/52/head -> origin/gh/mlazos/52/head 2025-12-04T09:33:40.1809733Z * [new branch] gh/mlazos/52/orig -> origin/gh/mlazos/52/orig 2025-12-04T09:33:40.1811469Z * [new branch] gh/mlazos/53/base -> origin/gh/mlazos/53/base 2025-12-04T09:33:40.1812693Z * [new branch] gh/mlazos/53/head -> origin/gh/mlazos/53/head 2025-12-04T09:33:40.1813920Z * [new branch] gh/mlazos/53/orig -> origin/gh/mlazos/53/orig 2025-12-04T09:33:40.1815657Z * [new branch] gh/mlazos/54/base -> origin/gh/mlazos/54/base 2025-12-04T09:33:40.1816954Z * [new branch] gh/mlazos/54/head -> origin/gh/mlazos/54/head 2025-12-04T09:33:40.1818268Z * [new branch] gh/mlazos/54/orig -> origin/gh/mlazos/54/orig 2025-12-04T09:33:40.1820065Z * [new branch] gh/mlazos/55/base -> origin/gh/mlazos/55/base 2025-12-04T09:33:40.1821363Z * [new branch] gh/mlazos/55/head -> origin/gh/mlazos/55/head 2025-12-04T09:33:40.1822684Z * [new branch] gh/mlazos/55/orig -> origin/gh/mlazos/55/orig 2025-12-04T09:33:40.1824306Z * [new branch] gh/mlazos/56/base -> origin/gh/mlazos/56/base 2025-12-04T09:33:40.1825511Z * [new branch] gh/mlazos/56/head -> origin/gh/mlazos/56/head 2025-12-04T09:33:40.1826777Z * [new branch] gh/mlazos/56/orig -> origin/gh/mlazos/56/orig 2025-12-04T09:33:40.1828445Z * [new branch] gh/mlazos/57/base -> origin/gh/mlazos/57/base 2025-12-04T09:33:40.1829700Z * [new branch] gh/mlazos/57/head -> origin/gh/mlazos/57/head 2025-12-04T09:33:40.1830903Z * [new branch] gh/mlazos/57/orig -> origin/gh/mlazos/57/orig 2025-12-04T09:33:40.1833182Z * [new branch] gh/mlazos/58/base -> origin/gh/mlazos/58/base 2025-12-04T09:33:40.1834492Z * [new branch] gh/mlazos/58/head -> origin/gh/mlazos/58/head 2025-12-04T09:33:40.1835792Z * [new branch] gh/mlazos/58/orig -> origin/gh/mlazos/58/orig 2025-12-04T09:33:40.1837461Z * [new branch] gh/mlazos/59/base -> origin/gh/mlazos/59/base 2025-12-04T09:33:40.1839163Z * [new branch] gh/mlazos/59/head -> origin/gh/mlazos/59/head 2025-12-04T09:33:40.1840391Z * [new branch] gh/mlazos/59/orig -> origin/gh/mlazos/59/orig 2025-12-04T09:33:40.1842053Z * [new branch] gh/mlazos/60/base -> origin/gh/mlazos/60/base 2025-12-04T09:33:40.1843843Z * [new branch] gh/mlazos/60/head -> origin/gh/mlazos/60/head 2025-12-04T09:33:40.1845046Z * [new branch] gh/mlazos/60/orig -> origin/gh/mlazos/60/orig 2025-12-04T09:33:40.1847093Z * [new branch] gh/mlazos/61/base -> origin/gh/mlazos/61/base 2025-12-04T09:33:40.1848384Z * [new branch] gh/mlazos/61/head -> origin/gh/mlazos/61/head 2025-12-04T09:33:40.1850119Z * [new branch] gh/mlazos/61/orig -> origin/gh/mlazos/61/orig 2025-12-04T09:33:40.1851946Z * [new branch] gh/mlazos/62/base -> origin/gh/mlazos/62/base 2025-12-04T09:33:40.1853244Z * [new branch] gh/mlazos/62/head -> origin/gh/mlazos/62/head 2025-12-04T09:33:40.1854511Z * [new branch] gh/mlazos/62/orig -> origin/gh/mlazos/62/orig 2025-12-04T09:33:40.1856302Z * [new branch] gh/mlazos/63/base -> origin/gh/mlazos/63/base 2025-12-04T09:33:40.1857639Z * [new branch] gh/mlazos/63/head -> origin/gh/mlazos/63/head 2025-12-04T09:33:40.1859189Z * [new branch] gh/mlazos/63/orig -> origin/gh/mlazos/63/orig 2025-12-04T09:33:40.1860802Z * [new branch] gh/mlazos/64/base -> origin/gh/mlazos/64/base 2025-12-04T09:33:40.1862133Z * [new branch] gh/mlazos/64/head -> origin/gh/mlazos/64/head 2025-12-04T09:33:40.1863405Z * [new branch] gh/mlazos/64/orig -> origin/gh/mlazos/64/orig 2025-12-04T09:33:40.1865124Z * [new branch] gh/mlazos/65/base -> origin/gh/mlazos/65/base 2025-12-04T09:33:40.1866392Z * [new branch] gh/mlazos/65/head -> origin/gh/mlazos/65/head 2025-12-04T09:33:40.1867676Z * [new branch] gh/mlazos/65/orig -> origin/gh/mlazos/65/orig 2025-12-04T09:33:40.1869468Z * [new branch] gh/mlazos/66/base -> origin/gh/mlazos/66/base 2025-12-04T09:33:40.1870743Z * [new branch] gh/mlazos/66/head -> origin/gh/mlazos/66/head 2025-12-04T09:33:40.1872030Z * [new branch] gh/mlazos/66/orig -> origin/gh/mlazos/66/orig 2025-12-04T09:33:40.1873734Z * [new branch] gh/mlazos/67/base -> origin/gh/mlazos/67/base 2025-12-04T09:33:40.1874997Z * [new branch] gh/mlazos/67/head -> origin/gh/mlazos/67/head 2025-12-04T09:33:40.1876260Z * [new branch] gh/mlazos/67/orig -> origin/gh/mlazos/67/orig 2025-12-04T09:33:40.1877917Z * [new branch] gh/mlazos/68/base -> origin/gh/mlazos/68/base 2025-12-04T09:33:40.1879166Z * [new branch] gh/mlazos/68/head -> origin/gh/mlazos/68/head 2025-12-04T09:33:40.1880419Z * [new branch] gh/mlazos/68/orig -> origin/gh/mlazos/68/orig 2025-12-04T09:33:40.1882161Z * [new branch] gh/mlazos/69/base -> origin/gh/mlazos/69/base 2025-12-04T09:33:40.1883439Z * [new branch] gh/mlazos/69/head -> origin/gh/mlazos/69/head 2025-12-04T09:33:40.1884646Z * [new branch] gh/mlazos/69/orig -> origin/gh/mlazos/69/orig 2025-12-04T09:33:40.1886444Z * [new branch] gh/mlazos/70/base -> origin/gh/mlazos/70/base 2025-12-04T09:33:40.1887657Z * [new branch] gh/mlazos/70/head -> origin/gh/mlazos/70/head 2025-12-04T09:33:40.1889268Z * [new branch] gh/mlazos/70/orig -> origin/gh/mlazos/70/orig 2025-12-04T09:33:40.1891004Z * [new branch] gh/mlazos/71/base -> origin/gh/mlazos/71/base 2025-12-04T09:33:40.1892271Z * [new branch] gh/mlazos/71/head -> origin/gh/mlazos/71/head 2025-12-04T09:33:40.1893548Z * [new branch] gh/mlazos/71/orig -> origin/gh/mlazos/71/orig 2025-12-04T09:33:40.1895168Z * [new branch] gh/mlazos/72/base -> origin/gh/mlazos/72/base 2025-12-04T09:33:40.1896620Z * [new branch] gh/mlazos/72/head -> origin/gh/mlazos/72/head 2025-12-04T09:33:40.1897729Z * [new branch] gh/mlazos/72/orig -> origin/gh/mlazos/72/orig 2025-12-04T09:33:40.1899665Z * [new branch] gh/mlazos/73/base -> origin/gh/mlazos/73/base 2025-12-04T09:33:40.1900982Z * [new branch] gh/mlazos/73/head -> origin/gh/mlazos/73/head 2025-12-04T09:33:40.1902277Z * [new branch] gh/mlazos/73/orig -> origin/gh/mlazos/73/orig 2025-12-04T09:33:40.1904471Z * [new branch] gh/mrmiywj/1/base -> origin/gh/mrmiywj/1/base 2025-12-04T09:33:40.1905783Z * [new branch] gh/mrmiywj/1/head -> origin/gh/mrmiywj/1/head 2025-12-04T09:33:40.1907917Z * [new branch] gh/muchulee8/73/base -> origin/gh/muchulee8/73/base 2025-12-04T09:33:40.1909356Z * [new branch] gh/muchulee8/73/head -> origin/gh/muchulee8/73/head 2025-12-04T09:33:40.1910785Z * [new branch] gh/muchulee8/73/orig -> origin/gh/muchulee8/73/orig 2025-12-04T09:33:40.1912836Z * [new branch] gh/naveenthangudu/1/base -> origin/gh/naveenthangudu/1/base 2025-12-04T09:33:40.1914104Z * [new branch] gh/naveenthangudu/1/head -> origin/gh/naveenthangudu/1/head 2025-12-04T09:33:40.1915489Z * [new branch] gh/naveenthangudu/1/orig -> origin/gh/naveenthangudu/1/orig 2025-12-04T09:33:40.1917139Z * [new branch] gh/naveenthangudu/2/base -> origin/gh/naveenthangudu/2/base 2025-12-04T09:33:40.1918432Z * [new branch] gh/naveenthangudu/2/head -> origin/gh/naveenthangudu/2/head 2025-12-04T09:33:40.1919848Z * [new branch] gh/naveenthangudu/2/orig -> origin/gh/naveenthangudu/2/orig 2025-12-04T09:33:40.1921954Z * [new branch] gh/naveenthangudu/3/base -> origin/gh/naveenthangudu/3/base 2025-12-04T09:33:40.1923311Z * [new branch] gh/naveenthangudu/3/head -> origin/gh/naveenthangudu/3/head 2025-12-04T09:33:40.1924614Z * [new branch] gh/naveenthangudu/3/orig -> origin/gh/naveenthangudu/3/orig 2025-12-04T09:33:40.1926281Z * [new branch] gh/naveenthangudu/4/base -> origin/gh/naveenthangudu/4/base 2025-12-04T09:33:40.1927511Z * [new branch] gh/naveenthangudu/4/head -> origin/gh/naveenthangudu/4/head 2025-12-04T09:33:40.1929000Z * [new branch] gh/naveenthangudu/4/orig -> origin/gh/naveenthangudu/4/orig 2025-12-04T09:33:40.1930597Z * [new branch] gh/naveenthangudu/5/base -> origin/gh/naveenthangudu/5/base 2025-12-04T09:33:40.1931891Z * [new branch] gh/naveenthangudu/5/head -> origin/gh/naveenthangudu/5/head 2025-12-04T09:33:40.1933479Z * [new branch] gh/naveenthangudu/5/orig -> origin/gh/naveenthangudu/5/orig 2025-12-04T09:33:40.1935123Z * [new branch] gh/naveenthangudu/6/base -> origin/gh/naveenthangudu/6/base 2025-12-04T09:33:40.1936394Z * [new branch] gh/naveenthangudu/6/head -> origin/gh/naveenthangudu/6/head 2025-12-04T09:33:40.1937695Z * [new branch] gh/naveenthangudu/6/orig -> origin/gh/naveenthangudu/6/orig 2025-12-04T09:33:40.1939538Z * [new branch] gh/naveenthangudu/7/base -> origin/gh/naveenthangudu/7/base 2025-12-04T09:33:40.1940755Z * [new branch] gh/naveenthangudu/7/head -> origin/gh/naveenthangudu/7/head 2025-12-04T09:33:40.1941963Z * [new branch] gh/naveenthangudu/7/orig -> origin/gh/naveenthangudu/7/orig 2025-12-04T09:33:40.1943556Z * [new branch] gh/naveenthangudu/8/base -> origin/gh/naveenthangudu/8/base 2025-12-04T09:33:40.1944889Z * [new branch] gh/naveenthangudu/8/head -> origin/gh/naveenthangudu/8/head 2025-12-04T09:33:40.1946288Z * [new branch] gh/naveenthangudu/8/orig -> origin/gh/naveenthangudu/8/orig 2025-12-04T09:33:40.1948006Z * [new branch] gh/naveenthangudu/9/base -> origin/gh/naveenthangudu/9/base 2025-12-04T09:33:40.1949204Z * [new branch] gh/naveenthangudu/9/head -> origin/gh/naveenthangudu/9/head 2025-12-04T09:33:40.1950503Z * [new branch] gh/naveenthangudu/9/orig -> origin/gh/naveenthangudu/9/orig 2025-12-04T09:33:40.1952512Z * [new branch] gh/nikitaved/1/base -> origin/gh/nikitaved/1/base 2025-12-04T09:33:40.1953871Z * [new branch] gh/nikitaved/1/head -> origin/gh/nikitaved/1/head 2025-12-04T09:33:40.1955121Z * [new branch] gh/nikitaved/1/orig -> origin/gh/nikitaved/1/orig 2025-12-04T09:33:40.1956922Z * [new branch] gh/nikitaved/10/base -> origin/gh/nikitaved/10/base 2025-12-04T09:33:40.1958188Z * [new branch] gh/nikitaved/10/head -> origin/gh/nikitaved/10/head 2025-12-04T09:33:40.1959460Z * [new branch] gh/nikitaved/10/orig -> origin/gh/nikitaved/10/orig 2025-12-04T09:33:40.1961043Z * [new branch] gh/nikitaved/11/base -> origin/gh/nikitaved/11/base 2025-12-04T09:33:40.1962359Z * [new branch] gh/nikitaved/11/head -> origin/gh/nikitaved/11/head 2025-12-04T09:33:40.1963644Z * [new branch] gh/nikitaved/11/orig -> origin/gh/nikitaved/11/orig 2025-12-04T09:33:40.1965735Z * [new branch] gh/nikitaved/12/base -> origin/gh/nikitaved/12/base 2025-12-04T09:33:40.1967071Z * [new branch] gh/nikitaved/12/head -> origin/gh/nikitaved/12/head 2025-12-04T09:33:40.1968316Z * [new branch] gh/nikitaved/12/orig -> origin/gh/nikitaved/12/orig 2025-12-04T09:33:40.1970014Z * [new branch] gh/nikitaved/13/base -> origin/gh/nikitaved/13/base 2025-12-04T09:33:40.1971401Z * [new branch] gh/nikitaved/13/head -> origin/gh/nikitaved/13/head 2025-12-04T09:33:40.1972646Z * [new branch] gh/nikitaved/13/orig -> origin/gh/nikitaved/13/orig 2025-12-04T09:33:40.1974432Z * [new branch] gh/nikitaved/14/base -> origin/gh/nikitaved/14/base 2025-12-04T09:33:40.1975628Z * [new branch] gh/nikitaved/14/head -> origin/gh/nikitaved/14/head 2025-12-04T09:33:40.1976880Z * [new branch] gh/nikitaved/14/orig -> origin/gh/nikitaved/14/orig 2025-12-04T09:33:40.1978506Z * [new branch] gh/nikitaved/15/base -> origin/gh/nikitaved/15/base 2025-12-04T09:33:40.1980333Z * [new branch] gh/nikitaved/15/head -> origin/gh/nikitaved/15/head 2025-12-04T09:33:40.1981672Z * [new branch] gh/nikitaved/15/orig -> origin/gh/nikitaved/15/orig 2025-12-04T09:33:40.1983343Z * [new branch] gh/nikitaved/16/base -> origin/gh/nikitaved/16/base 2025-12-04T09:33:40.1984628Z * [new branch] gh/nikitaved/16/head -> origin/gh/nikitaved/16/head 2025-12-04T09:33:40.1985884Z * [new branch] gh/nikitaved/16/orig -> origin/gh/nikitaved/16/orig 2025-12-04T09:33:40.1987570Z * [new branch] gh/nikitaved/2/base -> origin/gh/nikitaved/2/base 2025-12-04T09:33:40.1992409Z * [new branch] gh/nikitaved/2/head -> origin/gh/nikitaved/2/head 2025-12-04T09:33:40.1993763Z * [new branch] gh/nikitaved/2/orig -> origin/gh/nikitaved/2/orig 2025-12-04T09:33:40.1995456Z * [new branch] gh/nikitaved/4/base -> origin/gh/nikitaved/4/base 2025-12-04T09:33:40.1996710Z * [new branch] gh/nikitaved/4/head -> origin/gh/nikitaved/4/head 2025-12-04T09:33:40.1998037Z * [new branch] gh/nikitaved/4/orig -> origin/gh/nikitaved/4/orig 2025-12-04T09:33:40.1999756Z * [new branch] gh/nikitaved/5/base -> origin/gh/nikitaved/5/base 2025-12-04T09:33:40.2001153Z * [new branch] gh/nikitaved/5/head -> origin/gh/nikitaved/5/head 2025-12-04T09:33:40.2002532Z * [new branch] gh/nikitaved/5/orig -> origin/gh/nikitaved/5/orig 2025-12-04T09:33:40.2004065Z * [new branch] gh/nikitaved/6/base -> origin/gh/nikitaved/6/base 2025-12-04T09:33:40.2005355Z * [new branch] gh/nikitaved/6/head -> origin/gh/nikitaved/6/head 2025-12-04T09:33:40.2006901Z * [new branch] gh/nikitaved/6/orig -> origin/gh/nikitaved/6/orig 2025-12-04T09:33:40.2008563Z * [new branch] gh/nikitaved/8/base -> origin/gh/nikitaved/8/base 2025-12-04T09:33:40.2009955Z * [new branch] gh/nikitaved/8/head -> origin/gh/nikitaved/8/head 2025-12-04T09:33:40.2011243Z * [new branch] gh/nikitaved/8/orig -> origin/gh/nikitaved/8/orig 2025-12-04T09:33:40.2012925Z * [new branch] gh/nikitaved/9/base -> origin/gh/nikitaved/9/base 2025-12-04T09:33:40.2014191Z * [new branch] gh/nikitaved/9/head -> origin/gh/nikitaved/9/head 2025-12-04T09:33:40.2015464Z * [new branch] gh/nikitaved/9/orig -> origin/gh/nikitaved/9/orig 2025-12-04T09:33:40.2017399Z * [new branch] gh/oulgen/10/base -> origin/gh/oulgen/10/base 2025-12-04T09:33:40.2018768Z * [new branch] gh/oulgen/10/head -> origin/gh/oulgen/10/head 2025-12-04T09:33:40.2020051Z * [new branch] gh/oulgen/10/orig -> origin/gh/oulgen/10/orig 2025-12-04T09:33:40.2021743Z * [new branch] gh/oulgen/11/base -> origin/gh/oulgen/11/base 2025-12-04T09:33:40.2022983Z * [new branch] gh/oulgen/11/head -> origin/gh/oulgen/11/head 2025-12-04T09:33:40.2024218Z * [new branch] gh/oulgen/11/orig -> origin/gh/oulgen/11/orig 2025-12-04T09:33:40.2025973Z * [new branch] gh/oulgen/12/base -> origin/gh/oulgen/12/base 2025-12-04T09:33:40.2027188Z * [new branch] gh/oulgen/12/head -> origin/gh/oulgen/12/head 2025-12-04T09:33:40.2028478Z * [new branch] gh/oulgen/12/orig -> origin/gh/oulgen/12/orig 2025-12-04T09:33:40.2030150Z * [new branch] gh/oulgen/13/base -> origin/gh/oulgen/13/base 2025-12-04T09:33:40.2031405Z * [new branch] gh/oulgen/13/head -> origin/gh/oulgen/13/head 2025-12-04T09:33:40.2032642Z * [new branch] gh/oulgen/13/orig -> origin/gh/oulgen/13/orig 2025-12-04T09:33:40.2034296Z * [new branch] gh/oulgen/14/base -> origin/gh/oulgen/14/base 2025-12-04T09:33:40.2035620Z * [new branch] gh/oulgen/14/head -> origin/gh/oulgen/14/head 2025-12-04T09:33:40.2036914Z * [new branch] gh/oulgen/14/orig -> origin/gh/oulgen/14/orig 2025-12-04T09:33:40.2038550Z * [new branch] gh/oulgen/15/base -> origin/gh/oulgen/15/base 2025-12-04T09:33:40.2039785Z * [new branch] gh/oulgen/15/head -> origin/gh/oulgen/15/head 2025-12-04T09:33:40.2041747Z * [new branch] gh/oulgen/15/orig -> origin/gh/oulgen/15/orig 2025-12-04T09:33:40.2043226Z * [new branch] gh/oulgen/16/base -> origin/gh/oulgen/16/base 2025-12-04T09:33:40.2044437Z * [new branch] gh/oulgen/16/head -> origin/gh/oulgen/16/head 2025-12-04T09:33:40.2045728Z * [new branch] gh/oulgen/16/orig -> origin/gh/oulgen/16/orig 2025-12-04T09:33:40.2047388Z * [new branch] gh/oulgen/17/base -> origin/gh/oulgen/17/base 2025-12-04T09:33:40.2048687Z * [new branch] gh/oulgen/17/head -> origin/gh/oulgen/17/head 2025-12-04T09:33:40.2049927Z * [new branch] gh/oulgen/17/orig -> origin/gh/oulgen/17/orig 2025-12-04T09:33:40.2051597Z * [new branch] gh/oulgen/18/base -> origin/gh/oulgen/18/base 2025-12-04T09:33:40.2052858Z * [new branch] gh/oulgen/18/head -> origin/gh/oulgen/18/head 2025-12-04T09:33:40.2054177Z * [new branch] gh/oulgen/18/orig -> origin/gh/oulgen/18/orig 2025-12-04T09:33:40.2055758Z * [new branch] gh/oulgen/19/base -> origin/gh/oulgen/19/base 2025-12-04T09:33:40.2057009Z * [new branch] gh/oulgen/19/head -> origin/gh/oulgen/19/head 2025-12-04T09:33:40.2058346Z * [new branch] gh/oulgen/19/orig -> origin/gh/oulgen/19/orig 2025-12-04T09:33:40.2060151Z * [new branch] gh/oulgen/20/base -> origin/gh/oulgen/20/base 2025-12-04T09:33:40.2062318Z * [new branch] gh/oulgen/20/head -> origin/gh/oulgen/20/head 2025-12-04T09:33:40.2063170Z * [new branch] gh/oulgen/20/orig -> origin/gh/oulgen/20/orig 2025-12-04T09:33:40.2064832Z * [new branch] gh/oulgen/21/base -> origin/gh/oulgen/21/base 2025-12-04T09:33:40.2066130Z * [new branch] gh/oulgen/21/head -> origin/gh/oulgen/21/head 2025-12-04T09:33:40.2067375Z * [new branch] gh/oulgen/21/orig -> origin/gh/oulgen/21/orig 2025-12-04T09:33:40.2068994Z * [new branch] gh/oulgen/22/base -> origin/gh/oulgen/22/base 2025-12-04T09:33:40.2070222Z * [new branch] gh/oulgen/22/head -> origin/gh/oulgen/22/head 2025-12-04T09:33:40.2071472Z * [new branch] gh/oulgen/22/orig -> origin/gh/oulgen/22/orig 2025-12-04T09:33:40.2073097Z * [new branch] gh/oulgen/23/base -> origin/gh/oulgen/23/base 2025-12-04T09:33:40.2074387Z * [new branch] gh/oulgen/23/head -> origin/gh/oulgen/23/head 2025-12-04T09:33:40.2075677Z * [new branch] gh/oulgen/23/orig -> origin/gh/oulgen/23/orig 2025-12-04T09:33:40.2077387Z * [new branch] gh/oulgen/24/base -> origin/gh/oulgen/24/base 2025-12-04T09:33:40.2078645Z * [new branch] gh/oulgen/24/head -> origin/gh/oulgen/24/head 2025-12-04T09:33:40.2079902Z * [new branch] gh/oulgen/24/orig -> origin/gh/oulgen/24/orig 2025-12-04T09:33:40.2081549Z * [new branch] gh/oulgen/25/base -> origin/gh/oulgen/25/base 2025-12-04T09:33:40.2082791Z * [new branch] gh/oulgen/25/head -> origin/gh/oulgen/25/head 2025-12-04T09:33:40.2084060Z * [new branch] gh/oulgen/25/orig -> origin/gh/oulgen/25/orig 2025-12-04T09:33:40.2086187Z * [new branch] gh/oulgen/26/base -> origin/gh/oulgen/26/base 2025-12-04T09:33:40.2087438Z * [new branch] gh/oulgen/26/head -> origin/gh/oulgen/26/head 2025-12-04T09:33:40.2088955Z * [new branch] gh/oulgen/26/orig -> origin/gh/oulgen/26/orig 2025-12-04T09:33:40.2090630Z * [new branch] gh/oulgen/4/base -> origin/gh/oulgen/4/base 2025-12-04T09:33:40.2091872Z * [new branch] gh/oulgen/4/head -> origin/gh/oulgen/4/head 2025-12-04T09:33:40.2093133Z * [new branch] gh/oulgen/4/orig -> origin/gh/oulgen/4/orig 2025-12-04T09:33:40.2095315Z * [new branch] gh/oulgen/7/base -> origin/gh/oulgen/7/base 2025-12-04T09:33:40.2096613Z * [new branch] gh/oulgen/7/head -> origin/gh/oulgen/7/head 2025-12-04T09:33:40.2097859Z * [new branch] gh/oulgen/7/orig -> origin/gh/oulgen/7/orig 2025-12-04T09:33:40.2099781Z * [new branch] gh/oulgen/8/base -> origin/gh/oulgen/8/base 2025-12-04T09:33:40.2100992Z * [new branch] gh/oulgen/8/head -> origin/gh/oulgen/8/head 2025-12-04T09:33:40.2102231Z * [new branch] gh/oulgen/8/orig -> origin/gh/oulgen/8/orig 2025-12-04T09:33:40.2103873Z * [new branch] gh/oulgen/9/base -> origin/gh/oulgen/9/base 2025-12-04T09:33:40.2105141Z * [new branch] gh/oulgen/9/head -> origin/gh/oulgen/9/head 2025-12-04T09:33:40.2107038Z * [new branch] gh/oulgen/9/orig -> origin/gh/oulgen/9/orig 2025-12-04T09:33:40.2108730Z * [new branch] gh/patvig/mtia-serialization -> origin/gh/patvig/mtia-serialization 2025-12-04T09:33:40.2110778Z * [new branch] gh/pearu/108/base -> origin/gh/pearu/108/base 2025-12-04T09:33:40.2112227Z * [new branch] gh/pearu/108/head -> origin/gh/pearu/108/head 2025-12-04T09:33:40.2113606Z * [new branch] gh/pearu/108/orig -> origin/gh/pearu/108/orig 2025-12-04T09:33:40.2115319Z * [new branch] gh/pearu/109/base -> origin/gh/pearu/109/base 2025-12-04T09:33:40.2116606Z * [new branch] gh/pearu/109/head -> origin/gh/pearu/109/head 2025-12-04T09:33:40.2117887Z * [new branch] gh/pearu/109/orig -> origin/gh/pearu/109/orig 2025-12-04T09:33:40.2119596Z * [new branch] gh/pearu/110/base -> origin/gh/pearu/110/base 2025-12-04T09:33:40.2120928Z * [new branch] gh/pearu/110/head -> origin/gh/pearu/110/head 2025-12-04T09:33:40.2122199Z * [new branch] gh/pearu/110/orig -> origin/gh/pearu/110/orig 2025-12-04T09:33:40.2123908Z * [new branch] gh/pearu/111/base -> origin/gh/pearu/111/base 2025-12-04T09:33:40.2125190Z * [new branch] gh/pearu/111/head -> origin/gh/pearu/111/head 2025-12-04T09:33:40.2126508Z * [new branch] gh/pearu/111/orig -> origin/gh/pearu/111/orig 2025-12-04T09:33:40.2128240Z * [new branch] gh/pearu/112/base -> origin/gh/pearu/112/base 2025-12-04T09:33:40.2129613Z * [new branch] gh/pearu/112/head -> origin/gh/pearu/112/head 2025-12-04T09:33:40.2130923Z * [new branch] gh/pearu/112/orig -> origin/gh/pearu/112/orig 2025-12-04T09:33:40.2132496Z * [new branch] gh/pearu/115/base -> origin/gh/pearu/115/base 2025-12-04T09:33:40.2133749Z * [new branch] gh/pearu/115/head -> origin/gh/pearu/115/head 2025-12-04T09:33:40.2135021Z * [new branch] gh/pearu/115/orig -> origin/gh/pearu/115/orig 2025-12-04T09:33:40.2136595Z * [new branch] gh/pearu/116/base -> origin/gh/pearu/116/base 2025-12-04T09:33:40.2137855Z * [new branch] gh/pearu/116/head -> origin/gh/pearu/116/head 2025-12-04T09:33:40.2139254Z * [new branch] gh/pearu/116/orig -> origin/gh/pearu/116/orig 2025-12-04T09:33:40.2140964Z * [new branch] gh/pearu/117/base -> origin/gh/pearu/117/base 2025-12-04T09:33:40.2142202Z * [new branch] gh/pearu/117/head -> origin/gh/pearu/117/head 2025-12-04T09:33:40.2143487Z * [new branch] gh/pearu/117/orig -> origin/gh/pearu/117/orig 2025-12-04T09:33:40.2145139Z * [new branch] gh/pearu/118/base -> origin/gh/pearu/118/base 2025-12-04T09:33:40.2146510Z * [new branch] gh/pearu/118/head -> origin/gh/pearu/118/head 2025-12-04T09:33:40.2147764Z * [new branch] gh/pearu/118/orig -> origin/gh/pearu/118/orig 2025-12-04T09:33:40.2149418Z * [new branch] gh/pearu/119/base -> origin/gh/pearu/119/base 2025-12-04T09:33:40.2150687Z * [new branch] gh/pearu/119/head -> origin/gh/pearu/119/head 2025-12-04T09:33:40.2151963Z * [new branch] gh/pearu/119/orig -> origin/gh/pearu/119/orig 2025-12-04T09:33:40.2153554Z * [new branch] gh/pearu/139/base -> origin/gh/pearu/139/base 2025-12-04T09:33:40.2154825Z * [new branch] gh/pearu/139/head -> origin/gh/pearu/139/head 2025-12-04T09:33:40.2156149Z * [new branch] gh/pearu/139/orig -> origin/gh/pearu/139/orig 2025-12-04T09:33:40.2157879Z * [new branch] gh/pearu/140/base -> origin/gh/pearu/140/base 2025-12-04T09:33:40.2159249Z * [new branch] gh/pearu/140/head -> origin/gh/pearu/140/head 2025-12-04T09:33:40.2160510Z * [new branch] gh/pearu/140/orig -> origin/gh/pearu/140/orig 2025-12-04T09:33:40.2162191Z * [new branch] gh/pearu/142/base -> origin/gh/pearu/142/base 2025-12-04T09:33:40.2163549Z * [new branch] gh/pearu/142/head -> origin/gh/pearu/142/head 2025-12-04T09:33:40.2164813Z * [new branch] gh/pearu/142/orig -> origin/gh/pearu/142/orig 2025-12-04T09:33:40.2166483Z * [new branch] gh/pearu/143/base -> origin/gh/pearu/143/base 2025-12-04T09:33:40.2167774Z * [new branch] gh/pearu/143/head -> origin/gh/pearu/143/head 2025-12-04T09:33:40.2169191Z * [new branch] gh/pearu/143/orig -> origin/gh/pearu/143/orig 2025-12-04T09:33:40.2170874Z * [new branch] gh/pearu/147/base -> origin/gh/pearu/147/base 2025-12-04T09:33:40.2172160Z * [new branch] gh/pearu/147/head -> origin/gh/pearu/147/head 2025-12-04T09:33:40.2173402Z * [new branch] gh/pearu/147/orig -> origin/gh/pearu/147/orig 2025-12-04T09:33:40.2175071Z * [new branch] gh/pearu/149/base -> origin/gh/pearu/149/base 2025-12-04T09:33:40.2176322Z * [new branch] gh/pearu/149/head -> origin/gh/pearu/149/head 2025-12-04T09:33:40.2177540Z * [new branch] gh/pearu/149/orig -> origin/gh/pearu/149/orig 2025-12-04T09:33:40.2179816Z * [new branch] gh/pearu/150/base -> origin/gh/pearu/150/base 2025-12-04T09:33:40.2181204Z * [new branch] gh/pearu/150/head -> origin/gh/pearu/150/head 2025-12-04T09:33:40.2182476Z * [new branch] gh/pearu/150/orig -> origin/gh/pearu/150/orig 2025-12-04T09:33:40.2184191Z * [new branch] gh/pearu/151/base -> origin/gh/pearu/151/base 2025-12-04T09:33:40.2185476Z * [new branch] gh/pearu/151/head -> origin/gh/pearu/151/head 2025-12-04T09:33:40.2186740Z * [new branch] gh/pearu/151/orig -> origin/gh/pearu/151/orig 2025-12-04T09:33:40.2188638Z * [new branch] gh/pearu/152/base -> origin/gh/pearu/152/base 2025-12-04T09:33:40.2190007Z * [new branch] gh/pearu/152/head -> origin/gh/pearu/152/head 2025-12-04T09:33:40.2191345Z * [new branch] gh/pearu/152/orig -> origin/gh/pearu/152/orig 2025-12-04T09:33:40.2193028Z * [new branch] gh/pearu/153/base -> origin/gh/pearu/153/base 2025-12-04T09:33:40.2194237Z * [new branch] gh/pearu/153/head -> origin/gh/pearu/153/head 2025-12-04T09:33:40.2195518Z * [new branch] gh/pearu/153/orig -> origin/gh/pearu/153/orig 2025-12-04T09:33:40.2197274Z * [new branch] gh/pearu/154/base -> origin/gh/pearu/154/base 2025-12-04T09:33:40.2198625Z * [new branch] gh/pearu/154/head -> origin/gh/pearu/154/head 2025-12-04T09:33:40.2199873Z * [new branch] gh/pearu/154/orig -> origin/gh/pearu/154/orig 2025-12-04T09:33:40.2201680Z * [new branch] gh/pearu/155/base -> origin/gh/pearu/155/base 2025-12-04T09:33:40.2203014Z * [new branch] gh/pearu/155/head -> origin/gh/pearu/155/head 2025-12-04T09:33:40.2204354Z * [new branch] gh/pearu/155/orig -> origin/gh/pearu/155/orig 2025-12-04T09:33:40.2206531Z * [new branch] gh/pearu/156/base -> origin/gh/pearu/156/base 2025-12-04T09:33:40.2207834Z * [new branch] gh/pearu/156/head -> origin/gh/pearu/156/head 2025-12-04T09:33:40.2209091Z * [new branch] gh/pearu/156/orig -> origin/gh/pearu/156/orig 2025-12-04T09:33:40.2211311Z * [new branch] gh/pearu/56/base -> origin/gh/pearu/56/base 2025-12-04T09:33:40.2212867Z * [new branch] gh/pearu/56/head -> origin/gh/pearu/56/head 2025-12-04T09:33:40.2214084Z * [new branch] gh/pearu/56/orig -> origin/gh/pearu/56/orig 2025-12-04T09:33:40.2216020Z * [new branch] gh/pearu/97/base -> origin/gh/pearu/97/base 2025-12-04T09:33:40.2217422Z * [new branch] gh/pearu/97/head -> origin/gh/pearu/97/head 2025-12-04T09:33:40.2218814Z * [new branch] gh/pearu/97/orig -> origin/gh/pearu/97/orig 2025-12-04T09:33:40.2220820Z * [new branch] gh/pianpwk/21/base -> origin/gh/pianpwk/21/base 2025-12-04T09:33:40.2222097Z * [new branch] gh/pianpwk/21/head -> origin/gh/pianpwk/21/head 2025-12-04T09:33:40.2223915Z * [new branch] gh/pianpwk/28/base -> origin/gh/pianpwk/28/base 2025-12-04T09:33:40.2225116Z * [new branch] gh/pianpwk/28/head -> origin/gh/pianpwk/28/head 2025-12-04T09:33:40.2226442Z * [new branch] gh/pianpwk/28/orig -> origin/gh/pianpwk/28/orig 2025-12-04T09:33:40.2228111Z * [new branch] gh/pianpwk/29/base -> origin/gh/pianpwk/29/base 2025-12-04T09:33:40.2229491Z * [new branch] gh/pianpwk/29/head -> origin/gh/pianpwk/29/head 2025-12-04T09:33:40.2230944Z * [new branch] gh/pianpwk/29/orig -> origin/gh/pianpwk/29/orig 2025-12-04T09:33:40.2232839Z * [new branch] gh/pianpwk/30/base -> origin/gh/pianpwk/30/base 2025-12-04T09:33:40.2234213Z * [new branch] gh/pianpwk/30/head -> origin/gh/pianpwk/30/head 2025-12-04T09:33:40.2235547Z * [new branch] gh/pianpwk/30/orig -> origin/gh/pianpwk/30/orig 2025-12-04T09:33:40.2237314Z * [new branch] gh/pianpwk/31/base -> origin/gh/pianpwk/31/base 2025-12-04T09:33:40.2238559Z * [new branch] gh/pianpwk/31/head -> origin/gh/pianpwk/31/head 2025-12-04T09:33:40.2239820Z * [new branch] gh/pianpwk/31/orig -> origin/gh/pianpwk/31/orig 2025-12-04T09:33:40.2241333Z * [new branch] gh/pianpwk/32/base -> origin/gh/pianpwk/32/base 2025-12-04T09:33:40.2242590Z * [new branch] gh/pianpwk/32/head -> origin/gh/pianpwk/32/head 2025-12-04T09:33:40.2243928Z * [new branch] gh/pianpwk/32/orig -> origin/gh/pianpwk/32/orig 2025-12-04T09:33:40.2245421Z * [new branch] gh/pianpwk/33/base -> origin/gh/pianpwk/33/base 2025-12-04T09:33:40.2246677Z * [new branch] gh/pianpwk/33/head -> origin/gh/pianpwk/33/head 2025-12-04T09:33:40.2247910Z * [new branch] gh/pianpwk/33/orig -> origin/gh/pianpwk/33/orig 2025-12-04T09:33:40.2249917Z * [new branch] gh/pianpwk/34/base -> origin/gh/pianpwk/34/base 2025-12-04T09:33:40.2251638Z * [new branch] gh/pianpwk/34/head -> origin/gh/pianpwk/34/head 2025-12-04T09:33:40.2253170Z * [new branch] gh/pianpwk/34/orig -> origin/gh/pianpwk/34/orig 2025-12-04T09:33:40.2254836Z * [new branch] gh/pianpwk/35/base -> origin/gh/pianpwk/35/base 2025-12-04T09:33:40.2256280Z * [new branch] gh/pianpwk/35/head -> origin/gh/pianpwk/35/head 2025-12-04T09:33:40.2257674Z * [new branch] gh/pianpwk/35/orig -> origin/gh/pianpwk/35/orig 2025-12-04T09:33:40.2259808Z * [new branch] gh/rec/141/base -> origin/gh/rec/141/base 2025-12-04T09:33:40.2261063Z * [new branch] gh/rec/141/head -> origin/gh/rec/141/head 2025-12-04T09:33:40.2262695Z * [new branch] gh/rec/153/base -> origin/gh/rec/153/base 2025-12-04T09:33:40.2263932Z * [new branch] gh/rec/153/head -> origin/gh/rec/153/head 2025-12-04T09:33:40.2265196Z * [new branch] gh/rec/153/orig -> origin/gh/rec/153/orig 2025-12-04T09:33:40.2266927Z * [new branch] gh/rec/154/base -> origin/gh/rec/154/base 2025-12-04T09:33:40.2268207Z * [new branch] gh/rec/154/head -> origin/gh/rec/154/head 2025-12-04T09:33:40.2269436Z * [new branch] gh/rec/154/orig -> origin/gh/rec/154/orig 2025-12-04T09:33:40.2271545Z * [new branch] gh/rec/164/base -> origin/gh/rec/164/base 2025-12-04T09:33:40.2272773Z * [new branch] gh/rec/164/head -> origin/gh/rec/164/head 2025-12-04T09:33:40.2274073Z * [new branch] gh/rec/164/orig -> origin/gh/rec/164/orig 2025-12-04T09:33:40.2275795Z * [new branch] gh/rec/166/base -> origin/gh/rec/166/base 2025-12-04T09:33:40.2277015Z * [new branch] gh/rec/166/head -> origin/gh/rec/166/head 2025-12-04T09:33:40.2278290Z * [new branch] gh/rec/166/orig -> origin/gh/rec/166/orig 2025-12-04T09:33:40.2279963Z * [new branch] gh/rec/167/base -> origin/gh/rec/167/base 2025-12-04T09:33:40.2281210Z * [new branch] gh/rec/167/head -> origin/gh/rec/167/head 2025-12-04T09:33:40.2282487Z * [new branch] gh/rec/167/orig -> origin/gh/rec/167/orig 2025-12-04T09:33:40.2284129Z * [new branch] gh/rec/168/base -> origin/gh/rec/168/base 2025-12-04T09:33:40.2285601Z * [new branch] gh/rec/168/head -> origin/gh/rec/168/head 2025-12-04T09:33:40.2286855Z * [new branch] gh/rec/168/orig -> origin/gh/rec/168/orig 2025-12-04T09:33:40.2288676Z * [new branch] gh/rec/169/base -> origin/gh/rec/169/base 2025-12-04T09:33:40.2289958Z * [new branch] gh/rec/169/head -> origin/gh/rec/169/head 2025-12-04T09:33:40.2291190Z * [new branch] gh/rec/169/orig -> origin/gh/rec/169/orig 2025-12-04T09:33:40.2292888Z * [new branch] gh/rec/170/base -> origin/gh/rec/170/base 2025-12-04T09:33:40.2294136Z * [new branch] gh/rec/170/head -> origin/gh/rec/170/head 2025-12-04T09:33:40.2295409Z * [new branch] gh/rec/170/orig -> origin/gh/rec/170/orig 2025-12-04T09:33:40.2297066Z * [new branch] gh/rec/171/base -> origin/gh/rec/171/base 2025-12-04T09:33:40.2298416Z * [new branch] gh/rec/171/head -> origin/gh/rec/171/head 2025-12-04T09:33:40.2299838Z * [new branch] gh/rec/171/orig -> origin/gh/rec/171/orig 2025-12-04T09:33:40.2301387Z * [new branch] gh/rec/172/base -> origin/gh/rec/172/base 2025-12-04T09:33:40.2302744Z * [new branch] gh/rec/172/head -> origin/gh/rec/172/head 2025-12-04T09:33:40.2303973Z * [new branch] gh/rec/172/orig -> origin/gh/rec/172/orig 2025-12-04T09:33:40.2305710Z * [new branch] gh/rec/173/base -> origin/gh/rec/173/base 2025-12-04T09:33:40.2307001Z * [new branch] gh/rec/173/head -> origin/gh/rec/173/head 2025-12-04T09:33:40.2308284Z * [new branch] gh/rec/173/orig -> origin/gh/rec/173/orig 2025-12-04T09:33:40.2309948Z * [new branch] gh/rec/174/base -> origin/gh/rec/174/base 2025-12-04T09:33:40.2311128Z * [new branch] gh/rec/174/head -> origin/gh/rec/174/head 2025-12-04T09:33:40.2312441Z * [new branch] gh/rec/174/orig -> origin/gh/rec/174/orig 2025-12-04T09:33:40.2314064Z * [new branch] gh/rec/175/base -> origin/gh/rec/175/base 2025-12-04T09:33:40.2315346Z * [new branch] gh/rec/175/head -> origin/gh/rec/175/head 2025-12-04T09:33:40.2316554Z * [new branch] gh/rec/175/orig -> origin/gh/rec/175/orig 2025-12-04T09:33:40.2318316Z * [new branch] gh/rec/176/base -> origin/gh/rec/176/base 2025-12-04T09:33:40.2319567Z * [new branch] gh/rec/176/head -> origin/gh/rec/176/head 2025-12-04T09:33:40.2321242Z * [new branch] gh/rec/176/orig -> origin/gh/rec/176/orig 2025-12-04T09:33:40.2322945Z * [new branch] gh/rec/177/base -> origin/gh/rec/177/base 2025-12-04T09:33:40.2324180Z * [new branch] gh/rec/177/head -> origin/gh/rec/177/head 2025-12-04T09:33:40.2325467Z * [new branch] gh/rec/177/orig -> origin/gh/rec/177/orig 2025-12-04T09:33:40.2327590Z * [new branch] gh/robert-hardwick/3/base -> origin/gh/robert-hardwick/3/base 2025-12-04T09:33:40.2328876Z * [new branch] gh/robert-hardwick/3/head -> origin/gh/robert-hardwick/3/head 2025-12-04T09:33:40.2330170Z * [new branch] gh/robert-hardwick/3/orig -> origin/gh/robert-hardwick/3/orig 2025-12-04T09:33:40.2331838Z * [new branch] gh/robert-hardwick/4/base -> origin/gh/robert-hardwick/4/base 2025-12-04T09:33:40.2333148Z * [new branch] gh/robert-hardwick/4/head -> origin/gh/robert-hardwick/4/head 2025-12-04T09:33:40.2334448Z * [new branch] gh/robert-hardwick/4/orig -> origin/gh/robert-hardwick/4/orig 2025-12-04T09:33:40.2336220Z * [new branch] gh/robert-hardwick/5/base -> origin/gh/robert-hardwick/5/base 2025-12-04T09:33:40.2337459Z * [new branch] gh/robert-hardwick/5/head -> origin/gh/robert-hardwick/5/head 2025-12-04T09:33:40.2338971Z * [new branch] gh/robert-hardwick/5/orig -> origin/gh/robert-hardwick/5/orig 2025-12-04T09:33:40.2340659Z * [new branch] gh/robert-hardwick/6/base -> origin/gh/robert-hardwick/6/base 2025-12-04T09:33:40.2341904Z * [new branch] gh/robert-hardwick/6/head -> origin/gh/robert-hardwick/6/head 2025-12-04T09:33:40.2343199Z * [new branch] gh/robert-hardwick/6/orig -> origin/gh/robert-hardwick/6/orig 2025-12-04T09:33:40.2345313Z * [new branch] gh/robert-hardwick/7/base -> origin/gh/robert-hardwick/7/base 2025-12-04T09:33:40.2346593Z * [new branch] gh/robert-hardwick/7/head -> origin/gh/robert-hardwick/7/head 2025-12-04T09:33:40.2347849Z * [new branch] gh/robert-hardwick/7/orig -> origin/gh/robert-hardwick/7/orig 2025-12-04T09:33:40.2349521Z * [new branch] gh/robert-hardwick/8/base -> origin/gh/robert-hardwick/8/base 2025-12-04T09:33:40.2350768Z * [new branch] gh/robert-hardwick/8/head -> origin/gh/robert-hardwick/8/head 2025-12-04T09:33:40.2352024Z * [new branch] gh/robert-hardwick/8/orig -> origin/gh/robert-hardwick/8/orig 2025-12-04T09:33:40.2353912Z * [new branch] gh/robert-hardwick/9/base -> origin/gh/robert-hardwick/9/base 2025-12-04T09:33:40.2355164Z * [new branch] gh/robert-hardwick/9/head -> origin/gh/robert-hardwick/9/head 2025-12-04T09:33:40.2356469Z * [new branch] gh/robert-hardwick/9/orig -> origin/gh/robert-hardwick/9/orig 2025-12-04T09:33:40.2358459Z * [new branch] gh/rtimpe/1/base -> origin/gh/rtimpe/1/base 2025-12-04T09:33:40.2359742Z * [new branch] gh/rtimpe/1/head -> origin/gh/rtimpe/1/head 2025-12-04T09:33:40.2361354Z * [new branch] gh/rtimpe/2/base -> origin/gh/rtimpe/2/base 2025-12-04T09:33:40.2362566Z * [new branch] gh/rtimpe/2/head -> origin/gh/rtimpe/2/head 2025-12-04T09:33:40.2364343Z * [new branch] gh/rtimpe/22/base -> origin/gh/rtimpe/22/base 2025-12-04T09:33:40.2365586Z * [new branch] gh/rtimpe/22/head -> origin/gh/rtimpe/22/head 2025-12-04T09:33:40.2366851Z * [new branch] gh/rtimpe/22/orig -> origin/gh/rtimpe/22/orig 2025-12-04T09:33:40.2368381Z * [new branch] gh/rtimpe/23/base -> origin/gh/rtimpe/23/base 2025-12-04T09:33:40.2369885Z * [new branch] gh/rtimpe/23/head -> origin/gh/rtimpe/23/head 2025-12-04T09:33:40.2371100Z * [new branch] gh/rtimpe/23/orig -> origin/gh/rtimpe/23/orig 2025-12-04T09:33:40.2372745Z * [new branch] gh/rtimpe/24/base -> origin/gh/rtimpe/24/base 2025-12-04T09:33:40.2373991Z * [new branch] gh/rtimpe/24/head -> origin/gh/rtimpe/24/head 2025-12-04T09:33:40.2375209Z * [new branch] gh/rtimpe/24/orig -> origin/gh/rtimpe/24/orig 2025-12-04T09:33:40.2376839Z * [new branch] gh/rtimpe/25/base -> origin/gh/rtimpe/25/base 2025-12-04T09:33:40.2378126Z * [new branch] gh/rtimpe/25/head -> origin/gh/rtimpe/25/head 2025-12-04T09:33:40.2379536Z * [new branch] gh/rtimpe/25/orig -> origin/gh/rtimpe/25/orig 2025-12-04T09:33:40.2381183Z * [new branch] gh/rtimpe/26/base -> origin/gh/rtimpe/26/base 2025-12-04T09:33:40.2382427Z * [new branch] gh/rtimpe/26/head -> origin/gh/rtimpe/26/head 2025-12-04T09:33:40.2383733Z * [new branch] gh/rtimpe/26/orig -> origin/gh/rtimpe/26/orig 2025-12-04T09:33:40.2385751Z * [new branch] gh/rtimpe/27/base -> origin/gh/rtimpe/27/base 2025-12-04T09:33:40.2387170Z * [new branch] gh/rtimpe/27/head -> origin/gh/rtimpe/27/head 2025-12-04T09:33:40.2391199Z * [new branch] gh/rtimpe/27/orig -> origin/gh/rtimpe/27/orig 2025-12-04T09:33:40.2393004Z * [new branch] gh/rtimpe/28/base -> origin/gh/rtimpe/28/base 2025-12-04T09:33:40.2394243Z * [new branch] gh/rtimpe/28/head -> origin/gh/rtimpe/28/head 2025-12-04T09:33:40.2395559Z * [new branch] gh/rtimpe/28/orig -> origin/gh/rtimpe/28/orig 2025-12-04T09:33:40.2397254Z * [new branch] gh/rtimpe/29/base -> origin/gh/rtimpe/29/base 2025-12-04T09:33:40.2398552Z * [new branch] gh/rtimpe/29/head -> origin/gh/rtimpe/29/head 2025-12-04T09:33:40.2400358Z * [new branch] gh/rtimpe/29/orig -> origin/gh/rtimpe/29/orig 2025-12-04T09:33:40.2401963Z * [new branch] gh/rtimpe/3/base -> origin/gh/rtimpe/3/base 2025-12-04T09:33:40.2403141Z * [new branch] gh/rtimpe/3/head -> origin/gh/rtimpe/3/head 2025-12-04T09:33:40.2404812Z * [new branch] gh/rtimpe/30/base -> origin/gh/rtimpe/30/base 2025-12-04T09:33:40.2406040Z * [new branch] gh/rtimpe/30/head -> origin/gh/rtimpe/30/head 2025-12-04T09:33:40.2407487Z * [new branch] gh/rtimpe/30/orig -> origin/gh/rtimpe/30/orig 2025-12-04T09:33:40.2409119Z * [new branch] gh/rtimpe/31/base -> origin/gh/rtimpe/31/base 2025-12-04T09:33:40.2410357Z * [new branch] gh/rtimpe/31/head -> origin/gh/rtimpe/31/head 2025-12-04T09:33:40.2411734Z * [new branch] gh/rtimpe/31/orig -> origin/gh/rtimpe/31/orig 2025-12-04T09:33:40.2413388Z * [new branch] gh/rtimpe/32/base -> origin/gh/rtimpe/32/base 2025-12-04T09:33:40.2414625Z * [new branch] gh/rtimpe/32/head -> origin/gh/rtimpe/32/head 2025-12-04T09:33:40.2415936Z * [new branch] gh/rtimpe/32/orig -> origin/gh/rtimpe/32/orig 2025-12-04T09:33:40.2417632Z * [new branch] gh/rtimpe/33/base -> origin/gh/rtimpe/33/base 2025-12-04T09:33:40.2419064Z * [new branch] gh/rtimpe/33/head -> origin/gh/rtimpe/33/head 2025-12-04T09:33:40.2420362Z * [new branch] gh/rtimpe/33/orig -> origin/gh/rtimpe/33/orig 2025-12-04T09:33:40.2421950Z * [new branch] gh/rtimpe/34/base -> origin/gh/rtimpe/34/base 2025-12-04T09:33:40.2423195Z * [new branch] gh/rtimpe/34/head -> origin/gh/rtimpe/34/head 2025-12-04T09:33:40.2424667Z * [new branch] gh/rtimpe/34/orig -> origin/gh/rtimpe/34/orig 2025-12-04T09:33:40.2426321Z * [new branch] gh/rtimpe/35/base -> origin/gh/rtimpe/35/base 2025-12-04T09:33:40.2427609Z * [new branch] gh/rtimpe/35/head -> origin/gh/rtimpe/35/head 2025-12-04T09:33:40.2428840Z * [new branch] gh/rtimpe/35/orig -> origin/gh/rtimpe/35/orig 2025-12-04T09:33:40.2430490Z * [new branch] gh/rtimpe/4/base -> origin/gh/rtimpe/4/base 2025-12-04T09:33:40.2431790Z * [new branch] gh/rtimpe/4/head -> origin/gh/rtimpe/4/head 2025-12-04T09:33:40.2433905Z * [new branch] gh/ruisizhang123/1/base -> origin/gh/ruisizhang123/1/base 2025-12-04T09:33:40.2435637Z * [new branch] gh/ruisizhang123/1/head -> origin/gh/ruisizhang123/1/head 2025-12-04T09:33:40.2436952Z * [new branch] gh/ruisizhang123/1/orig -> origin/gh/ruisizhang123/1/orig 2025-12-04T09:33:40.2438621Z * [new branch] gh/ruisizhang123/4/base -> origin/gh/ruisizhang123/4/base 2025-12-04T09:33:40.2439914Z * [new branch] gh/ruisizhang123/4/head -> origin/gh/ruisizhang123/4/head 2025-12-04T09:33:40.2441728Z * [new branch] gh/ruisizhang123/4/orig -> origin/gh/ruisizhang123/4/orig 2025-12-04T09:33:40.2443422Z * [new branch] gh/ruisizhang123/5/base -> origin/gh/ruisizhang123/5/base 2025-12-04T09:33:40.2444666Z * [new branch] gh/ruisizhang123/5/head -> origin/gh/ruisizhang123/5/head 2025-12-04T09:33:40.2445933Z * [new branch] gh/ruisizhang123/5/orig -> origin/gh/ruisizhang123/5/orig 2025-12-04T09:33:40.2447594Z * [new branch] gh/ruisizhang123/6/base -> origin/gh/ruisizhang123/6/base 2025-12-04T09:33:40.2448988Z * [new branch] gh/ruisizhang123/6/head -> origin/gh/ruisizhang123/6/head 2025-12-04T09:33:40.2450246Z * [new branch] gh/ruisizhang123/6/orig -> origin/gh/ruisizhang123/6/orig 2025-12-04T09:33:40.2451939Z * [new branch] gh/ruisizhang123/7/base -> origin/gh/ruisizhang123/7/base 2025-12-04T09:33:40.2453203Z * [new branch] gh/ruisizhang123/7/head -> origin/gh/ruisizhang123/7/head 2025-12-04T09:33:40.2454450Z * [new branch] gh/ruisizhang123/7/orig -> origin/gh/ruisizhang123/7/orig 2025-12-04T09:33:40.2456143Z * [new branch] gh/ruisizhang123/8/base -> origin/gh/ruisizhang123/8/base 2025-12-04T09:33:40.2457567Z * [new branch] gh/ruisizhang123/8/head -> origin/gh/ruisizhang123/8/head 2025-12-04T09:33:40.2459073Z * [new branch] gh/ruisizhang123/8/orig -> origin/gh/ruisizhang123/8/orig 2025-12-04T09:33:40.2460751Z * [new branch] gh/ruisizhang123/9/base -> origin/gh/ruisizhang123/9/base 2025-12-04T09:33:40.2462039Z * [new branch] gh/ruisizhang123/9/head -> origin/gh/ruisizhang123/9/head 2025-12-04T09:33:40.2463288Z * [new branch] gh/ruisizhang123/9/orig -> origin/gh/ruisizhang123/9/orig 2025-12-04T09:33:40.2465373Z * [new branch] gh/seemethere/52/base -> origin/gh/seemethere/52/base 2025-12-04T09:33:40.2466678Z * [new branch] gh/seemethere/52/head -> origin/gh/seemethere/52/head 2025-12-04T09:33:40.2467990Z * [new branch] gh/seemethere/52/orig -> origin/gh/seemethere/52/orig 2025-12-04T09:33:40.2469684Z * [new branch] gh/seemethere/53/base -> origin/gh/seemethere/53/base 2025-12-04T09:33:40.2470905Z * [new branch] gh/seemethere/53/head -> origin/gh/seemethere/53/head 2025-12-04T09:33:40.2472188Z * [new branch] gh/seemethere/53/orig -> origin/gh/seemethere/53/orig 2025-12-04T09:33:40.2473945Z * [new branch] gh/seemethere/54/base -> origin/gh/seemethere/54/base 2025-12-04T09:33:40.2475750Z * [new branch] gh/seemethere/54/head -> origin/gh/seemethere/54/head 2025-12-04T09:33:40.2477096Z * [new branch] gh/seemethere/54/orig -> origin/gh/seemethere/54/orig 2025-12-04T09:33:40.2478595Z * [new branch] gh/seemethere/55/base -> origin/gh/seemethere/55/base 2025-12-04T09:33:40.2479770Z * [new branch] gh/seemethere/55/head -> origin/gh/seemethere/55/head 2025-12-04T09:33:40.2481125Z * [new branch] gh/seemethere/55/orig -> origin/gh/seemethere/55/orig 2025-12-04T09:33:40.2482681Z * [new branch] gh/seemethere/59/base -> origin/gh/seemethere/59/base 2025-12-04T09:33:40.2483946Z * [new branch] gh/seemethere/59/head -> origin/gh/seemethere/59/head 2025-12-04T09:33:40.2485174Z * [new branch] gh/seemethere/59/orig -> origin/gh/seemethere/59/orig 2025-12-04T09:33:40.2486794Z * [new branch] gh/seemethere/62/base -> origin/gh/seemethere/62/base 2025-12-04T09:33:40.2488145Z * [new branch] gh/seemethere/62/head -> origin/gh/seemethere/62/head 2025-12-04T09:33:40.2489538Z * [new branch] gh/seemethere/62/orig -> origin/gh/seemethere/62/orig 2025-12-04T09:33:40.2491192Z * [new branch] gh/seemethere/63/base -> origin/gh/seemethere/63/base 2025-12-04T09:33:40.2492562Z * [new branch] gh/seemethere/63/head -> origin/gh/seemethere/63/head 2025-12-04T09:33:40.2493861Z * [new branch] gh/seemethere/63/orig -> origin/gh/seemethere/63/orig 2025-12-04T09:33:40.2495526Z * [new branch] gh/seemethere/71/base -> origin/gh/seemethere/71/base 2025-12-04T09:33:40.2496767Z * [new branch] gh/seemethere/71/head -> origin/gh/seemethere/71/head 2025-12-04T09:33:40.2498073Z * [new branch] gh/seemethere/71/orig -> origin/gh/seemethere/71/orig 2025-12-04T09:33:40.2499992Z * [new branch] gh/seemethere/72/base -> origin/gh/seemethere/72/base 2025-12-04T09:33:40.2501318Z * [new branch] gh/seemethere/72/head -> origin/gh/seemethere/72/head 2025-12-04T09:33:40.2502549Z * [new branch] gh/seemethere/72/orig -> origin/gh/seemethere/72/orig 2025-12-04T09:33:40.2504219Z * [new branch] gh/seemethere/73/base -> origin/gh/seemethere/73/base 2025-12-04T09:33:40.2505465Z * [new branch] gh/seemethere/73/head -> origin/gh/seemethere/73/head 2025-12-04T09:33:40.2506760Z * [new branch] gh/seemethere/73/orig -> origin/gh/seemethere/73/orig 2025-12-04T09:33:40.2508459Z * [new branch] gh/seemethere/74/base -> origin/gh/seemethere/74/base 2025-12-04T09:33:40.2509835Z * [new branch] gh/seemethere/74/head -> origin/gh/seemethere/74/head 2025-12-04T09:33:40.2511184Z * [new branch] gh/seemethere/74/orig -> origin/gh/seemethere/74/orig 2025-12-04T09:33:40.2512902Z * [new branch] gh/seemethere/75/base -> origin/gh/seemethere/75/base 2025-12-04T09:33:40.2514126Z * [new branch] gh/seemethere/75/head -> origin/gh/seemethere/75/head 2025-12-04T09:33:40.2515431Z * [new branch] gh/seemethere/75/orig -> origin/gh/seemethere/75/orig 2025-12-04T09:33:40.2517093Z * [new branch] gh/seemethere/76/base -> origin/gh/seemethere/76/base 2025-12-04T09:33:40.2518401Z * [new branch] gh/seemethere/76/head -> origin/gh/seemethere/76/head 2025-12-04T09:33:40.2519678Z * [new branch] gh/seemethere/76/orig -> origin/gh/seemethere/76/orig 2025-12-04T09:33:40.2522015Z * [new branch] gh/shunting314/145/base -> origin/gh/shunting314/145/base 2025-12-04T09:33:40.2523429Z * [new branch] gh/shunting314/145/head -> origin/gh/shunting314/145/head 2025-12-04T09:33:40.2524775Z * [new branch] gh/shunting314/145/orig -> origin/gh/shunting314/145/orig 2025-12-04T09:33:40.2526925Z * [new branch] gh/shunting314/176/base -> origin/gh/shunting314/176/base 2025-12-04T09:33:40.2528444Z * [new branch] gh/shunting314/176/head -> origin/gh/shunting314/176/head 2025-12-04T09:33:40.2529706Z * [new branch] gh/shunting314/176/orig -> origin/gh/shunting314/176/orig 2025-12-04T09:33:40.2531535Z * [new branch] gh/shunting314/249/base -> origin/gh/shunting314/249/base 2025-12-04T09:33:40.2532905Z * [new branch] gh/shunting314/249/head -> origin/gh/shunting314/249/head 2025-12-04T09:33:40.2534290Z * [new branch] gh/shunting314/249/orig -> origin/gh/shunting314/249/orig 2025-12-04T09:33:40.2536060Z * [new branch] gh/shunting314/253/base -> origin/gh/shunting314/253/base 2025-12-04T09:33:40.2537304Z * [new branch] gh/shunting314/253/head -> origin/gh/shunting314/253/head 2025-12-04T09:33:40.2538614Z * [new branch] gh/shunting314/253/orig -> origin/gh/shunting314/253/orig 2025-12-04T09:33:40.2540454Z * [new branch] gh/shunting314/256/base -> origin/gh/shunting314/256/base 2025-12-04T09:33:40.2541691Z * [new branch] gh/shunting314/256/head -> origin/gh/shunting314/256/head 2025-12-04T09:33:40.2542979Z * [new branch] gh/shunting314/256/orig -> origin/gh/shunting314/256/orig 2025-12-04T09:33:40.2545132Z * [new branch] gh/shunting314/257/base -> origin/gh/shunting314/257/base 2025-12-04T09:33:40.2546458Z * [new branch] gh/shunting314/257/head -> origin/gh/shunting314/257/head 2025-12-04T09:33:40.2547700Z * [new branch] gh/shunting314/257/orig -> origin/gh/shunting314/257/orig 2025-12-04T09:33:40.2549621Z * [new branch] gh/shunting314/258/base -> origin/gh/shunting314/258/base 2025-12-04T09:33:40.2551301Z * [new branch] gh/shunting314/258/head -> origin/gh/shunting314/258/head 2025-12-04T09:33:40.2552626Z * [new branch] gh/shunting314/258/orig -> origin/gh/shunting314/258/orig 2025-12-04T09:33:40.2554174Z * [new branch] gh/shunting314/259/base -> origin/gh/shunting314/259/base 2025-12-04T09:33:40.2555445Z * [new branch] gh/shunting314/259/head -> origin/gh/shunting314/259/head 2025-12-04T09:33:40.2556751Z * [new branch] gh/shunting314/259/orig -> origin/gh/shunting314/259/orig 2025-12-04T09:33:40.2558468Z * [new branch] gh/shunting314/260/base -> origin/gh/shunting314/260/base 2025-12-04T09:33:40.2559920Z * [new branch] gh/shunting314/260/head -> origin/gh/shunting314/260/head 2025-12-04T09:33:40.2561201Z * [new branch] gh/shunting314/260/orig -> origin/gh/shunting314/260/orig 2025-12-04T09:33:40.2563119Z * [new branch] gh/shunting314/261/base -> origin/gh/shunting314/261/base 2025-12-04T09:33:40.2564529Z * [new branch] gh/shunting314/261/head -> origin/gh/shunting314/261/head 2025-12-04T09:33:40.2565800Z * [new branch] gh/shunting314/261/orig -> origin/gh/shunting314/261/orig 2025-12-04T09:33:40.2567557Z * [new branch] gh/shunting314/262/base -> origin/gh/shunting314/262/base 2025-12-04T09:33:40.2568896Z * [new branch] gh/shunting314/262/head -> origin/gh/shunting314/262/head 2025-12-04T09:33:40.2570187Z * [new branch] gh/shunting314/262/orig -> origin/gh/shunting314/262/orig 2025-12-04T09:33:40.2572011Z * [new branch] gh/shunting314/263/base -> origin/gh/shunting314/263/base 2025-12-04T09:33:40.2573369Z * [new branch] gh/shunting314/263/head -> origin/gh/shunting314/263/head 2025-12-04T09:33:40.2574705Z * [new branch] gh/shunting314/263/orig -> origin/gh/shunting314/263/orig 2025-12-04T09:33:40.2576401Z * [new branch] gh/shunting314/264/base -> origin/gh/shunting314/264/base 2025-12-04T09:33:40.2577839Z * [new branch] gh/shunting314/264/head -> origin/gh/shunting314/264/head 2025-12-04T09:33:40.2579091Z * [new branch] gh/shunting314/264/orig -> origin/gh/shunting314/264/orig 2025-12-04T09:33:40.2581029Z * [new branch] gh/shunting314/265/base -> origin/gh/shunting314/265/base 2025-12-04T09:33:40.2582243Z * [new branch] gh/shunting314/265/head -> origin/gh/shunting314/265/head 2025-12-04T09:33:40.2583553Z * [new branch] gh/shunting314/265/orig -> origin/gh/shunting314/265/orig 2025-12-04T09:33:40.2585237Z * [new branch] gh/shunting314/266/base -> origin/gh/shunting314/266/base 2025-12-04T09:33:40.2586655Z * [new branch] gh/shunting314/266/head -> origin/gh/shunting314/266/head 2025-12-04T09:33:40.2588045Z * [new branch] gh/shunting314/266/orig -> origin/gh/shunting314/266/orig 2025-12-04T09:33:40.2590230Z * [new branch] gh/shunting314/267/base -> origin/gh/shunting314/267/base 2025-12-04T09:33:40.2591647Z * [new branch] gh/shunting314/267/head -> origin/gh/shunting314/267/head 2025-12-04T09:33:40.2592985Z * [new branch] gh/shunting314/267/orig -> origin/gh/shunting314/267/orig 2025-12-04T09:33:40.2595184Z * [new branch] gh/shunting314/268/base -> origin/gh/shunting314/268/base 2025-12-04T09:33:40.2596611Z * [new branch] gh/shunting314/268/head -> origin/gh/shunting314/268/head 2025-12-04T09:33:40.2597866Z * [new branch] gh/shunting314/268/orig -> origin/gh/shunting314/268/orig 2025-12-04T09:33:40.2600233Z * [new branch] gh/shunting314/269/base -> origin/gh/shunting314/269/base 2025-12-04T09:33:40.2601504Z * [new branch] gh/shunting314/269/head -> origin/gh/shunting314/269/head 2025-12-04T09:33:40.2602726Z * [new branch] gh/shunting314/269/orig -> origin/gh/shunting314/269/orig 2025-12-04T09:33:40.2604762Z * [new branch] gh/silverguo/1/base -> origin/gh/silverguo/1/base 2025-12-04T09:33:40.2606052Z * [new branch] gh/silverguo/1/head -> origin/gh/silverguo/1/head 2025-12-04T09:33:40.2607585Z * [new branch] gh/silverguo/2/base -> origin/gh/silverguo/2/base 2025-12-04T09:33:40.2608729Z * [new branch] gh/silverguo/2/head -> origin/gh/silverguo/2/head 2025-12-04T09:33:40.2610241Z * [new branch] gh/silverguo/3/base -> origin/gh/silverguo/3/base 2025-12-04T09:33:40.2611479Z * [new branch] gh/silverguo/3/head -> origin/gh/silverguo/3/head 2025-12-04T09:33:40.2612985Z * [new branch] gh/silverguo/4/base -> origin/gh/silverguo/4/base 2025-12-04T09:33:40.2614296Z * [new branch] gh/silverguo/4/head -> origin/gh/silverguo/4/head 2025-12-04T09:33:40.2616375Z * [new branch] gh/slayton58/39/base -> origin/gh/slayton58/39/base 2025-12-04T09:33:40.2617674Z * [new branch] gh/slayton58/39/head -> origin/gh/slayton58/39/head 2025-12-04T09:33:40.2619085Z * [new branch] gh/slayton58/39/orig -> origin/gh/slayton58/39/orig 2025-12-04T09:33:40.2620809Z * [new branch] gh/slayton58/42/base -> origin/gh/slayton58/42/base 2025-12-04T09:33:40.2622135Z * [new branch] gh/slayton58/42/head -> origin/gh/slayton58/42/head 2025-12-04T09:33:40.2623511Z * [new branch] gh/slayton58/42/orig -> origin/gh/slayton58/42/orig 2025-12-04T09:33:40.2625222Z * [new branch] gh/slayton58/43/base -> origin/gh/slayton58/43/base 2025-12-04T09:33:40.2626517Z * [new branch] gh/slayton58/43/head -> origin/gh/slayton58/43/head 2025-12-04T09:33:40.2627852Z * [new branch] gh/slayton58/43/orig -> origin/gh/slayton58/43/orig 2025-12-04T09:33:40.2630185Z * [new branch] gh/slayton58/44/base -> origin/gh/slayton58/44/base 2025-12-04T09:33:40.2631567Z * [new branch] gh/slayton58/44/head -> origin/gh/slayton58/44/head 2025-12-04T09:33:40.2632832Z * [new branch] gh/slayton58/44/orig -> origin/gh/slayton58/44/orig 2025-12-04T09:33:40.2634456Z * [new branch] gh/slayton58/45/base -> origin/gh/slayton58/45/base 2025-12-04T09:33:40.2635745Z * [new branch] gh/slayton58/45/head -> origin/gh/slayton58/45/head 2025-12-04T09:33:40.2637084Z * [new branch] gh/slayton58/45/orig -> origin/gh/slayton58/45/orig 2025-12-04T09:33:40.2639213Z * [new branch] gh/slayton58/46/base -> origin/gh/slayton58/46/base 2025-12-04T09:33:40.2640607Z * [new branch] gh/slayton58/46/head -> origin/gh/slayton58/46/head 2025-12-04T09:33:40.2641919Z * [new branch] gh/slayton58/46/orig -> origin/gh/slayton58/46/orig 2025-12-04T09:33:40.2643612Z * [new branch] gh/slayton58/6/base -> origin/gh/slayton58/6/base 2025-12-04T09:33:40.2644936Z * [new branch] gh/slayton58/6/head -> origin/gh/slayton58/6/head 2025-12-04T09:33:40.2646497Z * [new branch] gh/slayton58/7/base -> origin/gh/slayton58/7/base 2025-12-04T09:33:40.2647654Z * [new branch] gh/slayton58/7/head -> origin/gh/slayton58/7/head 2025-12-04T09:33:40.2650076Z * [new branch] gh/soulitzer/269/base -> origin/gh/soulitzer/269/base 2025-12-04T09:33:40.2651231Z * [new branch] gh/soulitzer/269/head -> origin/gh/soulitzer/269/head 2025-12-04T09:33:40.2652540Z * [new branch] gh/soulitzer/269/orig -> origin/gh/soulitzer/269/orig 2025-12-04T09:33:40.2654364Z * [new branch] gh/soulitzer/276/base -> origin/gh/soulitzer/276/base 2025-12-04T09:33:40.2655718Z * [new branch] gh/soulitzer/276/head -> origin/gh/soulitzer/276/head 2025-12-04T09:33:40.2656995Z * [new branch] gh/soulitzer/276/orig -> origin/gh/soulitzer/276/orig 2025-12-04T09:33:40.2659197Z * [new branch] gh/soulitzer/287/base -> origin/gh/soulitzer/287/base 2025-12-04T09:33:40.2660444Z * [new branch] gh/soulitzer/287/head -> origin/gh/soulitzer/287/head 2025-12-04T09:33:40.2661751Z * [new branch] gh/soulitzer/287/orig -> origin/gh/soulitzer/287/orig 2025-12-04T09:33:40.2663586Z * [new branch] gh/soulitzer/296/base -> origin/gh/soulitzer/296/base 2025-12-04T09:33:40.2665048Z * [new branch] gh/soulitzer/296/head -> origin/gh/soulitzer/296/head 2025-12-04T09:33:40.2666343Z * [new branch] gh/soulitzer/296/orig -> origin/gh/soulitzer/296/orig 2025-12-04T09:33:40.2668151Z * [new branch] gh/soulitzer/299/base -> origin/gh/soulitzer/299/base 2025-12-04T09:33:40.2669524Z * [new branch] gh/soulitzer/299/head -> origin/gh/soulitzer/299/head 2025-12-04T09:33:40.2670844Z * [new branch] gh/soulitzer/299/orig -> origin/gh/soulitzer/299/orig 2025-12-04T09:33:40.2672564Z * [new branch] gh/soulitzer/300/base -> origin/gh/soulitzer/300/base 2025-12-04T09:33:40.2673860Z * [new branch] gh/soulitzer/300/head -> origin/gh/soulitzer/300/head 2025-12-04T09:33:40.2675121Z * [new branch] gh/soulitzer/300/orig -> origin/gh/soulitzer/300/orig 2025-12-04T09:33:40.2676928Z * [new branch] gh/soulitzer/301/base -> origin/gh/soulitzer/301/base 2025-12-04T09:33:40.2678278Z * [new branch] gh/soulitzer/301/head -> origin/gh/soulitzer/301/head 2025-12-04T09:33:40.2679535Z * [new branch] gh/soulitzer/301/orig -> origin/gh/soulitzer/301/orig 2025-12-04T09:33:40.2681240Z * [new branch] gh/soulitzer/313/base -> origin/gh/soulitzer/313/base 2025-12-04T09:33:40.2682475Z * [new branch] gh/soulitzer/313/head -> origin/gh/soulitzer/313/head 2025-12-04T09:33:40.2683823Z * [new branch] gh/soulitzer/313/orig -> origin/gh/soulitzer/313/orig 2025-12-04T09:33:40.2685594Z * [new branch] gh/soulitzer/319/base -> origin/gh/soulitzer/319/base 2025-12-04T09:33:40.2687082Z * [new branch] gh/soulitzer/319/head -> origin/gh/soulitzer/319/head 2025-12-04T09:33:40.2688547Z * [new branch] gh/soulitzer/319/orig -> origin/gh/soulitzer/319/orig 2025-12-04T09:33:40.2690407Z * [new branch] gh/soulitzer/320/base -> origin/gh/soulitzer/320/base 2025-12-04T09:33:40.2691660Z * [new branch] gh/soulitzer/320/head -> origin/gh/soulitzer/320/head 2025-12-04T09:33:40.2692829Z * [new branch] gh/soulitzer/320/orig -> origin/gh/soulitzer/320/orig 2025-12-04T09:33:40.2694655Z * [new branch] gh/soulitzer/336/base -> origin/gh/soulitzer/336/base 2025-12-04T09:33:40.2695880Z * [new branch] gh/soulitzer/336/head -> origin/gh/soulitzer/336/head 2025-12-04T09:33:40.2697141Z * [new branch] gh/soulitzer/336/orig -> origin/gh/soulitzer/336/orig 2025-12-04T09:33:40.2699021Z * [new branch] gh/soulitzer/347/base -> origin/gh/soulitzer/347/base 2025-12-04T09:33:40.2700192Z * [new branch] gh/soulitzer/347/head -> origin/gh/soulitzer/347/head 2025-12-04T09:33:40.2701479Z * [new branch] gh/soulitzer/347/orig -> origin/gh/soulitzer/347/orig 2025-12-04T09:33:40.2703476Z * [new branch] gh/soulitzer/349/base -> origin/gh/soulitzer/349/base 2025-12-04T09:33:40.2704874Z * [new branch] gh/soulitzer/349/head -> origin/gh/soulitzer/349/head 2025-12-04T09:33:40.2706206Z * [new branch] gh/soulitzer/349/orig -> origin/gh/soulitzer/349/orig 2025-12-04T09:33:40.2707806Z * [new branch] gh/soulitzer/350/base -> origin/gh/soulitzer/350/base 2025-12-04T09:33:40.2708997Z * [new branch] gh/soulitzer/350/head -> origin/gh/soulitzer/350/head 2025-12-04T09:33:40.2710227Z * [new branch] gh/soulitzer/350/orig -> origin/gh/soulitzer/350/orig 2025-12-04T09:33:40.2712026Z * [new branch] gh/soulitzer/351/base -> origin/gh/soulitzer/351/base 2025-12-04T09:33:40.2713326Z * [new branch] gh/soulitzer/351/head -> origin/gh/soulitzer/351/head 2025-12-04T09:33:40.2714581Z * [new branch] gh/soulitzer/351/orig -> origin/gh/soulitzer/351/orig 2025-12-04T09:33:40.2716264Z * [new branch] gh/soulitzer/353/base -> origin/gh/soulitzer/353/base 2025-12-04T09:33:40.2717641Z * [new branch] gh/soulitzer/353/head -> origin/gh/soulitzer/353/head 2025-12-04T09:33:40.2718969Z * [new branch] gh/soulitzer/353/orig -> origin/gh/soulitzer/353/orig 2025-12-04T09:33:40.2721349Z * [new branch] gh/soulitzer/358/base -> origin/gh/soulitzer/358/base 2025-12-04T09:33:40.2722773Z * [new branch] gh/soulitzer/358/head -> origin/gh/soulitzer/358/head 2025-12-04T09:33:40.2724072Z * [new branch] gh/soulitzer/358/orig -> origin/gh/soulitzer/358/orig 2025-12-04T09:33:40.2726919Z * [new branch] gh/soulitzer/359/base -> origin/gh/soulitzer/359/base 2025-12-04T09:33:40.2728236Z * [new branch] gh/soulitzer/359/head -> origin/gh/soulitzer/359/head 2025-12-04T09:33:40.2729544Z * [new branch] gh/soulitzer/359/orig -> origin/gh/soulitzer/359/orig 2025-12-04T09:33:40.2731325Z * [new branch] gh/soulitzer/374/base -> origin/gh/soulitzer/374/base 2025-12-04T09:33:40.2732575Z * [new branch] gh/soulitzer/374/head -> origin/gh/soulitzer/374/head 2025-12-04T09:33:40.2733881Z * [new branch] gh/soulitzer/374/orig -> origin/gh/soulitzer/374/orig 2025-12-04T09:33:40.2735566Z * [new branch] gh/soulitzer/375/base -> origin/gh/soulitzer/375/base 2025-12-04T09:33:40.2736879Z * [new branch] gh/soulitzer/375/head -> origin/gh/soulitzer/375/head 2025-12-04T09:33:40.2738560Z * [new branch] gh/soulitzer/375/orig -> origin/gh/soulitzer/375/orig 2025-12-04T09:33:40.2740478Z * [new branch] gh/soulitzer/380/base -> origin/gh/soulitzer/380/base 2025-12-04T09:33:40.2741805Z * [new branch] gh/soulitzer/380/head -> origin/gh/soulitzer/380/head 2025-12-04T09:33:40.2743076Z * [new branch] gh/soulitzer/380/orig -> origin/gh/soulitzer/380/orig 2025-12-04T09:33:40.2744777Z * [new branch] gh/soulitzer/385/base -> origin/gh/soulitzer/385/base 2025-12-04T09:33:40.2746124Z * [new branch] gh/soulitzer/385/head -> origin/gh/soulitzer/385/head 2025-12-04T09:33:40.2747416Z * [new branch] gh/soulitzer/385/orig -> origin/gh/soulitzer/385/orig 2025-12-04T09:33:40.2749167Z * [new branch] gh/soulitzer/386/base -> origin/gh/soulitzer/386/base 2025-12-04T09:33:40.2750448Z * [new branch] gh/soulitzer/386/head -> origin/gh/soulitzer/386/head 2025-12-04T09:33:40.2751784Z * [new branch] gh/soulitzer/386/orig -> origin/gh/soulitzer/386/orig 2025-12-04T09:33:40.2753462Z * [new branch] gh/soulitzer/387/base -> origin/gh/soulitzer/387/base 2025-12-04T09:33:40.2754722Z * [new branch] gh/soulitzer/387/head -> origin/gh/soulitzer/387/head 2025-12-04T09:33:40.2755983Z * [new branch] gh/soulitzer/387/orig -> origin/gh/soulitzer/387/orig 2025-12-04T09:33:40.2757820Z * [new branch] gh/soulitzer/388/base -> origin/gh/soulitzer/388/base 2025-12-04T09:33:40.2759119Z * [new branch] gh/soulitzer/388/head -> origin/gh/soulitzer/388/head 2025-12-04T09:33:40.2760360Z * [new branch] gh/soulitzer/388/orig -> origin/gh/soulitzer/388/orig 2025-12-04T09:33:40.2762068Z * [new branch] gh/soulitzer/389/base -> origin/gh/soulitzer/389/base 2025-12-04T09:33:40.2763303Z * [new branch] gh/soulitzer/389/head -> origin/gh/soulitzer/389/head 2025-12-04T09:33:40.2764579Z * [new branch] gh/soulitzer/389/orig -> origin/gh/soulitzer/389/orig 2025-12-04T09:33:40.2766300Z * [new branch] gh/soulitzer/390/base -> origin/gh/soulitzer/390/base 2025-12-04T09:33:40.2767503Z * [new branch] gh/soulitzer/390/head -> origin/gh/soulitzer/390/head 2025-12-04T09:33:40.2768813Z * [new branch] gh/soulitzer/390/orig -> origin/gh/soulitzer/390/orig 2025-12-04T09:33:40.2770492Z * [new branch] gh/soulitzer/391/base -> origin/gh/soulitzer/391/base 2025-12-04T09:33:40.2771771Z * [new branch] gh/soulitzer/391/head -> origin/gh/soulitzer/391/head 2025-12-04T09:33:40.2772984Z * [new branch] gh/soulitzer/391/orig -> origin/gh/soulitzer/391/orig 2025-12-04T09:33:40.2774798Z * [new branch] gh/soulitzer/392/base -> origin/gh/soulitzer/392/base 2025-12-04T09:33:40.2776037Z * [new branch] gh/soulitzer/392/head -> origin/gh/soulitzer/392/head 2025-12-04T09:33:40.2777324Z * [new branch] gh/soulitzer/392/orig -> origin/gh/soulitzer/392/orig 2025-12-04T09:33:40.2779499Z * [new branch] gh/swolchok/728/next -> origin/gh/swolchok/728/next 2025-12-04T09:33:40.2781551Z * [new branch] gh/swolchok/819/base -> origin/gh/swolchok/819/base 2025-12-04T09:33:40.2782745Z * [new branch] gh/swolchok/819/head -> origin/gh/swolchok/819/head 2025-12-04T09:33:40.2784070Z * [new branch] gh/swolchok/819/orig -> origin/gh/swolchok/819/orig 2025-12-04T09:33:40.2785688Z * [new branch] gh/swolchok/824/base -> origin/gh/swolchok/824/base 2025-12-04T09:33:40.2787121Z * [new branch] gh/swolchok/824/head -> origin/gh/swolchok/824/head 2025-12-04T09:33:40.2788437Z * [new branch] gh/swolchok/824/orig -> origin/gh/swolchok/824/orig 2025-12-04T09:33:40.2790300Z * [new branch] gh/swolchok/829/base -> origin/gh/swolchok/829/base 2025-12-04T09:33:40.2791466Z * [new branch] gh/swolchok/829/head -> origin/gh/swolchok/829/head 2025-12-04T09:33:40.2793217Z * [new branch] gh/swolchok/829/orig -> origin/gh/swolchok/829/orig 2025-12-04T09:33:40.2794960Z * [new branch] gh/swolchok/839/base -> origin/gh/swolchok/839/base 2025-12-04T09:33:40.2796158Z * [new branch] gh/swolchok/839/head -> origin/gh/swolchok/839/head 2025-12-04T09:33:40.2797445Z * [new branch] gh/swolchok/839/orig -> origin/gh/swolchok/839/orig 2025-12-04T09:33:40.2799106Z * [new branch] gh/swolchok/841/base -> origin/gh/swolchok/841/base 2025-12-04T09:33:40.2800428Z * [new branch] gh/swolchok/841/head -> origin/gh/swolchok/841/head 2025-12-04T09:33:40.2802183Z * [new branch] gh/swolchok/841/orig -> origin/gh/swolchok/841/orig 2025-12-04T09:33:40.2803878Z * [new branch] gh/swolchok/842/base -> origin/gh/swolchok/842/base 2025-12-04T09:33:40.2805156Z * [new branch] gh/swolchok/842/head -> origin/gh/swolchok/842/head 2025-12-04T09:33:40.2806392Z * [new branch] gh/swolchok/842/orig -> origin/gh/swolchok/842/orig 2025-12-04T09:33:40.2808153Z * [new branch] gh/swolchok/845/base -> origin/gh/swolchok/845/base 2025-12-04T09:33:40.2809445Z * [new branch] gh/swolchok/845/head -> origin/gh/swolchok/845/head 2025-12-04T09:33:40.2810797Z * [new branch] gh/swolchok/845/orig -> origin/gh/swolchok/845/orig 2025-12-04T09:33:40.2812517Z * [new branch] gh/swolchok/848/base -> origin/gh/swolchok/848/base 2025-12-04T09:33:40.2813891Z * [new branch] gh/swolchok/848/head -> origin/gh/swolchok/848/head 2025-12-04T09:33:40.2815200Z * [new branch] gh/swolchok/848/orig -> origin/gh/swolchok/848/orig 2025-12-04T09:33:40.2816872Z * [new branch] gh/swolchok/856/base -> origin/gh/swolchok/856/base 2025-12-04T09:33:40.2818201Z * [new branch] gh/swolchok/856/head -> origin/gh/swolchok/856/head 2025-12-04T09:33:40.2819537Z * [new branch] gh/swolchok/856/orig -> origin/gh/swolchok/856/orig 2025-12-04T09:33:40.2821351Z * [new branch] gh/swolchok/860/base -> origin/gh/swolchok/860/base 2025-12-04T09:33:40.2822623Z * [new branch] gh/swolchok/860/head -> origin/gh/swolchok/860/head 2025-12-04T09:33:40.2823856Z * [new branch] gh/swolchok/860/orig -> origin/gh/swolchok/860/orig 2025-12-04T09:33:40.2825906Z * [new branch] gh/swolchok/861/base -> origin/gh/swolchok/861/base 2025-12-04T09:33:40.2827257Z * [new branch] gh/swolchok/861/head -> origin/gh/swolchok/861/head 2025-12-04T09:33:40.2828621Z * [new branch] gh/swolchok/861/orig -> origin/gh/swolchok/861/orig 2025-12-04T09:33:40.2830361Z * [new branch] gh/swolchok/862/base -> origin/gh/swolchok/862/base 2025-12-04T09:33:40.2831585Z * [new branch] gh/swolchok/862/head -> origin/gh/swolchok/862/head 2025-12-04T09:33:40.2832857Z * [new branch] gh/swolchok/862/orig -> origin/gh/swolchok/862/orig 2025-12-04T09:33:40.2834723Z * [new branch] gh/swolchok/863/base -> origin/gh/swolchok/863/base 2025-12-04T09:33:40.2835945Z * [new branch] gh/swolchok/863/head -> origin/gh/swolchok/863/head 2025-12-04T09:33:40.2837354Z * [new branch] gh/swolchok/863/orig -> origin/gh/swolchok/863/orig 2025-12-04T09:33:40.2839676Z * [new branch] gh/swolchok/864/base -> origin/gh/swolchok/864/base 2025-12-04T09:33:40.2840887Z * [new branch] gh/swolchok/864/head -> origin/gh/swolchok/864/head 2025-12-04T09:33:40.2842218Z * [new branch] gh/swolchok/864/orig -> origin/gh/swolchok/864/orig 2025-12-04T09:33:40.2843958Z * [new branch] gh/swolchok/865/base -> origin/gh/swolchok/865/base 2025-12-04T09:33:40.2845424Z * [new branch] gh/swolchok/865/head -> origin/gh/swolchok/865/head 2025-12-04T09:33:40.2846778Z * [new branch] gh/swolchok/865/orig -> origin/gh/swolchok/865/orig 2025-12-04T09:33:40.2849042Z * [new branch] gh/swolchok/866/base -> origin/gh/swolchok/866/base 2025-12-04T09:33:40.2850425Z * [new branch] gh/swolchok/866/head -> origin/gh/swolchok/866/head 2025-12-04T09:33:40.2851684Z * [new branch] gh/swolchok/866/orig -> origin/gh/swolchok/866/orig 2025-12-04T09:33:40.2853351Z * [new branch] gh/swolchok/867/base -> origin/gh/swolchok/867/base 2025-12-04T09:33:40.2854684Z * [new branch] gh/swolchok/867/head -> origin/gh/swolchok/867/head 2025-12-04T09:33:40.2855927Z * [new branch] gh/swolchok/867/orig -> origin/gh/swolchok/867/orig 2025-12-04T09:33:40.2857644Z * [new branch] gh/swolchok/868/base -> origin/gh/swolchok/868/base 2025-12-04T09:33:40.2859023Z * [new branch] gh/swolchok/868/head -> origin/gh/swolchok/868/head 2025-12-04T09:33:40.2860339Z * [new branch] gh/swolchok/868/orig -> origin/gh/swolchok/868/orig 2025-12-04T09:33:40.2862166Z * [new branch] gh/swolchok/869/base -> origin/gh/swolchok/869/base 2025-12-04T09:33:40.2863496Z * [new branch] gh/swolchok/869/head -> origin/gh/swolchok/869/head 2025-12-04T09:33:40.2864823Z * [new branch] gh/swolchok/869/orig -> origin/gh/swolchok/869/orig 2025-12-04T09:33:40.2867105Z * [new branch] gh/swolchok/870/base -> origin/gh/swolchok/870/base 2025-12-04T09:33:40.2868333Z * [new branch] gh/swolchok/870/head -> origin/gh/swolchok/870/head 2025-12-04T09:33:40.2869647Z * [new branch] gh/swolchok/870/orig -> origin/gh/swolchok/870/orig 2025-12-04T09:33:40.2871921Z * [new branch] gh/swolchok/871/base -> origin/gh/swolchok/871/base 2025-12-04T09:33:40.2873254Z * [new branch] gh/swolchok/871/head -> origin/gh/swolchok/871/head 2025-12-04T09:33:40.2874638Z * [new branch] gh/swolchok/871/orig -> origin/gh/swolchok/871/orig 2025-12-04T09:33:40.2876847Z * [new branch] gh/teja-rao/4/base -> origin/gh/teja-rao/4/base 2025-12-04T09:33:40.2878141Z * [new branch] gh/teja-rao/4/head -> origin/gh/teja-rao/4/head 2025-12-04T09:33:40.2879437Z * [new branch] gh/teja-rao/4/orig -> origin/gh/teja-rao/4/orig 2025-12-04T09:33:40.2881532Z * [new branch] gh/tianyu-l/2/base -> origin/gh/tianyu-l/2/base 2025-12-04T09:33:40.2882853Z * [new branch] gh/tianyu-l/2/head -> origin/gh/tianyu-l/2/head 2025-12-04T09:33:40.2884090Z * [new branch] gh/tianyu-l/2/orig -> origin/gh/tianyu-l/2/orig 2025-12-04T09:33:40.2885751Z * [new branch] gh/tianyu-l/3/base -> origin/gh/tianyu-l/3/base 2025-12-04T09:33:40.2887038Z * [new branch] gh/tianyu-l/3/orig -> origin/gh/tianyu-l/3/orig 2025-12-04T09:33:40.2890189Z * [new branch] gh/tianyu-l/4/base -> origin/gh/tianyu-l/4/base 2025-12-04T09:33:40.2891503Z * [new branch] gh/tianyu-l/4/head -> origin/gh/tianyu-l/4/head 2025-12-04T09:33:40.2892745Z * [new branch] gh/tianyu-l/4/orig -> origin/gh/tianyu-l/4/orig 2025-12-04T09:33:40.2895355Z * [new branch] gh/tugsbayasgalan/10/base -> origin/gh/tugsbayasgalan/10/base 2025-12-04T09:33:40.2896655Z * [new branch] gh/tugsbayasgalan/10/head -> origin/gh/tugsbayasgalan/10/head 2025-12-04T09:33:40.2898040Z * [new branch] gh/tugsbayasgalan/10/orig -> origin/gh/tugsbayasgalan/10/orig 2025-12-04T09:33:40.2899847Z * [new branch] gh/tugsbayasgalan/13/base -> origin/gh/tugsbayasgalan/13/base 2025-12-04T09:33:40.2901120Z * [new branch] gh/tugsbayasgalan/13/head -> origin/gh/tugsbayasgalan/13/head 2025-12-04T09:33:40.2902404Z * [new branch] gh/tugsbayasgalan/13/orig -> origin/gh/tugsbayasgalan/13/orig 2025-12-04T09:33:40.2904221Z * [new branch] gh/tugsbayasgalan/17/base -> origin/gh/tugsbayasgalan/17/base 2025-12-04T09:33:40.2905470Z * [new branch] gh/tugsbayasgalan/17/head -> origin/gh/tugsbayasgalan/17/head 2025-12-04T09:33:40.2906766Z * [new branch] gh/tugsbayasgalan/17/orig -> origin/gh/tugsbayasgalan/17/orig 2025-12-04T09:33:40.2908529Z * [new branch] gh/tugsbayasgalan/2/base -> origin/gh/tugsbayasgalan/2/base 2025-12-04T09:33:40.2909887Z * [new branch] gh/tugsbayasgalan/2/head -> origin/gh/tugsbayasgalan/2/head 2025-12-04T09:33:40.2911018Z * [new branch] gh/tugsbayasgalan/2/orig -> origin/gh/tugsbayasgalan/2/orig 2025-12-04T09:33:40.2913066Z * [new branch] gh/tugsbayasgalan/28/base -> origin/gh/tugsbayasgalan/28/base 2025-12-04T09:33:40.2914374Z * [new branch] gh/tugsbayasgalan/28/head -> origin/gh/tugsbayasgalan/28/head 2025-12-04T09:33:40.2915758Z * [new branch] gh/tugsbayasgalan/28/orig -> origin/gh/tugsbayasgalan/28/orig 2025-12-04T09:33:40.2917415Z * [new branch] gh/tugsbayasgalan/32/base -> origin/gh/tugsbayasgalan/32/base 2025-12-04T09:33:40.2919362Z * [new branch] gh/tugsbayasgalan/32/head -> origin/gh/tugsbayasgalan/32/head 2025-12-04T09:33:40.2920636Z * [new branch] gh/tugsbayasgalan/32/orig -> origin/gh/tugsbayasgalan/32/orig 2025-12-04T09:33:40.2922515Z * [new branch] gh/tugsbayasgalan/35/base -> origin/gh/tugsbayasgalan/35/base 2025-12-04T09:33:40.2924276Z * [new branch] gh/tugsbayasgalan/35/head -> origin/gh/tugsbayasgalan/35/head 2025-12-04T09:33:40.2925577Z * [new branch] gh/tugsbayasgalan/35/orig -> origin/gh/tugsbayasgalan/35/orig 2025-12-04T09:33:40.2927279Z * [new branch] gh/tugsbayasgalan/36/base -> origin/gh/tugsbayasgalan/36/base 2025-12-04T09:33:40.2928599Z * [new branch] gh/tugsbayasgalan/36/head -> origin/gh/tugsbayasgalan/36/head 2025-12-04T09:33:40.2929860Z * [new branch] gh/tugsbayasgalan/36/orig -> origin/gh/tugsbayasgalan/36/orig 2025-12-04T09:33:40.2931559Z * [new branch] gh/tugsbayasgalan/37/base -> origin/gh/tugsbayasgalan/37/base 2025-12-04T09:33:40.2932819Z * [new branch] gh/tugsbayasgalan/37/head -> origin/gh/tugsbayasgalan/37/head 2025-12-04T09:33:40.2934224Z * [new branch] gh/tugsbayasgalan/37/orig -> origin/gh/tugsbayasgalan/37/orig 2025-12-04T09:33:40.2935853Z * [new branch] gh/tugsbayasgalan/43/base -> origin/gh/tugsbayasgalan/43/base 2025-12-04T09:33:40.2937105Z * [new branch] gh/tugsbayasgalan/43/head -> origin/gh/tugsbayasgalan/43/head 2025-12-04T09:33:40.2938412Z * [new branch] gh/tugsbayasgalan/43/orig -> origin/gh/tugsbayasgalan/43/orig 2025-12-04T09:33:40.2940000Z * [new branch] gh/tugsbayasgalan/48/base -> origin/gh/tugsbayasgalan/48/base 2025-12-04T09:33:40.2941327Z * [new branch] gh/tugsbayasgalan/48/head -> origin/gh/tugsbayasgalan/48/head 2025-12-04T09:33:40.2942566Z * [new branch] gh/tugsbayasgalan/48/orig -> origin/gh/tugsbayasgalan/48/orig 2025-12-04T09:33:40.2944278Z * [new branch] gh/tugsbayasgalan/51/base -> origin/gh/tugsbayasgalan/51/base 2025-12-04T09:33:40.2945577Z * [new branch] gh/tugsbayasgalan/51/head -> origin/gh/tugsbayasgalan/51/head 2025-12-04T09:33:40.2946751Z * [new branch] gh/tugsbayasgalan/51/orig -> origin/gh/tugsbayasgalan/51/orig 2025-12-04T09:33:40.2948259Z * [new branch] gh/tugsbayasgalan/52/base -> origin/gh/tugsbayasgalan/52/base 2025-12-04T09:33:40.2949549Z * [new branch] gh/tugsbayasgalan/52/head -> origin/gh/tugsbayasgalan/52/head 2025-12-04T09:33:40.2950913Z * [new branch] gh/tugsbayasgalan/52/orig -> origin/gh/tugsbayasgalan/52/orig 2025-12-04T09:33:40.2952662Z * [new branch] gh/tugsbayasgalan/53/base -> origin/gh/tugsbayasgalan/53/base 2025-12-04T09:33:40.2953925Z * [new branch] gh/tugsbayasgalan/53/head -> origin/gh/tugsbayasgalan/53/head 2025-12-04T09:33:40.2955734Z * [new branch] gh/tugsbayasgalan/53/orig -> origin/gh/tugsbayasgalan/53/orig 2025-12-04T09:33:40.2957577Z * [new branch] gh/tugsbayasgalan/55/base -> origin/gh/tugsbayasgalan/55/base 2025-12-04T09:33:40.2959002Z * [new branch] gh/tugsbayasgalan/55/head -> origin/gh/tugsbayasgalan/55/head 2025-12-04T09:33:40.2960778Z * [new branch] gh/tugsbayasgalan/55/orig -> origin/gh/tugsbayasgalan/55/orig 2025-12-04T09:33:40.2962674Z * [new branch] gh/tugsbayasgalan/59/base -> origin/gh/tugsbayasgalan/59/base 2025-12-04T09:33:40.2964049Z * [new branch] gh/tugsbayasgalan/59/head -> origin/gh/tugsbayasgalan/59/head 2025-12-04T09:33:40.2965385Z * [new branch] gh/tugsbayasgalan/59/orig -> origin/gh/tugsbayasgalan/59/orig 2025-12-04T09:33:40.2967014Z * [new branch] gh/tugsbayasgalan/6/base -> origin/gh/tugsbayasgalan/6/base 2025-12-04T09:33:40.2968230Z * [new branch] gh/tugsbayasgalan/6/head -> origin/gh/tugsbayasgalan/6/head 2025-12-04T09:33:40.2969584Z * [new branch] gh/tugsbayasgalan/6/orig -> origin/gh/tugsbayasgalan/6/orig 2025-12-04T09:33:40.2971144Z * [new branch] gh/tugsbayasgalan/60/base -> origin/gh/tugsbayasgalan/60/base 2025-12-04T09:33:40.2972418Z * [new branch] gh/tugsbayasgalan/60/head -> origin/gh/tugsbayasgalan/60/head 2025-12-04T09:33:40.2973670Z * [new branch] gh/tugsbayasgalan/60/orig -> origin/gh/tugsbayasgalan/60/orig 2025-12-04T09:33:40.2975900Z * [new branch] gh/tugsbayasgalan/61/base -> origin/gh/tugsbayasgalan/61/base 2025-12-04T09:33:40.2977089Z * [new branch] gh/tugsbayasgalan/61/head -> origin/gh/tugsbayasgalan/61/head 2025-12-04T09:33:40.2978415Z * [new branch] gh/tugsbayasgalan/61/orig -> origin/gh/tugsbayasgalan/61/orig 2025-12-04T09:33:40.2980347Z * [new branch] gh/tugsbayasgalan/63/base -> origin/gh/tugsbayasgalan/63/base 2025-12-04T09:33:40.2981590Z * [new branch] gh/tugsbayasgalan/63/head -> origin/gh/tugsbayasgalan/63/head 2025-12-04T09:33:40.2982949Z * [new branch] gh/tugsbayasgalan/63/orig -> origin/gh/tugsbayasgalan/63/orig 2025-12-04T09:33:40.2984627Z * [new branch] gh/tugsbayasgalan/67/base -> origin/gh/tugsbayasgalan/67/base 2025-12-04T09:33:40.2985911Z * [new branch] gh/tugsbayasgalan/67/head -> origin/gh/tugsbayasgalan/67/head 2025-12-04T09:33:40.2987228Z * [new branch] gh/tugsbayasgalan/67/orig -> origin/gh/tugsbayasgalan/67/orig 2025-12-04T09:33:40.2989298Z * [new branch] gh/tugsbayasgalan/68/base -> origin/gh/tugsbayasgalan/68/base 2025-12-04T09:33:40.2990607Z * [new branch] gh/tugsbayasgalan/68/head -> origin/gh/tugsbayasgalan/68/head 2025-12-04T09:33:40.2991896Z * [new branch] gh/tugsbayasgalan/68/orig -> origin/gh/tugsbayasgalan/68/orig 2025-12-04T09:33:40.2993632Z * [new branch] gh/tugsbayasgalan/7/base -> origin/gh/tugsbayasgalan/7/base 2025-12-04T09:33:40.2994890Z * [new branch] gh/tugsbayasgalan/7/head -> origin/gh/tugsbayasgalan/7/head 2025-12-04T09:33:40.2996274Z * [new branch] gh/tugsbayasgalan/7/orig -> origin/gh/tugsbayasgalan/7/orig 2025-12-04T09:33:40.2998308Z * [new branch] gh/tugsbayasgalan/70/base -> origin/gh/tugsbayasgalan/70/base 2025-12-04T09:33:40.2999726Z * [new branch] gh/tugsbayasgalan/70/head -> origin/gh/tugsbayasgalan/70/head 2025-12-04T09:33:40.3001047Z * [new branch] gh/tugsbayasgalan/70/orig -> origin/gh/tugsbayasgalan/70/orig 2025-12-04T09:33:40.3003054Z * [new branch] gh/tugsbayasgalan/71/base -> origin/gh/tugsbayasgalan/71/base 2025-12-04T09:33:40.3004475Z * [new branch] gh/tugsbayasgalan/71/head -> origin/gh/tugsbayasgalan/71/head 2025-12-04T09:33:40.3005889Z * [new branch] gh/tugsbayasgalan/71/orig -> origin/gh/tugsbayasgalan/71/orig 2025-12-04T09:33:40.3007853Z * [new branch] gh/tugsbayasgalan/72/base -> origin/gh/tugsbayasgalan/72/base 2025-12-04T09:33:40.3009110Z * [new branch] gh/tugsbayasgalan/72/head -> origin/gh/tugsbayasgalan/72/head 2025-12-04T09:33:40.3010381Z * [new branch] gh/tugsbayasgalan/72/orig -> origin/gh/tugsbayasgalan/72/orig 2025-12-04T09:33:40.3012169Z * [new branch] gh/tugsbayasgalan/73/base -> origin/gh/tugsbayasgalan/73/base 2025-12-04T09:33:40.3013630Z * [new branch] gh/tugsbayasgalan/73/head -> origin/gh/tugsbayasgalan/73/head 2025-12-04T09:33:40.3014903Z * [new branch] gh/tugsbayasgalan/73/orig -> origin/gh/tugsbayasgalan/73/orig 2025-12-04T09:33:40.3016834Z * [new branch] gh/tugsbayasgalan/74/base -> origin/gh/tugsbayasgalan/74/base 2025-12-04T09:33:40.3018211Z * [new branch] gh/tugsbayasgalan/74/head -> origin/gh/tugsbayasgalan/74/head 2025-12-04T09:33:40.3019612Z * [new branch] gh/tugsbayasgalan/74/orig -> origin/gh/tugsbayasgalan/74/orig 2025-12-04T09:33:40.3021396Z * [new branch] gh/tugsbayasgalan/75/base -> origin/gh/tugsbayasgalan/75/base 2025-12-04T09:33:40.3022657Z * [new branch] gh/tugsbayasgalan/75/head -> origin/gh/tugsbayasgalan/75/head 2025-12-04T09:33:40.3024047Z * [new branch] gh/tugsbayasgalan/75/orig -> origin/gh/tugsbayasgalan/75/orig 2025-12-04T09:33:40.3025652Z * [new branch] gh/tugsbayasgalan/76/base -> origin/gh/tugsbayasgalan/76/base 2025-12-04T09:33:40.3026946Z * [new branch] gh/tugsbayasgalan/76/head -> origin/gh/tugsbayasgalan/76/head 2025-12-04T09:33:40.3028220Z * [new branch] gh/tugsbayasgalan/76/orig -> origin/gh/tugsbayasgalan/76/orig 2025-12-04T09:33:40.3030154Z * [new branch] gh/tugsbayasgalan/77/base -> origin/gh/tugsbayasgalan/77/base 2025-12-04T09:33:40.3031404Z * [new branch] gh/tugsbayasgalan/77/head -> origin/gh/tugsbayasgalan/77/head 2025-12-04T09:33:40.3032617Z * [new branch] gh/tugsbayasgalan/77/orig -> origin/gh/tugsbayasgalan/77/orig 2025-12-04T09:33:40.3034403Z * [new branch] gh/tugsbayasgalan/78/base -> origin/gh/tugsbayasgalan/78/base 2025-12-04T09:33:40.3035866Z * [new branch] gh/tugsbayasgalan/78/head -> origin/gh/tugsbayasgalan/78/head 2025-12-04T09:33:40.3037195Z * [new branch] gh/tugsbayasgalan/78/orig -> origin/gh/tugsbayasgalan/78/orig 2025-12-04T09:33:40.3038977Z * [new branch] gh/tugsbayasgalan/79/base -> origin/gh/tugsbayasgalan/79/base 2025-12-04T09:33:40.3040229Z * [new branch] gh/tugsbayasgalan/79/head -> origin/gh/tugsbayasgalan/79/head 2025-12-04T09:33:40.3041635Z * [new branch] gh/tugsbayasgalan/79/orig -> origin/gh/tugsbayasgalan/79/orig 2025-12-04T09:33:40.3043354Z * [new branch] gh/tugsbayasgalan/8/base -> origin/gh/tugsbayasgalan/8/base 2025-12-04T09:33:40.3044575Z * [new branch] gh/tugsbayasgalan/8/head -> origin/gh/tugsbayasgalan/8/head 2025-12-04T09:33:40.3045978Z * [new branch] gh/tugsbayasgalan/8/orig -> origin/gh/tugsbayasgalan/8/orig 2025-12-04T09:33:40.3047506Z * [new branch] gh/tugsbayasgalan/80/base -> origin/gh/tugsbayasgalan/80/base 2025-12-04T09:33:40.3048739Z * [new branch] gh/tugsbayasgalan/80/head -> origin/gh/tugsbayasgalan/80/head 2025-12-04T09:33:40.3050002Z * [new branch] gh/tugsbayasgalan/80/orig -> origin/gh/tugsbayasgalan/80/orig 2025-12-04T09:33:40.3051724Z * [new branch] gh/tugsbayasgalan/81/base -> origin/gh/tugsbayasgalan/81/base 2025-12-04T09:33:40.3052916Z * [new branch] gh/tugsbayasgalan/81/head -> origin/gh/tugsbayasgalan/81/head 2025-12-04T09:33:40.3054175Z * [new branch] gh/tugsbayasgalan/81/orig -> origin/gh/tugsbayasgalan/81/orig 2025-12-04T09:33:40.3056647Z * [new branch] gh/tugsbayasgalan/82/base -> origin/gh/tugsbayasgalan/82/base 2025-12-04T09:33:40.3058040Z * [new branch] gh/tugsbayasgalan/82/head -> origin/gh/tugsbayasgalan/82/head 2025-12-04T09:33:40.3059626Z * [new branch] gh/tugsbayasgalan/82/orig -> origin/gh/tugsbayasgalan/82/orig 2025-12-04T09:33:40.3061230Z * [new branch] gh/tugsbayasgalan/83/base -> origin/gh/tugsbayasgalan/83/base 2025-12-04T09:33:40.3062502Z * [new branch] gh/tugsbayasgalan/83/head -> origin/gh/tugsbayasgalan/83/head 2025-12-04T09:33:40.3063790Z * [new branch] gh/tugsbayasgalan/83/orig -> origin/gh/tugsbayasgalan/83/orig 2025-12-04T09:33:40.3065390Z * [new branch] gh/tugsbayasgalan/84/base -> origin/gh/tugsbayasgalan/84/base 2025-12-04T09:33:40.3066704Z * [new branch] gh/tugsbayasgalan/84/head -> origin/gh/tugsbayasgalan/84/head 2025-12-04T09:33:40.3067940Z * [new branch] gh/tugsbayasgalan/84/orig -> origin/gh/tugsbayasgalan/84/orig 2025-12-04T09:33:40.3069538Z * [new branch] gh/tugsbayasgalan/85/base -> origin/gh/tugsbayasgalan/85/base 2025-12-04T09:33:40.3070779Z * [new branch] gh/tugsbayasgalan/85/head -> origin/gh/tugsbayasgalan/85/head 2025-12-04T09:33:40.3072042Z * [new branch] gh/tugsbayasgalan/85/orig -> origin/gh/tugsbayasgalan/85/orig 2025-12-04T09:33:40.3073796Z * [new branch] gh/tugsbayasgalan/86/base -> origin/gh/tugsbayasgalan/86/base 2025-12-04T09:33:40.3075141Z * [new branch] gh/tugsbayasgalan/86/head -> origin/gh/tugsbayasgalan/86/head 2025-12-04T09:33:40.3076723Z * [new branch] gh/tugsbayasgalan/86/orig -> origin/gh/tugsbayasgalan/86/orig 2025-12-04T09:33:40.3079266Z * [new branch] gh/tugsbayasgalan/87/base -> origin/gh/tugsbayasgalan/87/base 2025-12-04T09:33:40.3081030Z * [new branch] gh/tugsbayasgalan/87/head -> origin/gh/tugsbayasgalan/87/head 2025-12-04T09:33:40.3082322Z * [new branch] gh/tugsbayasgalan/87/orig -> origin/gh/tugsbayasgalan/87/orig 2025-12-04T09:33:40.3084120Z * [new branch] gh/tugsbayasgalan/88/base -> origin/gh/tugsbayasgalan/88/base 2025-12-04T09:33:40.3085350Z * [new branch] gh/tugsbayasgalan/88/head -> origin/gh/tugsbayasgalan/88/head 2025-12-04T09:33:40.3086654Z * [new branch] gh/tugsbayasgalan/88/orig -> origin/gh/tugsbayasgalan/88/orig 2025-12-04T09:33:40.3088625Z * [new branch] gh/tugsbayasgalan/89/base -> origin/gh/tugsbayasgalan/89/base 2025-12-04T09:33:40.3089933Z * [new branch] gh/tugsbayasgalan/89/head -> origin/gh/tugsbayasgalan/89/head 2025-12-04T09:33:40.3091215Z * [new branch] gh/tugsbayasgalan/89/orig -> origin/gh/tugsbayasgalan/89/orig 2025-12-04T09:33:40.3092948Z * [new branch] gh/tugsbayasgalan/9/base -> origin/gh/tugsbayasgalan/9/base 2025-12-04T09:33:40.3094061Z * [new branch] gh/tugsbayasgalan/9/head -> origin/gh/tugsbayasgalan/9/head 2025-12-04T09:33:40.3095448Z * [new branch] gh/tugsbayasgalan/9/orig -> origin/gh/tugsbayasgalan/9/orig 2025-12-04T09:33:40.3098106Z * [new branch] gh/tugsbayasgalan/90/base -> origin/gh/tugsbayasgalan/90/base 2025-12-04T09:33:40.3099425Z * [new branch] gh/tugsbayasgalan/90/head -> origin/gh/tugsbayasgalan/90/head 2025-12-04T09:33:40.3100721Z * [new branch] gh/tugsbayasgalan/90/orig -> origin/gh/tugsbayasgalan/90/orig 2025-12-04T09:33:40.3102687Z * [new branch] gh/tugsbayasgalan/91/base -> origin/gh/tugsbayasgalan/91/base 2025-12-04T09:33:40.3103934Z * [new branch] gh/tugsbayasgalan/91/head -> origin/gh/tugsbayasgalan/91/head 2025-12-04T09:33:40.3105120Z * [new branch] gh/tugsbayasgalan/91/orig -> origin/gh/tugsbayasgalan/91/orig 2025-12-04T09:33:40.3106958Z * [new branch] gh/tugsbayasgalan/92/base -> origin/gh/tugsbayasgalan/92/base 2025-12-04T09:33:40.3108256Z * [new branch] gh/tugsbayasgalan/92/head -> origin/gh/tugsbayasgalan/92/head 2025-12-04T09:33:40.3109570Z * [new branch] gh/tugsbayasgalan/92/orig -> origin/gh/tugsbayasgalan/92/orig 2025-12-04T09:33:40.3111409Z * [new branch] gh/tugsbayasgalan/93/base -> origin/gh/tugsbayasgalan/93/base 2025-12-04T09:33:40.3112794Z * [new branch] gh/tugsbayasgalan/93/head -> origin/gh/tugsbayasgalan/93/head 2025-12-04T09:33:40.3114209Z * [new branch] gh/tugsbayasgalan/93/orig -> origin/gh/tugsbayasgalan/93/orig 2025-12-04T09:33:40.3116311Z * [new branch] gh/v0i0/14/base -> origin/gh/v0i0/14/base 2025-12-04T09:33:40.3117506Z * [new branch] gh/v0i0/14/head -> origin/gh/v0i0/14/head 2025-12-04T09:33:40.3118737Z * [new branch] gh/v0i0/14/orig -> origin/gh/v0i0/14/orig 2025-12-04T09:33:40.3120282Z * [new branch] gh/v0i0/15/base -> origin/gh/v0i0/15/base 2025-12-04T09:33:40.3121676Z * [new branch] gh/v0i0/15/head -> origin/gh/v0i0/15/head 2025-12-04T09:33:40.3122918Z * [new branch] gh/v0i0/15/orig -> origin/gh/v0i0/15/orig 2025-12-04T09:33:40.3124632Z * [new branch] gh/v0i0/16/base -> origin/gh/v0i0/16/base 2025-12-04T09:33:40.3125877Z * [new branch] gh/v0i0/16/head -> origin/gh/v0i0/16/head 2025-12-04T09:33:40.3127149Z * [new branch] gh/v0i0/16/orig -> origin/gh/v0i0/16/orig 2025-12-04T09:33:40.3128759Z * [new branch] gh/v0i0/17/base -> origin/gh/v0i0/17/base 2025-12-04T09:33:40.3130134Z * [new branch] gh/v0i0/17/head -> origin/gh/v0i0/17/head 2025-12-04T09:33:40.3131407Z * [new branch] gh/v0i0/17/orig -> origin/gh/v0i0/17/orig 2025-12-04T09:33:40.3133179Z * [new branch] gh/v0i0/18/base -> origin/gh/v0i0/18/base 2025-12-04T09:33:40.3134485Z * [new branch] gh/v0i0/18/head -> origin/gh/v0i0/18/head 2025-12-04T09:33:40.3135761Z * [new branch] gh/v0i0/18/orig -> origin/gh/v0i0/18/orig 2025-12-04T09:33:40.3137462Z * [new branch] gh/v0i0/19/base -> origin/gh/v0i0/19/base 2025-12-04T09:33:40.3138778Z * [new branch] gh/v0i0/19/head -> origin/gh/v0i0/19/head 2025-12-04T09:33:40.3140111Z * [new branch] gh/v0i0/19/orig -> origin/gh/v0i0/19/orig 2025-12-04T09:33:40.3142225Z * [new branch] gh/vishal9-team/1/base -> origin/gh/vishal9-team/1/base 2025-12-04T09:33:40.3143531Z * [new branch] gh/vishal9-team/1/head -> origin/gh/vishal9-team/1/head 2025-12-04T09:33:40.3145041Z * [new branch] gh/vishal9-team/2/base -> origin/gh/vishal9-team/2/base 2025-12-04T09:33:40.3146420Z * [new branch] gh/vishal9-team/2/head -> origin/gh/vishal9-team/2/head 2025-12-04T09:33:40.3147825Z * [new branch] gh/vishal9-team/2/orig -> origin/gh/vishal9-team/2/orig 2025-12-04T09:33:40.3149614Z * [new branch] gh/vishal9-team/3/base -> origin/gh/vishal9-team/3/base 2025-12-04T09:33:40.3150928Z * [new branch] gh/vishal9-team/3/head -> origin/gh/vishal9-team/3/head 2025-12-04T09:33:40.3152206Z * [new branch] gh/vishal9-team/3/orig -> origin/gh/vishal9-team/3/orig 2025-12-04T09:33:40.3153737Z * [new branch] gh/vishal9-team/4/base -> origin/gh/vishal9-team/4/base 2025-12-04T09:33:40.3155046Z * [new branch] gh/vishal9-team/4/head -> origin/gh/vishal9-team/4/head 2025-12-04T09:33:40.3156293Z * [new branch] gh/vishal9-team/4/orig -> origin/gh/vishal9-team/4/orig 2025-12-04T09:33:40.3158221Z * [new branch] gh/vkuzo/1/next -> origin/gh/vkuzo/1/next 2025-12-04T09:33:40.3160317Z * [new branch] gh/vkuzo/2/next -> origin/gh/vkuzo/2/next 2025-12-04T09:33:40.3162538Z * [new branch] gh/vkuzo/3/next -> origin/gh/vkuzo/3/next 2025-12-04T09:33:40.3164593Z * [new branch] gh/wconstab/424/base -> origin/gh/wconstab/424/base 2025-12-04T09:33:40.3165959Z * [new branch] gh/wconstab/424/head -> origin/gh/wconstab/424/head 2025-12-04T09:33:40.3167278Z * [new branch] gh/wconstab/424/orig -> origin/gh/wconstab/424/orig 2025-12-04T09:33:40.3168987Z * [new branch] gh/wconstab/435/base -> origin/gh/wconstab/435/base 2025-12-04T09:33:40.3170325Z * [new branch] gh/wconstab/435/head -> origin/gh/wconstab/435/head 2025-12-04T09:33:40.3171570Z * [new branch] gh/wconstab/435/orig -> origin/gh/wconstab/435/orig 2025-12-04T09:33:40.3173261Z * [new branch] gh/wconstab/444/base -> origin/gh/wconstab/444/base 2025-12-04T09:33:40.3174634Z * [new branch] gh/wconstab/444/head -> origin/gh/wconstab/444/head 2025-12-04T09:33:40.3175930Z * [new branch] gh/wconstab/444/orig -> origin/gh/wconstab/444/orig 2025-12-04T09:33:40.3177668Z * [new branch] gh/wconstab/447/base -> origin/gh/wconstab/447/base 2025-12-04T09:33:40.3179594Z * [new branch] gh/wconstab/447/head -> origin/gh/wconstab/447/head 2025-12-04T09:33:40.3180439Z * [new branch] gh/wconstab/447/orig -> origin/gh/wconstab/447/orig 2025-12-04T09:33:40.3182258Z * [new branch] gh/wconstab/448/base -> origin/gh/wconstab/448/base 2025-12-04T09:33:40.3183606Z * [new branch] gh/wconstab/448/head -> origin/gh/wconstab/448/head 2025-12-04T09:33:40.3184808Z * [new branch] gh/wconstab/448/orig -> origin/gh/wconstab/448/orig 2025-12-04T09:33:40.3186372Z * [new branch] gh/wconstab/449/base -> origin/gh/wconstab/449/base 2025-12-04T09:33:40.3187639Z * [new branch] gh/wconstab/449/head -> origin/gh/wconstab/449/head 2025-12-04T09:33:40.3189101Z * [new branch] gh/wconstab/449/orig -> origin/gh/wconstab/449/orig 2025-12-04T09:33:40.3190637Z * [new branch] gh/wconstab/450/base -> origin/gh/wconstab/450/base 2025-12-04T09:33:40.3192032Z * [new branch] gh/wconstab/450/head -> origin/gh/wconstab/450/head 2025-12-04T09:33:40.3193312Z * [new branch] gh/wconstab/450/orig -> origin/gh/wconstab/450/orig 2025-12-04T09:33:40.3194804Z * [new branch] gh/wconstab/451/base -> origin/gh/wconstab/451/base 2025-12-04T09:33:40.3196279Z * [new branch] gh/wconstab/451/head -> origin/gh/wconstab/451/head 2025-12-04T09:33:40.3197626Z * [new branch] gh/wconstab/451/orig -> origin/gh/wconstab/451/orig 2025-12-04T09:33:40.3199369Z * [new branch] gh/wconstab/452/base -> origin/gh/wconstab/452/base 2025-12-04T09:33:40.3200609Z * [new branch] gh/wconstab/452/head -> origin/gh/wconstab/452/head 2025-12-04T09:33:40.3202057Z * [new branch] gh/wconstab/452/orig -> origin/gh/wconstab/452/orig 2025-12-04T09:33:40.3203496Z * [new branch] gh/wconstab/453/base -> origin/gh/wconstab/453/base 2025-12-04T09:33:40.3204817Z * [new branch] gh/wconstab/453/head -> origin/gh/wconstab/453/head 2025-12-04T09:33:40.3206168Z * [new branch] gh/wconstab/453/orig -> origin/gh/wconstab/453/orig 2025-12-04T09:33:40.3208265Z * [new branch] gh/wconstab/454/base -> origin/gh/wconstab/454/base 2025-12-04T09:33:40.3209595Z * [new branch] gh/wconstab/454/head -> origin/gh/wconstab/454/head 2025-12-04T09:33:40.3210869Z * [new branch] gh/wconstab/454/orig -> origin/gh/wconstab/454/orig 2025-12-04T09:33:40.3212576Z * [new branch] gh/wconstab/455/base -> origin/gh/wconstab/455/base 2025-12-04T09:33:40.3213981Z * [new branch] gh/wconstab/455/head -> origin/gh/wconstab/455/head 2025-12-04T09:33:40.3215265Z * [new branch] gh/wconstab/455/orig -> origin/gh/wconstab/455/orig 2025-12-04T09:33:40.3217212Z * [new branch] gh/wconstab/456/base -> origin/gh/wconstab/456/base 2025-12-04T09:33:40.3218895Z * [new branch] gh/wconstab/456/head -> origin/gh/wconstab/456/head 2025-12-04T09:33:40.3220297Z * [new branch] gh/wconstab/456/orig -> origin/gh/wconstab/456/orig 2025-12-04T09:33:40.3223421Z * [new branch] gh/wconstab/457/base -> origin/gh/wconstab/457/base 2025-12-04T09:33:40.3224216Z * [new branch] gh/wconstab/457/head -> origin/gh/wconstab/457/head 2025-12-04T09:33:40.3224842Z * [new branch] gh/wconstab/457/orig -> origin/gh/wconstab/457/orig 2025-12-04T09:33:40.3226676Z * [new branch] gh/wconstab/458/base -> origin/gh/wconstab/458/base 2025-12-04T09:33:40.3227982Z * [new branch] gh/wconstab/458/head -> origin/gh/wconstab/458/head 2025-12-04T09:33:40.3229202Z * [new branch] gh/wconstab/458/orig -> origin/gh/wconstab/458/orig 2025-12-04T09:33:40.3231199Z * [new branch] gh/wconstab/459/base -> origin/gh/wconstab/459/base 2025-12-04T09:33:40.3232716Z * [new branch] gh/wconstab/459/head -> origin/gh/wconstab/459/head 2025-12-04T09:33:40.3233888Z * [new branch] gh/wconstab/459/orig -> origin/gh/wconstab/459/orig 2025-12-04T09:33:40.3236188Z * [new branch] gh/wconstab/460/base -> origin/gh/wconstab/460/base 2025-12-04T09:33:40.3237779Z * [new branch] gh/wconstab/460/head -> origin/gh/wconstab/460/head 2025-12-04T09:33:40.3239272Z * [new branch] gh/wconstab/460/orig -> origin/gh/wconstab/460/orig 2025-12-04T09:33:40.3241102Z * [new branch] gh/wconstab/461/base -> origin/gh/wconstab/461/base 2025-12-04T09:33:40.3242375Z * [new branch] gh/wconstab/461/head -> origin/gh/wconstab/461/head 2025-12-04T09:33:40.3243748Z * [new branch] gh/wconstab/461/orig -> origin/gh/wconstab/461/orig 2025-12-04T09:33:40.3245343Z * [new branch] gh/wconstab/462/base -> origin/gh/wconstab/462/base 2025-12-04T09:33:40.3246717Z * [new branch] gh/wconstab/462/head -> origin/gh/wconstab/462/head 2025-12-04T09:33:40.3248065Z * [new branch] gh/wconstab/462/orig -> origin/gh/wconstab/462/orig 2025-12-04T09:33:40.3249906Z * [new branch] gh/wconstab/463/base -> origin/gh/wconstab/463/base 2025-12-04T09:33:40.3251354Z * [new branch] gh/wconstab/463/head -> origin/gh/wconstab/463/head 2025-12-04T09:33:40.3252671Z * [new branch] gh/wconstab/463/orig -> origin/gh/wconstab/463/orig 2025-12-04T09:33:40.3254418Z * [new branch] gh/wconstab/464/base -> origin/gh/wconstab/464/base 2025-12-04T09:33:40.3255853Z * [new branch] gh/wconstab/464/head -> origin/gh/wconstab/464/head 2025-12-04T09:33:40.3257067Z * [new branch] gh/wconstab/464/orig -> origin/gh/wconstab/464/orig 2025-12-04T09:33:40.3258811Z * [new branch] gh/wconstab/465/base -> origin/gh/wconstab/465/base 2025-12-04T09:33:40.3260199Z * [new branch] gh/wconstab/465/head -> origin/gh/wconstab/465/head 2025-12-04T09:33:40.3261400Z * [new branch] gh/wconstab/465/orig -> origin/gh/wconstab/465/orig 2025-12-04T09:33:40.3263222Z * [new branch] gh/wconstab/466/base -> origin/gh/wconstab/466/base 2025-12-04T09:33:40.3264401Z * [new branch] gh/wconstab/466/head -> origin/gh/wconstab/466/head 2025-12-04T09:33:40.3265581Z * [new branch] gh/wconstab/466/orig -> origin/gh/wconstab/466/orig 2025-12-04T09:33:40.3267715Z * [new branch] gh/wconstab/467/base -> origin/gh/wconstab/467/base 2025-12-04T09:33:40.3269192Z * [new branch] gh/wconstab/467/head -> origin/gh/wconstab/467/head 2025-12-04T09:33:40.3270523Z * [new branch] gh/wconstab/467/orig -> origin/gh/wconstab/467/orig 2025-12-04T09:33:40.3272099Z * [new branch] gh/wconstab/468/base -> origin/gh/wconstab/468/base 2025-12-04T09:33:40.3273392Z * [new branch] gh/wconstab/468/head -> origin/gh/wconstab/468/head 2025-12-04T09:33:40.3274621Z * [new branch] gh/wconstab/468/orig -> origin/gh/wconstab/468/orig 2025-12-04T09:33:40.3276773Z * [new branch] gh/weifengpy/39/base -> origin/gh/weifengpy/39/base 2025-12-04T09:33:40.3277991Z * [new branch] gh/weifengpy/39/head -> origin/gh/weifengpy/39/head 2025-12-04T09:33:40.3279407Z * [new branch] gh/weifengpy/39/orig -> origin/gh/weifengpy/39/orig 2025-12-04T09:33:40.3281291Z * [new branch] gh/weifengpy/40/base -> origin/gh/weifengpy/40/base 2025-12-04T09:33:40.3282555Z * [new branch] gh/weifengpy/40/head -> origin/gh/weifengpy/40/head 2025-12-04T09:33:40.3283886Z * [new branch] gh/weifengpy/40/orig -> origin/gh/weifengpy/40/orig 2025-12-04T09:33:40.3285796Z * [new branch] gh/weifengpy/41/base -> origin/gh/weifengpy/41/base 2025-12-04T09:33:40.3287174Z * [new branch] gh/weifengpy/41/head -> origin/gh/weifengpy/41/head 2025-12-04T09:33:40.3291547Z * [new branch] gh/weifengpy/41/orig -> origin/gh/weifengpy/41/orig 2025-12-04T09:33:40.3293827Z * [new branch] gh/williamwen42/250/base -> origin/gh/williamwen42/250/base 2025-12-04T09:33:40.3295162Z * [new branch] gh/williamwen42/250/head -> origin/gh/williamwen42/250/head 2025-12-04T09:33:40.3296433Z * [new branch] gh/williamwen42/250/orig -> origin/gh/williamwen42/250/orig 2025-12-04T09:33:40.3298771Z * [new branch] gh/williamwen42/279/base -> origin/gh/williamwen42/279/base 2025-12-04T09:33:40.3300280Z * [new branch] gh/williamwen42/279/head -> origin/gh/williamwen42/279/head 2025-12-04T09:33:40.3301559Z * [new branch] gh/williamwen42/279/orig -> origin/gh/williamwen42/279/orig 2025-12-04T09:33:40.3303325Z * [new branch] gh/williamwen42/282/base -> origin/gh/williamwen42/282/base 2025-12-04T09:33:40.3304580Z * [new branch] gh/williamwen42/282/head -> origin/gh/williamwen42/282/head 2025-12-04T09:33:40.3305899Z * [new branch] gh/williamwen42/282/orig -> origin/gh/williamwen42/282/orig 2025-12-04T09:33:40.3307746Z * [new branch] gh/williamwen42/287/base -> origin/gh/williamwen42/287/base 2025-12-04T09:33:40.3309047Z * [new branch] gh/williamwen42/287/head -> origin/gh/williamwen42/287/head 2025-12-04T09:33:40.3310377Z * [new branch] gh/williamwen42/287/orig -> origin/gh/williamwen42/287/orig 2025-12-04T09:33:40.3312241Z * [new branch] gh/williamwen42/288/base -> origin/gh/williamwen42/288/base 2025-12-04T09:33:40.3313433Z * [new branch] gh/williamwen42/288/head -> origin/gh/williamwen42/288/head 2025-12-04T09:33:40.3314690Z * [new branch] gh/williamwen42/288/orig -> origin/gh/williamwen42/288/orig 2025-12-04T09:33:40.3316686Z * [new branch] gh/williamwen42/296/base -> origin/gh/williamwen42/296/base 2025-12-04T09:33:40.3318081Z * [new branch] gh/williamwen42/296/head -> origin/gh/williamwen42/296/head 2025-12-04T09:33:40.3319366Z * [new branch] gh/williamwen42/296/orig -> origin/gh/williamwen42/296/orig 2025-12-04T09:33:40.3321014Z * [new branch] gh/williamwen42/297/base -> origin/gh/williamwen42/297/base 2025-12-04T09:33:40.3322340Z * [new branch] gh/williamwen42/297/head -> origin/gh/williamwen42/297/head 2025-12-04T09:33:40.3323702Z * [new branch] gh/williamwen42/297/orig -> origin/gh/williamwen42/297/orig 2025-12-04T09:33:40.3325532Z * [new branch] gh/williamwen42/306/base -> origin/gh/williamwen42/306/base 2025-12-04T09:33:40.3327319Z * [new branch] gh/williamwen42/306/head -> origin/gh/williamwen42/306/head 2025-12-04T09:33:40.3328644Z * [new branch] gh/williamwen42/306/orig -> origin/gh/williamwen42/306/orig 2025-12-04T09:33:40.3330418Z * [new branch] gh/williamwen42/309/base -> origin/gh/williamwen42/309/base 2025-12-04T09:33:40.3331736Z * [new branch] gh/williamwen42/309/head -> origin/gh/williamwen42/309/head 2025-12-04T09:33:40.3333019Z * [new branch] gh/williamwen42/309/orig -> origin/gh/williamwen42/309/orig 2025-12-04T09:33:40.3334692Z * [new branch] gh/williamwen42/310/base -> origin/gh/williamwen42/310/base 2025-12-04T09:33:40.3336009Z * [new branch] gh/williamwen42/310/head -> origin/gh/williamwen42/310/head 2025-12-04T09:33:40.3337375Z * [new branch] gh/williamwen42/310/orig -> origin/gh/williamwen42/310/orig 2025-12-04T09:33:40.3340676Z * [new branch] gh/williamwen42/311/base -> origin/gh/williamwen42/311/base 2025-12-04T09:33:40.3341975Z * [new branch] gh/williamwen42/311/head -> origin/gh/williamwen42/311/head 2025-12-04T09:33:40.3343238Z * [new branch] gh/williamwen42/311/orig -> origin/gh/williamwen42/311/orig 2025-12-04T09:33:40.3344870Z * [new branch] gh/williamwen42/319/base -> origin/gh/williamwen42/319/base 2025-12-04T09:33:40.3346161Z * [new branch] gh/williamwen42/319/head -> origin/gh/williamwen42/319/head 2025-12-04T09:33:40.3347434Z * [new branch] gh/williamwen42/319/orig -> origin/gh/williamwen42/319/orig 2025-12-04T09:33:40.3349221Z * [new branch] gh/williamwen42/325/base -> origin/gh/williamwen42/325/base 2025-12-04T09:33:40.3350610Z * [new branch] gh/williamwen42/325/head -> origin/gh/williamwen42/325/head 2025-12-04T09:33:40.3351887Z * [new branch] gh/williamwen42/325/orig -> origin/gh/williamwen42/325/orig 2025-12-04T09:33:40.3353564Z * [new branch] gh/williamwen42/326/base -> origin/gh/williamwen42/326/base 2025-12-04T09:33:40.3354914Z * [new branch] gh/williamwen42/326/head -> origin/gh/williamwen42/326/head 2025-12-04T09:33:40.3356249Z * [new branch] gh/williamwen42/326/orig -> origin/gh/williamwen42/326/orig 2025-12-04T09:33:40.3358002Z * [new branch] gh/williamwen42/327/base -> origin/gh/williamwen42/327/base 2025-12-04T09:33:40.3359348Z * [new branch] gh/williamwen42/327/head -> origin/gh/williamwen42/327/head 2025-12-04T09:33:40.3360566Z * [new branch] gh/williamwen42/327/orig -> origin/gh/williamwen42/327/orig 2025-12-04T09:33:40.3362469Z * [new branch] gh/williamwen42/328/base -> origin/gh/williamwen42/328/base 2025-12-04T09:33:40.3363974Z * [new branch] gh/williamwen42/328/head -> origin/gh/williamwen42/328/head 2025-12-04T09:33:40.3365204Z * [new branch] gh/williamwen42/328/orig -> origin/gh/williamwen42/328/orig 2025-12-04T09:33:40.3367304Z * [new branch] gh/williamwen42/329/base -> origin/gh/williamwen42/329/base 2025-12-04T09:33:40.3368674Z * [new branch] gh/williamwen42/329/head -> origin/gh/williamwen42/329/head 2025-12-04T09:33:40.3370066Z * [new branch] gh/williamwen42/329/orig -> origin/gh/williamwen42/329/orig 2025-12-04T09:33:40.3371861Z * [new branch] gh/williamwen42/330/base -> origin/gh/williamwen42/330/base 2025-12-04T09:33:40.3373211Z * [new branch] gh/williamwen42/330/head -> origin/gh/williamwen42/330/head 2025-12-04T09:33:40.3374480Z * [new branch] gh/williamwen42/330/orig -> origin/gh/williamwen42/330/orig 2025-12-04T09:33:40.3376159Z * [new branch] gh/williamwen42/331/base -> origin/gh/williamwen42/331/base 2025-12-04T09:33:40.3377458Z * [new branch] gh/williamwen42/331/head -> origin/gh/williamwen42/331/head 2025-12-04T09:33:40.3378837Z * [new branch] gh/williamwen42/331/orig -> origin/gh/williamwen42/331/orig 2025-12-04T09:33:40.3380589Z * [new branch] gh/williamwen42/332/base -> origin/gh/williamwen42/332/base 2025-12-04T09:33:40.3381856Z * [new branch] gh/williamwen42/332/head -> origin/gh/williamwen42/332/head 2025-12-04T09:33:40.3383406Z * [new branch] gh/williamwen42/332/orig -> origin/gh/williamwen42/332/orig 2025-12-04T09:33:40.3385424Z * [new branch] gh/williamwen42/333/base -> origin/gh/williamwen42/333/base 2025-12-04T09:33:40.3386695Z * [new branch] gh/williamwen42/333/head -> origin/gh/williamwen42/333/head 2025-12-04T09:33:40.3388173Z * [new branch] gh/williamwen42/333/orig -> origin/gh/williamwen42/333/orig 2025-12-04T09:33:40.3389921Z * [new branch] gh/williamwen42/334/base -> origin/gh/williamwen42/334/base 2025-12-04T09:33:40.3391190Z * [new branch] gh/williamwen42/334/head -> origin/gh/williamwen42/334/head 2025-12-04T09:33:40.3392503Z * [new branch] gh/williamwen42/334/orig -> origin/gh/williamwen42/334/orig 2025-12-04T09:33:40.3398277Z * [new branch] gh/williamwen42/335/base -> origin/gh/williamwen42/335/base 2025-12-04T09:33:40.3399640Z * [new branch] gh/williamwen42/335/head -> origin/gh/williamwen42/335/head 2025-12-04T09:33:40.3400922Z * [new branch] gh/williamwen42/335/orig -> origin/gh/williamwen42/335/orig 2025-12-04T09:33:40.3402869Z * [new branch] gh/williamwen42/336/base -> origin/gh/williamwen42/336/base 2025-12-04T09:33:40.3404060Z * [new branch] gh/williamwen42/336/head -> origin/gh/williamwen42/336/head 2025-12-04T09:33:40.3405312Z * [new branch] gh/williamwen42/336/orig -> origin/gh/williamwen42/336/orig 2025-12-04T09:33:40.3407042Z * [new branch] gh/williamwen42/337/base -> origin/gh/williamwen42/337/base 2025-12-04T09:33:40.3408336Z * [new branch] gh/williamwen42/337/head -> origin/gh/williamwen42/337/head 2025-12-04T09:33:40.3409627Z * [new branch] gh/williamwen42/337/orig -> origin/gh/williamwen42/337/orig 2025-12-04T09:33:40.3411520Z * [new branch] gh/williamwen42/338/base -> origin/gh/williamwen42/338/base 2025-12-04T09:33:40.3412842Z * [new branch] gh/williamwen42/338/head -> origin/gh/williamwen42/338/head 2025-12-04T09:33:40.3414057Z * [new branch] gh/williamwen42/338/orig -> origin/gh/williamwen42/338/orig 2025-12-04T09:33:40.3415795Z * [new branch] gh/williamwen42/339/base -> origin/gh/williamwen42/339/base 2025-12-04T09:33:40.3417162Z * [new branch] gh/williamwen42/339/head -> origin/gh/williamwen42/339/head 2025-12-04T09:33:40.3418399Z * [new branch] gh/williamwen42/339/orig -> origin/gh/williamwen42/339/orig 2025-12-04T09:33:40.3420432Z * [new branch] gh/williamwen42/340/base -> origin/gh/williamwen42/340/base 2025-12-04T09:33:40.3421624Z * [new branch] gh/williamwen42/340/head -> origin/gh/williamwen42/340/head 2025-12-04T09:33:40.3422851Z * [new branch] gh/williamwen42/340/orig -> origin/gh/williamwen42/340/orig 2025-12-04T09:33:40.3424699Z * [new branch] gh/williamwen42/341/base -> origin/gh/williamwen42/341/base 2025-12-04T09:33:40.3426059Z * [new branch] gh/williamwen42/341/head -> origin/gh/williamwen42/341/head 2025-12-04T09:33:40.3427359Z * [new branch] gh/williamwen42/341/orig -> origin/gh/williamwen42/341/orig 2025-12-04T09:33:40.3429059Z * [new branch] gh/williamwen42/342/base -> origin/gh/williamwen42/342/base 2025-12-04T09:33:40.3430372Z * [new branch] gh/williamwen42/342/head -> origin/gh/williamwen42/342/head 2025-12-04T09:33:40.3438692Z * [new branch] gh/williamwen42/342/orig -> origin/gh/williamwen42/342/orig 2025-12-04T09:33:40.3439125Z * [new branch] gh/williamwen42/343/base -> origin/gh/williamwen42/343/base 2025-12-04T09:33:40.3439588Z * [new branch] gh/williamwen42/343/head -> origin/gh/williamwen42/343/head 2025-12-04T09:33:40.3439869Z * [new branch] gh/williamwen42/343/orig -> origin/gh/williamwen42/343/orig 2025-12-04T09:33:40.3440164Z * [new branch] gh/williamwen42/344/base -> origin/gh/williamwen42/344/base 2025-12-04T09:33:40.3440449Z * [new branch] gh/williamwen42/344/head -> origin/gh/williamwen42/344/head 2025-12-04T09:33:40.3440736Z * [new branch] gh/williamwen42/344/orig -> origin/gh/williamwen42/344/orig 2025-12-04T09:33:40.3442362Z * [new branch] gh/williamwen42/345/base -> origin/gh/williamwen42/345/base 2025-12-04T09:33:40.3443591Z * [new branch] gh/williamwen42/345/head -> origin/gh/williamwen42/345/head 2025-12-04T09:33:40.3444892Z * [new branch] gh/williamwen42/345/orig -> origin/gh/williamwen42/345/orig 2025-12-04T09:33:40.3446614Z * [new branch] gh/williamwen42/346/base -> origin/gh/williamwen42/346/base 2025-12-04T09:33:40.3447964Z * [new branch] gh/williamwen42/346/head -> origin/gh/williamwen42/346/head 2025-12-04T09:33:40.3449279Z * [new branch] gh/williamwen42/346/orig -> origin/gh/williamwen42/346/orig 2025-12-04T09:33:40.3451092Z * [new branch] gh/williamwen42/347/base -> origin/gh/williamwen42/347/base 2025-12-04T09:33:40.3452789Z * [new branch] gh/williamwen42/347/head -> origin/gh/williamwen42/347/head 2025-12-04T09:33:40.3454081Z * [new branch] gh/williamwen42/347/orig -> origin/gh/williamwen42/347/orig 2025-12-04T09:33:40.3455865Z * [new branch] gh/williamwen42/348/base -> origin/gh/williamwen42/348/base 2025-12-04T09:33:40.3457141Z * [new branch] gh/williamwen42/348/head -> origin/gh/williamwen42/348/head 2025-12-04T09:33:40.3458479Z * [new branch] gh/williamwen42/348/orig -> origin/gh/williamwen42/348/orig 2025-12-04T09:33:40.3460614Z * [new branch] gh/williamwen42/349/base -> origin/gh/williamwen42/349/base 2025-12-04T09:33:40.3462373Z * [new branch] gh/williamwen42/349/head -> origin/gh/williamwen42/349/head 2025-12-04T09:33:40.3463667Z * [new branch] gh/williamwen42/349/orig -> origin/gh/williamwen42/349/orig 2025-12-04T09:33:40.3465462Z * [new branch] gh/williamwen42/350/base -> origin/gh/williamwen42/350/base 2025-12-04T09:33:40.3466737Z * [new branch] gh/williamwen42/350/head -> origin/gh/williamwen42/350/head 2025-12-04T09:33:40.3468110Z * [new branch] gh/williamwen42/350/orig -> origin/gh/williamwen42/350/orig 2025-12-04T09:33:40.3469836Z * [new branch] gh/williamwen42/351/base -> origin/gh/williamwen42/351/base 2025-12-04T09:33:40.3471243Z * [new branch] gh/williamwen42/351/head -> origin/gh/williamwen42/351/head 2025-12-04T09:33:40.3472600Z * [new branch] gh/williamwen42/351/orig -> origin/gh/williamwen42/351/orig 2025-12-04T09:33:40.3474426Z * [new branch] gh/williamwen42/352/base -> origin/gh/williamwen42/352/base 2025-12-04T09:33:40.3475736Z * [new branch] gh/williamwen42/352/head -> origin/gh/williamwen42/352/head 2025-12-04T09:33:40.3477694Z * [new branch] gh/williamwen42/352/orig -> origin/gh/williamwen42/352/orig 2025-12-04T09:33:40.3479380Z * [new branch] gh/williamwen42/353/base -> origin/gh/williamwen42/353/base 2025-12-04T09:33:40.3480703Z * [new branch] gh/williamwen42/353/head -> origin/gh/williamwen42/353/head 2025-12-04T09:33:40.3482013Z * [new branch] gh/williamwen42/353/orig -> origin/gh/williamwen42/353/orig 2025-12-04T09:33:40.3483687Z * [new branch] gh/williamwen42/354/base -> origin/gh/williamwen42/354/base 2025-12-04T09:33:40.3485088Z * [new branch] gh/williamwen42/354/head -> origin/gh/williamwen42/354/head 2025-12-04T09:33:40.3486354Z * [new branch] gh/williamwen42/354/orig -> origin/gh/williamwen42/354/orig 2025-12-04T09:33:40.3488220Z * [new branch] gh/williamwen42/355/base -> origin/gh/williamwen42/355/base 2025-12-04T09:33:40.3489586Z * [new branch] gh/williamwen42/355/head -> origin/gh/williamwen42/355/head 2025-12-04T09:33:40.3490871Z * [new branch] gh/williamwen42/355/orig -> origin/gh/williamwen42/355/orig 2025-12-04T09:33:40.3492767Z * [new branch] gh/williamwen42/356/base -> origin/gh/williamwen42/356/base 2025-12-04T09:33:40.3494049Z * [new branch] gh/williamwen42/356/head -> origin/gh/williamwen42/356/head 2025-12-04T09:33:40.3495281Z * [new branch] gh/williamwen42/356/orig -> origin/gh/williamwen42/356/orig 2025-12-04T09:33:40.3497022Z * [new branch] gh/williamwen42/357/base -> origin/gh/williamwen42/357/base 2025-12-04T09:33:40.3498474Z * [new branch] gh/williamwen42/357/head -> origin/gh/williamwen42/357/head 2025-12-04T09:33:40.3499878Z * [new branch] gh/williamwen42/357/orig -> origin/gh/williamwen42/357/orig 2025-12-04T09:33:40.3501632Z * [new branch] gh/williamwen42/358/base -> origin/gh/williamwen42/358/base 2025-12-04T09:33:40.3502881Z * [new branch] gh/williamwen42/358/head -> origin/gh/williamwen42/358/head 2025-12-04T09:33:40.3504287Z * [new branch] gh/williamwen42/358/orig -> origin/gh/williamwen42/358/orig 2025-12-04T09:33:40.3506242Z * [new branch] gh/xmfan/169/base -> origin/gh/xmfan/169/base 2025-12-04T09:33:40.3507537Z * [new branch] gh/xmfan/169/head -> origin/gh/xmfan/169/head 2025-12-04T09:33:40.3509213Z * [new branch] gh/xmfan/170/base -> origin/gh/xmfan/170/base 2025-12-04T09:33:40.3510504Z * [new branch] gh/xmfan/170/head -> origin/gh/xmfan/170/head 2025-12-04T09:33:40.3512094Z * [new branch] gh/xmfan/274/base -> origin/gh/xmfan/274/base 2025-12-04T09:33:40.3513340Z * [new branch] gh/xmfan/274/head -> origin/gh/xmfan/274/head 2025-12-04T09:33:40.3514615Z * [new branch] gh/xmfan/274/orig -> origin/gh/xmfan/274/orig 2025-12-04T09:33:40.3516239Z * [new branch] gh/xmfan/277/base -> origin/gh/xmfan/277/base 2025-12-04T09:33:40.3517471Z * [new branch] gh/xmfan/277/head -> origin/gh/xmfan/277/head 2025-12-04T09:33:40.3518718Z * [new branch] gh/xmfan/277/orig -> origin/gh/xmfan/277/orig 2025-12-04T09:33:40.3520530Z * [new branch] gh/xmfan/301/base -> origin/gh/xmfan/301/base 2025-12-04T09:33:40.3521637Z * [new branch] gh/xmfan/301/head -> origin/gh/xmfan/301/head 2025-12-04T09:33:40.3522855Z * [new branch] gh/xmfan/301/orig -> origin/gh/xmfan/301/orig 2025-12-04T09:33:40.3524621Z * [new branch] gh/xmfan/304/base -> origin/gh/xmfan/304/base 2025-12-04T09:33:40.3525875Z * [new branch] gh/xmfan/304/head -> origin/gh/xmfan/304/head 2025-12-04T09:33:40.3527127Z * [new branch] gh/xmfan/304/orig -> origin/gh/xmfan/304/orig 2025-12-04T09:33:40.3529225Z * [new branch] gh/xmfan/309/base -> origin/gh/xmfan/309/base 2025-12-04T09:33:40.3530476Z * [new branch] gh/xmfan/309/head -> origin/gh/xmfan/309/head 2025-12-04T09:33:40.3531767Z * [new branch] gh/xmfan/309/orig -> origin/gh/xmfan/309/orig 2025-12-04T09:33:40.3533402Z * [new branch] gh/xmfan/310/base -> origin/gh/xmfan/310/base 2025-12-04T09:33:40.3534645Z * [new branch] gh/xmfan/310/head -> origin/gh/xmfan/310/head 2025-12-04T09:33:40.3535920Z * [new branch] gh/xmfan/310/orig -> origin/gh/xmfan/310/orig 2025-12-04T09:33:40.3537629Z * [new branch] gh/xmfan/311/base -> origin/gh/xmfan/311/base 2025-12-04T09:33:40.3538994Z * [new branch] gh/xmfan/311/head -> origin/gh/xmfan/311/head 2025-12-04T09:33:40.3540383Z * [new branch] gh/xmfan/311/orig -> origin/gh/xmfan/311/orig 2025-12-04T09:33:40.3542075Z * [new branch] gh/xmfan/312/base -> origin/gh/xmfan/312/base 2025-12-04T09:33:40.3543338Z * [new branch] gh/xmfan/312/head -> origin/gh/xmfan/312/head 2025-12-04T09:33:40.3544619Z * [new branch] gh/xmfan/312/orig -> origin/gh/xmfan/312/orig 2025-12-04T09:33:40.3546288Z * [new branch] gh/xmfan/313/base -> origin/gh/xmfan/313/base 2025-12-04T09:33:40.3547516Z * [new branch] gh/xmfan/313/head -> origin/gh/xmfan/313/head 2025-12-04T09:33:40.3548757Z * [new branch] gh/xmfan/313/orig -> origin/gh/xmfan/313/orig 2025-12-04T09:33:40.3550743Z * [new branch] gh/xuanzhang816/27/base -> origin/gh/xuanzhang816/27/base 2025-12-04T09:33:40.3552014Z * [new branch] gh/xuanzhang816/27/head -> origin/gh/xuanzhang816/27/head 2025-12-04T09:33:40.3553268Z * [new branch] gh/xuanzhang816/27/orig -> origin/gh/xuanzhang816/27/orig 2025-12-04T09:33:40.3555046Z * [new branch] gh/xuanzhang816/32/base -> origin/gh/xuanzhang816/32/base 2025-12-04T09:33:40.3556303Z * [new branch] gh/xuanzhang816/32/head -> origin/gh/xuanzhang816/32/head 2025-12-04T09:33:40.3557574Z * [new branch] gh/xuanzhang816/32/orig -> origin/gh/xuanzhang816/32/orig 2025-12-04T09:33:40.3559372Z * [new branch] gh/xuanzhang816/33/base -> origin/gh/xuanzhang816/33/base 2025-12-04T09:33:40.3560626Z * [new branch] gh/xuanzhang816/33/head -> origin/gh/xuanzhang816/33/head 2025-12-04T09:33:40.3561906Z * [new branch] gh/xuanzhang816/33/orig -> origin/gh/xuanzhang816/33/orig 2025-12-04T09:33:40.3563955Z * [new branch] gh/xuanzhang816/34/base -> origin/gh/xuanzhang816/34/base 2025-12-04T09:33:40.3565266Z * [new branch] gh/xuanzhang816/34/head -> origin/gh/xuanzhang816/34/head 2025-12-04T09:33:40.3566503Z * [new branch] gh/xuanzhang816/34/orig -> origin/gh/xuanzhang816/34/orig 2025-12-04T09:33:40.3568423Z * [new branch] gh/xuanzhang816/35/base -> origin/gh/xuanzhang816/35/base 2025-12-04T09:33:40.3569707Z * [new branch] gh/xuanzhang816/35/head -> origin/gh/xuanzhang816/35/head 2025-12-04T09:33:40.3571031Z * [new branch] gh/xuanzhang816/35/orig -> origin/gh/xuanzhang816/35/orig 2025-12-04T09:33:40.3573031Z * [new branch] gh/yanbing-j/11/base -> origin/gh/yanbing-j/11/base 2025-12-04T09:33:40.3574252Z * [new branch] gh/yanbing-j/11/head -> origin/gh/yanbing-j/11/head 2025-12-04T09:33:40.3575621Z * [new branch] gh/yanbing-j/11/orig -> origin/gh/yanbing-j/11/orig 2025-12-04T09:33:40.3577299Z * [new branch] gh/yanbing-j/12/base -> origin/gh/yanbing-j/12/base 2025-12-04T09:33:40.3578703Z * [new branch] gh/yanbing-j/12/head -> origin/gh/yanbing-j/12/head 2025-12-04T09:33:40.3579963Z * [new branch] gh/yanbing-j/12/orig -> origin/gh/yanbing-j/12/orig 2025-12-04T09:33:40.3581664Z * [new branch] gh/yanbing-j/13/base -> origin/gh/yanbing-j/13/base 2025-12-04T09:33:40.3582919Z * [new branch] gh/yanbing-j/13/head -> origin/gh/yanbing-j/13/head 2025-12-04T09:33:40.3584189Z * [new branch] gh/yanbing-j/13/orig -> origin/gh/yanbing-j/13/orig 2025-12-04T09:33:40.3586364Z * [new branch] gh/yanbing-j/14/base -> origin/gh/yanbing-j/14/base 2025-12-04T09:33:40.3587635Z * [new branch] gh/yanbing-j/14/head -> origin/gh/yanbing-j/14/head 2025-12-04T09:33:40.3589123Z * [new branch] gh/yanbing-j/14/orig -> origin/gh/yanbing-j/14/orig 2025-12-04T09:33:40.3590738Z * [new branch] gh/yanbing-j/15/base -> origin/gh/yanbing-j/15/base 2025-12-04T09:33:40.3591949Z * [new branch] gh/yanbing-j/15/head -> origin/gh/yanbing-j/15/head 2025-12-04T09:33:40.3593310Z * [new branch] gh/yanbing-j/15/orig -> origin/gh/yanbing-j/15/orig 2025-12-04T09:33:40.3594859Z * [new branch] gh/yanbing-j/18/base -> origin/gh/yanbing-j/18/base 2025-12-04T09:33:40.3596130Z * [new branch] gh/yanbing-j/18/head -> origin/gh/yanbing-j/18/head 2025-12-04T09:33:40.3597356Z * [new branch] gh/yanbing-j/18/orig -> origin/gh/yanbing-j/18/orig 2025-12-04T09:33:40.3598990Z * [new branch] gh/yanbing-j/19/base -> origin/gh/yanbing-j/19/base 2025-12-04T09:33:40.3600699Z * [new branch] gh/yanbing-j/19/head -> origin/gh/yanbing-j/19/head 2025-12-04T09:33:40.3601944Z * [new branch] gh/yanbing-j/19/orig -> origin/gh/yanbing-j/19/orig 2025-12-04T09:33:40.3603682Z * [new branch] gh/yanbing-j/20/base -> origin/gh/yanbing-j/20/base 2025-12-04T09:33:40.3604916Z * [new branch] gh/yanbing-j/20/head -> origin/gh/yanbing-j/20/head 2025-12-04T09:33:40.3606163Z * [new branch] gh/yanbing-j/20/orig -> origin/gh/yanbing-j/20/orig 2025-12-04T09:33:40.3607823Z * [new branch] gh/yanbing-j/21/base -> origin/gh/yanbing-j/21/base 2025-12-04T09:33:40.3609122Z * [new branch] gh/yanbing-j/21/head -> origin/gh/yanbing-j/21/head 2025-12-04T09:33:40.3610929Z * [new branch] gh/yanbing-j/22/base -> origin/gh/yanbing-j/22/base 2025-12-04T09:33:40.3612171Z * [new branch] gh/yanbing-j/22/head -> origin/gh/yanbing-j/22/head 2025-12-04T09:33:40.3613762Z * [new branch] gh/yanbing-j/22/orig -> origin/gh/yanbing-j/22/orig 2025-12-04T09:33:40.3615463Z * [new branch] gh/yanbing-j/23/base -> origin/gh/yanbing-j/23/base 2025-12-04T09:33:40.3616733Z * [new branch] gh/yanbing-j/23/head -> origin/gh/yanbing-j/23/head 2025-12-04T09:33:40.3618051Z * [new branch] gh/yanbing-j/23/orig -> origin/gh/yanbing-j/23/orig 2025-12-04T09:33:40.3619926Z * [new branch] gh/yanbing-j/24/base -> origin/gh/yanbing-j/24/base 2025-12-04T09:33:40.3621179Z * [new branch] gh/yanbing-j/24/head -> origin/gh/yanbing-j/24/head 2025-12-04T09:33:40.3622548Z * [new branch] gh/yanbing-j/24/orig -> origin/gh/yanbing-j/24/orig 2025-12-04T09:33:40.3624147Z * [new branch] gh/yanbing-j/25/base -> origin/gh/yanbing-j/25/base 2025-12-04T09:33:40.3625339Z * [new branch] gh/yanbing-j/25/head -> origin/gh/yanbing-j/25/head 2025-12-04T09:33:40.3626586Z * [new branch] gh/yanbing-j/25/orig -> origin/gh/yanbing-j/25/orig 2025-12-04T09:33:40.3628423Z * [new branch] gh/yanbing-j/26/base -> origin/gh/yanbing-j/26/base 2025-12-04T09:33:40.3629680Z * [new branch] gh/yanbing-j/26/head -> origin/gh/yanbing-j/26/head 2025-12-04T09:33:40.3630936Z * [new branch] gh/yanbing-j/26/orig -> origin/gh/yanbing-j/26/orig 2025-12-04T09:33:40.3633058Z * [new branch] gh/yang-yu-hang/1/base -> origin/gh/yang-yu-hang/1/base 2025-12-04T09:33:40.3634542Z * [new branch] gh/yang-yu-hang/1/head -> origin/gh/yang-yu-hang/1/head 2025-12-04T09:33:40.3636013Z * [new branch] gh/yang-yu-hang/1/orig -> origin/gh/yang-yu-hang/1/orig 2025-12-04T09:33:40.3637668Z * [new branch] gh/yang-yu-hang/2/base -> origin/gh/yang-yu-hang/2/base 2025-12-04T09:33:40.3639236Z * [new branch] gh/yang-yu-hang/2/head -> origin/gh/yang-yu-hang/2/head 2025-12-04T09:33:40.3640786Z * [new branch] gh/yang-yu-hang/2/orig -> origin/gh/yang-yu-hang/2/orig 2025-12-04T09:33:40.3642517Z * [new branch] gh/yang-yu-hang/3/base -> origin/gh/yang-yu-hang/3/base 2025-12-04T09:33:40.3643777Z * [new branch] gh/yang-yu-hang/3/head -> origin/gh/yang-yu-hang/3/head 2025-12-04T09:33:40.3645221Z * [new branch] gh/yang-yu-hang/3/orig -> origin/gh/yang-yu-hang/3/orig 2025-12-04T09:33:40.3647201Z * [new branch] gh/yangw-dev/12/base -> origin/gh/yangw-dev/12/base 2025-12-04T09:33:40.3648487Z * [new branch] gh/yangw-dev/12/head -> origin/gh/yangw-dev/12/head 2025-12-04T09:33:40.3649723Z * [new branch] gh/yangw-dev/12/orig -> origin/gh/yangw-dev/12/orig 2025-12-04T09:33:40.3651417Z * [new branch] gh/yangw-dev/13/base -> origin/gh/yangw-dev/13/base 2025-12-04T09:33:40.3652782Z * [new branch] gh/yangw-dev/13/head -> origin/gh/yangw-dev/13/head 2025-12-04T09:33:40.3654083Z * [new branch] gh/yangw-dev/13/orig -> origin/gh/yangw-dev/13/orig 2025-12-04T09:33:40.3655698Z * [new branch] gh/yangw-dev/14/base -> origin/gh/yangw-dev/14/base 2025-12-04T09:33:40.3656914Z * [new branch] gh/yangw-dev/14/head -> origin/gh/yangw-dev/14/head 2025-12-04T09:33:40.3658235Z * [new branch] gh/yangw-dev/14/orig -> origin/gh/yangw-dev/14/orig 2025-12-04T09:33:40.3660037Z * [new branch] gh/yangw-dev/15/base -> origin/gh/yangw-dev/15/base 2025-12-04T09:33:40.3661383Z * [new branch] gh/yangw-dev/15/head -> origin/gh/yangw-dev/15/head 2025-12-04T09:33:40.3662661Z * [new branch] gh/yangw-dev/15/orig -> origin/gh/yangw-dev/15/orig 2025-12-04T09:33:40.3664278Z * [new branch] gh/yangw-dev/19/base -> origin/gh/yangw-dev/19/base 2025-12-04T09:33:40.3665525Z * [new branch] gh/yangw-dev/19/head -> origin/gh/yangw-dev/19/head 2025-12-04T09:33:40.3666757Z * [new branch] gh/yangw-dev/19/orig -> origin/gh/yangw-dev/19/orig 2025-12-04T09:33:40.3668479Z * [new branch] gh/yangw-dev/26/base -> origin/gh/yangw-dev/26/base 2025-12-04T09:33:40.3669796Z * [new branch] gh/yangw-dev/26/head -> origin/gh/yangw-dev/26/head 2025-12-04T09:33:40.3671030Z * [new branch] gh/yangw-dev/26/orig -> origin/gh/yangw-dev/26/orig 2025-12-04T09:33:40.3672670Z * [new branch] gh/yangw-dev/27/base -> origin/gh/yangw-dev/27/base 2025-12-04T09:33:40.3673989Z * [new branch] gh/yangw-dev/27/head -> origin/gh/yangw-dev/27/head 2025-12-04T09:33:40.3675189Z * [new branch] gh/yangw-dev/27/orig -> origin/gh/yangw-dev/27/orig 2025-12-04T09:33:40.3677347Z * [new branch] gh/ydwu4/292/base -> origin/gh/ydwu4/292/base 2025-12-04T09:33:40.3678553Z * [new branch] gh/ydwu4/292/head -> origin/gh/ydwu4/292/head 2025-12-04T09:33:40.3679784Z * [new branch] gh/ydwu4/292/orig -> origin/gh/ydwu4/292/orig 2025-12-04T09:33:40.3681490Z * [new branch] gh/ydwu4/294/base -> origin/gh/ydwu4/294/base 2025-12-04T09:33:40.3682759Z * [new branch] gh/ydwu4/294/head -> origin/gh/ydwu4/294/head 2025-12-04T09:33:40.3684036Z * [new branch] gh/ydwu4/294/orig -> origin/gh/ydwu4/294/orig 2025-12-04T09:33:40.3685943Z * [new branch] gh/ydwu4/295/base -> origin/gh/ydwu4/295/base 2025-12-04T09:33:40.3687284Z * [new branch] gh/ydwu4/295/head -> origin/gh/ydwu4/295/head 2025-12-04T09:33:40.3689212Z * [new branch] gh/ydwu4/295/orig -> origin/gh/ydwu4/295/orig 2025-12-04T09:33:40.3690852Z * [new branch] gh/ydwu4/296/base -> origin/gh/ydwu4/296/base 2025-12-04T09:33:40.3692047Z * [new branch] gh/ydwu4/296/head -> origin/gh/ydwu4/296/head 2025-12-04T09:33:40.3693760Z * [new branch] gh/ydwu4/296/orig -> origin/gh/ydwu4/296/orig 2025-12-04T09:33:40.3695572Z * [new branch] gh/ydwu4/306/base -> origin/gh/ydwu4/306/base 2025-12-04T09:33:40.3696899Z * [new branch] gh/ydwu4/306/head -> origin/gh/ydwu4/306/head 2025-12-04T09:33:40.3698323Z * [new branch] gh/ydwu4/306/orig -> origin/gh/ydwu4/306/orig 2025-12-04T09:33:40.3700000Z * [new branch] gh/ydwu4/312/base -> origin/gh/ydwu4/312/base 2025-12-04T09:33:40.3701328Z * [new branch] gh/ydwu4/312/head -> origin/gh/ydwu4/312/head 2025-12-04T09:33:40.3702596Z * [new branch] gh/ydwu4/312/orig -> origin/gh/ydwu4/312/orig 2025-12-04T09:33:40.3704240Z * [new branch] gh/ydwu4/322/base -> origin/gh/ydwu4/322/base 2025-12-04T09:33:40.3705451Z * [new branch] gh/ydwu4/322/head -> origin/gh/ydwu4/322/head 2025-12-04T09:33:40.3706684Z * [new branch] gh/ydwu4/322/orig -> origin/gh/ydwu4/322/orig 2025-12-04T09:33:40.3708355Z * [new branch] gh/ydwu4/327/base -> origin/gh/ydwu4/327/base 2025-12-04T09:33:40.3709629Z * [new branch] gh/ydwu4/327/head -> origin/gh/ydwu4/327/head 2025-12-04T09:33:40.3710926Z * [new branch] gh/ydwu4/327/orig -> origin/gh/ydwu4/327/orig 2025-12-04T09:33:40.3712897Z * [new branch] gh/ydwu4/328/base -> origin/gh/ydwu4/328/base 2025-12-04T09:33:40.3714119Z * [new branch] gh/ydwu4/328/head -> origin/gh/ydwu4/328/head 2025-12-04T09:33:40.3715295Z * [new branch] gh/ydwu4/328/orig -> origin/gh/ydwu4/328/orig 2025-12-04T09:33:40.3716833Z * [new branch] gh/ydwu4/329/base -> origin/gh/ydwu4/329/base 2025-12-04T09:33:40.3718093Z * [new branch] gh/ydwu4/329/head -> origin/gh/ydwu4/329/head 2025-12-04T09:33:40.3719317Z * [new branch] gh/ydwu4/329/orig -> origin/gh/ydwu4/329/orig 2025-12-04T09:33:40.3721121Z * [new branch] gh/ydwu4/330/base -> origin/gh/ydwu4/330/base 2025-12-04T09:33:40.3722321Z * [new branch] gh/ydwu4/330/head -> origin/gh/ydwu4/330/head 2025-12-04T09:33:40.3723594Z * [new branch] gh/ydwu4/330/orig -> origin/gh/ydwu4/330/orig 2025-12-04T09:33:40.3725111Z * [new branch] gh/ydwu4/331/base -> origin/gh/ydwu4/331/base 2025-12-04T09:33:40.3726926Z * [new branch] gh/ydwu4/331/head -> origin/gh/ydwu4/331/head 2025-12-04T09:33:40.3728200Z * [new branch] gh/ydwu4/331/orig -> origin/gh/ydwu4/331/orig 2025-12-04T09:33:40.3729731Z * [new branch] gh/ydwu4/332/base -> origin/gh/ydwu4/332/base 2025-12-04T09:33:40.3731023Z * [new branch] gh/ydwu4/332/head -> origin/gh/ydwu4/332/head 2025-12-04T09:33:40.3732709Z * [new branch] gh/ydwu4/332/orig -> origin/gh/ydwu4/332/orig 2025-12-04T09:33:40.3734251Z * [new branch] gh/ydwu4/333/base -> origin/gh/ydwu4/333/base 2025-12-04T09:33:40.3735476Z * [new branch] gh/ydwu4/333/head -> origin/gh/ydwu4/333/head 2025-12-04T09:33:40.3736702Z * [new branch] gh/ydwu4/333/orig -> origin/gh/ydwu4/333/orig 2025-12-04T09:33:40.3738314Z * [new branch] gh/ydwu4/334/base -> origin/gh/ydwu4/334/base 2025-12-04T09:33:40.3739717Z * [new branch] gh/ydwu4/334/head -> origin/gh/ydwu4/334/head 2025-12-04T09:33:40.3740989Z * [new branch] gh/ydwu4/334/orig -> origin/gh/ydwu4/334/orig 2025-12-04T09:33:40.3742498Z * [new branch] gh/ydwu4/335/base -> origin/gh/ydwu4/335/base 2025-12-04T09:33:40.3743750Z * [new branch] gh/ydwu4/335/head -> origin/gh/ydwu4/335/head 2025-12-04T09:33:40.3745015Z * [new branch] gh/ydwu4/335/orig -> origin/gh/ydwu4/335/orig 2025-12-04T09:33:40.3747126Z * [new branch] gh/ydwu4/337/base -> origin/gh/ydwu4/337/base 2025-12-04T09:33:40.3748404Z * [new branch] gh/ydwu4/337/head -> origin/gh/ydwu4/337/head 2025-12-04T09:33:40.3749670Z * [new branch] gh/ydwu4/337/orig -> origin/gh/ydwu4/337/orig 2025-12-04T09:33:40.3751407Z * [new branch] gh/ydwu4/339/base -> origin/gh/ydwu4/339/base 2025-12-04T09:33:40.3752682Z * [new branch] gh/ydwu4/339/head -> origin/gh/ydwu4/339/head 2025-12-04T09:33:40.3753952Z * [new branch] gh/ydwu4/339/orig -> origin/gh/ydwu4/339/orig 2025-12-04T09:33:40.3756093Z * [new branch] gh/yf225/133/base -> origin/gh/yf225/133/base 2025-12-04T09:33:40.3757340Z * [new branch] gh/yf225/133/head -> origin/gh/yf225/133/head 2025-12-04T09:33:40.3759040Z * [new branch] gh/yf225/93/base -> origin/gh/yf225/93/base 2025-12-04T09:33:40.3760263Z * [new branch] gh/yf225/93/head -> origin/gh/yf225/93/head 2025-12-04T09:33:40.3763014Z * [new branch] gh/yifuwang/152/base -> origin/gh/yifuwang/152/base 2025-12-04T09:33:40.3764683Z * [new branch] gh/yifuwang/152/head -> origin/gh/yifuwang/152/head 2025-12-04T09:33:40.3765991Z * [new branch] gh/yifuwang/152/orig -> origin/gh/yifuwang/152/orig 2025-12-04T09:33:40.3767689Z * [new branch] gh/yifuwang/195/base -> origin/gh/yifuwang/195/base 2025-12-04T09:33:40.3768965Z * [new branch] gh/yifuwang/195/head -> origin/gh/yifuwang/195/head 2025-12-04T09:33:40.3770291Z * [new branch] gh/yifuwang/195/orig -> origin/gh/yifuwang/195/orig 2025-12-04T09:33:40.3772311Z * [new branch] gh/yiming0416/1/base -> origin/gh/yiming0416/1/base 2025-12-04T09:33:40.3773607Z * [new branch] gh/yiming0416/1/head -> origin/gh/yiming0416/1/head 2025-12-04T09:33:40.3775119Z * [new branch] gh/yiming0416/2/base -> origin/gh/yiming0416/2/base 2025-12-04T09:33:40.3776292Z * [new branch] gh/yiming0416/2/head -> origin/gh/yiming0416/2/head 2025-12-04T09:33:40.3778527Z * [new branch] gh/yushangdi/1/base -> origin/gh/yushangdi/1/base 2025-12-04T09:33:40.3779971Z * [new branch] gh/yushangdi/1/head -> origin/gh/yushangdi/1/head 2025-12-04T09:33:40.3781602Z * [new branch] gh/yushangdi/10/base -> origin/gh/yushangdi/10/base 2025-12-04T09:33:40.3782868Z * [new branch] gh/yushangdi/10/head -> origin/gh/yushangdi/10/head 2025-12-04T09:33:40.3784115Z * [new branch] gh/yushangdi/10/orig -> origin/gh/yushangdi/10/orig 2025-12-04T09:33:40.3785797Z * [new branch] gh/yushangdi/11/base -> origin/gh/yushangdi/11/base 2025-12-04T09:33:40.3787043Z * [new branch] gh/yushangdi/11/head -> origin/gh/yushangdi/11/head 2025-12-04T09:33:40.3791518Z * [new branch] gh/yushangdi/11/orig -> origin/gh/yushangdi/11/orig 2025-12-04T09:33:40.3793191Z * [new branch] gh/yushangdi/2/base -> origin/gh/yushangdi/2/base 2025-12-04T09:33:40.3794431Z * [new branch] gh/yushangdi/2/head -> origin/gh/yushangdi/2/head 2025-12-04T09:33:40.3796179Z * [new branch] gh/yushangdi/7/base -> origin/gh/yushangdi/7/base 2025-12-04T09:33:40.3797451Z * [new branch] gh/yushangdi/7/head -> origin/gh/yushangdi/7/head 2025-12-04T09:33:40.3798858Z * [new branch] gh/yushangdi/7/orig -> origin/gh/yushangdi/7/orig 2025-12-04T09:33:40.3800925Z * [new branch] gh/yushangdi/8/base -> origin/gh/yushangdi/8/base 2025-12-04T09:33:40.3802370Z * [new branch] gh/yushangdi/8/head -> origin/gh/yushangdi/8/head 2025-12-04T09:33:40.3803654Z * [new branch] gh/yushangdi/8/orig -> origin/gh/yushangdi/8/orig 2025-12-04T09:33:40.3805192Z * [new branch] gh/yushangdi/9/base -> origin/gh/yushangdi/9/base 2025-12-04T09:33:40.3806458Z * [new branch] gh/yushangdi/9/head -> origin/gh/yushangdi/9/head 2025-12-04T09:33:40.3807737Z * [new branch] gh/yushangdi/9/orig -> origin/gh/yushangdi/9/orig 2025-12-04T09:33:40.3809989Z * [new branch] gh/zklaus/19/base -> origin/gh/zklaus/19/base 2025-12-04T09:33:40.3811278Z * [new branch] gh/zklaus/19/head -> origin/gh/zklaus/19/head 2025-12-04T09:33:40.3812548Z * [new branch] gh/zklaus/19/orig -> origin/gh/zklaus/19/orig 2025-12-04T09:33:40.3814271Z * [new branch] gh/zklaus/20/base -> origin/gh/zklaus/20/base 2025-12-04T09:33:40.3815650Z * [new branch] gh/zklaus/20/head -> origin/gh/zklaus/20/head 2025-12-04T09:33:40.3816930Z * [new branch] gh/zklaus/20/orig -> origin/gh/zklaus/20/orig 2025-12-04T09:33:40.3818710Z * [new branch] gh/zklaus/21/base -> origin/gh/zklaus/21/base 2025-12-04T09:33:40.3820279Z * [new branch] gh/zklaus/21/head -> origin/gh/zklaus/21/head 2025-12-04T09:33:40.3821542Z * [new branch] gh/zklaus/21/orig -> origin/gh/zklaus/21/orig 2025-12-04T09:33:40.3823217Z * [new branch] gh/zklaus/22/base -> origin/gh/zklaus/22/base 2025-12-04T09:33:40.3824488Z * [new branch] gh/zklaus/22/head -> origin/gh/zklaus/22/head 2025-12-04T09:33:40.3825764Z * [new branch] gh/zklaus/22/orig -> origin/gh/zklaus/22/orig 2025-12-04T09:33:40.3827488Z * [new branch] gh/zklaus/23/base -> origin/gh/zklaus/23/base 2025-12-04T09:33:40.3828743Z * [new branch] gh/zklaus/23/head -> origin/gh/zklaus/23/head 2025-12-04T09:33:40.3830024Z * [new branch] gh/zklaus/23/orig -> origin/gh/zklaus/23/orig 2025-12-04T09:33:40.3831600Z * [new branch] gh/zklaus/24/base -> origin/gh/zklaus/24/base 2025-12-04T09:33:40.3832915Z * [new branch] gh/zklaus/24/head -> origin/gh/zklaus/24/head 2025-12-04T09:33:40.3834167Z * [new branch] gh/zklaus/24/orig -> origin/gh/zklaus/24/orig 2025-12-04T09:33:40.3836468Z * [new branch] gh/zou3519/1197/base -> origin/gh/zou3519/1197/base 2025-12-04T09:33:40.3837598Z * [new branch] gh/zou3519/1197/head -> origin/gh/zou3519/1197/head 2025-12-04T09:33:40.3838834Z * [new branch] gh/zou3519/1197/orig -> origin/gh/zou3519/1197/orig 2025-12-04T09:33:40.3840950Z * [new branch] gh/zou3519/1199/base -> origin/gh/zou3519/1199/base 2025-12-04T09:33:40.3842233Z * [new branch] gh/zou3519/1199/head -> origin/gh/zou3519/1199/head 2025-12-04T09:33:40.3843784Z * [new branch] gh/zou3519/1199/orig -> origin/gh/zou3519/1199/orig 2025-12-04T09:33:40.3845460Z * [new branch] gh/zou3519/1200/base -> origin/gh/zou3519/1200/base 2025-12-04T09:33:40.3846712Z * [new branch] gh/zou3519/1200/head -> origin/gh/zou3519/1200/head 2025-12-04T09:33:40.3848030Z * [new branch] gh/zou3519/1200/orig -> origin/gh/zou3519/1200/orig 2025-12-04T09:33:40.3849884Z * [new branch] gh/zou3519/1201/base -> origin/gh/zou3519/1201/base 2025-12-04T09:33:40.3851094Z * [new branch] gh/zou3519/1201/head -> origin/gh/zou3519/1201/head 2025-12-04T09:33:40.3852343Z * [new branch] gh/zou3519/1201/orig -> origin/gh/zou3519/1201/orig 2025-12-04T09:33:40.3854336Z * [new branch] gh/zou3519/1202/base -> origin/gh/zou3519/1202/base 2025-12-04T09:33:40.3855577Z * [new branch] gh/zou3519/1202/head -> origin/gh/zou3519/1202/head 2025-12-04T09:33:40.3856837Z * [new branch] gh/zou3519/1202/orig -> origin/gh/zou3519/1202/orig 2025-12-04T09:33:40.3859027Z * [new branch] gh/zpcore/1/base -> origin/gh/zpcore/1/base 2025-12-04T09:33:40.3860294Z * [new branch] gh/zpcore/1/head -> origin/gh/zpcore/1/head 2025-12-04T09:33:40.3862040Z * [new branch] gh/zpcore/11/base -> origin/gh/zpcore/11/base 2025-12-04T09:33:40.3863334Z * [new branch] gh/zpcore/11/head -> origin/gh/zpcore/11/head 2025-12-04T09:33:40.3864652Z * [new branch] gh/zpcore/11/orig -> origin/gh/zpcore/11/orig 2025-12-04T09:33:40.3866884Z * [new branch] gh/zpcore/12/base -> origin/gh/zpcore/12/base 2025-12-04T09:33:40.3868152Z * [new branch] gh/zpcore/12/head -> origin/gh/zpcore/12/head 2025-12-04T09:33:40.3869451Z * [new branch] gh/zpcore/12/orig -> origin/gh/zpcore/12/orig 2025-12-04T09:33:40.3871253Z * [new branch] gh/zpcore/13/base -> origin/gh/zpcore/13/base 2025-12-04T09:33:40.3872471Z * [new branch] gh/zpcore/13/head -> origin/gh/zpcore/13/head 2025-12-04T09:33:40.3873680Z * [new branch] gh/zpcore/13/orig -> origin/gh/zpcore/13/orig 2025-12-04T09:33:40.3875401Z * [new branch] gh/zpcore/14/base -> origin/gh/zpcore/14/base 2025-12-04T09:33:40.3876682Z * [new branch] gh/zpcore/14/head -> origin/gh/zpcore/14/head 2025-12-04T09:33:40.3877912Z * [new branch] gh/zpcore/14/orig -> origin/gh/zpcore/14/orig 2025-12-04T09:33:40.3879857Z * [new branch] gh/zpcore/15/base -> origin/gh/zpcore/15/base 2025-12-04T09:33:40.3881181Z * [new branch] gh/zpcore/15/head -> origin/gh/zpcore/15/head 2025-12-04T09:33:40.3882467Z * [new branch] gh/zpcore/15/orig -> origin/gh/zpcore/15/orig 2025-12-04T09:33:40.3884249Z * [new branch] gh/zpcore/2/base -> origin/gh/zpcore/2/base 2025-12-04T09:33:40.3885518Z * [new branch] gh/zpcore/2/head -> origin/gh/zpcore/2/head 2025-12-04T09:33:40.3887887Z * [new branch] gh/zpcore/21/base -> origin/gh/zpcore/21/base 2025-12-04T09:33:40.3889454Z * [new branch] gh/zpcore/21/head -> origin/gh/zpcore/21/head 2025-12-04T09:33:40.3890727Z * [new branch] gh/zpcore/21/orig -> origin/gh/zpcore/21/orig 2025-12-04T09:33:40.3892589Z * [new branch] gh/zpcore/22/base -> origin/gh/zpcore/22/base 2025-12-04T09:33:40.3893829Z * [new branch] gh/zpcore/22/head -> origin/gh/zpcore/22/head 2025-12-04T09:33:40.3895120Z * [new branch] gh/zpcore/22/orig -> origin/gh/zpcore/22/orig 2025-12-04T09:33:40.3897005Z * [new branch] gh/zpcore/23/base -> origin/gh/zpcore/23/base 2025-12-04T09:33:40.3898413Z * [new branch] gh/zpcore/23/head -> origin/gh/zpcore/23/head 2025-12-04T09:33:40.3899726Z * [new branch] gh/zpcore/23/orig -> origin/gh/zpcore/23/orig 2025-12-04T09:33:40.3901255Z * [new branch] gh/zpcore/24/base -> origin/gh/zpcore/24/base 2025-12-04T09:33:40.3902673Z * [new branch] gh/zpcore/24/head -> origin/gh/zpcore/24/head 2025-12-04T09:33:40.3903957Z * [new branch] gh/zpcore/24/orig -> origin/gh/zpcore/24/orig 2025-12-04T09:33:40.3905832Z * [new branch] gh/zpcore/25/base -> origin/gh/zpcore/25/base 2025-12-04T09:33:40.3907589Z * [new branch] gh/zpcore/25/head -> origin/gh/zpcore/25/head 2025-12-04T09:33:40.3908928Z * [new branch] gh/zpcore/25/orig -> origin/gh/zpcore/25/orig 2025-12-04T09:33:40.3910669Z * [new branch] gh/zpcore/26/base -> origin/gh/zpcore/26/base 2025-12-04T09:33:40.3912050Z * [new branch] gh/zpcore/26/head -> origin/gh/zpcore/26/head 2025-12-04T09:33:40.3913311Z * [new branch] gh/zpcore/26/orig -> origin/gh/zpcore/26/orig 2025-12-04T09:33:40.3915100Z * [new branch] gh/zpcore/27/base -> origin/gh/zpcore/27/base 2025-12-04T09:33:40.3916331Z * [new branch] gh/zpcore/27/head -> origin/gh/zpcore/27/head 2025-12-04T09:33:40.3917604Z * [new branch] gh/zpcore/27/orig -> origin/gh/zpcore/27/orig 2025-12-04T09:33:40.3919988Z * [new branch] gh/zpcore/28/base -> origin/gh/zpcore/28/base 2025-12-04T09:33:40.3921849Z * [new branch] gh/zpcore/28/head -> origin/gh/zpcore/28/head 2025-12-04T09:33:40.3923142Z * [new branch] gh/zpcore/28/orig -> origin/gh/zpcore/28/orig 2025-12-04T09:33:40.3924645Z * [new branch] gh/zpcore/3/base -> origin/gh/zpcore/3/base 2025-12-04T09:33:40.3925821Z * [new branch] gh/zpcore/3/head -> origin/gh/zpcore/3/head 2025-12-04T09:33:40.3927373Z * [new branch] gh/zpcore/4/base -> origin/gh/zpcore/4/base 2025-12-04T09:33:40.3928571Z * [new branch] gh/zpcore/4/head -> origin/gh/zpcore/4/head 2025-12-04T09:33:40.3930098Z * [new branch] gh/zpcore/5/base -> origin/gh/zpcore/5/base 2025-12-04T09:33:40.3931310Z * [new branch] gh/zpcore/5/head -> origin/gh/zpcore/5/head 2025-12-04T09:33:40.3932827Z * [new branch] gh/zpcore/6/base -> origin/gh/zpcore/6/base 2025-12-04T09:33:40.3934007Z * [new branch] gh/zpcore/6/head -> origin/gh/zpcore/6/head 2025-12-04T09:33:40.3935973Z * [new branch] gh/zpcore/7/base -> origin/gh/zpcore/7/base 2025-12-04T09:33:40.3937278Z * [new branch] gh/zpcore/7/head -> origin/gh/zpcore/7/head 2025-12-04T09:33:40.3938984Z * [new branch] gh/zpcore/8/base -> origin/gh/zpcore/8/base 2025-12-04T09:33:40.3940240Z * [new branch] gh/zpcore/8/head -> origin/gh/zpcore/8/head 2025-12-04T09:33:40.3941641Z * [new branch] google-main -> origin/google-main 2025-12-04T09:33:40.3943958Z * [new branch] guangyey/external_stream -> origin/guangyey/external_stream 2025-12-04T09:33:40.3945110Z * [new branch] guangyey/test_2025 -> origin/guangyey/test_2025 2025-12-04T09:33:40.3947092Z * [new branch] guilhermeleobas/cherry-pick-55d87d9dfd9 -> origin/guilhermeleobas/cherry-pick-55d87d9dfd9 2025-12-04T09:33:40.3949169Z * [new branch] hameerabbasi/complex_tensor_subclass -> origin/hameerabbasi/complex_tensor_subclass 2025-12-04T09:33:40.3950614Z * [new branch] hameerabbasi/fix-ctensor-gradcheck-tests -> origin/hameerabbasi/fix-ctensor-gradcheck-tests 2025-12-04T09:33:40.3951650Z * [new branch] hameerabbasi/gradcheck-allclose -> origin/hameerabbasi/gradcheck-allclose 2025-12-04T09:33:40.3952886Z * [new branch] hc_baseline -> origin/hc_baseline 2025-12-04T09:33:40.3954254Z * [new branch] hhh_rand -> origin/hhh_rand 2025-12-04T09:33:40.3955957Z * [new branch] huba/f1 -> origin/huba/f1 2025-12-04T09:33:40.3957949Z * [new branch] increase-timeout-linux-jammy-cuda12_8-py3_10-gcc11-test -> origin/increase-timeout-linux-jammy-cuda12_8-py3_10-gcc11-test 2025-12-04T09:33:40.3958750Z * [new branch] inlining -> origin/inlining 2025-12-04T09:33:40.3961009Z * [new branch] inlining-ezyang -> origin/inlining-ezyang 2025-12-04T09:33:40.3962469Z * [new branch] install-torchao-0.13.0 -> origin/install-torchao-0.13.0 2025-12-04T09:33:40.3964093Z * [new branch] instrument-trunk-pull-linux-with-job-test-filters -> origin/instrument-trunk-pull-linux-with-job-test-filters 2025-12-04T09:33:40.3964973Z * [new branch] invoke-subgraph -> origin/invoke-subgraph 2025-12-04T09:33:40.3966385Z * [new branch] issue#58739 -> origin/issue#58739 2025-12-04T09:33:40.3967828Z * [new branch] jainapurva-patch-1 -> origin/jainapurva-patch-1 2025-12-04T09:33:40.3969448Z * [new branch] jathu/o3 -> origin/jathu/o3 2025-12-04T09:33:40.3970628Z * [new branch] jathu/sve -> origin/jathu/sve 2025-12-04T09:33:40.3972625Z * [new branch] jcaip/test-cusparselt-version-0.6.2 -> origin/jcaip/test-cusparselt-version-0.6.2 2025-12-04T09:33:40.3973877Z * [new branch] jcaip/update-cusparselt-0.6.2 -> origin/jcaip/update-cusparselt-0.6.2 2025-12-04T09:33:40.3975640Z * [new branch] jiannanWang/memorysnapshot_filter -> origin/jiannanWang/memorysnapshot_filter 2025-12-04T09:33:40.3976896Z * [new branch] jiannanWang/profilerstepwarning -> origin/jiannanWang/profilerstepwarning 2025-12-04T09:33:40.3978347Z * [new branch] jithunnair-amd-patch-1 -> origin/jithunnair-amd-patch-1 2025-12-04T09:33:40.3979836Z * [new branch] jithunnair-amd-patch-10 -> origin/jithunnair-amd-patch-10 2025-12-04T09:33:40.3981166Z * [new branch] jithunnair-amd-patch-2 -> origin/jithunnair-amd-patch-2 2025-12-04T09:33:40.3982519Z * [new branch] jithunnair-amd-patch-3 -> origin/jithunnair-amd-patch-3 2025-12-04T09:33:40.3983925Z * [new branch] jithunnair-amd-patch-4 -> origin/jithunnair-amd-patch-4 2025-12-04T09:33:40.3985231Z * [new branch] jithunnair-amd-patch-5 -> origin/jithunnair-amd-patch-5 2025-12-04T09:33:40.3986562Z * [new branch] jithunnair-amd-patch-6 -> origin/jithunnair-amd-patch-6 2025-12-04T09:33:40.3987995Z * [new branch] jithunnair-amd-patch-7 -> origin/jithunnair-amd-patch-7 2025-12-04T09:33:40.3989452Z * [new branch] jithunnair-amd-patch-8 -> origin/jithunnair-amd-patch-8 2025-12-04T09:33:40.3990767Z * [new branch] jithunnair-amd-patch-9 -> origin/jithunnair-amd-patch-9 2025-12-04T09:33:40.3992587Z * [new branch] justinchu/native-qdq -> origin/justinchu/native-qdq 2025-12-04T09:33:40.3994469Z * [new branch] kainan666/xlf_debug -> origin/kainan666/xlf_debug 2025-12-04T09:33:40.3995630Z * [new branch] kainan_test -> origin/kainan_test 2025-12-04T09:33:40.3996961Z * [new branch] larryliu0820-patch-1 -> origin/larryliu0820-patch-1 2025-12-04T09:33:40.3998767Z * [new branch] leslie/test_group_gemm_epilogues -> origin/leslie/test_group_gemm_epilogues 2025-12-04T09:33:40.4000440Z * [new branch] lessw2020/fix_cutlass_cache_error -> origin/lessw2020/fix_cutlass_cache_error 2025-12-04T09:33:40.4002033Z * [new branch] liaoxuan/shm_all_reduce -> origin/liaoxuan/shm_all_reduce 2025-12-04T09:33:40.4003340Z * [new branch] liaoxuan/test_fa_disable_softmax -> origin/liaoxuan/test_fa_disable_softmax 2025-12-04T09:33:40.4004551Z * [new branch] liaoxuan/test_int8_sdpa -> origin/liaoxuan/test_int8_sdpa 2025-12-04T09:33:40.4005796Z * [new branch] llama4-stable -> origin/llama4-stable 2025-12-04T09:33:40.4007857Z * [new branch] lts/release/1.8 -> origin/lts/release/1.8 2025-12-04T09:33:40.4009683Z * [new branch] lucaskabela/#94773 -> origin/lucaskabela/#94773 2025-12-04T09:33:40.4010958Z * [new branch] lucaskabela/fix_164876 -> origin/lucaskabela/fix_164876 2025-12-04T09:33:40.4012148Z * [new branch] lucaskabela/flop_counter -> origin/lucaskabela/flop_counter 2025-12-04T09:33:40.4013398Z * [new branch] lucaskabela/func_under_decomp -> origin/lucaskabela/func_under_decomp 2025-12-04T09:33:40.4014616Z * [new branch] lucaskabela/functional_in_dynamo -> origin/lucaskabela/functional_in_dynamo 2025-12-04T09:33:40.4015924Z * [new branch] lucaskabela/install_params_as_graph_attr -> origin/lucaskabela/install_params_as_graph_attr 2025-12-04T09:33:40.4017432Z * [new branch] lucaskabela/parameters_as_graph_attr -> origin/lucaskabela/parameters_as_graph_attr 2025-12-04T09:33:40.4019366Z * [new branch] lucaskabela/remove_aot_dispatcher_metadata -> origin/lucaskabela/remove_aot_dispatcher_metadata 2025-12-04T09:33:40.4020508Z * [new branch] lucaskabela/rnn_decomp -> origin/lucaskabela/rnn_decomp 2025-12-04T09:33:40.4021785Z * [new branch] lucaskabela/typing_backends -> origin/lucaskabela/typing_backends 2025-12-04T09:33:40.4023083Z * [new branch] lucaskabela/typing_ctx_manager -> origin/lucaskabela/typing_ctx_manager 2025-12-04T09:33:40.4024333Z * [new branch] lucaskabela/typing_nn_module -> origin/lucaskabela/typing_nn_module 2025-12-04T09:33:40.4025579Z * [new branch] lucaskabela/typing_user_defined -> origin/lucaskabela/typing_user_defined 2025-12-04T09:33:40.4026820Z * [new branch] lucaskabela/typing_variables -> origin/lucaskabela/typing_variables 2025-12-04T09:33:40.4028143Z * [new branch] lucaskabela/typing_variables_dicts -> origin/lucaskabela/typing_variables_dicts 2025-12-04T09:33:40.4029936Z * [new branch] lucaskabela/typing_variables_functions -> origin/lucaskabela/typing_variables_functions 2025-12-04T09:33:40.4031226Z * [new branch] lucaskabela/typing_variables_lists -> origin/lucaskabela/typing_variables_lists 2025-12-04T09:33:40.4032921Z * [new branch] lw/torch_box_by_ref -> origin/lw/torch_box_by_ref 2025-12-04T09:33:40.4034238Z * [new branch] main -> origin/main 2025-12-04T09:33:40.4035750Z * [new branch] malfet-patch-1 -> origin/malfet-patch-1 2025-12-04T09:33:40.4037199Z * [new branch] malfet-patch-2 -> origin/malfet-patch-2 2025-12-04T09:33:40.4038623Z * [new branch] malfet-patch-3 -> origin/malfet-patch-3 2025-12-04T09:33:40.4040128Z * [new branch] malfet-patch-4 -> origin/malfet-patch-4 2025-12-04T09:33:40.4041440Z * [new branch] malfet-patch-5 -> origin/malfet-patch-5 2025-12-04T09:33:40.4042786Z * [new branch] malfet-patch-6 -> origin/malfet-patch-6 2025-12-04T09:33:40.4044230Z * [new branch] malfet-patch-7 -> origin/malfet-patch-7 2025-12-04T09:33:40.4045690Z * [new branch] malfet-patch-8 -> origin/malfet-patch-8 2025-12-04T09:33:40.4047912Z * [new branch] malfet/add-3.14-ci -> origin/malfet/add-3.14-ci 2025-12-04T09:33:40.4049491Z * [new branch] malfet/be-do-not-make-typos-in-build-artifacts -> origin/malfet/be-do-not-make-typos-in-build-artifacts 2025-12-04T09:33:40.4050677Z * [new branch] malfet/be-move-more-settings-to-checkout-pytorch -> origin/malfet/be-move-more-settings-to-checkout-pytorch 2025-12-04T09:33:40.4052314Z * [new branch] malfet/be-remove-misisng-neon-headers -> origin/malfet/be-remove-misisng-neon-headers 2025-12-04T09:33:40.4053704Z * [new branch] malfet/mps-implement-col2im -> origin/malfet/mps-implement-col2im 2025-12-04T09:33:40.4055529Z * [new branch] manuel/aoti_metal_shimify-thread_safe -> origin/manuel/aoti_metal_shimify-thread_safe 2025-12-04T09:33:40.4056571Z * [new branch] manuel/inductor_link_openmp -> origin/manuel/inductor_link_openmp 2025-12-04T09:33:40.4058408Z * [new branch] masnesral/metaconda -> origin/masnesral/metaconda 2025-12-04T09:33:40.4059885Z * [new branch] mem_profiler_flaky_fix -> origin/mem_profiler_flaky_fix 2025-12-04T09:33:40.4061226Z * [new branch] mem_profiler_stack_trace -> origin/mem_profiler_stack_trace 2025-12-04T09:33:40.4062526Z * [new branch] memory_profiler_stack -> origin/memory_profiler_stack 2025-12-04T09:33:40.4063887Z * [new branch] metascroy-patch-1 -> origin/metascroy-patch-1 2025-12-04T09:33:40.4065163Z * [new branch] mingw_posix -> origin/mingw_posix 2025-12-04T09:33:40.4066956Z * [new branch] mlazos/S429861-debug -> origin/mlazos/S429861-debug 2025-12-04T09:33:40.4068119Z * [new branch] mlazos/aa -> origin/mlazos/aa 2025-12-04T09:33:40.4069344Z * [new branch] mlazos/acts -> origin/mlazos/acts 2025-12-04T09:33:40.4070918Z * [new branch] mlazos/arg-renames -> origin/mlazos/arg-renames 2025-12-04T09:33:40.4072129Z * [new branch] mlazos/bad-cudagraphs -> origin/mlazos/bad-cudagraphs 2025-12-04T09:33:40.4073901Z * [new branch] mlazos/baseline-graph-breaks -> origin/mlazos/baseline-graph-breaks 2025-12-04T09:33:40.4075060Z * [new branch] mlazos/beta-tensor -> origin/mlazos/beta-tensor 2025-12-04T09:33:40.4076282Z * [new branch] mlazos/buffers -> origin/mlazos/buffers 2025-12-04T09:33:40.4077373Z * [new branch] mlazos/buffers2 -> origin/mlazos/buffers2 2025-12-04T09:33:40.4078843Z * [new branch] mlazos/buffers3 -> origin/mlazos/buffers3 2025-12-04T09:33:40.4080316Z * [new branch] mlazos/bwd -> origin/mlazos/bwd 2025-12-04T09:33:40.4081452Z * [new branch] mlazos/combo-test -> origin/mlazos/combo-test 2025-12-04T09:33:40.4082744Z * [new branch] mlazos/ctx-cleanup -> origin/mlazos/ctx-cleanup 2025-12-04T09:33:40.4083981Z * [new branch] mlazos/cuda-cmd-log -> origin/mlazos/cuda-cmd-log 2025-12-04T09:33:40.4085460Z * [new branch] mlazos/cudagraph-tests -> origin/mlazos/cudagraph-tests 2025-12-04T09:33:40.4086826Z * [new branch] mlazos/cudagraphs-measurement -> origin/mlazos/cudagraphs-measurement 2025-12-04T09:33:40.4088224Z * [new branch] mlazos/cutlass-test -> origin/mlazos/cutlass-test 2025-12-04T09:33:40.4089748Z * [new branch] mlazos/cutlass-topo-bug -> origin/mlazos/cutlass-topo-bug 2025-12-04T09:33:40.4090922Z * [new branch] mlazos/dataclass-proxy -> origin/mlazos/dataclass-proxy 2025-12-04T09:33:40.4092232Z * [new branch] mlazos/dc-attrs -> origin/mlazos/dc-attrs 2025-12-04T09:33:40.4093511Z * [new branch] mlazos/dc-helion -> origin/mlazos/dc-helion 2025-12-04T09:33:40.4094790Z * [new branch] mlazos/dict-fix -> origin/mlazos/dict-fix 2025-12-04T09:33:40.4096086Z * [new branch] mlazos/disable-tf -> origin/mlazos/disable-tf 2025-12-04T09:33:40.4097351Z * [new branch] mlazos/dupe-fix -> origin/mlazos/dupe-fix 2025-12-04T09:33:40.4098765Z * [new branch] mlazos/dyn-batch -> origin/mlazos/dyn-batch 2025-12-04T09:33:40.4100016Z * [new branch] mlazos/evt -> origin/mlazos/evt 2025-12-04T09:33:40.4101343Z * [new branch] mlazos/extract-examples -> origin/mlazos/extract-examples 2025-12-04T09:33:40.4102553Z * [new branch] mlazos/foreach-op -> origin/mlazos/foreach-op 2025-12-04T09:33:40.4103794Z * [new branch] mlazos/fp8 -> origin/mlazos/fp8 2025-12-04T09:33:40.4105080Z * [new branch] mlazos/fp8-bias -> origin/mlazos/fp8-bias 2025-12-04T09:33:40.4106436Z * [new branch] mlazos/fp8-bias-fusion -> origin/mlazos/fp8-bias-fusion 2025-12-04T09:33:40.4107665Z * [new branch] mlazos/fp8-fixes -> origin/mlazos/fp8-fixes 2025-12-04T09:33:40.4108911Z * [new branch] mlazos/freezing -> origin/mlazos/freezing 2025-12-04T09:33:40.4110191Z * [new branch] mlazos/h-comp -> origin/mlazos/h-comp 2025-12-04T09:33:40.4111493Z * [new branch] mlazos/h-comp2 -> origin/mlazos/h-comp2 2025-12-04T09:33:40.4112890Z * [new branch] mlazos/hash-hop -> origin/mlazos/hash-hop 2025-12-04T09:33:40.4114211Z * [new branch] mlazos/hc -> origin/mlazos/hc 2025-12-04T09:33:40.4115527Z * [new branch] mlazos/hc-cycles -> origin/mlazos/hc-cycles 2025-12-04T09:33:40.4116838Z * [new branch] mlazos/hc-fixes -> origin/mlazos/hc-fixes 2025-12-04T09:33:40.4118074Z * [new branch] mlazos/hc-fixes3 -> origin/mlazos/hc-fixes3 2025-12-04T09:33:40.4119328Z * [new branch] mlazos/hc-fixes4 -> origin/mlazos/hc-fixes4 2025-12-04T09:33:40.4120569Z * [new branch] mlazos/hc-hf -> origin/mlazos/hc-hf 2025-12-04T09:33:40.4121848Z * [new branch] mlazos/hc-mut -> origin/mlazos/hc-mut 2025-12-04T09:33:40.4123126Z * [new branch] mlazos/hc10 -> origin/mlazos/hc10 2025-12-04T09:33:40.4124353Z * [new branch] mlazos/hc11 -> origin/mlazos/hc11 2025-12-04T09:33:40.4125647Z * [new branch] mlazos/hc12 -> origin/mlazos/hc12 2025-12-04T09:33:40.4126856Z * [new branch] mlazos/hc13 -> origin/mlazos/hc13 2025-12-04T09:33:40.4128068Z * [new branch] mlazos/hc14 -> origin/mlazos/hc14 2025-12-04T09:33:40.4129319Z * [new branch] mlazos/hc15 -> origin/mlazos/hc15 2025-12-04T09:33:40.4130556Z * [new branch] mlazos/hc2 -> origin/mlazos/hc2 2025-12-04T09:33:40.4131817Z * [new branch] mlazos/hc4 -> origin/mlazos/hc4 2025-12-04T09:33:40.4133158Z * [new branch] mlazos/hc5 -> origin/mlazos/hc5 2025-12-04T09:33:40.4134404Z * [new branch] mlazos/hc6 -> origin/mlazos/hc6 2025-12-04T09:33:40.4135697Z * [new branch] mlazos/hc7 -> origin/mlazos/hc7 2025-12-04T09:33:40.4136948Z * [new branch] mlazos/hc8 -> origin/mlazos/hc8 2025-12-04T09:33:40.4138073Z * [new branch] mlazos/hc9 -> origin/mlazos/hc9 2025-12-04T09:33:40.4139533Z * [new branch] mlazos/hc_baseline2 -> origin/mlazos/hc_baseline2 2025-12-04T09:33:40.4140696Z * [new branch] mlazos/inductor-streams -> origin/mlazos/inductor-streams 2025-12-04T09:33:40.4141724Z * [new branch] mlazos/main -> origin/mlazos/main 2025-12-04T09:33:40.4143007Z * [new branch] mlazos/mcg2 -> origin/mlazos/mcg2 2025-12-04T09:33:40.4144776Z * [new branch] mlazos/meta-guards -> origin/mlazos/meta-guards 2025-12-04T09:33:40.4146629Z * [new branch] mlazos/mlazos/foreach-map-adam -> origin/mlazos/mlazos/foreach-map-adam 2025-12-04T09:33:40.4147957Z * [new branch] mlazos/mlazos/tf-mode-backup -> origin/mlazos/mlazos/tf-mode-backup 2025-12-04T09:33:40.4149136Z * [new branch] mlazos/mod-fix -> origin/mlazos/mod-fix 2025-12-04T09:33:40.4150481Z * [new branch] mlazos/mode-fix -> origin/mlazos/mode-fix 2025-12-04T09:33:40.4151770Z * [new branch] mlazos/offsets -> origin/mlazos/offsets 2025-12-04T09:33:40.4153080Z * [new branch] mlazos/overguarding -> origin/mlazos/overguarding 2025-12-04T09:33:40.4154311Z * [new branch] mlazos/proxy-ctors -> origin/mlazos/proxy-ctors 2025-12-04T09:33:40.4155577Z * [new branch] mlazos/quant-fix -> origin/mlazos/quant-fix 2025-12-04T09:33:40.4156878Z * [new branch] mlazos/resnet-fix -> origin/mlazos/resnet-fix 2025-12-04T09:33:40.4158169Z * [new branch] mlazos/rm-buf-names -> origin/mlazos/rm-buf-names 2025-12-04T09:33:40.4159889Z * [new branch] mlazos/rm-code -> origin/mlazos/rm-code 2025-12-04T09:33:40.4161185Z * [new branch] mlazos/rm-spam -> origin/mlazos/rm-spam 2025-12-04T09:33:40.4162517Z * [new branch] mlazos/rtp -> origin/mlazos/rtp 2025-12-04T09:33:40.4163861Z * [new branch] mlazos/static-idx-dbg -> origin/mlazos/static-idx-dbg 2025-12-04T09:33:40.4165151Z * [new branch] mlazos/static-inputs-log -> origin/mlazos/static-inputs-log 2025-12-04T09:33:40.4166269Z * [new branch] mlazos/stests -> origin/mlazos/stests 2025-12-04T09:33:40.4167558Z * [new branch] mlazos/stream-ops -> origin/mlazos/stream-ops 2025-12-04T09:33:40.4168780Z * [new branch] mlazos/td-fix2 -> origin/mlazos/td-fix2 2025-12-04T09:33:40.4170161Z * [new branch] mlazos/tensor-hasattr2 -> origin/mlazos/tensor-hasattr2 2025-12-04T09:33:40.4171360Z * [new branch] mlazos/test -> origin/mlazos/test 2025-12-04T09:33:40.4172683Z * [new branch] mlazos/tf-mode -> origin/mlazos/tf-mode 2025-12-04T09:33:40.4174062Z * [new branch] mlazos/tf-mode-backup2 -> origin/mlazos/tf-mode-backup2 2025-12-04T09:33:40.4175329Z * [new branch] mlazos/tf-mode-reland -> origin/mlazos/tf-mode-reland 2025-12-04T09:33:40.4176683Z * [new branch] mlazos/tf-mode-reland2 -> origin/mlazos/tf-mode-reland2 2025-12-04T09:33:40.4177971Z * [new branch] mlazos/tf-mode-reland3 -> origin/mlazos/tf-mode-reland3 2025-12-04T09:33:40.4179417Z * [new branch] mlazos/triton-no-epi -> origin/mlazos/triton-no-epi 2025-12-04T09:33:40.4180684Z * [new branch] mlazos/tune-proto -> origin/mlazos/tune-proto 2025-12-04T09:33:40.4181938Z * [new branch] mlazos/tuple-fixes -> origin/mlazos/tuple-fixes 2025-12-04T09:33:40.4183328Z * [new branch] mlazos/tuple-fixes2 -> origin/mlazos/tuple-fixes2 2025-12-04T09:33:40.4184530Z * [new branch] mlazos/tuple-handling -> origin/mlazos/tuple-handling 2025-12-04T09:33:40.4185840Z * [new branch] mlazos/user-stream-base -> origin/mlazos/user-stream-base 2025-12-04T09:33:40.4187104Z * [new branch] mlazos/user-streams -> origin/mlazos/user-streams 2025-12-04T09:33:40.4190781Z * [new branch] mlazos/user-streams-backup -> origin/mlazos/user-streams-backup 2025-12-04T09:33:40.4192219Z * [new branch] mlazos/user-streams-backup2 -> origin/mlazos/user-streams-backup2 2025-12-04T09:33:40.4193511Z * [new branch] mlazos/vary-beta -> origin/mlazos/vary-beta 2025-12-04T09:33:40.4194780Z * [new branch] mlazos/vary-beta2 -> origin/mlazos/vary-beta2 2025-12-04T09:33:40.4196083Z * [new branch] mlazos/weird-perf1 -> origin/mlazos/weird-perf1 2025-12-04T09:33:40.4197545Z * [new branch] mm_out_dtype_compile -> origin/mm_out_dtype_compile 2025-12-04T09:33:40.4198859Z * [new branch] module-shim -> origin/module-shim 2025-12-04T09:33:40.4200237Z * [new branch] move_config -> origin/move_config 2025-12-04T09:33:40.4201993Z * [new branch] msaroufim/reduce -> origin/msaroufim/reduce 2025-12-04T09:33:40.4203626Z * [new branch] mtia/basic-cmake -> origin/mtia/basic-cmake 2025-12-04T09:33:40.4205396Z * [new branch] mwizak/fix-triton-block-shape -> origin/mwizak/fix-triton-block-shape 2025-12-04T09:33:40.4206698Z * [new branch] my_varlen_backup -> origin/my_varlen_backup 2025-12-04T09:33:40.4208054Z * [new branch] nativert_num_outputs -> origin/nativert_num_outputs 2025-12-04T09:33:40.4209338Z * [new branch] new-codegen -> origin/new-codegen 2025-12-04T09:33:40.4210721Z * [new branch] newtest-base -> origin/newtest-base 2025-12-04T09:33:40.4212404Z * [new branch] ngimel/addmm_dtype -> origin/ngimel/addmm_dtype 2025-12-04T09:33:40.4213608Z * [new branch] ngimel/div_inv -> origin/ngimel/div_inv 2025-12-04T09:33:40.4214908Z * [new branch] ngimel/error_index_list -> origin/ngimel/error_index_list 2025-12-04T09:33:40.4216160Z * [new branch] ngimel/gather_grid -> origin/ngimel/gather_grid 2025-12-04T09:33:40.4217416Z * [new branch] ngimel/gather_grid_release -> origin/ngimel/gather_grid_release 2025-12-04T09:33:40.4218588Z * [new branch] ngimel/gg_new -> origin/ngimel/gg_new 2025-12-04T09:33:40.4219890Z * [new branch] ngimel/hostalloc -> origin/ngimel/hostalloc 2025-12-04T09:33:40.4221101Z * [new branch] ngimel/storage_id -> origin/ngimel/storage_id 2025-12-04T09:33:40.4222462Z * [new branch] nightly -> origin/nightly 2025-12-04T09:33:40.4224377Z * [new branch] nikitaved/addmm_1_rowcol_lt_path_check -> origin/nikitaved/addmm_1_rowcol_lt_path_check 2025-12-04T09:33:40.4226099Z * [new branch] nikitaved/addmm_epilogue_fusions_2d_bias -> origin/nikitaved/addmm_epilogue_fusions_2d_bias 2025-12-04T09:33:40.4227353Z * [new branch] nikitaved/addmm_epilogue_fusions_inductor -> origin/nikitaved/addmm_epilogue_fusions_inductor 2025-12-04T09:33:40.4228959Z * [new branch] nikitaved/addmm_epilogue_fusions_scratch -> origin/nikitaved/addmm_epilogue_fusions_scratch 2025-12-04T09:33:40.4230403Z * [new branch] nikitaved/grad_addmm_epilogue_fusions -> origin/nikitaved/grad_addmm_epilogue_fusions 2025-12-04T09:33:40.4231992Z * [new branch] nikitaved/simpler_can_use_32bit_index -> origin/nikitaved/simpler_can_use_32bit_index 2025-12-04T09:33:40.4233265Z * [new branch] nikitaved/test -> origin/nikitaved/test 2025-12-04T09:33:40.4235198Z * [new branch] nmacchioni-perf-test-async-autotune -> origin/nmacchioni-perf-test-async-autotune 2025-12-04T09:33:40.4236521Z * [new branch] no_distributed_log_spew -> origin/no_distributed_log_spew 2025-12-04T09:33:40.4237836Z * [new branch] nofun-hack -> origin/nofun-hack 2025-12-04T09:33:40.4239170Z * [new branch] norm_bench -> origin/norm_bench 2025-12-04T09:33:40.4240962Z * [new branch] nullplay/fuse_matmul -> origin/nullplay/fuse_matmul 2025-12-04T09:33:40.4242274Z * [new branch] nullplay_fuse_matmul -> origin/nullplay_fuse_matmul 2025-12-04T09:33:40.4243597Z * [new branch] optimizer_test -> origin/optimizer_test 2025-12-04T09:33:40.4245692Z * [new branch] orig/release/1.10 -> origin/orig/release/1.10 2025-12-04T09:33:40.4247080Z * [new branch] orig/release/1.11 -> origin/orig/release/1.11 2025-12-04T09:33:40.4248402Z * [new branch] orig/release/1.12 -> origin/orig/release/1.12 2025-12-04T09:33:40.4249952Z * [new branch] orig/release/1.13 -> origin/orig/release/1.13 2025-12-04T09:33:40.4251304Z * [new branch] orig/release/1.6 -> origin/orig/release/1.6 2025-12-04T09:33:40.4252806Z * [new branch] orig/release/1.7 -> origin/orig/release/1.7 2025-12-04T09:33:40.4254091Z * [new branch] orig/release/1.8 -> origin/orig/release/1.8 2025-12-04T09:33:40.4255562Z * [new branch] orig/release/1.9 -> origin/orig/release/1.9 2025-12-04T09:33:40.4256831Z * [new branch] orig/release/2.0 -> origin/orig/release/2.0 2025-12-04T09:33:40.4258127Z * [new branch] orig/release/2.1 -> origin/orig/release/2.1 2025-12-04T09:33:40.4259514Z * [new branch] orig/release/2.2 -> origin/orig/release/2.2 2025-12-04T09:33:40.4260822Z * [new branch] orig/release/2.3 -> origin/orig/release/2.3 2025-12-04T09:33:40.4262046Z * [new branch] orig/release/2.4 -> origin/orig/release/2.4 2025-12-04T09:33:40.4263268Z * [new branch] orig/release/2.5 -> origin/orig/release/2.5 2025-12-04T09:33:40.4264535Z * [new branch] orig/release/2.6 -> origin/orig/release/2.6 2025-12-04T09:33:40.4266103Z * [new branch] orig/release/2.7 -> origin/orig/release/2.7 2025-12-04T09:33:40.4267836Z * [new branch] orig/release/2.8 -> origin/orig/release/2.8 2025-12-04T09:33:40.4269067Z * [new branch] orig/release/2.9 -> origin/orig/release/2.9 2025-12-04T09:33:40.4271753Z * [new branch] origin/gh/fxdawnn/1/base -> origin/origin/gh/fxdawnn/1/base 2025-12-04T09:33:40.4273022Z * [new branch] origin/gh/fxdawnn/1/orig -> origin/origin/gh/fxdawnn/1/orig 2025-12-04T09:33:40.4275121Z * [new branch] origin/gh/zpcore/14/orig -> origin/origin/gh/zpcore/14/orig 2025-12-04T09:33:40.4276529Z * [new branch] oulgen-patch-1 -> origin/oulgen-patch-1 2025-12-04T09:33:40.4277962Z * [new branch] oulgen-patch-2 -> origin/oulgen-patch-2 2025-12-04T09:33:40.4279350Z * [new branch] oulgen-patch-3 -> origin/oulgen-patch-3 2025-12-04T09:33:40.4280803Z * [new branch] oulgen-patch-4 -> origin/oulgen-patch-4 2025-12-04T09:33:40.4282388Z * [new branch] padded-tensor -> origin/padded-tensor 2025-12-04T09:33:40.4283763Z * [new branch] pca2 -> origin/pca2 2025-12-04T09:33:40.4285245Z * [new branch] per_channel_backup -> origin/per_channel_backup 2025-12-04T09:33:40.4286633Z * [new branch] perf_ops -> origin/perf_ops 2025-12-04T09:33:40.4288054Z * [new branch] perf_ops_2_9 -> origin/perf_ops_2_9 2025-12-04T09:33:40.4289607Z * [new branch] pianpwk-patch-1 -> origin/pianpwk-patch-1 2025-12-04T09:33:40.4291382Z * [new branch] pianpwk/__draft_debug_mode -> origin/pianpwk/__draft_debug_mode 2025-12-04T09:33:40.4292800Z * [new branch] pianpwk/_debug_mode_for_triton_draft -> origin/pianpwk/_debug_mode_for_triton_draft 2025-12-04T09:33:40.4294054Z * [new branch] pianpwk/_debug_nn_module_compile -> origin/pianpwk/_debug_nn_module_compile 2025-12-04T09:33:40.4295245Z * [new branch] pianpwk/_draft_triton_11_3 -> origin/pianpwk/_draft_triton_11_3 2025-12-04T09:33:40.4296468Z * [new branch] pianpwk/_manual_bucket_draft -> origin/pianpwk/_manual_bucket_draft 2025-12-04T09:33:40.4297995Z * [new branch] pianpwk/_profile_w_dispatch_keys -> origin/pianpwk/_profile_w_dispatch_keys 2025-12-04T09:33:40.4299775Z * [new branch] pianpwk/_super_draft_debug_mode -> origin/pianpwk/_super_draft_debug_mode 2025-12-04T09:33:40.4301186Z * [new branch] pianpwk/_unbacked_local_shard_size -> origin/pianpwk/_unbacked_local_shard_size 2025-12-04T09:33:40.4302393Z * [new branch] pianpwk/anomaly_tb -> origin/pianpwk/anomaly_tb 2025-12-04T09:33:40.4303639Z * [new branch] pianpwk/auto_fx_annotate -> origin/pianpwk/auto_fx_annotate 2025-12-04T09:33:40.4304989Z * [new branch] pianpwk/backed_size_oblivious_export -> origin/pianpwk/backed_size_oblivious_export 2025-12-04T09:33:40.4306118Z * [new branch] pianpwk/bert_dynamic_perf -> origin/pianpwk/bert_dynamic_perf 2025-12-04T09:33:40.4307546Z * [new branch] pianpwk/debug_fwd_stack_traces -> origin/pianpwk/debug_fwd_stack_traces 2025-12-04T09:33:40.4308942Z * [new branch] pianpwk/debug_hash_tensor -> origin/pianpwk/debug_hash_tensor 2025-12-04T09:33:40.4310164Z * [new branch] pianpwk/debug_mode_annotate -> origin/pianpwk/debug_mode_annotate 2025-12-04T09:33:40.4311340Z * [new branch] pianpwk/debug_mode_defaults -> origin/pianpwk/debug_mode_defaults 2025-12-04T09:33:40.4312616Z * [new branch] pianpwk/debug_mode_hacks -> origin/pianpwk/debug_mode_hacks 2025-12-04T09:33:40.4314101Z * [new branch] pianpwk/debug_mode_opcall_refactor -> origin/pianpwk/debug_mode_opcall_refactor 2025-12-04T09:33:40.4315226Z * [new branch] pianpwk/debug_mode_show_ids -> origin/pianpwk/debug_mode_show_ids 2025-12-04T09:33:40.4316505Z * [new branch] pianpwk/debug_mode_triton -> origin/pianpwk/debug_mode_triton 2025-12-04T09:33:40.4317869Z * [new branch] pianpwk/debug_show_stack_trace -> origin/pianpwk/debug_show_stack_trace 2025-12-04T09:33:40.4319262Z * [new branch] pianpwk/debug_wait_on_collective -> origin/pianpwk/debug_wait_on_collective 2025-12-04T09:33:40.4320556Z * [new branch] pianpwk/debugmode_compile_tf -> origin/pianpwk/debugmode_compile_tf 2025-12-04T09:33:40.4321959Z * [new branch] pianpwk/dispatch_key_debugging_for_debug -> origin/pianpwk/dispatch_key_debugging_for_debug 2025-12-04T09:33:40.4323176Z * [new branch] pianpwk/draft_debug_mode_tfcompile -> origin/pianpwk/draft_debug_mode_tfcompile 2025-12-04T09:33:40.4324857Z * [new branch] pianpwk/draft_multikernel_nn -> origin/pianpwk/draft_multikernel_nn 2025-12-04T09:33:40.4326261Z * [new branch] pianpwk/draft_multikernel_status_10_5 -> origin/pianpwk/draft_multikernel_status_10_5 2025-12-04T09:33:40.4327669Z * [new branch] pianpwk/dtensor_custom_chunk -> origin/pianpwk/dtensor_custom_chunk 2025-12-04T09:33:40.4329096Z * [new branch] pianpwk/dtensor_unbacked_keypath -> origin/pianpwk/dtensor_unbacked_keypath 2025-12-04T09:33:40.4330443Z * [new branch] pianpwk/event_list_tree -> origin/pianpwk/event_list_tree 2025-12-04T09:33:40.4331708Z * [new branch] pianpwk/false_numel_refs -> origin/pianpwk/false_numel_refs 2025-12-04T09:33:40.4332996Z * [new branch] pianpwk/maybe_guard_rel -> origin/pianpwk/maybe_guard_rel 2025-12-04T09:33:40.4334340Z * [new branch] pianpwk/multikernel_hints_draft -> origin/pianpwk/multikernel_hints_draft 2025-12-04T09:33:40.4335848Z * [new branch] pianpwk/no_size_oblivious_slice_scat -> origin/pianpwk/no_size_oblivious_slice_scat 2025-12-04T09:33:40.4337081Z * [new branch] pianpwk/oblivious_reshape_view_better -> origin/pianpwk/oblivious_reshape_view_better 2025-12-04T09:33:40.4338296Z * [new branch] pianpwk/pre_forward_hook -> origin/pianpwk/pre_forward_hook 2025-12-04T09:33:40.4339707Z * [new branch] pianpwk/skip_python_keys_alternate -> origin/pianpwk/skip_python_keys_alternate 2025-12-04T09:33:40.4340974Z * [new branch] pianpwk/skip_python_keys_in_guards -> origin/pianpwk/skip_python_keys_in_guards 2025-12-04T09:33:40.4342137Z * [new branch] pianpwk/sym_tokens_draft -> origin/pianpwk/sym_tokens_draft 2025-12-04T09:33:40.4343394Z * [new branch] pianpwk/symint_one_hot -> origin/pianpwk/symint_one_hot 2025-12-04T09:33:40.4344934Z * [new branch] pianpwk/test_pointwise_guard_or_false -> origin/pianpwk/test_pointwise_guard_or_false 2025-12-04T09:33:40.4346106Z * [new branch] pianpwk/totally_draft_sym_wrap -> origin/pianpwk/totally_draft_sym_wrap 2025-12-04T09:33:40.4347324Z * [new branch] pianpwk/try_dumb_stuff -> origin/pianpwk/try_dumb_stuff 2025-12-04T09:33:40.4348595Z * [new branch] pianpwk/try_dumb_stuff_2 -> origin/pianpwk/try_dumb_stuff_2 2025-12-04T09:33:40.4349903Z * [new branch] pianpwk/unbacked_dtensor_mm -> origin/pianpwk/unbacked_dtensor_mm 2025-12-04T09:33:40.4351181Z * [new branch] pianpwk/unbacked_tracing_12_2 -> origin/pianpwk/unbacked_tracing_12_2 2025-12-04T09:33:40.4352343Z * [new branch] pianpwk/user_symints -> origin/pianpwk/user_symints 2025-12-04T09:33:40.4353609Z * [new branch] pianpwk/wan21_reshape -> origin/pianpwk/wan21_reshape 2025-12-04T09:33:40.4355499Z * [new branch] piz/fix_partial_backward_1112 -> origin/piz/fix_partial_backward_1112 2025-12-04T09:33:40.4356677Z * [new branch] piz/prop_cache_clean -> origin/piz/prop_cache_clean 2025-12-04T09:33:40.4358004Z * [new branch] pool-separate -> origin/pool-separate 2025-12-04T09:33:40.4359335Z * [new branch] pr-156087 -> origin/pr-156087 2025-12-04T09:33:40.4361122Z * [new branch] pr/131860 -> origin/pr/131860 2025-12-04T09:33:40.4362480Z * [new branch] predispatch_to -> origin/predispatch_to 2025-12-04T09:33:40.4363819Z * [new branch] protect-c17 -> origin/protect-c17 2025-12-04T09:33:40.4365716Z * [new branch] pt-opt-cuda3 -> origin/pt-opt-cuda3 2025-12-04T09:33:40.4368063Z * [new branch] python_compiled_autograd -> origin/python_compiled_autograd 2025-12-04T09:33:40.4370043Z * [new branch] q1l1/fix_device_moved_constant_type_unknown -> origin/q1l1/fix_device_moved_constant_type_unknown 2025-12-04T09:33:40.4371323Z * [new branch] q1l1/fix_wrong_default_type_for_kernel_call_args -> origin/q1l1/fix_wrong_default_type_for_kernel_call_args 2025-12-04T09:33:40.4373400Z * [new branch] qchip/export-D54134695 -> origin/qchip/export-D54134695 2025-12-04T09:33:40.4374909Z * [new branch] quote-pytest_cache -> origin/quote-pytest_cache 2025-12-04T09:33:40.4376728Z * [new branch] reland-accgrad-stream-warn -> origin/reland-accgrad-stream-warn 2025-12-04T09:33:40.4378603Z * [new branch] release/1.10 -> origin/release/1.10 2025-12-04T09:33:40.4379936Z * [new branch] release/1.11 -> origin/release/1.11 2025-12-04T09:33:40.4381296Z * [new branch] release/1.12 -> origin/release/1.12 2025-12-04T09:33:40.4382578Z * [new branch] release/1.13 -> origin/release/1.13 2025-12-04T09:33:40.4383841Z * [new branch] release/1.4 -> origin/release/1.4 2025-12-04T09:33:40.4384887Z * [new branch] release/1.4.1 -> origin/release/1.4.1 2025-12-04T09:33:40.4386177Z * [new branch] release/1.5 -> origin/release/1.5 2025-12-04T09:33:40.4387536Z * [new branch] release/1.6 -> origin/release/1.6 2025-12-04T09:33:40.4389485Z * [new branch] release/1.7 -> origin/release/1.7 2025-12-04T09:33:40.4390968Z * [new branch] release/1.8 -> origin/release/1.8 2025-12-04T09:33:40.4392166Z * [new branch] release/1.9 -> origin/release/1.9 2025-12-04T09:33:40.4393476Z * [new branch] release/2.0 -> origin/release/2.0 2025-12-04T09:33:40.4394855Z * [new branch] release/2.1 -> origin/release/2.1 2025-12-04T09:33:40.4396299Z * [new branch] release/2.2 -> origin/release/2.2 2025-12-04T09:33:40.4397938Z * [new branch] release/2.3 -> origin/release/2.3 2025-12-04T09:33:40.4399752Z * [new branch] release/2.4 -> origin/release/2.4 2025-12-04T09:33:40.4401473Z * [new branch] release/2.5 -> origin/release/2.5 2025-12-04T09:33:40.4402889Z * [new branch] release/2.6 -> origin/release/2.6 2025-12-04T09:33:40.4406283Z * [new branch] release/2.7 -> origin/release/2.7 2025-12-04T09:33:40.4406499Z * [new branch] release/2.8 -> origin/release/2.8 2025-12-04T09:33:40.4407266Z * [new branch] release/2.9 -> origin/release/2.9 2025-12-04T09:33:40.4408506Z * [new branch] release_notes -> origin/release_notes 2025-12-04T09:33:40.4409938Z * [new branch] remove_pyinterpreter -> origin/remove_pyinterpreter 2025-12-04T09:33:40.4411591Z * [new branch] replace-pytorch-labs-20250812-195836 -> origin/replace-pytorch-labs-20250812-195836 2025-12-04T09:33:40.4412684Z * [new branch] replace-pytorch-labs-20250812-200248 -> origin/replace-pytorch-labs-20250812-200248 2025-12-04T09:33:40.4413946Z * [new branch] replace-pytorch-labs-20250812-200324 -> origin/replace-pytorch-labs-20250812-200324 2025-12-04T09:33:40.4415247Z * [new branch] replace-pytorch-labs-20250812-204020 -> origin/replace-pytorch-labs-20250812-204020 2025-12-04T09:33:40.4417831Z * [new branch] revert-131069-gh/krzysztofjordan/1/head -> origin/revert-131069-gh/krzysztofjordan/1/head 2025-12-04T09:33:40.4420344Z * [new branch] revert-131469-gh/andrewor14/51/head -> origin/revert-131469-gh/andrewor14/51/head 2025-12-04T09:33:40.4422813Z * [new branch] revert-152361-gh/fadara01/1/head -> origin/revert-152361-gh/fadara01/1/head 2025-12-04T09:33:40.4425166Z * [new branch] revert-156870-gh/skarjala/3/head -> origin/revert-156870-gh/skarjala/3/head 2025-12-04T09:33:40.4426852Z * [new branch] revert-157914-cherry-pick-157503-by-pytorch_bot_bot_ -> origin/revert-157914-cherry-pick-157503-by-pytorch_bot_bot_ 2025-12-04T09:33:40.4428029Z * [new branch] revert-hoo-invoke-subgraph -> origin/revert-hoo-invoke-subgraph 2025-12-04T09:33:40.4429350Z * [new branch] revert_always_build_distributed -> origin/revert_always_build_distributed 2025-12-04T09:33:40.4430717Z * [new branch] rms_norm_patch -> origin/rms_norm_patch 2025-12-04T09:33:40.4432602Z * [new branch] ruisi/fix_all_to_all_estimation -> origin/ruisi/fix_all_to_all_estimation 2025-12-04T09:33:40.4433755Z * [new branch] ruisi/fix_comm_estimation -> origin/ruisi/fix_comm_estimation 2025-12-04T09:33:40.4435049Z * [new branch] ruisi/fix_dynamic_shape_estimation -> origin/ruisi/fix_dynamic_shape_estimation 2025-12-04T09:33:40.4436224Z * [new branch] ruisi/fix_llama3_autobucketing -> origin/ruisi/fix_llama3_autobucketing 2025-12-04T09:33:40.4437742Z * [new branch] ruisi/fix_manual_bucketing_ep_pass -> origin/ruisi/fix_manual_bucketing_ep_pass 2025-12-04T09:33:40.4439262Z * [new branch] ruisi/manual_bucket_pass -> origin/ruisi/manual_bucket_pass 2025-12-04T09:33:40.4441312Z * [new branch] ryanguo99/cleanup-dynamo-expected-failures -> origin/ryanguo99/cleanup-dynamo-expected-failures 2025-12-04T09:33:40.4442243Z * [new branch] ryanguo99/fix-closure-var -> origin/ryanguo99/fix-closure-var 2025-12-04T09:33:40.4444061Z * [new branch] rzou/faketensor_bench -> origin/rzou/faketensor_bench 2025-12-04T09:33:40.4445264Z * [new branch] rzou/njt -> origin/rzou/njt 2025-12-04T09:33:40.4447001Z * [new branch] rzou/pca -> origin/rzou/pca 2025-12-04T09:33:40.4448244Z * [new branch] rzou/realprop -> origin/rzou/realprop 2025-12-04T09:33:40.4449741Z * [new branch] samplevllm -> origin/samplevllm 2025-12-04T09:33:40.4452020Z * [new branch] sanchitintel/weird_thing_with_test_cpu_select_algorithm -> origin/sanchitintel/weird_thing_with_test_cpu_select_algorithm 2025-12-04T09:33:40.4453205Z * [new branch] sapling-pr-archive-SS-JIA -> origin/sapling-pr-archive-SS-JIA 2025-12-04T09:33:40.4454749Z * [new branch] sapling-pr-archive-tushar00jain -> origin/sapling-pr-archive-tushar00jain 2025-12-04T09:33:40.4455976Z * [new branch] save -> origin/save 2025-12-04T09:33:40.4457301Z * [new branch] scaled_mm -> origin/scaled_mm 2025-12-04T09:33:40.4458771Z * [new branch] scan_attempt -> origin/scan_attempt 2025-12-04T09:33:40.4460492Z * [new branch] sdym/2.5.1 -> origin/sdym/2.5.1 2025-12-04T09:33:40.4461907Z * [new branch] sekyondaMeta-dynamoconfig-fix -> origin/sekyondaMeta-dynamoconfig-fix 2025-12-04T09:33:40.4463546Z * [new branch] shengf/fx-xform-perf -> origin/shengf/fx-xform-perf 2025-12-04T09:33:40.4465061Z * [new branch] shoumikhin-patch-1 -> origin/shoumikhin-patch-1 2025-12-04T09:33:40.4466410Z * [new branch] solve-accuracy-fix -> origin/solve-accuracy-fix 2025-12-04T09:33:40.4467756Z * [new branch] some_rocm_inductor_skips -> origin/some_rocm_inductor_skips 2025-12-04T09:33:40.4469588Z * [new branch] soulitzer/stash-tls-ac -> origin/soulitzer/stash-tls-ac 2025-12-04T09:33:40.4471062Z * [new branch] sparse-mm-bf16-support -> origin/sparse-mm-bf16-support 2025-12-04T09:33:40.4472381Z * [new branch] starterTaskUpdate -> origin/starterTaskUpdate 2025-12-04T09:33:40.4473733Z * [new branch] suo -> origin/suo 2025-12-04T09:33:40.4475073Z * [new branch] sve-poc -> origin/sve-poc 2025-12-04T09:33:40.4476540Z * [new branch] switch-bn -> origin/switch-bn 2025-12-04T09:33:40.4477903Z * [new branch] sy_annotation_in_autograd_hop -> origin/sy_annotation_in_autograd_hop 2025-12-04T09:33:40.4479205Z * [new branch] sy_aot_eager_record -> origin/sy_aot_eager_record 2025-12-04T09:33:40.4480573Z * [new branch] sy_custom_bucketing -> origin/sy_custom_bucketing 2025-12-04T09:33:40.4482050Z * [new branch] sy_debug_mode_test -> origin/sy_debug_mode_test 2025-12-04T09:33:40.4483338Z * [new branch] sy_deserialize -> origin/sy_deserialize 2025-12-04T09:33:40.4484664Z * [new branch] sy_dump_gm_code -> origin/sy_dump_gm_code 2025-12-04T09:33:40.4485988Z * [new branch] sy_exp -> origin/sy_exp 2025-12-04T09:33:40.4487388Z * [new branch] sy_export_annotation -> origin/sy_export_annotation 2025-12-04T09:33:40.4488899Z * [new branch] sy_invoke_subgraph -> origin/sy_invoke_subgraph 2025-12-04T09:33:40.4490310Z * [new branch] sy_kernel_bw_name -> origin/sy_kernel_bw_name 2025-12-04T09:33:40.4491633Z * [new branch] sy_multi_arch -> origin/sy_multi_arch 2025-12-04T09:33:40.4493000Z * [new branch] sy_nn_module_stack -> origin/sy_nn_module_stack 2025-12-04T09:33:40.4494436Z * [new branch] sy_original_dtensor -> origin/sy_original_dtensor 2025-12-04T09:33:40.4495742Z * [new branch] sy_profiler_cia -> origin/sy_profiler_cia 2025-12-04T09:33:40.4497041Z * [new branch] symm_mem_sync -> origin/symm_mem_sync 2025-12-04T09:33:40.4498561Z * [new branch] sympy-bottleneck-repro -> origin/sympy-bottleneck-repro 2025-12-04T09:33:40.4499992Z * [new branch] tensordict_integration -> origin/tensordict_integration 2025-12-04T09:33:40.4501470Z * [new branch] test-move-conda-builds -> origin/test-move-conda-builds 2025-12-04T09:33:40.4502763Z * [new branch] test-old -> origin/test-old 2025-12-04T09:33:40.4504735Z * [new branch] test/bmm_heur -> origin/test/bmm_heur 2025-12-04T09:33:40.4506555Z * [new branch] tianren/customOp_autotune_fix -> origin/tianren/customOp_autotune_fix 2025-12-04T09:33:40.4507891Z * [new branch] tianren/customOp_enable_max_autotune -> origin/tianren/customOp_enable_max_autotune 2025-12-04T09:33:40.4509062Z * [new branch] tianren/customOp_fusion -> origin/tianren/customOp_fusion 2025-12-04T09:33:40.4511017Z * [new branch] tianren/customop_collectiveop_benchmark -> origin/tianren/customop_collectiveop_benchmark 2025-12-04T09:33:40.4512580Z * [new branch] tianren/customop_collectiveop_benchmark_fix -> origin/tianren/customop_collectiveop_benchmark_fix 2025-12-04T09:33:40.4514101Z * [new branch] tianren/customop_dynamic_config -> origin/tianren/customop_dynamic_config 2025-12-04T09:33:40.4515377Z * [new branch] tianren/dynamic_range_input -> origin/tianren/dynamic_range_input 2025-12-04T09:33:40.4516724Z * [new branch] tianren/dynamic_range_input_fix -> origin/tianren/dynamic_range_input_fix 2025-12-04T09:33:40.4517992Z * [new branch] tianren/dynamic_range_input_merge -> origin/tianren/dynamic_range_input_merge 2025-12-04T09:33:40.4519284Z * [new branch] tianren/flex_paged_attn_fix_temp -> origin/tianren/flex_paged_attn_fix_temp 2025-12-04T09:33:40.4520602Z * [new branch] tianren/fx_codegen_dump -> origin/tianren/fx_codegen_dump 2025-12-04T09:33:40.4521837Z * [new branch] tianren/symmetric_memory -> origin/tianren/symmetric_memory 2025-12-04T09:33:40.4523027Z * [new branch] tianren/test -> origin/tianren/test 2025-12-04T09:33:40.4524461Z * [new branch] tidy_performance_cyy -> origin/tidy_performance_cyy 2025-12-04T09:33:40.4525769Z * [new branch] tmp -> origin/tmp 2025-12-04T09:33:40.4527136Z * [new branch] torchtitan_ep -> origin/torchtitan_ep 2025-12-04T09:33:40.4528557Z * [new branch] torchtitan_integration -> origin/torchtitan_integration 2025-12-04T09:33:40.4529989Z * [new branch] trace_fsdp_torchtune_lora -> origin/trace_fsdp_torchtune_lora 2025-12-04T09:33:40.4531236Z * [new branch] traceable_fsdp_unit_tests -> origin/traceable_fsdp_unit_tests 2025-12-04T09:33:40.4532672Z * [new branch] tree_loop_vec_base -> origin/tree_loop_vec_base 2025-12-04T09:33:40.4534054Z * [new branch] triton_kernel -> origin/triton_kernel 2025-12-04T09:33:40.4535461Z * [new branch] tt_pkg_1908 -> origin/tt_pkg_1908 2025-12-04T09:33:40.4536822Z * [new branch] type_dec -> origin/type_dec 2025-12-04T09:33:40.4538350Z * [new branch] udate-sphinx-dependancies -> origin/udate-sphinx-dependancies 2025-12-04T09:33:40.4540352Z * [new branch] update-audio-commit-hash/17630256502-1803-1 -> origin/update-audio-commit-hash/17630256502-1803-1 2025-12-04T09:33:40.4541598Z * [new branch] update-audio-commit-hash/19087141161-1916-1 -> origin/update-audio-commit-hash/19087141161-1916-1 2025-12-04T09:33:40.4542915Z * [new branch] update-audio-commit-hash/19250643381-1929-1 -> origin/update-audio-commit-hash/19250643381-1929-1 2025-12-04T09:33:40.4544016Z * [new branch] update-audio-commit-hash/19397724337-1935-1 -> origin/update-audio-commit-hash/19397724337-1935-1 2025-12-04T09:33:40.4545243Z * [new branch] update-audio-commit-hash/19555670148-1941-1 -> origin/update-audio-commit-hash/19555670148-1941-1 2025-12-04T09:33:40.4546818Z * [new branch] update-audio-commit-hash/19750627930-1946-1 -> origin/update-audio-commit-hash/19750627930-1946-1 2025-12-04T09:33:40.4548542Z * [new branch] update-triton-commit-hash/13663274526-1487-2 -> origin/update-triton-commit-hash/13663274526-1487-2 2025-12-04T09:33:40.4550243Z * [new branch] update-vision-commit-hash/19087141161-1916-1 -> origin/update-vision-commit-hash/19087141161-1916-1 2025-12-04T09:33:40.4551593Z * [new branch] update-vision-commit-hash/19184897099-1925-1 -> origin/update-vision-commit-hash/19184897099-1925-1 2025-12-04T09:33:40.4552641Z * [new branch] update-vision-commit-hash/19250643381-1929-1 -> origin/update-vision-commit-hash/19250643381-1929-1 2025-12-04T09:33:40.4554010Z * [new branch] update-vision-commit-hash/19381328640-1934-1 -> origin/update-vision-commit-hash/19381328640-1934-1 2025-12-04T09:33:40.4555158Z * [new branch] update-vision-commit-hash/19485237164-1938-1 -> origin/update-vision-commit-hash/19485237164-1938-1 2025-12-04T09:33:40.4557031Z * [new branch] update-vllm-commit-hash/18451675449-1879-1 -> origin/update-vllm-commit-hash/18451675449-1879-1 2025-12-04T09:33:40.4558373Z * [new branch] update-vllm-dockerfile -> origin/update-vllm-dockerfile 2025-12-04T09:33:40.4560159Z * [new branch] update-xla-commit-hash/19224287370-211-1 -> origin/update-xla-commit-hash/19224287370-211-1 2025-12-04T09:33:40.4561381Z * [new branch] update-xla-commit-hash/19422028566-212-1 -> origin/update-xla-commit-hash/19422028566-212-1 2025-12-04T09:33:40.4562613Z * [new branch] update-xla-commit-hash/19626841311-213-1 -> origin/update-xla-commit-hash/19626841311-213-1 2025-12-04T09:33:40.4564446Z * [new branch] update_docs_torch_multinomial_issue#125388 -> origin/update_docs_torch_multinomial_issue#125388 2025-12-04T09:33:40.4565645Z * [new branch] update_operator_readme -> origin/update_operator_readme 2025-12-04T09:33:40.4567489Z * [new branch] update_slow_tests_1722488736 -> origin/update_slow_tests_1722488736 2025-12-04T09:33:40.4568923Z * [new branch] update_slow_tests_1722879173 -> origin/update_slow_tests_1722879173 2025-12-04T09:33:40.4570275Z * [new branch] update_slow_tests_1762155677 -> origin/update_slow_tests_1762155677 2025-12-04T09:33:40.4571835Z * [new branch] update_slow_tests_1763365283 -> origin/update_slow_tests_1763365283 2025-12-04T09:33:40.4573117Z * [new branch] update_submodule_FBGEMM -> origin/update_submodule_FBGEMM 2025-12-04T09:33:40.4574490Z * [new branch] update_submodule_kineto -> origin/update_submodule_kineto 2025-12-04T09:33:40.4575853Z * [new branch] update_submodule_tensorpipe -> origin/update_submodule_tensorpipe 2025-12-04T09:33:40.4577196Z * [new branch] upload-tests-for-autorevert -> origin/upload-tests-for-autorevert 2025-12-04T09:33:40.4578710Z * [new branch] v0.1.2 -> origin/v0.1.2 2025-12-04T09:33:40.4580240Z * [new branch] v1.0.1 -> origin/v1.0.1 2025-12-04T09:33:40.4581668Z * [new branch] v1.0.3 -> origin/v1.0.3 2025-12-04T09:33:40.4583267Z * [new branch] v1.1.0 -> origin/v1.1.0 2025-12-04T09:33:40.4584729Z * [new branch] v1.2.0 -> origin/v1.2.0 2025-12-04T09:33:40.4586148Z * [new branch] v1.3.0 -> origin/v1.3.0 2025-12-04T09:33:40.4587547Z * [new branch] v1.3.1 -> origin/v1.3.1 2025-12-04T09:33:40.4592422Z * [new branch] validate_fn -> origin/validate_fn 2025-12-04T09:33:40.4593980Z * [new branch] validations_2.6 -> origin/validations_2.6 2025-12-04T09:33:40.4595498Z * [new branch] validations_2.8 -> origin/validations_2.8 2025-12-04T09:33:40.4596870Z * [new branch] varlen-api -> origin/varlen-api 2025-12-04T09:33:40.4598365Z * [new branch] varlen-api-backup -> origin/varlen-api-backup 2025-12-04T09:33:40.4599667Z * [new branch] varlen_batch_invariance -> origin/varlen_batch_invariance 2025-12-04T09:33:40.4601241Z * [new branch] viable/strict -> origin/viable/strict 2025-12-04T09:33:40.4603173Z * [new branch] vishal9-team/dtensor_parallelism_toy -> origin/vishal9-team/dtensor_parallelism_toy 2025-12-04T09:33:40.4604341Z * [new branch] vllmbuildci -> origin/vllmbuildci 2025-12-04T09:33:40.4605772Z * [new branch] vllmpin -> origin/vllmpin 2025-12-04T09:33:40.4607339Z * [new branch] vscode-recommend-pyrefly -> origin/vscode-recommend-pyrefly 2025-12-04T09:33:40.4608697Z * [new branch] wdvr-patch-1 -> origin/wdvr-patch-1 2025-12-04T09:33:40.4610421Z * [new branch] wdvr/iss_145259 -> origin/wdvr/iss_145259 2025-12-04T09:33:40.4612158Z * [new branch] whc/pei -> origin/whc/pei 2025-12-04T09:33:40.4613399Z * [new branch] whc/pp_fix -> origin/whc/pp_fix 2025-12-04T09:33:40.4614749Z * [new branch] whc/sharding -> origin/whc/sharding 2025-12-04T09:33:40.4616049Z * [new branch] whc/sharding2 -> origin/whc/sharding2 2025-12-04T09:33:40.4617232Z * [new branch] whc/uneven -> origin/whc/uneven 2025-12-04T09:33:40.4619005Z * [new branch] whc/uneven-merge -> origin/whc/uneven-merge 2025-12-04T09:33:40.4620393Z * [new branch] win_warnings -> origin/win_warnings 2025-12-04T09:33:40.4621811Z * [new branch] windows_libtorch_free -> origin/windows_libtorch_free 2025-12-04T09:33:40.4623128Z * [new branch] xmfan-war -> origin/xmfan-war 2025-12-04T09:33:40.4624849Z * [new branch] xmfan/ca_0516 -> origin/xmfan/ca_0516 2025-12-04T09:33:40.4626558Z * [new branch] xmfan/ca_1051b93192 -> origin/xmfan/ca_1051b93192 2025-12-04T09:33:40.4628093Z * [new branch] xmfan/ca_1a722f62c248391fc4a542e8851a5559aa356ae8 -> origin/xmfan/ca_1a722f62c248391fc4a542e8851a5559aa356ae8 2025-12-04T09:33:40.4628815Z * [new branch] xmfan/ca_5a2be192d1 -> origin/xmfan/ca_5a2be192d1 2025-12-04T09:33:40.4630161Z * [new branch] xmfan/ca_9d59b516e9 -> origin/xmfan/ca_9d59b516e9 2025-12-04T09:33:40.4631283Z * [new branch] xmfan/ca_apr8 -> origin/xmfan/ca_apr8 2025-12-04T09:33:40.4632470Z * [new branch] xmfan/ca_base -> origin/xmfan/ca_base 2025-12-04T09:33:40.4633952Z * [new branch] xmfan/ca_dynamic -> origin/xmfan/ca_dynamic 2025-12-04T09:33:40.4635525Z * [new branch] xmfan/ca_fix_dyn -> origin/xmfan/ca_fix_dyn 2025-12-04T09:33:40.4636920Z * [new branch] xmfan/ca_fix_lowering -> origin/xmfan/ca_fix_lowering 2025-12-04T09:33:40.4638179Z * [new branch] xmfan/ca_fix_polyfills -> origin/xmfan/ca_fix_polyfills 2025-12-04T09:33:40.4639278Z * [new branch] xmfan/ca_jan3 -> origin/xmfan/ca_jan3 2025-12-04T09:33:40.4640603Z * [new branch] xmfan/ca_jun18 -> origin/xmfan/ca_jun18 2025-12-04T09:33:40.4641839Z * [new branch] xmfan/ca_jun24 -> origin/xmfan/ca_jun24 2025-12-04T09:33:40.4643067Z * [new branch] xmfan/ca_nested -> origin/xmfan/ca_nested 2025-12-04T09:33:40.4644388Z * [new branch] xmfan/ca_overhead -> origin/xmfan/ca_overhead 2025-12-04T09:33:40.4645745Z * [new branch] xmfan/ca_overhead_0eba7e5451 -> origin/xmfan/ca_overhead_0eba7e5451 2025-12-04T09:33:40.4646865Z * [new branch] xmfan/cacu_jun18 -> origin/xmfan/cacu_jun18 2025-12-04T09:33:40.4648112Z * [new branch] xmfan/cacu_jun19 -> origin/xmfan/cacu_jun19 2025-12-04T09:33:40.4649341Z * [new branch] xmfan/cacu_jun4 -> origin/xmfan/cacu_jun4 2025-12-04T09:33:40.4650765Z * [new branch] xmfan/disable_duck_shape -> origin/xmfan/disable_duck_shape 2025-12-04T09:33:40.4651953Z * [new branch] xmfan/fca_cpp_node_passthrough -> origin/xmfan/fca_cpp_node_passthrough 2025-12-04T09:33:40.4653475Z * [new branch] xmfan/post_3945954741e2d37023c5d6954f9483008e0892f9 -> origin/xmfan/post_3945954741e2d37023c5d6954f9483008e0892f9 2025-12-04T09:33:40.4654667Z * [new branch] xmfan/pre_3945954741e2d37023c5d6954f9483008e0892f9 -> origin/xmfan/pre_3945954741e2d37023c5d6954f9483008e0892f9 2025-12-04T09:33:40.4655720Z * [new branch] xmfan/single_step -> origin/xmfan/single_step 2025-12-04T09:33:40.4657043Z * [new branch] xmfan/sth_0829 -> origin/xmfan/sth_0829 2025-12-04T09:33:40.4658417Z * [new branch] xmfan/test -> origin/xmfan/test 2025-12-04T09:33:40.4660335Z * [new branch] yguo/debug-0226-constexpr -> origin/yguo/debug-0226-constexpr 2025-12-04T09:33:40.4661549Z * [new branch] yguo/new_latest_changes -> origin/yguo/new_latest_changes 2025-12-04T09:33:40.4662857Z * [new branch] yguo/patch_constexpr_changes -> origin/yguo/patch_constexpr_changes 2025-12-04T09:33:40.4664478Z * [new branch] yiming/bootcamp -> origin/yiming/bootcamp 2025-12-04T09:33:40.4665788Z * [new branch] yiming/run_with_start_end_rng_hop -> origin/yiming/run_with_start_end_rng_hop 2025-12-04T09:33:40.4667049Z * [new branch] yolo-llama3 -> origin/yolo-llama3 2025-12-04T09:33:40.4668748Z * [new branch] zainr/canary-test -> origin/zainr/canary-test 2025-12-04T09:33:40.4670172Z * [new branch] zainr/cleanup-gh-runners -> origin/zainr/cleanup-gh-runners 2025-12-04T09:33:40.4671329Z * [new branch] zainr/pull-migration-c -> origin/zainr/pull-migration-c 2025-12-04T09:33:40.4672520Z * [new branch] zainr/test2 -> origin/zainr/test2 2025-12-04T09:33:40.4674067Z * [new branch] zasdfgbnm-patch-3 -> origin/zasdfgbnm-patch-3 2025-12-04T09:33:40.4675406Z * [new branch] zb2p -> origin/zb2p 2025-12-04T09:33:40.4676831Z * [new branch] zeros-and-scatter-part2 -> origin/zeros-and-scatter-part2 2025-12-04T09:33:40.4678828Z * [new branch] zhxchen17/ci/vllm_lora_oom -> origin/zhxchen17/ci/vllm_lora_oom 2025-12-04T09:33:40.4680154Z * [new branch] zhxchen17/ci/vllm_multimodal_oom -> origin/zhxchen17/ci/vllm_multimodal_oom 2025-12-04T09:33:40.4681285Z * [new branch] zhxchen17/ci/vllm_pin -> origin/zhxchen17/ci/vllm_pin 2025-12-04T09:33:40.4683211Z * [new branch] zhxchen17/dynamo/unsafe_drop_all_guards -> origin/zhxchen17/dynamo/unsafe_drop_all_guards 2025-12-04T09:33:40.4684766Z * [new branch] zhxchen17/export/call_override -> origin/zhxchen17/export/call_override 2025-12-04T09:33:40.4685953Z * [new branch] zhxchen17/export/codemod1 -> origin/zhxchen17/export/codemod1 2025-12-04T09:33:40.4687271Z * [new branch] zhxchen17/export/ctx_return -> origin/zhxchen17/export/ctx_return 2025-12-04T09:33:40.4688880Z * [new branch] zhxchen17/export/disable_side_effect_warn -> origin/zhxchen17/export/disable_side_effect_warn 2025-12-04T09:33:40.4689944Z * [new branch] zhxchen17/export/pytree_check -> origin/zhxchen17/export/pytree_check 2025-12-04T09:33:40.4691773Z * [new branch] zhxchen17/precompile/aoti -> origin/zhxchen17/precompile/aoti 2025-12-04T09:33:40.4693074Z * [new branch] zhxchen17/precompile/globals -> origin/zhxchen17/precompile/globals 2025-12-04T09:33:40.4694429Z * [new branch] zhxchen17/precompile/inductor_guards -> origin/zhxchen17/precompile/inductor_guards 2025-12-04T09:33:40.4695850Z * [new branch] zhxchen17/scratch/0 -> origin/zhxchen17/scratch/0 2025-12-04T09:33:40.4697296Z * [new branch] zhxchen17/torch_export_api_update -> origin/zhxchen17/torch_export_api_update 2025-12-04T09:33:40.4699096Z * [new branch] zhxhcen17/moodycamel -> origin/zhxhcen17/moodycamel 2025-12-04T09:33:40.4701000Z * [new branch] zxiiro/build-times -> origin/zxiiro/build-times 2025-12-04T09:33:40.4702793Z * [new branch] zxiiro/c7i.2xlarge -> origin/zxiiro/c7i.2xlarge 2025-12-04T09:33:40.4704135Z * [new branch] zxiiro/c7i.2xlarge.h100 -> origin/zxiiro/c7i.2xlarge.h100 2025-12-04T09:33:40.4705433Z * [new branch] zxiiro/main -> origin/zxiiro/main 2025-12-04T09:33:40.4706724Z * [new branch] zxiiro/risc64 -> origin/zxiiro/risc64 2025-12-04T09:33:40.4708060Z * [new branch] zxiiro/test-multicloud-arc -> origin/zxiiro/test-multicloud-arc 2025-12-04T09:33:40.4709315Z * [new tag] bc2caa7fdf006894eff7af936babde69ab5a40f8-huydhn-debug -> bc2caa7fdf006894eff7af936babde69ab5a40f8-huydhn-debug 2025-12-04T09:33:40.4710448Z * [new tag] ci/binaries/77164 -> ci/binaries/77164 2025-12-04T09:33:40.4711532Z * [new tag] ciflow/b200/115316 -> ciflow/b200/115316 2025-12-04T09:33:40.4712283Z * [new tag] ciflow/b200/160685 -> ciflow/b200/160685 2025-12-04T09:33:40.4713103Z * [new tag] ciflow/b200/161607 -> ciflow/b200/161607 2025-12-04T09:33:40.4713924Z * [new tag] ciflow/b200/161938 -> ciflow/b200/161938 2025-12-04T09:33:40.4714975Z * [new tag] ciflow/b200/167207 -> ciflow/b200/167207 2025-12-04T09:33:40.4715683Z * [new tag] ciflow/b200/167989 -> ciflow/b200/167989 2025-12-04T09:33:40.4716718Z * [new tag] ciflow/b200/168096 -> ciflow/b200/168096 2025-12-04T09:33:40.4717566Z * [new tag] ciflow/b200/168175 -> ciflow/b200/168175 2025-12-04T09:33:40.4718513Z * [new tag] ciflow/b200/168195 -> ciflow/b200/168195 2025-12-04T09:33:40.4719363Z * [new tag] ciflow/b200/169200 -> ciflow/b200/169200 2025-12-04T09:33:40.4720398Z * [new tag] ciflow/b200/169216 -> ciflow/b200/169216 2025-12-04T09:33:40.4721689Z * [new tag] ciflow/b200/169380 -> ciflow/b200/169380 2025-12-04T09:33:40.4722946Z * [new tag] ciflow/b200/169412 -> ciflow/b200/169412 2025-12-04T09:33:40.4723992Z * [new tag] ciflow/b200/169470 -> ciflow/b200/169470 2025-12-04T09:33:40.4724777Z * [new tag] ciflow/b200/169471 -> ciflow/b200/169471 2025-12-04T09:33:40.4725651Z * [new tag] ciflow/b200/169472 -> ciflow/b200/169472 2025-12-04T09:33:40.4726874Z * [new tag] ciflow/b200/169514 -> ciflow/b200/169514 2025-12-04T09:33:40.4727638Z * [new tag] ciflow/b200/169517 -> ciflow/b200/169517 2025-12-04T09:33:40.4728907Z * [new tag] ciflow/binaries/165922 -> ciflow/binaries/165922 2025-12-04T09:33:40.4729743Z * [new tag] ciflow/binaries/169510 -> ciflow/binaries/169510 2025-12-04T09:33:40.4731081Z * [new tag] ciflow/binaries_wheel/157994 -> ciflow/binaries_wheel/157994 2025-12-04T09:33:40.4731850Z * [new tag] ciflow/binaries_wheel/166829 -> ciflow/binaries_wheel/166829 2025-12-04T09:33:40.4732673Z * [new tag] ciflow/binaries_wheel/167972 -> ciflow/binaries_wheel/167972 2025-12-04T09:33:40.4733793Z * [new tag] ciflow/binaries_wheel/167981 -> ciflow/binaries_wheel/167981 2025-12-04T09:33:40.4734595Z * [new tag] ciflow/dynamo/167695 -> ciflow/dynamo/167695 2025-12-04T09:33:40.4735689Z * [new tag] ciflow/dynamo/168096 -> ciflow/dynamo/168096 2025-12-04T09:33:40.4736494Z * [new tag] ciflow/dynamo/169525 -> ciflow/dynamo/169525 2025-12-04T09:33:40.4737710Z * [new tag] ciflow/h100-cutlass-backend/161938 -> ciflow/h100-cutlass-backend/161938 2025-12-04T09:33:40.4738500Z * [new tag] ciflow/h100-cutlass-backend/161940 -> ciflow/h100-cutlass-backend/161940 2025-12-04T09:33:40.4739751Z * [new tag] ciflow/h100-distributed/168923 -> ciflow/h100-distributed/168923 2025-12-04T09:33:40.4740581Z * [new tag] ciflow/h100-symm-mem/167552 -> ciflow/h100-symm-mem/167552 2025-12-04T09:33:40.4741376Z * [new tag] ciflow/h100-symm-mem/168129 -> ciflow/h100-symm-mem/168129 2025-12-04T09:33:40.4742226Z * [new tag] ciflow/h100-symm-mem/168917 -> ciflow/h100-symm-mem/168917 2025-12-04T09:33:40.4743439Z * [new tag] ciflow/h100-symm-mem/169156 -> ciflow/h100-symm-mem/169156 2025-12-04T09:33:40.4744155Z * [new tag] ciflow/h100-symm-mem/169200 -> ciflow/h100-symm-mem/169200 2025-12-04T09:33:40.4744983Z * [new tag] ciflow/h100-symm-mem/169216 -> ciflow/h100-symm-mem/169216 2025-12-04T09:33:40.4745763Z * [new tag] ciflow/h100-symm-mem/169338 -> ciflow/h100-symm-mem/169338 2025-12-04T09:33:40.4747325Z * [new tag] ciflow/h100-symm-mem/169355 -> ciflow/h100-symm-mem/169355 2025-12-04T09:33:40.4748042Z * [new tag] ciflow/h100-symm-mem/169543 -> ciflow/h100-symm-mem/169543 2025-12-04T09:33:40.4749128Z * [new tag] ciflow/h100/115316 -> ciflow/h100/115316 2025-12-04T09:33:40.4750027Z * [new tag] ciflow/h100/160685 -> ciflow/h100/160685 2025-12-04T09:33:40.4750643Z * [new tag] ciflow/h100/160729 -> ciflow/h100/160729 2025-12-04T09:33:40.4751454Z * [new tag] ciflow/h100/161607 -> ciflow/h100/161607 2025-12-04T09:33:40.4752267Z * [new tag] ciflow/h100/161938 -> ciflow/h100/161938 2025-12-04T09:33:40.4753165Z * [new tag] ciflow/h100/167207 -> ciflow/h100/167207 2025-12-04T09:33:40.4753876Z * [new tag] ciflow/h100/167989 -> ciflow/h100/167989 2025-12-04T09:33:40.4754678Z * [new tag] ciflow/h100/168096 -> ciflow/h100/168096 2025-12-04T09:33:40.4755852Z * [new tag] ciflow/h100/168175 -> ciflow/h100/168175 2025-12-04T09:33:40.4756380Z * [new tag] ciflow/h100/168195 -> ciflow/h100/168195 2025-12-04T09:33:40.4757166Z * [new tag] ciflow/h100/168980 -> ciflow/h100/168980 2025-12-04T09:33:40.4758359Z * [new tag] ciflow/h100/169200 -> ciflow/h100/169200 2025-12-04T09:33:40.4759482Z * [new tag] ciflow/h100/169216 -> ciflow/h100/169216 2025-12-04T09:33:40.4760450Z * [new tag] ciflow/h100/169380 -> ciflow/h100/169380 2025-12-04T09:33:40.4761434Z * [new tag] ciflow/h100/169412 -> ciflow/h100/169412 2025-12-04T09:33:40.4762193Z * [new tag] ciflow/h100/169470 -> ciflow/h100/169470 2025-12-04T09:33:40.4763037Z * [new tag] ciflow/h100/169471 -> ciflow/h100/169471 2025-12-04T09:33:40.4763857Z * [new tag] ciflow/h100/169472 -> ciflow/h100/169472 2025-12-04T09:33:40.4764693Z * [new tag] ciflow/h100/169514 -> ciflow/h100/169514 2025-12-04T09:33:40.4766266Z * [new tag] ciflow/inductor-cu126/168096 -> ciflow/inductor-cu126/168096 2025-12-04T09:33:40.4767681Z * [new tag] ciflow/inductor-micro-benchmark-cpu-x86/168096 -> ciflow/inductor-micro-benchmark-cpu-x86/168096 2025-12-04T09:33:40.4769006Z * [new tag] ciflow/inductor-micro-benchmark/166165 -> ciflow/inductor-micro-benchmark/166165 2025-12-04T09:33:40.4769767Z * [new tag] ciflow/inductor-micro-benchmark/168096 -> ciflow/inductor-micro-benchmark/168096 2025-12-04T09:33:40.4770764Z * [new tag] ciflow/inductor-perf-compare/168096 -> ciflow/inductor-perf-compare/168096 2025-12-04T09:33:40.4772252Z * [new tag] ciflow/inductor-perf-test-nightly-rocm-mi300/168073 -> ciflow/inductor-perf-test-nightly-rocm-mi300/168073 2025-12-04T09:33:40.4773005Z * [new tag] ciflow/inductor-perf-test-nightly-rocm-mi300/168096 -> ciflow/inductor-perf-test-nightly-rocm-mi300/168096 2025-12-04T09:33:40.4773964Z * [new tag] ciflow/inductor-perf-test-nightly-rocm-mi300/169024 -> ciflow/inductor-perf-test-nightly-rocm-mi300/169024 2025-12-04T09:33:40.4774947Z * [new tag] ciflow/inductor-perf-test-nightly-rocm-mi355/169024 -> ciflow/inductor-perf-test-nightly-rocm-mi355/169024 2025-12-04T09:33:40.4775815Z * [new tag] ciflow/inductor-perf-test-nightly/168096 -> ciflow/inductor-perf-test-nightly/168096 2025-12-04T09:33:40.4776685Z * [new tag] ciflow/inductor-periodic/168096 -> ciflow/inductor-periodic/168096 2025-12-04T09:33:40.4777511Z * [new tag] ciflow/inductor-periodic/169024 -> ciflow/inductor-periodic/169024 2025-12-04T09:33:40.4778687Z * [new tag] ciflow/inductor-periodic/169425 -> ciflow/inductor-periodic/169425 2025-12-04T09:33:40.4779888Z * [new tag] ciflow/inductor-rocm-mi200/165545 -> ciflow/inductor-rocm-mi200/165545 2025-12-04T09:33:40.4780764Z * [new tag] ciflow/inductor-rocm-mi200/165997 -> ciflow/inductor-rocm-mi200/165997 2025-12-04T09:33:40.4781604Z * [new tag] ciflow/inductor-rocm-mi200/168096 -> ciflow/inductor-rocm-mi200/168096 2025-12-04T09:33:40.4782566Z * [new tag] ciflow/inductor-rocm-mi200/169063 -> ciflow/inductor-rocm-mi200/169063 2025-12-04T09:33:40.4783355Z * [new tag] ciflow/inductor-rocm-mi200/169425 -> ciflow/inductor-rocm-mi200/169425 2025-12-04T09:33:40.4784632Z * [new tag] ciflow/inductor-rocm-mi300/165545 -> ciflow/inductor-rocm-mi300/165545 2025-12-04T09:33:40.4785240Z * [new tag] ciflow/inductor-rocm-mi300/168096 -> ciflow/inductor-rocm-mi300/168096 2025-12-04T09:33:40.4786114Z * [new tag] ciflow/inductor-rocm-mi300/169063 -> ciflow/inductor-rocm-mi300/169063 2025-12-04T09:33:40.4786895Z * [new tag] ciflow/inductor-rocm-mi300/169425 -> ciflow/inductor-rocm-mi300/169425 2025-12-04T09:33:40.4788199Z * [new tag] ciflow/inductor-rocm/162052 -> ciflow/inductor-rocm/162052 2025-12-04T09:33:40.4788997Z * [new tag] ciflow/inductor-rocm/168971 -> ciflow/inductor-rocm/168971 2025-12-04T09:33:40.4790156Z * [new tag] ciflow/inductor-windows/168096 -> ciflow/inductor-windows/168096 2025-12-04T09:33:40.4790991Z * [new tag] ciflow/inductor/144542 -> ciflow/inductor/144542 2025-12-04T09:33:40.4791794Z * [new tag] ciflow/inductor/146506 -> ciflow/inductor/146506 2025-12-04T09:33:40.4792639Z * [new tag] ciflow/inductor/147990 -> ciflow/inductor/147990 2025-12-04T09:33:40.4793682Z * [new tag] ciflow/inductor/148294 -> ciflow/inductor/148294 2025-12-04T09:33:40.4794380Z * [new tag] ciflow/inductor/148492 -> ciflow/inductor/148492 2025-12-04T09:33:40.4795234Z * [new tag] ciflow/inductor/157149 -> ciflow/inductor/157149 2025-12-04T09:33:40.4796048Z * [new tag] ciflow/inductor/157994 -> ciflow/inductor/157994 2025-12-04T09:33:40.4796843Z * [new tag] ciflow/inductor/160685 -> ciflow/inductor/160685 2025-12-04T09:33:40.4797648Z * [new tag] ciflow/inductor/160686 -> ciflow/inductor/160686 2025-12-04T09:33:40.4798469Z * [new tag] ciflow/inductor/160687 -> ciflow/inductor/160687 2025-12-04T09:33:40.4799292Z * [new tag] ciflow/inductor/160688 -> ciflow/inductor/160688 2025-12-04T09:33:40.4800515Z * [new tag] ciflow/inductor/160706 -> ciflow/inductor/160706 2025-12-04T09:33:40.4801686Z * [new tag] ciflow/inductor/160729 -> ciflow/inductor/160729 2025-12-04T09:33:40.4802707Z * [new tag] ciflow/inductor/161938 -> ciflow/inductor/161938 2025-12-04T09:33:40.4803475Z * [new tag] ciflow/inductor/161939 -> ciflow/inductor/161939 2025-12-04T09:33:40.4804308Z * [new tag] ciflow/inductor/161940 -> ciflow/inductor/161940 2025-12-04T09:33:40.4805183Z * [new tag] ciflow/inductor/162052 -> ciflow/inductor/162052 2025-12-04T09:33:40.4806031Z * [new tag] ciflow/inductor/162275 -> ciflow/inductor/162275 2025-12-04T09:33:40.4806912Z * [new tag] ciflow/inductor/162795 -> ciflow/inductor/162795 2025-12-04T09:33:40.4808153Z * [new tag] ciflow/inductor/163245 -> ciflow/inductor/163245 2025-12-04T09:33:40.4808928Z * [new tag] ciflow/inductor/163335 -> ciflow/inductor/163335 2025-12-04T09:33:40.4810001Z * [new tag] ciflow/inductor/163503 -> ciflow/inductor/163503 2025-12-04T09:33:40.4810753Z * [new tag] ciflow/inductor/163942 -> ciflow/inductor/163942 2025-12-04T09:33:40.4811935Z * [new tag] ciflow/inductor/165270 -> ciflow/inductor/165270 2025-12-04T09:33:40.4812756Z * [new tag] ciflow/inductor/165274 -> ciflow/inductor/165274 2025-12-04T09:33:40.4813584Z * [new tag] ciflow/inductor/165322 -> ciflow/inductor/165322 2025-12-04T09:33:40.4814428Z * [new tag] ciflow/inductor/165597 -> ciflow/inductor/165597 2025-12-04T09:33:40.4815290Z * [new tag] ciflow/inductor/166063 -> ciflow/inductor/166063 2025-12-04T09:33:40.4816126Z * [new tag] ciflow/inductor/166075 -> ciflow/inductor/166075 2025-12-04T09:33:40.4817111Z * [new tag] ciflow/inductor/166165 -> ciflow/inductor/166165 2025-12-04T09:33:40.4818287Z * [new tag] ciflow/inductor/166254 -> ciflow/inductor/166254 2025-12-04T09:33:40.4819135Z * [new tag] ciflow/inductor/166483 -> ciflow/inductor/166483 2025-12-04T09:33:40.4820002Z * [new tag] ciflow/inductor/166494 -> ciflow/inductor/166494 2025-12-04T09:33:40.4820847Z * [new tag] ciflow/inductor/166545 -> ciflow/inductor/166545 2025-12-04T09:33:40.4821717Z * [new tag] ciflow/inductor/166788 -> ciflow/inductor/166788 2025-12-04T09:33:40.4822793Z * [new tag] ciflow/inductor/166846 -> ciflow/inductor/166846 2025-12-04T09:33:40.4823557Z * [new tag] ciflow/inductor/167300 -> ciflow/inductor/167300 2025-12-04T09:33:40.4824466Z * [new tag] ciflow/inductor/167407 -> ciflow/inductor/167407 2025-12-04T09:33:40.4825474Z * [new tag] ciflow/inductor/167536 -> ciflow/inductor/167536 2025-12-04T09:33:40.4826279Z * [new tag] ciflow/inductor/167552 -> ciflow/inductor/167552 2025-12-04T09:33:40.4827147Z * [new tag] ciflow/inductor/167555 -> ciflow/inductor/167555 2025-12-04T09:33:40.4828258Z * [new tag] ciflow/inductor/167583 -> ciflow/inductor/167583 2025-12-04T09:33:40.4829551Z * [new tag] ciflow/inductor/167599 -> ciflow/inductor/167599 2025-12-04T09:33:40.4830326Z * [new tag] ciflow/inductor/167647 -> ciflow/inductor/167647 2025-12-04T09:33:40.4831187Z * [new tag] ciflow/inductor/167677 -> ciflow/inductor/167677 2025-12-04T09:33:40.4832021Z * [new tag] ciflow/inductor/167680 -> ciflow/inductor/167680 2025-12-04T09:33:40.4832928Z * [new tag] ciflow/inductor/167695 -> ciflow/inductor/167695 2025-12-04T09:33:40.4833755Z * [new tag] ciflow/inductor/167742 -> ciflow/inductor/167742 2025-12-04T09:33:40.4834638Z * [new tag] ciflow/inductor/167768 -> ciflow/inductor/167768 2025-12-04T09:33:40.4835828Z * [new tag] ciflow/inductor/167773 -> ciflow/inductor/167773 2025-12-04T09:33:40.4836670Z * [new tag] ciflow/inductor/167781 -> ciflow/inductor/167781 2025-12-04T09:33:40.4837520Z * [new tag] ciflow/inductor/167880 -> ciflow/inductor/167880 2025-12-04T09:33:40.4838367Z * [new tag] ciflow/inductor/167887 -> ciflow/inductor/167887 2025-12-04T09:33:40.4839440Z * [new tag] ciflow/inductor/167972 -> ciflow/inductor/167972 2025-12-04T09:33:40.4840208Z * [new tag] ciflow/inductor/167989 -> ciflow/inductor/167989 2025-12-04T09:33:40.4841046Z * [new tag] ciflow/inductor/168002 -> ciflow/inductor/168002 2025-12-04T09:33:40.4841922Z * [new tag] ciflow/inductor/168050 -> ciflow/inductor/168050 2025-12-04T09:33:40.4842780Z * [new tag] ciflow/inductor/168051 -> ciflow/inductor/168051 2025-12-04T09:33:40.4843666Z * [new tag] ciflow/inductor/168052 -> ciflow/inductor/168052 2025-12-04T09:33:40.4844482Z * [new tag] ciflow/inductor/168073 -> ciflow/inductor/168073 2025-12-04T09:33:40.4845358Z * [new tag] ciflow/inductor/168096 -> ciflow/inductor/168096 2025-12-04T09:33:40.4846212Z * [new tag] ciflow/inductor/168114 -> ciflow/inductor/168114 2025-12-04T09:33:40.4847108Z * [new tag] ciflow/inductor/168115 -> ciflow/inductor/168115 2025-12-04T09:33:40.4847926Z * [new tag] ciflow/inductor/168127 -> ciflow/inductor/168127 2025-12-04T09:33:40.4848776Z * [new tag] ciflow/inductor/168129 -> ciflow/inductor/168129 2025-12-04T09:33:40.4849654Z * [new tag] ciflow/inductor/168157 -> ciflow/inductor/168157 2025-12-04T09:33:40.4850591Z * [new tag] ciflow/inductor/168175 -> ciflow/inductor/168175 2025-12-04T09:33:40.4851375Z * [new tag] ciflow/inductor/168185 -> ciflow/inductor/168185 2025-12-04T09:33:40.4852246Z * [new tag] ciflow/inductor/168195 -> ciflow/inductor/168195 2025-12-04T09:33:40.4853090Z * [new tag] ciflow/inductor/168209 -> ciflow/inductor/168209 2025-12-04T09:33:40.4853947Z * [new tag] ciflow/inductor/168266 -> ciflow/inductor/168266 2025-12-04T09:33:40.4854982Z * [new tag] ciflow/inductor/168316 -> ciflow/inductor/168316 2025-12-04T09:33:40.4855815Z * [new tag] ciflow/inductor/168326 -> ciflow/inductor/168326 2025-12-04T09:33:40.4856664Z * [new tag] ciflow/inductor/168368 -> ciflow/inductor/168368 2025-12-04T09:33:40.4857521Z * [new tag] ciflow/inductor/168894 -> ciflow/inductor/168894 2025-12-04T09:33:40.4858494Z * [new tag] ciflow/inductor/168934 -> ciflow/inductor/168934 2025-12-04T09:33:40.4859481Z * [new tag] ciflow/inductor/168939 -> ciflow/inductor/168939 2025-12-04T09:33:40.4860288Z * [new tag] ciflow/inductor/168946 -> ciflow/inductor/168946 2025-12-04T09:33:40.4861129Z * [new tag] ciflow/inductor/168950 -> ciflow/inductor/168950 2025-12-04T09:33:40.4861998Z * [new tag] ciflow/inductor/168951 -> ciflow/inductor/168951 2025-12-04T09:33:40.4862860Z * [new tag] ciflow/inductor/168952 -> ciflow/inductor/168952 2025-12-04T09:33:40.4863742Z * [new tag] ciflow/inductor/168955 -> ciflow/inductor/168955 2025-12-04T09:33:40.4864573Z * [new tag] ciflow/inductor/168971 -> ciflow/inductor/168971 2025-12-04T09:33:40.4865442Z * [new tag] ciflow/inductor/168979 -> ciflow/inductor/168979 2025-12-04T09:33:40.4866508Z * [new tag] ciflow/inductor/168980 -> ciflow/inductor/168980 2025-12-04T09:33:40.4867588Z * [new tag] ciflow/inductor/168983 -> ciflow/inductor/168983 2025-12-04T09:33:40.4868372Z * [new tag] ciflow/inductor/169006 -> ciflow/inductor/169006 2025-12-04T09:33:40.4869260Z * [new tag] ciflow/inductor/169023 -> ciflow/inductor/169023 2025-12-04T09:33:40.4870108Z * [new tag] ciflow/inductor/169024 -> ciflow/inductor/169024 2025-12-04T09:33:40.4870986Z * [new tag] ciflow/inductor/169025 -> ciflow/inductor/169025 2025-12-04T09:33:40.4871847Z * [new tag] ciflow/inductor/169066 -> ciflow/inductor/169066 2025-12-04T09:33:40.4872713Z * [new tag] ciflow/inductor/169091 -> ciflow/inductor/169091 2025-12-04T09:33:40.4873566Z * [new tag] ciflow/inductor/169102 -> ciflow/inductor/169102 2025-12-04T09:33:40.4874447Z * [new tag] ciflow/inductor/169103 -> ciflow/inductor/169103 2025-12-04T09:33:40.4875283Z * [new tag] ciflow/inductor/169121 -> ciflow/inductor/169121 2025-12-04T09:33:40.4876142Z * [new tag] ciflow/inductor/169134 -> ciflow/inductor/169134 2025-12-04T09:33:40.4877020Z * [new tag] ciflow/inductor/169135 -> ciflow/inductor/169135 2025-12-04T09:33:40.4877878Z * [new tag] ciflow/inductor/169141 -> ciflow/inductor/169141 2025-12-04T09:33:40.4878742Z * [new tag] ciflow/inductor/169151 -> ciflow/inductor/169151 2025-12-04T09:33:40.4879841Z * [new tag] ciflow/inductor/169161 -> ciflow/inductor/169161 2025-12-04T09:33:40.4880600Z * [new tag] ciflow/inductor/169167 -> ciflow/inductor/169167 2025-12-04T09:33:40.4881736Z * [new tag] ciflow/inductor/169177 -> ciflow/inductor/169177 2025-12-04T09:33:40.4882688Z * [new tag] ciflow/inductor/169185 -> ciflow/inductor/169185 2025-12-04T09:33:40.4883511Z * [new tag] ciflow/inductor/169196 -> ciflow/inductor/169196 2025-12-04T09:33:40.4884385Z * [new tag] ciflow/inductor/169200 -> ciflow/inductor/169200 2025-12-04T09:33:40.4885286Z * [new tag] ciflow/inductor/169204 -> ciflow/inductor/169204 2025-12-04T09:33:40.4886677Z * [new tag] ciflow/inductor/169216 -> ciflow/inductor/169216 2025-12-04T09:33:40.4887501Z * [new tag] ciflow/inductor/169219 -> ciflow/inductor/169219 2025-12-04T09:33:40.4888619Z * [new tag] ciflow/inductor/169220 -> ciflow/inductor/169220 2025-12-04T09:33:40.4889677Z * [new tag] ciflow/inductor/169230 -> ciflow/inductor/169230 2025-12-04T09:33:40.4890428Z * [new tag] ciflow/inductor/169242 -> ciflow/inductor/169242 2025-12-04T09:33:40.4891284Z * [new tag] ciflow/inductor/169245 -> ciflow/inductor/169245 2025-12-04T09:33:40.4892401Z * [new tag] ciflow/inductor/169260 -> ciflow/inductor/169260 2025-12-04T09:33:40.4893165Z * [new tag] ciflow/inductor/169282 -> ciflow/inductor/169282 2025-12-04T09:33:40.4894024Z * [new tag] ciflow/inductor/169286 -> ciflow/inductor/169286 2025-12-04T09:33:40.4894898Z * [new tag] ciflow/inductor/169299 -> ciflow/inductor/169299 2025-12-04T09:33:40.4896057Z * [new tag] ciflow/inductor/169304 -> ciflow/inductor/169304 2025-12-04T09:33:40.4897560Z * [new tag] ciflow/inductor/169305 -> ciflow/inductor/169305 2025-12-04T09:33:40.4898348Z * [new tag] ciflow/inductor/169308 -> ciflow/inductor/169308 2025-12-04T09:33:40.4899407Z * [new tag] ciflow/inductor/169319 -> ciflow/inductor/169319 2025-12-04T09:33:40.4900137Z * [new tag] ciflow/inductor/169326 -> ciflow/inductor/169326 2025-12-04T09:33:40.4901024Z * [new tag] ciflow/inductor/169332 -> ciflow/inductor/169332 2025-12-04T09:33:40.4901868Z * [new tag] ciflow/inductor/169333 -> ciflow/inductor/169333 2025-12-04T09:33:40.4903049Z * [new tag] ciflow/inductor/169336 -> ciflow/inductor/169336 2025-12-04T09:33:40.4903853Z * [new tag] ciflow/inductor/169340 -> ciflow/inductor/169340 2025-12-04T09:33:40.4904965Z * [new tag] ciflow/inductor/169341 -> ciflow/inductor/169341 2025-12-04T09:33:40.4905744Z * [new tag] ciflow/inductor/169343 -> ciflow/inductor/169343 2025-12-04T09:33:40.4906517Z * [new tag] ciflow/inductor/169346 -> ciflow/inductor/169346 2025-12-04T09:33:40.4907643Z * [new tag] ciflow/inductor/169348 -> ciflow/inductor/169348 2025-12-04T09:33:40.4908666Z * [new tag] ciflow/inductor/169350 -> ciflow/inductor/169350 2025-12-04T09:33:40.4909444Z * [new tag] ciflow/inductor/169355 -> ciflow/inductor/169355 2025-12-04T09:33:40.4910302Z * [new tag] ciflow/inductor/169370 -> ciflow/inductor/169370 2025-12-04T09:33:40.4911579Z * [new tag] ciflow/inductor/169375 -> ciflow/inductor/169375 2025-12-04T09:33:40.4912363Z * [new tag] ciflow/inductor/169389 -> ciflow/inductor/169389 2025-12-04T09:33:40.4913211Z * [new tag] ciflow/inductor/169391 -> ciflow/inductor/169391 2025-12-04T09:33:40.4914090Z * [new tag] ciflow/inductor/169393 -> ciflow/inductor/169393 2025-12-04T09:33:40.4914938Z * [new tag] ciflow/inductor/169399 -> ciflow/inductor/169399 2025-12-04T09:33:40.4916218Z * [new tag] ciflow/inductor/169400 -> ciflow/inductor/169400 2025-12-04T09:33:40.4916976Z * [new tag] ciflow/inductor/169415 -> ciflow/inductor/169415 2025-12-04T09:33:40.4918151Z * [new tag] ciflow/inductor/169417 -> ciflow/inductor/169417 2025-12-04T09:33:40.4918814Z * [new tag] ciflow/inductor/169418 -> ciflow/inductor/169418 2025-12-04T09:33:40.4919999Z * [new tag] ciflow/inductor/169430 -> ciflow/inductor/169430 2025-12-04T09:33:40.4920842Z * [new tag] ciflow/inductor/169432 -> ciflow/inductor/169432 2025-12-04T09:33:40.4921698Z * [new tag] ciflow/inductor/169436 -> ciflow/inductor/169436 2025-12-04T09:33:40.4922827Z * [new tag] ciflow/inductor/169437 -> ciflow/inductor/169437 2025-12-04T09:33:40.4923615Z * [new tag] ciflow/inductor/169438 -> ciflow/inductor/169438 2025-12-04T09:33:40.4924492Z * [new tag] ciflow/inductor/169441 -> ciflow/inductor/169441 2025-12-04T09:33:40.4925390Z * [new tag] ciflow/inductor/169446 -> ciflow/inductor/169446 2025-12-04T09:33:40.4926467Z * [new tag] ciflow/inductor/169447 -> ciflow/inductor/169447 2025-12-04T09:33:40.4927477Z * [new tag] ciflow/inductor/169452 -> ciflow/inductor/169452 2025-12-04T09:33:40.4928469Z * [new tag] ciflow/inductor/169455 -> ciflow/inductor/169455 2025-12-04T09:33:40.4929276Z * [new tag] ciflow/inductor/169459 -> ciflow/inductor/169459 2025-12-04T09:33:40.4930365Z * [new tag] ciflow/inductor/169463 -> ciflow/inductor/169463 2025-12-04T09:33:40.4931384Z * [new tag] ciflow/inductor/169476 -> ciflow/inductor/169476 2025-12-04T09:33:40.4932167Z * [new tag] ciflow/inductor/169485 -> ciflow/inductor/169485 2025-12-04T09:33:40.4933016Z * [new tag] ciflow/inductor/169493 -> ciflow/inductor/169493 2025-12-04T09:33:40.4933893Z * [new tag] ciflow/inductor/169496 -> ciflow/inductor/169496 2025-12-04T09:33:40.4934749Z * [new tag] ciflow/inductor/169497 -> ciflow/inductor/169497 2025-12-04T09:33:40.4935624Z * [new tag] ciflow/inductor/169503 -> ciflow/inductor/169503 2025-12-04T09:33:40.4936491Z * [new tag] ciflow/inductor/169504 -> ciflow/inductor/169504 2025-12-04T09:33:40.4937744Z * [new tag] ciflow/inductor/169505 -> ciflow/inductor/169505 2025-12-04T09:33:40.4939265Z * [new tag] ciflow/inductor/169508 -> ciflow/inductor/169508 2025-12-04T09:33:40.4940030Z * [new tag] ciflow/inductor/169509 -> ciflow/inductor/169509 2025-12-04T09:33:40.4941032Z * [new tag] ciflow/inductor/169513 -> ciflow/inductor/169513 2025-12-04T09:33:40.4941812Z * [new tag] ciflow/inductor/169514 -> ciflow/inductor/169514 2025-12-04T09:33:40.4942664Z * [new tag] ciflow/inductor/169515 -> ciflow/inductor/169515 2025-12-04T09:33:40.4943543Z * [new tag] ciflow/inductor/169517 -> ciflow/inductor/169517 2025-12-04T09:33:40.4944412Z * [new tag] ciflow/inductor/169519 -> ciflow/inductor/169519 2025-12-04T09:33:40.4945298Z * [new tag] ciflow/inductor/169520 -> ciflow/inductor/169520 2025-12-04T09:33:40.4946151Z * [new tag] ciflow/inductor/169521 -> ciflow/inductor/169521 2025-12-04T09:33:40.4947562Z * [new tag] ciflow/inductor/169524 -> ciflow/inductor/169524 2025-12-04T09:33:40.4948348Z * [new tag] ciflow/inductor/169527 -> ciflow/inductor/169527 2025-12-04T09:33:40.4949240Z * [new tag] ciflow/inductor/169528 -> ciflow/inductor/169528 2025-12-04T09:33:40.4950338Z * [new tag] ciflow/inductor/169532 -> ciflow/inductor/169532 2025-12-04T09:33:40.4951099Z * [new tag] ciflow/inductor/169535 -> ciflow/inductor/169535 2025-12-04T09:33:40.4951981Z * [new tag] ciflow/inductor/169536 -> ciflow/inductor/169536 2025-12-04T09:33:40.4953171Z * [new tag] ciflow/inductor/169547 -> ciflow/inductor/169547 2025-12-04T09:33:40.4953761Z * [new tag] ciflow/inductor/169548 -> ciflow/inductor/169548 2025-12-04T09:33:40.4954601Z * [new tag] ciflow/inductor/169549 -> ciflow/inductor/169549 2025-12-04T09:33:40.4955463Z * [new tag] ciflow/inductor/169551 -> ciflow/inductor/169551 2025-12-04T09:33:40.4956541Z * [new tag] ciflow/inductor/169552 -> ciflow/inductor/169552 2025-12-04T09:33:40.4957329Z * [new tag] ciflow/inductor/169553 -> ciflow/inductor/169553 2025-12-04T09:33:40.4958210Z * [new tag] ciflow/inductor/169557 -> ciflow/inductor/169557 2025-12-04T09:33:40.4959426Z * [new tag] ciflow/inductor/3b9a386 -> ciflow/inductor/3b9a386 2025-12-04T09:33:40.4960569Z * [new tag] ciflow/inductor/3d4b92b -> ciflow/inductor/3d4b92b 2025-12-04T09:33:40.4961601Z * [new tag] ciflow/inductor/d224ac7 -> ciflow/inductor/d224ac7 2025-12-04T09:33:40.4962627Z * [new tag] ciflow/linux-aarch64/157994 -> ciflow/linux-aarch64/157994 2025-12-04T09:33:40.4963416Z * [new tag] ciflow/linux-aarch64/166075 -> ciflow/linux-aarch64/166075 2025-12-04T09:33:40.4964238Z * [new tag] ciflow/linux-aarch64/166876 -> ciflow/linux-aarch64/166876 2025-12-04T09:33:40.4965033Z * [new tag] ciflow/linux-aarch64/167981 -> ciflow/linux-aarch64/167981 2025-12-04T09:33:40.4966060Z * [new tag] ciflow/mps/166254 -> ciflow/mps/166254 2025-12-04T09:33:40.4966777Z * [new tag] ciflow/mps/169017 -> ciflow/mps/169017 2025-12-04T09:33:40.4967931Z * [new tag] ciflow/mps/169372 -> ciflow/mps/169372 2025-12-04T09:33:40.4968684Z * [new tag] ciflow/mps/169478 -> ciflow/mps/169478 2025-12-04T09:33:40.4969713Z * [new tag] ciflow/op-benchmark/157994 -> ciflow/op-benchmark/157994 2025-12-04T09:33:40.4970447Z * [new tag] ciflow/op-benchmark/166075 -> ciflow/op-benchmark/166075 2025-12-04T09:33:40.4971291Z * [new tag] ciflow/op-benchmark/169544 -> ciflow/op-benchmark/169544 2025-12-04T09:33:40.4972406Z * [new tag] ciflow/periodic-rocm-mi200/165997 -> ciflow/periodic-rocm-mi200/165997 2025-12-04T09:33:40.4973287Z * [new tag] ciflow/periodic-rocm-mi200/166517 -> ciflow/periodic-rocm-mi200/166517 2025-12-04T09:33:40.4974107Z * [new tag] ciflow/periodic-rocm-mi200/169063 -> ciflow/periodic-rocm-mi200/169063 2025-12-04T09:33:40.4974978Z * [new tag] ciflow/periodic-rocm-mi200/169425 -> ciflow/periodic-rocm-mi200/169425 2025-12-04T09:33:40.4976104Z * [new tag] ciflow/periodic-rocm-mi300/166517 -> ciflow/periodic-rocm-mi300/166517 2025-12-04T09:33:40.4976805Z * [new tag] ciflow/periodic-rocm-mi300/169063 -> ciflow/periodic-rocm-mi300/169063 2025-12-04T09:33:40.4977659Z * [new tag] ciflow/periodic-rocm-mi300/169425 -> ciflow/periodic-rocm-mi300/169425 2025-12-04T09:33:40.4979073Z * [new tag] ciflow/periodic/054a2fd -> ciflow/periodic/054a2fd 2025-12-04T09:33:40.4979796Z * [new tag] ciflow/periodic/167207 -> ciflow/periodic/167207 2025-12-04T09:33:40.4980877Z * [new tag] ciflow/periodic/167978 -> ciflow/periodic/167978 2025-12-04T09:33:40.4981717Z * [new tag] ciflow/periodic/168096 -> ciflow/periodic/168096 2025-12-04T09:33:40.4982508Z * [new tag] ciflow/periodic/169286 -> ciflow/periodic/169286 2025-12-04T09:33:40.4983709Z * [new tag] ciflow/periodic/2a6d37d -> ciflow/periodic/2a6d37d 2025-12-04T09:33:40.4984604Z * [new tag] ciflow/periodic/317eeb8 -> ciflow/periodic/317eeb8 2025-12-04T09:33:40.4985798Z * [new tag] ciflow/periodic/3c32 -> ciflow/periodic/3c32 2025-12-04T09:33:40.4986657Z * [new tag] ciflow/periodic/3e98831 -> ciflow/periodic/3e98831 2025-12-04T09:33:40.4988621Z * [new tag] ciflow/periodic/7c648509a7470ace9fb2bae960dd4790f7e943e9 -> ciflow/periodic/7c648509a7470ace9fb2bae960dd4790f7e943e9 2025-12-04T09:33:40.4992283Z * [new tag] ciflow/periodic/94512-point -> ciflow/periodic/94512-point 2025-12-04T09:33:40.4993636Z * [new tag] ciflow/periodic/csl/test87519 -> ciflow/periodic/csl/test87519 2025-12-04T09:33:40.4994575Z * [new tag] ciflow/periodic/csltest88275 -> ciflow/periodic/csltest88275 2025-12-04T09:33:40.4995730Z * [new tag] ciflow/periodic/csltest88761 -> ciflow/periodic/csltest88761 2025-12-04T09:33:40.4996878Z * [new tag] ciflow/periodic/release_1.12 -> ciflow/periodic/release_1.12 2025-12-04T09:33:40.4998101Z * [new tag] ciflow/periodic/release_1.12.0 -> ciflow/periodic/release_1.12.0 2025-12-04T09:33:40.4999240Z * [new tag] ciflow/periodic/sha-ec5b83 -> ciflow/periodic/sha-ec5b83 2025-12-04T09:33:40.5000146Z * [new tag] ciflow/pull/167207 -> ciflow/pull/167207 2025-12-04T09:33:40.5001523Z * [new tag] ciflow/quantization-periodic/169207 -> ciflow/quantization-periodic/169207 2025-12-04T09:33:40.5002302Z * [new tag] ciflow/rocm-mi200/165545 -> ciflow/rocm-mi200/165545 2025-12-04T09:33:40.5003114Z * [new tag] ciflow/rocm-mi200/165997 -> ciflow/rocm-mi200/165997 2025-12-04T09:33:40.5003926Z * [new tag] ciflow/rocm-mi200/168096 -> ciflow/rocm-mi200/168096 2025-12-04T09:33:40.5005051Z * [new tag] ciflow/rocm-mi200/168275 -> ciflow/rocm-mi200/168275 2025-12-04T09:33:40.5005771Z * [new tag] ciflow/rocm-mi200/169063 -> ciflow/rocm-mi200/169063 2025-12-04T09:33:40.5006869Z * [new tag] ciflow/rocm-mi200/169356 -> ciflow/rocm-mi200/169356 2025-12-04T09:33:40.5007545Z * [new tag] ciflow/rocm-mi200/169425 -> ciflow/rocm-mi200/169425 2025-12-04T09:33:40.5008649Z * [new tag] ciflow/rocm-mi300/165545 -> ciflow/rocm-mi300/165545 2025-12-04T09:33:40.5009690Z * [new tag] ciflow/rocm-mi300/167157 -> ciflow/rocm-mi300/167157 2025-12-04T09:33:40.5010368Z * [new tag] ciflow/rocm-mi300/168096 -> ciflow/rocm-mi300/168096 2025-12-04T09:33:40.5011216Z * [new tag] ciflow/rocm-mi300/169063 -> ciflow/rocm-mi300/169063 2025-12-04T09:33:40.5012111Z * [new tag] ciflow/rocm-mi300/169425 -> ciflow/rocm-mi300/169425 2025-12-04T09:33:40.5013185Z * [new tag] ciflow/rocm-mi355/167157 -> ciflow/rocm-mi355/167157 2025-12-04T09:33:40.5013947Z * [new tag] ciflow/rocm-mi355/168275 -> ciflow/rocm-mi355/168275 2025-12-04T09:33:40.5015008Z * [new tag] ciflow/rocm-mi355/169425 -> ciflow/rocm-mi355/169425 2025-12-04T09:33:40.5015914Z * [new tag] ciflow/rocm-navi31/168275 -> ciflow/rocm-navi31/168275 2025-12-04T09:33:40.5016707Z * [new tag] ciflow/rocm-navi31/169425 -> ciflow/rocm-navi31/169425 2025-12-04T09:33:40.5017781Z * [new tag] ciflow/rocm/115316 -> ciflow/rocm/115316 2025-12-04T09:33:40.5018583Z * [new tag] ciflow/rocm/148492 -> ciflow/rocm/148492 2025-12-04T09:33:40.5019445Z * [new tag] ciflow/rocm/160685 -> ciflow/rocm/160685 2025-12-04T09:33:40.5020283Z * [new tag] ciflow/rocm/161607 -> ciflow/rocm/161607 2025-12-04T09:33:40.5021076Z * [new tag] ciflow/rocm/162052 -> ciflow/rocm/162052 2025-12-04T09:33:40.5021896Z * [new tag] ciflow/rocm/165997 -> ciflow/rocm/165997 2025-12-04T09:33:40.5022799Z * [new tag] ciflow/rocm/166165 -> ciflow/rocm/166165 2025-12-04T09:33:40.5023519Z * [new tag] ciflow/rocm/166517 -> ciflow/rocm/166517 2025-12-04T09:33:40.5024309Z * [new tag] ciflow/rocm/167207 -> ciflow/rocm/167207 2025-12-04T09:33:40.5025127Z * [new tag] ciflow/rocm/167536 -> ciflow/rocm/167536 2025-12-04T09:33:40.5025945Z * [new tag] ciflow/rocm/167781 -> ciflow/rocm/167781 2025-12-04T09:33:40.5027157Z * [new tag] ciflow/rocm/167989 -> ciflow/rocm/167989 2025-12-04T09:33:40.5028279Z * [new tag] ciflow/rocm/168073 -> ciflow/rocm/168073 2025-12-04T09:33:40.5029161Z * [new tag] ciflow/rocm/168195 -> ciflow/rocm/168195 2025-12-04T09:33:40.5030142Z * [new tag] ciflow/rocm/168939 -> ciflow/rocm/168939 2025-12-04T09:33:40.5030908Z * [new tag] ciflow/rocm/168971 -> ciflow/rocm/168971 2025-12-04T09:33:40.5031767Z * [new tag] ciflow/rocm/169024 -> ciflow/rocm/169024 2025-12-04T09:33:40.5032627Z * [new tag] ciflow/rocm/169200 -> ciflow/rocm/169200 2025-12-04T09:33:40.5033602Z * [new tag] ciflow/rocm/169216 -> ciflow/rocm/169216 2025-12-04T09:33:40.5034398Z * [new tag] ciflow/rocm/169312 -> ciflow/rocm/169312 2025-12-04T09:33:40.5035220Z * [new tag] ciflow/rocm/169380 -> ciflow/rocm/169380 2025-12-04T09:33:40.5036085Z * [new tag] ciflow/rocm/169427 -> ciflow/rocm/169427 2025-12-04T09:33:40.5036946Z * [new tag] ciflow/rocm/169455 -> ciflow/rocm/169455 2025-12-04T09:33:40.5037987Z * [new tag] ciflow/rocm/169470 -> ciflow/rocm/169470 2025-12-04T09:33:40.5038759Z * [new tag] ciflow/rocm/169471 -> ciflow/rocm/169471 2025-12-04T09:33:40.5039602Z * [new tag] ciflow/rocm/169472 -> ciflow/rocm/169472 2025-12-04T09:33:40.5040453Z * [new tag] ciflow/rocm/169514 -> ciflow/rocm/169514 2025-12-04T09:33:40.5041749Z * [new tag] ciflow/slow/01c7106 -> ciflow/slow/01c7106 2025-12-04T09:33:40.5042629Z * [new tag] ciflow/slow/0577043 -> ciflow/slow/0577043 2025-12-04T09:33:40.5044249Z * [new tag] ciflow/slow/0d5b74da0cab798fbfdb9caa53fad816999c8386-sdym -> ciflow/slow/0d5b74da0cab798fbfdb9caa53fad816999c8386-sdym 2025-12-04T09:33:40.5044770Z * [new tag] ciflow/slow/0e81104 -> ciflow/slow/0e81104 2025-12-04T09:33:40.5045639Z * [new tag] ciflow/slow/167207 -> ciflow/slow/167207 2025-12-04T09:33:40.5046455Z * [new tag] ciflow/slow/168050 -> ciflow/slow/168050 2025-12-04T09:33:40.5047574Z * [new tag] ciflow/slow/1732077 -> ciflow/slow/1732077 2025-12-04T09:33:40.5048662Z * [new tag] ciflow/slow/187eb7c -> ciflow/slow/187eb7c 2025-12-04T09:33:40.5049969Z * [new tag] ciflow/slow/1faef89 -> ciflow/slow/1faef89 2025-12-04T09:33:40.5051308Z * [new tag] ciflow/slow/3920ec1 -> ciflow/slow/3920ec1 2025-12-04T09:33:40.5052531Z * [new tag] ciflow/slow/3b7c6b2 -> ciflow/slow/3b7c6b2 2025-12-04T09:33:40.5053604Z * [new tag] ciflow/slow/59a3759 -> ciflow/slow/59a3759 2025-12-04T09:33:40.5054592Z * [new tag] ciflow/slow/70ef0bb -> ciflow/slow/70ef0bb 2025-12-04T09:33:40.5055630Z * [new tag] ciflow/slow/788ff06 -> ciflow/slow/788ff06 2025-12-04T09:33:40.5057215Z * [new tag] ciflow/slow/8751002215790a3a88750faa8f4366933e296693-sdym -> ciflow/slow/8751002215790a3a88750faa8f4366933e296693-sdym 2025-12-04T09:33:40.5057790Z * [new tag] ciflow/slow/9d85864 -> ciflow/slow/9d85864 2025-12-04T09:33:40.5059153Z * [new tag] ciflow/slow/9ffad5b -> ciflow/slow/9ffad5b 2025-12-04T09:33:40.5060007Z * [new tag] ciflow/slow/a206e8b -> ciflow/slow/a206e8b 2025-12-04T09:33:40.5061163Z * [new tag] ciflow/slow/a837609 -> ciflow/slow/a837609 2025-12-04T09:33:40.5062255Z * [new tag] ciflow/slow/af841f3 -> ciflow/slow/af841f3 2025-12-04T09:33:40.5063861Z * [new tag] ciflow/slow/da3aba1e46157c4df504b067477cdf2b3c96b194-sdym -> ciflow/slow/da3aba1e46157c4df504b067477cdf2b3c96b194-sdym 2025-12-04T09:33:40.5064442Z * [new tag] ciflow/torchbench/168175 -> ciflow/torchbench/168175 2025-12-04T09:33:40.5065525Z * [new tag] ciflow/trunk/148492 -> ciflow/trunk/148492 2025-12-04T09:33:40.5066313Z * [new tag] ciflow/trunk/157149 -> ciflow/trunk/157149 2025-12-04T09:33:40.5067102Z * [new tag] ciflow/trunk/157994 -> ciflow/trunk/157994 2025-12-04T09:33:40.5068101Z * [new tag] ciflow/trunk/159718 -> ciflow/trunk/159718 2025-12-04T09:33:40.5068836Z * [new tag] ciflow/trunk/160685 -> ciflow/trunk/160685 2025-12-04T09:33:40.5069660Z * [new tag] ciflow/trunk/160729 -> ciflow/trunk/160729 2025-12-04T09:33:40.5070459Z * [new tag] ciflow/trunk/162275 -> ciflow/trunk/162275 2025-12-04T09:33:40.5071286Z * [new tag] ciflow/trunk/162795 -> ciflow/trunk/162795 2025-12-04T09:33:40.5072139Z * [new tag] ciflow/trunk/163245 -> ciflow/trunk/163245 2025-12-04T09:33:40.5072958Z * [new tag] ciflow/trunk/163942 -> ciflow/trunk/163942 2025-12-04T09:33:40.5073747Z * [new tag] ciflow/trunk/165274 -> ciflow/trunk/165274 2025-12-04T09:33:40.5075145Z * [new tag] ciflow/trunk/165483 -> ciflow/trunk/165483 2025-12-04T09:33:40.5076275Z * [new tag] ciflow/trunk/165728 -> ciflow/trunk/165728 2025-12-04T09:33:40.5077344Z * [new tag] ciflow/trunk/165922 -> ciflow/trunk/165922 2025-12-04T09:33:40.5078071Z * [new tag] ciflow/trunk/166075 -> ciflow/trunk/166075 2025-12-04T09:33:40.5078938Z * [new tag] ciflow/trunk/166165 -> ciflow/trunk/166165 2025-12-04T09:33:40.5079777Z * [new tag] ciflow/trunk/166829 -> ciflow/trunk/166829 2025-12-04T09:33:40.5080989Z * [new tag] ciflow/trunk/166843 -> ciflow/trunk/166843 2025-12-04T09:33:40.5081714Z * [new tag] ciflow/trunk/166876 -> ciflow/trunk/166876 2025-12-04T09:33:40.5082717Z * [new tag] ciflow/trunk/167207 -> ciflow/trunk/167207 2025-12-04T09:33:40.5083465Z * [new tag] ciflow/trunk/167536 -> ciflow/trunk/167536 2025-12-04T09:33:40.5084476Z * [new tag] ciflow/trunk/167552 -> ciflow/trunk/167552 2025-12-04T09:33:40.5085856Z * [new tag] ciflow/trunk/167555 -> ciflow/trunk/167555 2025-12-04T09:33:40.5086644Z * [new tag] ciflow/trunk/167599 -> ciflow/trunk/167599 2025-12-04T09:33:40.5087634Z * [new tag] ciflow/trunk/167659 -> ciflow/trunk/167659 2025-12-04T09:33:40.5088857Z * [new tag] ciflow/trunk/167672 -> ciflow/trunk/167672 2025-12-04T09:33:40.5089658Z * [new tag] ciflow/trunk/167742 -> ciflow/trunk/167742 2025-12-04T09:33:40.5090533Z * [new tag] ciflow/trunk/167781 -> ciflow/trunk/167781 2025-12-04T09:33:40.5091698Z * [new tag] ciflow/trunk/167837 -> ciflow/trunk/167837 2025-12-04T09:33:40.5092484Z * [new tag] ciflow/trunk/167887 -> ciflow/trunk/167887 2025-12-04T09:33:40.5093338Z * [new tag] ciflow/trunk/167978 -> ciflow/trunk/167978 2025-12-04T09:33:40.5094510Z * [new tag] ciflow/trunk/168050 -> ciflow/trunk/168050 2025-12-04T09:33:40.5095057Z * [new tag] ciflow/trunk/168051 -> ciflow/trunk/168051 2025-12-04T09:33:40.5096037Z * [new tag] ciflow/trunk/168096 -> ciflow/trunk/168096 2025-12-04T09:33:40.5096821Z * [new tag] ciflow/trunk/168127 -> ciflow/trunk/168127 2025-12-04T09:33:40.5097916Z * [new tag] ciflow/trunk/168157 -> ciflow/trunk/168157 2025-12-04T09:33:40.5098786Z * [new tag] ciflow/trunk/168175 -> ciflow/trunk/168175 2025-12-04T09:33:40.5099649Z * [new tag] ciflow/trunk/168209 -> ciflow/trunk/168209 2025-12-04T09:33:40.5100779Z * [new tag] ciflow/trunk/168213 -> ciflow/trunk/168213 2025-12-04T09:33:40.5101778Z * [new tag] ciflow/trunk/168226 -> ciflow/trunk/168226 2025-12-04T09:33:40.5102610Z * [new tag] ciflow/trunk/168262 -> ciflow/trunk/168262 2025-12-04T09:33:40.5103488Z * [new tag] ciflow/trunk/168275 -> ciflow/trunk/168275 2025-12-04T09:33:40.5104604Z * [new tag] ciflow/trunk/168328 -> ciflow/trunk/168328 2025-12-04T09:33:40.5105397Z * [new tag] ciflow/trunk/168368 -> ciflow/trunk/168368 2025-12-04T09:33:40.5106263Z * [new tag] ciflow/trunk/168917 -> ciflow/trunk/168917 2025-12-04T09:33:40.5107140Z * [new tag] ciflow/trunk/168933 -> ciflow/trunk/168933 2025-12-04T09:33:40.5108278Z * [new tag] ciflow/trunk/168941 -> ciflow/trunk/168941 2025-12-04T09:33:40.5109039Z * [new tag] ciflow/trunk/168955 -> ciflow/trunk/168955 2025-12-04T09:33:40.5110023Z * [new tag] ciflow/trunk/168980 -> ciflow/trunk/168980 2025-12-04T09:33:40.5111094Z * [new tag] ciflow/trunk/169004 -> ciflow/trunk/169004 2025-12-04T09:33:40.5111866Z * [new tag] ciflow/trunk/169006 -> ciflow/trunk/169006 2025-12-04T09:33:40.5112741Z * [new tag] ciflow/trunk/169023 -> ciflow/trunk/169023 2025-12-04T09:33:40.5113595Z * [new tag] ciflow/trunk/169025 -> ciflow/trunk/169025 2025-12-04T09:33:40.5114488Z * [new tag] ciflow/trunk/169048 -> ciflow/trunk/169048 2025-12-04T09:33:40.5115343Z * [new tag] ciflow/trunk/169066 -> ciflow/trunk/169066 2025-12-04T09:33:40.5116206Z * [new tag] ciflow/trunk/169091 -> ciflow/trunk/169091 2025-12-04T09:33:40.5117062Z * [new tag] ciflow/trunk/169102 -> ciflow/trunk/169102 2025-12-04T09:33:40.5118172Z * [new tag] ciflow/trunk/169103 -> ciflow/trunk/169103 2025-12-04T09:33:40.5118959Z * [new tag] ciflow/trunk/169125 -> ciflow/trunk/169125 2025-12-04T09:33:40.5120056Z * [new tag] ciflow/trunk/169139 -> ciflow/trunk/169139 2025-12-04T09:33:40.5121133Z * [new tag] ciflow/trunk/169148 -> ciflow/trunk/169148 2025-12-04T09:33:40.5121892Z * [new tag] ciflow/trunk/169151 -> ciflow/trunk/169151 2025-12-04T09:33:40.5122918Z * [new tag] ciflow/trunk/169156 -> ciflow/trunk/169156 2025-12-04T09:33:40.5123974Z * [new tag] ciflow/trunk/169176 -> ciflow/trunk/169176 2025-12-04T09:33:40.5124743Z * [new tag] ciflow/trunk/169204 -> ciflow/trunk/169204 2025-12-04T09:33:40.5125760Z * [new tag] ciflow/trunk/169207 -> ciflow/trunk/169207 2025-12-04T09:33:40.5126495Z * [new tag] ciflow/trunk/169211 -> ciflow/trunk/169211 2025-12-04T09:33:40.5127830Z * [new tag] ciflow/trunk/169231 -> ciflow/trunk/169231 2025-12-04T09:33:40.5128683Z * [new tag] ciflow/trunk/169260 -> ciflow/trunk/169260 2025-12-04T09:33:40.5129853Z * [new tag] ciflow/trunk/169271 -> ciflow/trunk/169271 2025-12-04T09:33:40.5130647Z * [new tag] ciflow/trunk/169280 -> ciflow/trunk/169280 2025-12-04T09:33:40.5131517Z * [new tag] ciflow/trunk/169281 -> ciflow/trunk/169281 2025-12-04T09:33:40.5132396Z * [new tag] ciflow/trunk/169286 -> ciflow/trunk/169286 2025-12-04T09:33:40.5133539Z * [new tag] ciflow/trunk/169293 -> ciflow/trunk/169293 2025-12-04T09:33:40.5134311Z * [new tag] ciflow/trunk/169296 -> ciflow/trunk/169296 2025-12-04T09:33:40.5135184Z * [new tag] ciflow/trunk/169304 -> ciflow/trunk/169304 2025-12-04T09:33:40.5136044Z * [new tag] ciflow/trunk/169305 -> ciflow/trunk/169305 2025-12-04T09:33:40.5137040Z * [new tag] ciflow/trunk/169312 -> ciflow/trunk/169312 2025-12-04T09:33:40.5138351Z * [new tag] ciflow/trunk/169328 -> ciflow/trunk/169328 2025-12-04T09:33:40.5139367Z * [new tag] ciflow/trunk/169343 -> ciflow/trunk/169343 2025-12-04T09:33:40.5140108Z * [new tag] ciflow/trunk/169355 -> ciflow/trunk/169355 2025-12-04T09:33:40.5140978Z * [new tag] ciflow/trunk/169370 -> ciflow/trunk/169370 2025-12-04T09:33:40.5142133Z * [new tag] ciflow/trunk/169379 -> ciflow/trunk/169379 2025-12-04T09:33:40.5142890Z * [new tag] ciflow/trunk/169380 -> ciflow/trunk/169380 2025-12-04T09:33:40.5143763Z * [new tag] ciflow/trunk/169385 -> ciflow/trunk/169385 2025-12-04T09:33:40.5144615Z * [new tag] ciflow/trunk/169387 -> ciflow/trunk/169387 2025-12-04T09:33:40.5146298Z * [new tag] ciflow/trunk/169410 -> ciflow/trunk/169410 2025-12-04T09:33:40.5147074Z * [new tag] ciflow/trunk/169412 -> ciflow/trunk/169412 2025-12-04T09:33:40.5147926Z * [new tag] ciflow/trunk/169418 -> ciflow/trunk/169418 2025-12-04T09:33:40.5148927Z * [new tag] ciflow/trunk/169423 -> ciflow/trunk/169423 2025-12-04T09:33:40.5149650Z * [new tag] ciflow/trunk/169427 -> ciflow/trunk/169427 2025-12-04T09:33:40.5150539Z * [new tag] ciflow/trunk/169430 -> ciflow/trunk/169430 2025-12-04T09:33:40.5151547Z * [new tag] ciflow/trunk/169437 -> ciflow/trunk/169437 2025-12-04T09:33:40.5152297Z * [new tag] ciflow/trunk/169442 -> ciflow/trunk/169442 2025-12-04T09:33:40.5153173Z * [new tag] ciflow/trunk/169452 -> ciflow/trunk/169452 2025-12-04T09:33:40.5154044Z * [new tag] ciflow/trunk/169454 -> ciflow/trunk/169454 2025-12-04T09:33:40.5154936Z * [new tag] ciflow/trunk/169459 -> ciflow/trunk/169459 2025-12-04T09:33:40.5156034Z * [new tag] ciflow/trunk/169474 -> ciflow/trunk/169474 2025-12-04T09:33:40.5156850Z * [new tag] ciflow/trunk/169475 -> ciflow/trunk/169475 2025-12-04T09:33:40.5157926Z * [new tag] ciflow/trunk/169476 -> ciflow/trunk/169476 2025-12-04T09:33:40.5158948Z * [new tag] ciflow/trunk/169487 -> ciflow/trunk/169487 2025-12-04T09:33:40.5159754Z * [new tag] ciflow/trunk/169497 -> ciflow/trunk/169497 2025-12-04T09:33:40.5160619Z * [new tag] ciflow/trunk/169503 -> ciflow/trunk/169503 2025-12-04T09:33:40.5161489Z * [new tag] ciflow/trunk/169505 -> ciflow/trunk/169505 2025-12-04T09:33:40.5162365Z * [new tag] ciflow/trunk/169507 -> ciflow/trunk/169507 2025-12-04T09:33:40.5163225Z * [new tag] ciflow/trunk/169514 -> ciflow/trunk/169514 2025-12-04T09:33:40.5164278Z * [new tag] ciflow/trunk/169517 -> ciflow/trunk/169517 2025-12-04T09:33:40.5164985Z * [new tag] ciflow/trunk/169519 -> ciflow/trunk/169519 2025-12-04T09:33:40.5165938Z * [new tag] ciflow/trunk/169528 -> ciflow/trunk/169528 2025-12-04T09:33:40.5166718Z * [new tag] ciflow/trunk/169541 -> ciflow/trunk/169541 2025-12-04T09:33:40.5167899Z * [new tag] ciflow/trunk/169555 -> ciflow/trunk/169555 2025-12-04T09:33:40.5169147Z * [new tag] ciflow/unstable/123 -> ciflow/unstable/123 2025-12-04T09:33:40.5170197Z * [new tag] ciflow/vllm/165270 -> ciflow/vllm/165270 2025-12-04T09:33:40.5170833Z * [new tag] ciflow/vllm/165274 -> ciflow/vllm/165274 2025-12-04T09:33:40.5171690Z * [new tag] ciflow/vllm/166494 -> ciflow/vllm/166494 2025-12-04T09:33:40.5172513Z * [new tag] ciflow/vllm/169219 -> ciflow/vllm/169219 2025-12-04T09:33:40.5173327Z * [new tag] ciflow/vllm/169220 -> ciflow/vllm/169220 2025-12-04T09:33:40.5174397Z * [new tag] ciflow/xpu/157994 -> ciflow/xpu/157994 2025-12-04T09:33:40.5175103Z * [new tag] ciflow/xpu/159718 -> ciflow/xpu/159718 2025-12-04T09:33:40.5175922Z * [new tag] ciflow/xpu/161940 -> ciflow/xpu/161940 2025-12-04T09:33:40.5177032Z * [new tag] ciflow/xpu/163251 -> ciflow/xpu/163251 2025-12-04T09:33:40.5177682Z * [new tag] ciflow/xpu/166829 -> ciflow/xpu/166829 2025-12-04T09:33:40.5178620Z * [new tag] ciflow/xpu/166843 -> ciflow/xpu/166843 2025-12-04T09:33:40.5179450Z * [new tag] ciflow/xpu/167972 -> ciflow/xpu/167972 2025-12-04T09:33:40.5180303Z * [new tag] ciflow/xpu/167981 -> ciflow/xpu/167981 2025-12-04T09:33:40.5181074Z * [new tag] ciflow/xpu/168213 -> ciflow/xpu/168213 2025-12-04T09:33:40.5181883Z * [new tag] ciflow/xpu/168262 -> ciflow/xpu/168262 2025-12-04T09:33:40.5182710Z * [new tag] ciflow/xpu/168328 -> ciflow/xpu/168328 2025-12-04T09:33:40.5184027Z * [new tag] ciflow/xpu/168950 -> ciflow/xpu/168950 2025-12-04T09:33:40.5185283Z * [new tag] ciflow/xpu/169039 -> ciflow/xpu/169039 2025-12-04T09:33:40.5186291Z * [new tag] ciflow/xpu/169200 -> ciflow/xpu/169200 2025-12-04T09:33:40.5187078Z * [new tag] ciflow/xpu/169203 -> ciflow/xpu/169203 2025-12-04T09:33:40.5188065Z * [new tag] ciflow/xpu/169230 -> ciflow/xpu/169230 2025-12-04T09:33:40.5189093Z * [new tag] ciflow/xpu/169231 -> ciflow/xpu/169231 2025-12-04T09:33:40.5189977Z * [new tag] ciflow/xpu/169241 -> ciflow/xpu/169241 2025-12-04T09:33:40.5191058Z * [new tag] ciflow/xpu/169280 -> ciflow/xpu/169280 2025-12-04T09:33:40.5192269Z * [new tag] ciflow/xpu/169296 -> ciflow/xpu/169296 2025-12-04T09:33:40.5193352Z * [new tag] ciflow/xpu/169353 -> ciflow/xpu/169353 2025-12-04T09:33:40.5193972Z * [new tag] ciflow/xpu/169410 -> ciflow/xpu/169410 2025-12-04T09:33:40.5194948Z * [new tag] ciflow/xpu/169442 -> ciflow/xpu/169442 2025-12-04T09:33:40.5195791Z * [new tag] ciflow/xpu/169555 -> ciflow/xpu/169555 2025-12-04T09:33:40.5196842Z * [new tag] cslpull75 -> cslpull75 2025-12-04T09:33:40.5197671Z * [new tag] cslpull76 -> cslpull76 2025-12-04T09:33:40.5198699Z * [new tag] cslpull77 -> cslpull77 2025-12-04T09:33:40.5199760Z * [new tag] cslpull78 -> cslpull78 2025-12-04T09:33:40.5200949Z * [new tag] cslpull79 -> cslpull79 2025-12-04T09:33:40.5202170Z * [new tag] cslpull80 -> cslpull80 2025-12-04T09:33:40.5203155Z * [new tag] cslpull81 -> cslpull81 2025-12-04T09:33:40.5204206Z * [new tag] cslpull82 -> cslpull82 2025-12-04T09:33:40.5205047Z * [new tag] cslpull83 -> cslpull83 2025-12-04T09:33:40.5206143Z * [new tag] cslpull84 -> cslpull84 2025-12-04T09:33:40.5206986Z * [new tag] cslpull85 -> cslpull85 2025-12-04T09:33:40.5208123Z * [new tag] cslpull86 -> cslpull86 2025-12-04T09:33:40.5209089Z * [new tag] cslpull87 -> cslpull87 2025-12-04T09:33:40.5210122Z * [new tag] cslpull88 -> cslpull88 2025-12-04T09:33:40.5210966Z * [new tag] cslpull89 -> cslpull89 2025-12-04T09:33:40.5211794Z * [new tag] cslpull90 -> cslpull90 2025-12-04T09:33:40.5213297Z * [new tag] cslpull91 -> cslpull91 2025-12-04T09:33:40.5214223Z * [new tag] cslpull92 -> cslpull92 2025-12-04T09:33:40.5215359Z * [new tag] flight_5 -> flight_5 2025-12-04T09:33:40.5216688Z * [new tag] flight_5.1 -> flight_5.1 2025-12-04T09:33:40.5217548Z * [new tag] flight_5.2 -> flight_5.2 2025-12-04T09:33:40.5218778Z * [new tag] flight_5.3 -> flight_5.3 2025-12-04T09:33:40.5219836Z * [new tag] forpull1 -> forpull1 2025-12-04T09:33:40.5221020Z * [new tag] malfet/tag-2ef5611 -> malfet/tag-2ef5611 2025-12-04T09:33:40.5222051Z * [new tag] malfet/tag-317b1a0 -> malfet/tag-317b1a0 2025-12-04T09:33:40.5222914Z * [new tag] malfet/tag-ec6f767 -> malfet/tag-ec6f767 2025-12-04T09:33:40.5224076Z * [new tag] nightly-binary -> nightly-binary 2025-12-04T09:33:40.5225253Z * [new tag] sqzhang_flight4_plus -> sqzhang_flight4_plus 2025-12-04T09:33:40.5226380Z * [new tag] sqzhang_flight_3 -> sqzhang_flight_3 2025-12-04T09:33:40.5227751Z * [new tag] trunk/02d8bd6974cf84b721680d773dbdb1b6f40ce272 -> trunk/02d8bd6974cf84b721680d773dbdb1b6f40ce272 2025-12-04T09:33:40.5228653Z * [new tag] trunk/066997fb38ade71e00d78e9d572e380b5f02bd3e -> trunk/066997fb38ade71e00d78e9d572e380b5f02bd3e 2025-12-04T09:33:40.5230060Z * [new tag] trunk/076e7b19fa1d481ad778d06d2b49ba57d3ce8c88 -> trunk/076e7b19fa1d481ad778d06d2b49ba57d3ce8c88 2025-12-04T09:33:40.5231131Z * [new tag] trunk/07dcc0b83db3211653a38565a24e15acdba75654 -> trunk/07dcc0b83db3211653a38565a24e15acdba75654 2025-12-04T09:33:40.5232270Z * [new tag] trunk/082e96b68dfcd16cab7cfafc4d3d055767dab3eb -> trunk/082e96b68dfcd16cab7cfafc4d3d055767dab3eb 2025-12-04T09:33:40.5233243Z * [new tag] trunk/088048f2fea28ff7d450f65c72419ca45780d30b -> trunk/088048f2fea28ff7d450f65c72419ca45780d30b 2025-12-04T09:33:40.5234272Z * [new tag] trunk/09076941a95c76f4d9ad189d064dfd8baa39e672 -> trunk/09076941a95c76f4d9ad189d064dfd8baa39e672 2025-12-04T09:33:40.5235257Z * [new tag] trunk/0b80a4c62b94402844bf221791c096b0035c6d75 -> trunk/0b80a4c62b94402844bf221791c096b0035c6d75 2025-12-04T09:33:40.5236573Z * [new tag] trunk/0bbbdf1750567a980634ad907a325357ba8ba8f2 -> trunk/0bbbdf1750567a980634ad907a325357ba8ba8f2 2025-12-04T09:33:40.5237670Z * [new tag] trunk/0c281dd78773b2bc17c58ead0e4cd4ac46e775c5 -> trunk/0c281dd78773b2bc17c58ead0e4cd4ac46e775c5 2025-12-04T09:33:40.5238608Z * [new tag] trunk/135f3753c418a6879b1954904184937b67e61688 -> trunk/135f3753c418a6879b1954904184937b67e61688 2025-12-04T09:33:40.5239696Z * [new tag] trunk/15da21026cb13cd20257dc9e96830db108743c10 -> trunk/15da21026cb13cd20257dc9e96830db108743c10 2025-12-04T09:33:40.5240706Z * [new tag] trunk/166efdad2ac827f30fb02504c6017520257f88ec -> trunk/166efdad2ac827f30fb02504c6017520257f88ec 2025-12-04T09:33:40.5241832Z * [new tag] trunk/174272c15fae553d8488140af931f7d8050a313f -> trunk/174272c15fae553d8488140af931f7d8050a313f 2025-12-04T09:33:40.5242975Z * [new tag] trunk/18f3ca08f13b8de61307f5e8cd7d4cccb67e9d11 -> trunk/18f3ca08f13b8de61307f5e8cd7d4cccb67e9d11 2025-12-04T09:33:40.5243994Z * [new tag] trunk/1902eddfe655a15ebcf2c72bd81ade110fdeef63 -> trunk/1902eddfe655a15ebcf2c72bd81ade110fdeef63 2025-12-04T09:33:40.5245023Z * [new tag] trunk/195f92e98d3d66738577f11f22c4b5c8a1c76dd5 -> trunk/195f92e98d3d66738577f11f22c4b5c8a1c76dd5 2025-12-04T09:33:40.5246033Z * [new tag] trunk/1aa13e17de39e3c768ea7aebaad166ce72a06676 -> trunk/1aa13e17de39e3c768ea7aebaad166ce72a06676 2025-12-04T09:33:40.5247620Z * [new tag] trunk/1afe2832f58e24e54a5bfda5a5afa9b96fdea40e -> trunk/1afe2832f58e24e54a5bfda5a5afa9b96fdea40e 2025-12-04T09:33:40.5248541Z * [new tag] trunk/1c87554d74140eaee964ca8b1832cede67f5f520 -> trunk/1c87554d74140eaee964ca8b1832cede67f5f520 2025-12-04T09:33:40.5249792Z * [new tag] trunk/1ccb743b7b5be955f49736c162c4f5004b8a0dd8 -> trunk/1ccb743b7b5be955f49736c162c4f5004b8a0dd8 2025-12-04T09:33:40.5250902Z * [new tag] trunk/1cee47d6ce0a02227185b566593f002dd639ca0c -> trunk/1cee47d6ce0a02227185b566593f002dd639ca0c 2025-12-04T09:33:40.5251789Z * [new tag] trunk/1d21b4df2babe322e5d085ceb6de884eb260a62d -> trunk/1d21b4df2babe322e5d085ceb6de884eb260a62d 2025-12-04T09:33:40.5252790Z * [new tag] trunk/1e34fb2550e4aa650314f7a6d9f6daf4da7478a8 -> trunk/1e34fb2550e4aa650314f7a6d9f6daf4da7478a8 2025-12-04T09:33:40.5253907Z * [new tag] trunk/1e526fb5b1d93bfc70691c5c3955fdffc1b7b7de -> trunk/1e526fb5b1d93bfc70691c5c3955fdffc1b7b7de 2025-12-04T09:33:40.5254920Z * [new tag] trunk/1ee32a8b1f554a312d79bad01ded24f38cd95543 -> trunk/1ee32a8b1f554a312d79bad01ded24f38cd95543 2025-12-04T09:33:40.5255964Z * [new tag] trunk/201e2c4117eb9744594dad6a5c18213d7b4705d7 -> trunk/201e2c4117eb9744594dad6a5c18213d7b4705d7 2025-12-04T09:33:40.5256999Z * [new tag] trunk/2353a0f60eb4b4cb6675907a7fa9fbedc1c02e7f -> trunk/2353a0f60eb4b4cb6675907a7fa9fbedc1c02e7f 2025-12-04T09:33:40.5258361Z * [new tag] trunk/285779b1621cf9f073a062b0889a642d200308d9 -> trunk/285779b1621cf9f073a062b0889a642d200308d9 2025-12-04T09:33:40.5259205Z * [new tag] trunk/2887faaec6295d081580d09fce161201826c6d87 -> trunk/2887faaec6295d081580d09fce161201826c6d87 2025-12-04T09:33:40.5260208Z * [new tag] trunk/296e67c92635443c67b11c0ae1bd045f03ebb7bc -> trunk/296e67c92635443c67b11c0ae1bd045f03ebb7bc 2025-12-04T09:33:40.5261252Z * [new tag] trunk/29856679769b3dede478767e2fe6cfb51197cb25 -> trunk/29856679769b3dede478767e2fe6cfb51197cb25 2025-12-04T09:33:40.5262283Z * [new tag] trunk/29e5455a4740c326ab187c7aa7b5ef98034ea563 -> trunk/29e5455a4740c326ab187c7aa7b5ef98034ea563 2025-12-04T09:33:40.5263346Z * [new tag] trunk/2ac3ef882afb23136adc188975f0a8802fc68adf -> trunk/2ac3ef882afb23136adc188975f0a8802fc68adf 2025-12-04T09:33:40.5264204Z * [new tag] trunk/2bec68e73b64715354af076ad309335f943e36cd -> trunk/2bec68e73b64715354af076ad309335f943e36cd 2025-12-04T09:33:40.5265242Z * [new tag] trunk/2c87367e6f88662cd5cedbd1537748b7948c38e1 -> trunk/2c87367e6f88662cd5cedbd1537748b7948c38e1 2025-12-04T09:33:40.5266359Z * [new tag] trunk/2d1f78fe3ec13820f136a2e0336da12a25f41708 -> trunk/2d1f78fe3ec13820f136a2e0336da12a25f41708 2025-12-04T09:33:40.5267441Z * [new tag] trunk/2df6058f116a65722a0e03073402feb242572d35 -> trunk/2df6058f116a65722a0e03073402feb242572d35 2025-12-04T09:33:40.5268557Z * [new tag] trunk/2e0c2e170fe658c440775c8e5c44228aafcc47ec -> trunk/2e0c2e170fe658c440775c8e5c44228aafcc47ec 2025-12-04T09:33:40.5269793Z * [new tag] trunk/2f9b7dad7b5419b063bd0f2e204de192720ebb94 -> trunk/2f9b7dad7b5419b063bd0f2e204de192720ebb94 2025-12-04T09:33:40.5270735Z * [new tag] trunk/305168768a95d69c444df5cd334bb774edfe06f1 -> trunk/305168768a95d69c444df5cd334bb774edfe06f1 2025-12-04T09:33:40.5271726Z * [new tag] trunk/31fc12773026e8e00f054dd79ad9b2491e693b48 -> trunk/31fc12773026e8e00f054dd79ad9b2491e693b48 2025-12-04T09:33:40.5272768Z * [new tag] trunk/320de0c6b0a3e7c6d2693ea5c28d5d0156ba7991 -> trunk/320de0c6b0a3e7c6d2693ea5c28d5d0156ba7991 2025-12-04T09:33:40.5273780Z * [new tag] trunk/3418bd29475dff06695045fcdf93e7d0dac67da8 -> trunk/3418bd29475dff06695045fcdf93e7d0dac67da8 2025-12-04T09:33:40.5274789Z * [new tag] trunk/34a98608afa0cb5b48f0d6d30432fdd0a2614ddf -> trunk/34a98608afa0cb5b48f0d6d30432fdd0a2614ddf 2025-12-04T09:33:40.5275743Z * [new tag] trunk/35b7a9a26c5923d98aebaa41a031dae21788a9ee -> trunk/35b7a9a26c5923d98aebaa41a031dae21788a9ee 2025-12-04T09:33:40.5276800Z * [new tag] trunk/39d07dbf03a911bdd45d1af78d8638dc92074938 -> trunk/39d07dbf03a911bdd45d1af78d8638dc92074938 2025-12-04T09:33:40.5277685Z * [new tag] trunk/3cd98b4205ada151042cc7ff097a82d4a4b18725 -> trunk/3cd98b4205ada151042cc7ff097a82d4a4b18725 2025-12-04T09:33:40.5278698Z * [new tag] trunk/3d35fd20a78ff4d016fa80f4e5fad37191d7bcae -> trunk/3d35fd20a78ff4d016fa80f4e5fad37191d7bcae 2025-12-04T09:33:40.5279716Z * [new tag] trunk/409a5fee945c46a3edaf5df162812f201bfd7b2f -> trunk/409a5fee945c46a3edaf5df162812f201bfd7b2f 2025-12-04T09:33:40.5280732Z * [new tag] trunk/42e9005cda22da3f1c559c3649218cebd671027c -> trunk/42e9005cda22da3f1c559c3649218cebd671027c 2025-12-04T09:33:40.5281840Z * [new tag] trunk/43b94713bbf340d3c124fde02d0f73add4021247 -> trunk/43b94713bbf340d3c124fde02d0f73add4021247 2025-12-04T09:33:40.5284794Z * [new tag] trunk/44ac69388a4a5eb463dbd2a13f00d1e3b924566c -> trunk/44ac69388a4a5eb463dbd2a13f00d1e3b924566c 2025-12-04T09:33:40.5285259Z * [new tag] trunk/45d14e2497292be06ad36eaa1aaaf7c630a2586a -> trunk/45d14e2497292be06ad36eaa1aaaf7c630a2586a 2025-12-04T09:33:40.5285716Z * [new tag] trunk/45d310ad84854dff730c0b12e577d7998d978686 -> trunk/45d310ad84854dff730c0b12e577d7998d978686 2025-12-04T09:33:40.5286331Z * [new tag] trunk/47b28ddf7bd74b50fa93b307a7d3b183a6d77f54 -> trunk/47b28ddf7bd74b50fa93b307a7d3b183a6d77f54 2025-12-04T09:33:40.5286997Z * [new tag] trunk/481e5ab336275bd3acd5fa8a611b05b4469012af -> trunk/481e5ab336275bd3acd5fa8a611b05b4469012af 2025-12-04T09:33:40.5288172Z * [new tag] trunk/491731647f6b8a9345dcfb3bc9416aea254a7d96 -> trunk/491731647f6b8a9345dcfb3bc9416aea254a7d96 2025-12-04T09:33:40.5289496Z * [new tag] trunk/49a04d26088acc17d948ddd66920f3e16371e873 -> trunk/49a04d26088acc17d948ddd66920f3e16371e873 2025-12-04T09:33:40.5290449Z * [new tag] trunk/4bebc827c47d2f1f0fa1a417a5201a97aef3d985 -> trunk/4bebc827c47d2f1f0fa1a417a5201a97aef3d985 2025-12-04T09:33:40.5291377Z * [new tag] trunk/4c246677784c6a14bc2dbb9ff8773ef0a3a3222f -> trunk/4c246677784c6a14bc2dbb9ff8773ef0a3a3222f 2025-12-04T09:33:40.5292615Z * [new tag] trunk/4cfb47ff548b6d996641058cf04a70e311a4c3aa -> trunk/4cfb47ff548b6d996641058cf04a70e311a4c3aa 2025-12-04T09:33:40.5293683Z * [new tag] trunk/4e0061c1aa52f606dda8cfab0bd7591e588faf2c -> trunk/4e0061c1aa52f606dda8cfab0bd7591e588faf2c 2025-12-04T09:33:40.5295296Z * [new tag] trunk/4fefb8e7e942386ffac764a41b232241f82bea3a -> trunk/4fefb8e7e942386ffac764a41b232241f82bea3a 2025-12-04T09:33:40.5296213Z * [new tag] trunk/503b2640023521f5a35cd9a52fc8033d73a95d0d -> trunk/503b2640023521f5a35cd9a52fc8033d73a95d0d 2025-12-04T09:33:40.5297292Z * [new tag] trunk/518c2b1b3dab9a2ef2849e04b3bc2f20c1c41db9 -> trunk/518c2b1b3dab9a2ef2849e04b3bc2f20c1c41db9 2025-12-04T09:33:40.5298380Z * [new tag] trunk/5191b2fa68ba19960912bfd7fd721c79d76bb1f3 -> trunk/5191b2fa68ba19960912bfd7fd721c79d76bb1f3 2025-12-04T09:33:40.5299629Z * [new tag] trunk/52ac0f0dc4acacd219f1317fbc28ec631c01e07a -> trunk/52ac0f0dc4acacd219f1317fbc28ec631c01e07a 2025-12-04T09:33:40.5300599Z * [new tag] trunk/539ba711b029de9f191070f4f0d12f18f5b7f292 -> trunk/539ba711b029de9f191070f4f0d12f18f5b7f292 2025-12-04T09:33:40.5301649Z * [new tag] trunk/556375b55deebebbc56cb7aef81f4d52f031ba28 -> trunk/556375b55deebebbc56cb7aef81f4d52f031ba28 2025-12-04T09:33:40.5302893Z * [new tag] trunk/55c4ab554845481d0a69a3811937575fe8bb1a66 -> trunk/55c4ab554845481d0a69a3811937575fe8bb1a66 2025-12-04T09:33:40.5303826Z * [new tag] trunk/5634469fda9e5d98869c82c7d03bb08914245f96 -> trunk/5634469fda9e5d98869c82c7d03bb08914245f96 2025-12-04T09:33:40.5304693Z * [new tag] trunk/5778f6ff894686a975a9a23645178ae4c87ad5dc -> trunk/5778f6ff894686a975a9a23645178ae4c87ad5dc 2025-12-04T09:33:40.5305738Z * [new tag] trunk/587d63a3e07de5dc91065f9ef70bcacda9989068 -> trunk/587d63a3e07de5dc91065f9ef70bcacda9989068 2025-12-04T09:33:40.5306773Z * [new tag] trunk/597930f6b568852356ca9795dac76f9e4653adbd -> trunk/597930f6b568852356ca9795dac76f9e4653adbd 2025-12-04T09:33:40.5307655Z * [new tag] trunk/597df3a4e2a67b9fdbe1a89b2f4d74f822274db6 -> trunk/597df3a4e2a67b9fdbe1a89b2f4d74f822274db6 2025-12-04T09:33:40.5308973Z * [new tag] trunk/59abd50e931f4efb21b053f7a2911f5d8a49d883 -> trunk/59abd50e931f4efb21b053f7a2911f5d8a49d883 2025-12-04T09:33:40.5309876Z * [new tag] trunk/5a607febc04c3a2b5824c75f3f60307867439a2c -> trunk/5a607febc04c3a2b5824c75f3f60307867439a2c 2025-12-04T09:33:40.5310968Z * [new tag] trunk/5bf1cdf4755c54ef462b44cb8041b0a57311556b -> trunk/5bf1cdf4755c54ef462b44cb8041b0a57311556b 2025-12-04T09:33:40.5311791Z * [new tag] trunk/5f0030ba63d334d7e8c93a09e41403b89e4c573c -> trunk/5f0030ba63d334d7e8c93a09e41403b89e4c573c 2025-12-04T09:33:40.5313150Z * [new tag] trunk/5f21d27e71268464d362a96c9ac09ea475f7f202 -> trunk/5f21d27e71268464d362a96c9ac09ea475f7f202 2025-12-04T09:33:40.5314279Z * [new tag] trunk/5fafc13038c9988d9ac21fa793fbd5890604b447 -> trunk/5fafc13038c9988d9ac21fa793fbd5890604b447 2025-12-04T09:33:40.5315908Z * [new tag] trunk/61be54a31dc09b59d99b62176fb935aee0b924ef -> trunk/61be54a31dc09b59d99b62176fb935aee0b924ef 2025-12-04T09:33:40.5316895Z * [new tag] trunk/62d3ccd71484ed6a760d909b41487101bbc65719 -> trunk/62d3ccd71484ed6a760d909b41487101bbc65719 2025-12-04T09:33:40.5317981Z * [new tag] trunk/641cdb68ae27668eb441d0e49c87a0602c120c2b -> trunk/641cdb68ae27668eb441d0e49c87a0602c120c2b 2025-12-04T09:33:40.5319095Z * [new tag] trunk/65c4620d6bb0c6029f69762c22b91dda2294da9a -> trunk/65c4620d6bb0c6029f69762c22b91dda2294da9a 2025-12-04T09:33:40.5320190Z * [new tag] trunk/66004b993744b4106bf8afaba71f3c228a804206 -> trunk/66004b993744b4106bf8afaba71f3c228a804206 2025-12-04T09:33:40.5321234Z * [new tag] trunk/6658a04c7ca67acb64512341342e7b3ee13ee386 -> trunk/6658a04c7ca67acb64512341342e7b3ee13ee386 2025-12-04T09:33:40.5322216Z * [new tag] trunk/6864e309092a71f8ab0ca6a4dc7f8a4073fd31c4 -> trunk/6864e309092a71f8ab0ca6a4dc7f8a4073fd31c4 2025-12-04T09:33:40.5323314Z * [new tag] trunk/6c261c6cb07892c90ca19ed51c9705b1659a3f7d -> trunk/6c261c6cb07892c90ca19ed51c9705b1659a3f7d 2025-12-04T09:33:40.5324313Z * [new tag] trunk/6c8b6a043f1628188b6396b3a2a6e000ca68362b -> trunk/6c8b6a043f1628188b6396b3a2a6e000ca68362b 2025-12-04T09:33:40.5325291Z * [new tag] trunk/6ceb4a32f92ae67ce5d7d97931d17401ebf5ffa5 -> trunk/6ceb4a32f92ae67ce5d7d97931d17401ebf5ffa5 2025-12-04T09:33:40.5326321Z * [new tag] trunk/6e404e9b7d6f5fb0de86aa73888c3038248c17f8 -> trunk/6e404e9b7d6f5fb0de86aa73888c3038248c17f8 2025-12-04T09:33:40.5327435Z * [new tag] trunk/6ec30b490aee1db6bcdc7340abddef25784f08ec -> trunk/6ec30b490aee1db6bcdc7340abddef25784f08ec 2025-12-04T09:33:40.5328483Z * [new tag] trunk/6f2783a6c08e1db34275ff25176ffe9aebc30a71 -> trunk/6f2783a6c08e1db34275ff25176ffe9aebc30a71 2025-12-04T09:33:40.5329513Z * [new tag] trunk/6f53fefeb90ad3281119b5cfc4aa9ffd8a066e3d -> trunk/6f53fefeb90ad3281119b5cfc4aa9ffd8a066e3d 2025-12-04T09:33:40.5330580Z * [new tag] trunk/6f7dcf51e46d0c880db1a2f5c70de57adb576f4a -> trunk/6f7dcf51e46d0c880db1a2f5c70de57adb576f4a 2025-12-04T09:33:40.5331670Z * [new tag] trunk/6ff831180d2fa436c7f1c1af3adac641fce9d60e -> trunk/6ff831180d2fa436c7f1c1af3adac641fce9d60e 2025-12-04T09:33:40.5332687Z * [new tag] trunk/70076464a63ab218a7ceefb0e76ccd7131deb8f8 -> trunk/70076464a63ab218a7ceefb0e76ccd7131deb8f8 2025-12-04T09:33:40.5333699Z * [new tag] trunk/70d797a5fc109b20a517646fcaa819477cd0d485 -> trunk/70d797a5fc109b20a517646fcaa819477cd0d485 2025-12-04T09:33:40.5334709Z * [new tag] trunk/7348cb355ff0a6f79cd4871215aea72185748734 -> trunk/7348cb355ff0a6f79cd4871215aea72185748734 2025-12-04T09:33:40.5335758Z * [new tag] trunk/74fe26a1ebe32931783569f2e762e3c2c974901f -> trunk/74fe26a1ebe32931783569f2e762e3c2c974901f 2025-12-04T09:33:40.5336823Z * [new tag] trunk/76aeb8c7e0f795b3fddca134cbea9a69da3ee696 -> trunk/76aeb8c7e0f795b3fddca134cbea9a69da3ee696 2025-12-04T09:33:40.5337679Z * [new tag] trunk/7716da9fb23f27a65b41f9f016a2afadf281c18f -> trunk/7716da9fb23f27a65b41f9f016a2afadf281c18f 2025-12-04T09:33:40.5339022Z * [new tag] trunk/7741edd4ed665f3988052e260863efb508d61a03 -> trunk/7741edd4ed665f3988052e260863efb508d61a03 2025-12-04T09:33:40.5340085Z * [new tag] trunk/78adb3b3df41b45d2368b67226d2f864b78939a6 -> trunk/78adb3b3df41b45d2368b67226d2f864b78939a6 2025-12-04T09:33:40.5341099Z * [new tag] trunk/79d7b178225e5ed24d4e1db74e5abbff848f5fb7 -> trunk/79d7b178225e5ed24d4e1db74e5abbff848f5fb7 2025-12-04T09:33:40.5341977Z * [new tag] trunk/7a1e316115fc6996b3f2336822ba5d5f6179f0c3 -> trunk/7a1e316115fc6996b3f2336822ba5d5f6179f0c3 2025-12-04T09:33:40.5342984Z * [new tag] trunk/7a41b66367c38d0af3e8a90f7be48d6b281e7bca -> trunk/7a41b66367c38d0af3e8a90f7be48d6b281e7bca 2025-12-04T09:33:40.5344029Z * [new tag] trunk/7b7af390ea8541c611d1ce2018a6934188fc197b -> trunk/7b7af390ea8541c611d1ce2018a6934188fc197b 2025-12-04T09:33:40.5345097Z * [new tag] trunk/7ba4680f3755a560af81aa0f688791e367aa3609 -> trunk/7ba4680f3755a560af81aa0f688791e367aa3609 2025-12-04T09:33:40.5346192Z * [new tag] trunk/7bc2a66ded06a0b2549aa51d807edc5dc3e73d1b -> trunk/7bc2a66ded06a0b2549aa51d807edc5dc3e73d1b 2025-12-04T09:33:40.5347067Z * [new tag] trunk/7c648509a7470ace9fb2bae960dd4790f7e943e9 -> trunk/7c648509a7470ace9fb2bae960dd4790f7e943e9 2025-12-04T09:33:40.5347991Z * [new tag] trunk/7cbc2d034cecd21ab5c9707d0a9c525c17143fb8 -> trunk/7cbc2d034cecd21ab5c9707d0a9c525c17143fb8 2025-12-04T09:33:40.5349038Z * [new tag] trunk/7d1bbaf4ba301ea3fba6f3c7bc02d58f6417aaed -> trunk/7d1bbaf4ba301ea3fba6f3c7bc02d58f6417aaed 2025-12-04T09:33:40.5350152Z * [new tag] trunk/7d2a33e4ebf60b217a3cd77feae19231eb996fc8 -> trunk/7d2a33e4ebf60b217a3cd77feae19231eb996fc8 2025-12-04T09:33:40.5351215Z * [new tag] trunk/7eb625920054b1126a7d2d99818aaa188c6ba95e -> trunk/7eb625920054b1126a7d2d99818aaa188c6ba95e 2025-12-04T09:33:40.5352141Z * [new tag] trunk/7f55ba19c456a3d6cc443dd9edb6bb7cca677ead -> trunk/7f55ba19c456a3d6cc443dd9edb6bb7cca677ead 2025-12-04T09:33:40.5353157Z * [new tag] trunk/81af382128efa094d8702e18f2c133760904c718 -> trunk/81af382128efa094d8702e18f2c133760904c718 2025-12-04T09:33:40.5354619Z * [new tag] trunk/84149583d483e9c973c9a0feda70e4f3964947b0 -> trunk/84149583d483e9c973c9a0feda70e4f3964947b0 2025-12-04T09:33:40.5355990Z * [new tag] trunk/85a315917efe82c24306be805c584ec044951c75 -> trunk/85a315917efe82c24306be805c584ec044951c75 2025-12-04T09:33:40.5357047Z * [new tag] trunk/87329491c82a5f8c1cc4ec11d8f55a5de2551ece -> trunk/87329491c82a5f8c1cc4ec11d8f55a5de2551ece 2025-12-04T09:33:40.5357917Z * [new tag] trunk/892640e25aeefa8007c5af837214b4502b6b62a6 -> trunk/892640e25aeefa8007c5af837214b4502b6b62a6 2025-12-04T09:33:40.5359304Z * [new tag] trunk/89e3bbcb5b5321dc8b9520b4d5a8ee60cea1d0b4 -> trunk/89e3bbcb5b5321dc8b9520b4d5a8ee60cea1d0b4 2025-12-04T09:33:40.5360214Z * [new tag] trunk/8c73bbbb02159223c0c97d268a0a74cb78158a1c -> trunk/8c73bbbb02159223c0c97d268a0a74cb78158a1c 2025-12-04T09:33:40.5361254Z * [new tag] trunk/8d56e98c8db988a22cb2dfaeefb30bc7d2a3cc43 -> trunk/8d56e98c8db988a22cb2dfaeefb30bc7d2a3cc43 2025-12-04T09:33:40.5362367Z * [new tag] trunk/8d9dd9603e5ee26c01007f0cd4f018e584840922 -> trunk/8d9dd9603e5ee26c01007f0cd4f018e584840922 2025-12-04T09:33:40.5363439Z * [new tag] trunk/8ef0c0b02b062d75e7c9be2594914a3e784d23ca -> trunk/8ef0c0b02b062d75e7c9be2594914a3e784d23ca 2025-12-04T09:33:40.5364559Z * [new tag] trunk/90b27e7e8352cde97d32ddad24740ef819633f38 -> trunk/90b27e7e8352cde97d32ddad24740ef819633f38 2025-12-04T09:33:40.5365410Z * [new tag] trunk/90f0139e64b2951815d524b6a373bed20c4fbf90 -> trunk/90f0139e64b2951815d524b6a373bed20c4fbf90 2025-12-04T09:33:40.5366351Z * [new tag] trunk/93d0d6838c56af59b0dba794e6aa08f0c1c7799c -> trunk/93d0d6838c56af59b0dba794e6aa08f0c1c7799c 2025-12-04T09:33:40.5367457Z * [new tag] trunk/94ca8d5f1e81fea3ae488650a0fb6795049a9f87 -> trunk/94ca8d5f1e81fea3ae488650a0fb6795049a9f87 2025-12-04T09:33:40.5368433Z * [new tag] trunk/9844fbeadd5cebdf1281d6fbf79164139c352693 -> trunk/9844fbeadd5cebdf1281d6fbf79164139c352693 2025-12-04T09:33:40.5369462Z * [new tag] trunk/99024dec888ec1e50b546822a32b6fb2f35e5eaa -> trunk/99024dec888ec1e50b546822a32b6fb2f35e5eaa 2025-12-04T09:33:40.5370488Z * [new tag] trunk/9a296e640fc88aa44d275b48cd9cc30c573b169d -> trunk/9a296e640fc88aa44d275b48cd9cc30c573b169d 2025-12-04T09:33:40.5371570Z * [new tag] trunk/9b3e34d8589b29f7b4e7fab6f78711b7ca6e4639 -> trunk/9b3e34d8589b29f7b4e7fab6f78711b7ca6e4639 2025-12-04T09:33:40.5372607Z * [new tag] trunk/9cd055e547e9b67a5f9827f8999c38d7eda1bcb8 -> trunk/9cd055e547e9b67a5f9827f8999c38d7eda1bcb8 2025-12-04T09:33:40.5373669Z * [new tag] trunk/9f0df5686cb4ada94f94620acba2e3c3f363b11d -> trunk/9f0df5686cb4ada94f94620acba2e3c3f363b11d 2025-12-04T09:33:40.5379778Z * [new tag] trunk/9f7fceb887d0cfa0326a59b887821c63ff11340a -> trunk/9f7fceb887d0cfa0326a59b887821c63ff11340a 2025-12-04T09:33:40.5380403Z * [new tag] trunk/9f8ef8855d3078d70f7b782540ff2aaf158d6742 -> trunk/9f8ef8855d3078d70f7b782540ff2aaf158d6742 2025-12-04T09:33:40.5380881Z * [new tag] trunk/9fb52efc797b47a1f425a03aa5e47b866d8b1098 -> trunk/9fb52efc797b47a1f425a03aa5e47b866d8b1098 2025-12-04T09:33:40.5381347Z * [new tag] trunk/9ff4a2ebc5762d46c73e46b1b523d7ff349fedfa -> trunk/9ff4a2ebc5762d46c73e46b1b523d7ff349fedfa 2025-12-04T09:33:40.5381920Z * [new tag] trunk/a0f3937b94422354538ebbd47202d5b0e8a3fd0d -> trunk/a0f3937b94422354538ebbd47202d5b0e8a3fd0d 2025-12-04T09:33:40.5382376Z * [new tag] trunk/a15066c28b3145e6edbfc88359d0411d14cfc70c -> trunk/a15066c28b3145e6edbfc88359d0411d14cfc70c 2025-12-04T09:33:40.5382846Z * [new tag] trunk/a20f775e82564d2a9979221ed7f3b8d7cf54ce90 -> trunk/a20f775e82564d2a9979221ed7f3b8d7cf54ce90 2025-12-04T09:33:40.5383300Z * [new tag] trunk/a2973fb00ec002dd4b6bbf07385f066efb259b8c -> trunk/a2973fb00ec002dd4b6bbf07385f066efb259b8c 2025-12-04T09:33:40.5383770Z * [new tag] trunk/a7dc6dab9ad911259d4801c502907e531594db45 -> trunk/a7dc6dab9ad911259d4801c502907e531594db45 2025-12-04T09:33:40.5385117Z * [new tag] trunk/a951a9cee65c01660bbc6e6fded90ecb10fa6109 -> trunk/a951a9cee65c01660bbc6e6fded90ecb10fa6109 2025-12-04T09:33:40.5386116Z * [new tag] trunk/abfa1a6d65c7c159e35c72c25979b9da4971689e -> trunk/abfa1a6d65c7c159e35c72c25979b9da4971689e 2025-12-04T09:33:40.5387368Z * [new tag] trunk/ae3a2395bf66151078e2d201716f7d63ce1c6f3e -> trunk/ae3a2395bf66151078e2d201716f7d63ce1c6f3e 2025-12-04T09:33:40.5388297Z * [new tag] trunk/afdff7f0325080dedac44d080cb5a3b0e65e6c5e -> trunk/afdff7f0325080dedac44d080cb5a3b0e65e6c5e 2025-12-04T09:33:40.5389257Z * [new tag] trunk/b1aed4e7a72c03a38f44543aaea0dae2e9b76d48 -> trunk/b1aed4e7a72c03a38f44543aaea0dae2e9b76d48 2025-12-04T09:33:40.5390322Z * [new tag] trunk/b1decff555cd50e2123c8c6e25cc0d447c411f62 -> trunk/b1decff555cd50e2123c8c6e25cc0d447c411f62 2025-12-04T09:33:40.5391566Z * [new tag] trunk/b2b6b034c9fd08672c40e63ef243556ad4c49bd2 -> trunk/b2b6b034c9fd08672c40e63ef243556ad4c49bd2 2025-12-04T09:33:40.5392507Z * [new tag] trunk/b39813b4a04931682b0491adba2138d01d716d99 -> trunk/b39813b4a04931682b0491adba2138d01d716d99 2025-12-04T09:33:40.5393600Z * [new tag] trunk/b3a7edb2311367974cc7cd764cfb11a5d6758b24 -> trunk/b3a7edb2311367974cc7cd764cfb11a5d6758b24 2025-12-04T09:33:40.5394960Z * [new tag] trunk/b4cc1329c86acaef6d42c1fac7169b8d870ab0d7 -> trunk/b4cc1329c86acaef6d42c1fac7169b8d870ab0d7 2025-12-04T09:33:40.5395963Z * [new tag] trunk/b555c39217f765759954a4f9f9bd1e9b87bed11a -> trunk/b555c39217f765759954a4f9f9bd1e9b87bed11a 2025-12-04T09:33:40.5397084Z * [new tag] trunk/b6b6c80379388b7f9932c3e6a0f9907bf430e417 -> trunk/b6b6c80379388b7f9932c3e6a0f9907bf430e417 2025-12-04T09:33:40.5398139Z * [new tag] trunk/b6b6d912df0b6f4082f8e50b18bd1de1dd7325f4 -> trunk/b6b6d912df0b6f4082f8e50b18bd1de1dd7325f4 2025-12-04T09:33:40.5399233Z * [new tag] trunk/b7d60685f8cbc939b68a20871e90db67e729329b -> trunk/b7d60685f8cbc939b68a20871e90db67e729329b 2025-12-04T09:33:40.5400369Z * [new tag] trunk/b7f6b9a4fc6259f7af068f31868b3119bb1bac3e -> trunk/b7f6b9a4fc6259f7af068f31868b3119bb1bac3e 2025-12-04T09:33:40.5401666Z * [new tag] trunk/b8c4ba3593761e7b2a3ebd86f040fb07b47c02cf -> trunk/b8c4ba3593761e7b2a3ebd86f040fb07b47c02cf 2025-12-04T09:33:40.5402490Z * [new tag] trunk/b9c8f3a4884befb965ff42620ce44a71b04887f5 -> trunk/b9c8f3a4884befb965ff42620ce44a71b04887f5 2025-12-04T09:33:40.5403537Z * [new tag] trunk/ba1412546f3082c0958c077acc2025e4dbc33f1f -> trunk/ba1412546f3082c0958c077acc2025e4dbc33f1f 2025-12-04T09:33:40.5404668Z * [new tag] trunk/bac403c0b38c63bdbcc0c31f1c2b0bc0260f610f -> trunk/bac403c0b38c63bdbcc0c31f1c2b0bc0260f610f 2025-12-04T09:33:40.5405731Z * [new tag] trunk/bb3034198b459401fabeab254e1b99f0115046e2 -> trunk/bb3034198b459401fabeab254e1b99f0115046e2 2025-12-04T09:33:40.5406763Z * [new tag] trunk/bc39b2b3bc7a6e19a42e62bd576974035086fe55 -> trunk/bc39b2b3bc7a6e19a42e62bd576974035086fe55 2025-12-04T09:33:40.5408159Z * [new tag] trunk/bc43d5b297f207a11d83d77ddf0152bdaabe15a8 -> trunk/bc43d5b297f207a11d83d77ddf0152bdaabe15a8 2025-12-04T09:33:40.5409095Z * [new tag] trunk/bc6a4863c7246a6493d16d4ea6eee71ec07c6a09 -> trunk/bc6a4863c7246a6493d16d4ea6eee71ec07c6a09 2025-12-04T09:33:40.5410149Z * [new tag] trunk/bea4912944defdbcb8b061800caab6cbbbd01df5 -> trunk/bea4912944defdbcb8b061800caab6cbbbd01df5 2025-12-04T09:33:40.5411620Z * [new tag] trunk/c04e2c656f48d82d1521b867bbbf03967b9b7564 -> trunk/c04e2c656f48d82d1521b867bbbf03967b9b7564 2025-12-04T09:33:40.5412617Z * [new tag] trunk/c0660bcee27e7d7731634e274576a7081882bede -> trunk/c0660bcee27e7d7731634e274576a7081882bede 2025-12-04T09:33:40.5413739Z * [new tag] trunk/c178ed43d3d99cbefe84fbfb21d6f282b20d62ac -> trunk/c178ed43d3d99cbefe84fbfb21d6f282b20d62ac 2025-12-04T09:33:40.5414775Z * [new tag] trunk/c55b1e8f61d041ee436d697449eb028931d574fb -> trunk/c55b1e8f61d041ee436d697449eb028931d574fb 2025-12-04T09:33:40.5415689Z * [new tag] trunk/c6ae7579fe12fe75f1a8f7043a494c90567273f1 -> trunk/c6ae7579fe12fe75f1a8f7043a494c90567273f1 2025-12-04T09:33:40.5417107Z * [new tag] trunk/c8210e7d94bad5ae21ac389fa4ba8a463c76c4d0 -> trunk/c8210e7d94bad5ae21ac389fa4ba8a463c76c4d0 2025-12-04T09:33:40.5418270Z * [new tag] trunk/cc0853af42122f8185321f542616f4474e717f09 -> trunk/cc0853af42122f8185321f542616f4474e717f09 2025-12-04T09:33:40.5419268Z * [new tag] trunk/cddec6562eabfa390d014fa3741a5659cf9c94c9 -> trunk/cddec6562eabfa390d014fa3741a5659cf9c94c9 2025-12-04T09:33:40.5420413Z * [new tag] trunk/ce5e7e3bf1f4b69a4f4f93d288ba75b906df492a -> trunk/ce5e7e3bf1f4b69a4f4f93d288ba75b906df492a 2025-12-04T09:33:40.5421717Z * [new tag] trunk/d038b0130ec7c20ebcac219301292fd8e98a1ace -> trunk/d038b0130ec7c20ebcac219301292fd8e98a1ace 2025-12-04T09:33:40.5422634Z * [new tag] trunk/d16447dacaf2420ea175f0c275c75da951f57d39 -> trunk/d16447dacaf2420ea175f0c275c75da951f57d39 2025-12-04T09:33:40.5423653Z * [new tag] trunk/d19f1e8cab6810bb2e99141f9976665954c67a50 -> trunk/d19f1e8cab6810bb2e99141f9976665954c67a50 2025-12-04T09:33:40.5424737Z * [new tag] trunk/d1c9f03b2a5af4104721712f8cdffe9b4f340c01 -> trunk/d1c9f03b2a5af4104721712f8cdffe9b4f340c01 2025-12-04T09:33:40.5425983Z * [new tag] trunk/d40f4950f2b7f7aa380a22fe0f6166e71680fbcf -> trunk/d40f4950f2b7f7aa380a22fe0f6166e71680fbcf 2025-12-04T09:33:40.5426985Z * [new tag] trunk/d5038950bacfe36bbf24a47a455fe76901deb8e8 -> trunk/d5038950bacfe36bbf24a47a455fe76901deb8e8 2025-12-04T09:33:40.5427992Z * [new tag] trunk/d54ff42903c2ae0533931ff11d23b35f875bdb3d -> trunk/d54ff42903c2ae0533931ff11d23b35f875bdb3d 2025-12-04T09:33:40.5429089Z * [new tag] trunk/d76697633a2d2b9cced1ae21161849b33bfe7e47 -> trunk/d76697633a2d2b9cced1ae21161849b33bfe7e47 2025-12-04T09:33:40.5430126Z * [new tag] trunk/d78f52b199c547106d4cd9d2856dd0805c118bf1 -> trunk/d78f52b199c547106d4cd9d2856dd0805c118bf1 2025-12-04T09:33:40.5431203Z * [new tag] trunk/d8fd5c6eed28e5004150691d048a3f6785e19a8e -> trunk/d8fd5c6eed28e5004150691d048a3f6785e19a8e 2025-12-04T09:33:40.5432270Z * [new tag] trunk/d900f5e86745dec76713f4b0ef07005ef36b2f5a -> trunk/d900f5e86745dec76713f4b0ef07005ef36b2f5a 2025-12-04T09:33:40.5433333Z * [new tag] trunk/d973dc6b87d763859fe1c5bd1287e3b6b1c49d1b -> trunk/d973dc6b87d763859fe1c5bd1287e3b6b1c49d1b 2025-12-04T09:33:40.5434427Z * [new tag] trunk/d998c03304cb6ede76e1ed535b4ddeb6c2bf40ec -> trunk/d998c03304cb6ede76e1ed535b4ddeb6c2bf40ec 2025-12-04T09:33:40.5435530Z * [new tag] trunk/d9cb8a70833101dbbe16b99520cfbdd70d0a87bf -> trunk/d9cb8a70833101dbbe16b99520cfbdd70d0a87bf 2025-12-04T09:33:40.5436584Z * [new tag] trunk/d9d5e91b43f70eb8637af55db6856d49be391ffd -> trunk/d9d5e91b43f70eb8637af55db6856d49be391ffd 2025-12-04T09:33:40.5437621Z * [new tag] trunk/dd18a75336a4fbd7497955cc5665904724fce889 -> trunk/dd18a75336a4fbd7497955cc5665904724fce889 2025-12-04T09:33:40.5438717Z * [new tag] trunk/ded9bcd61a059bf723e6e84689552962b480ea77 -> trunk/ded9bcd61a059bf723e6e84689552962b480ea77 2025-12-04T09:33:40.5439772Z * [new tag] trunk/dfbd3714d15c37a7b83b322a6b60f997fc00f50c -> trunk/dfbd3714d15c37a7b83b322a6b60f997fc00f50c 2025-12-04T09:33:40.5441057Z * [new tag] trunk/e115f9f4e4b039f8e9a642aaa2bd8254a920541b -> trunk/e115f9f4e4b039f8e9a642aaa2bd8254a920541b 2025-12-04T09:33:40.5441878Z * [new tag] trunk/e3f24fd73ad74c6e7176687986436956c7c18235 -> trunk/e3f24fd73ad74c6e7176687986436956c7c18235 2025-12-04T09:33:40.5443167Z * [new tag] trunk/e7d24d3ff93d1503ba63860b7057438ad93f918e -> trunk/e7d24d3ff93d1503ba63860b7057438ad93f918e 2025-12-04T09:33:40.5444546Z * [new tag] trunk/ea7035f462a0d2830865ee86c832bd101e1427fc -> trunk/ea7035f462a0d2830865ee86c832bd101e1427fc 2025-12-04T09:33:40.5445210Z * [new tag] trunk/eabb7ad2128580ef674446027b95bcf4e21e8df3 -> trunk/eabb7ad2128580ef674446027b95bcf4e21e8df3 2025-12-04T09:33:40.5446361Z * [new tag] trunk/eb5c63652a33da42e7018c23df5f20a3eb4c6ccf -> trunk/eb5c63652a33da42e7018c23df5f20a3eb4c6ccf 2025-12-04T09:33:40.5447470Z * [new tag] trunk/ec2c71f5c85021b8938cdafadce24c15a36fd93e -> trunk/ec2c71f5c85021b8938cdafadce24c15a36fd93e 2025-12-04T09:33:40.5448533Z * [new tag] trunk/ecbcc3f6bf327856b435b259ac63cc2f328c4b4e -> trunk/ecbcc3f6bf327856b435b259ac63cc2f328c4b4e 2025-12-04T09:33:40.5450111Z * [new tag] trunk/ee87bbe876c42575e961b32a0827d76bc9782ca2 -> trunk/ee87bbe876c42575e961b32a0827d76bc9782ca2 2025-12-04T09:33:40.5451157Z * [new tag] trunk/ef019d1d431c4c5a95b594cb90d40a50cd00f5e4 -> trunk/ef019d1d431c4c5a95b594cb90d40a50cd00f5e4 2025-12-04T09:33:40.5452176Z * [new tag] trunk/ef8ecc13830a86c4b231f1aad9aba7851db61b53 -> trunk/ef8ecc13830a86c4b231f1aad9aba7851db61b53 2025-12-04T09:33:40.5453235Z * [new tag] trunk/f1076f5510920044912247b1abb8760cb820f598 -> trunk/f1076f5510920044912247b1abb8760cb820f598 2025-12-04T09:33:40.5454779Z * [new tag] trunk/f2d6a75a00a1d648ca9a0abc6a33e14c3dea6c40 -> trunk/f2d6a75a00a1d648ca9a0abc6a33e14c3dea6c40 2025-12-04T09:33:40.5455788Z * [new tag] trunk/f47dd0ddef1359e5b43e4b962412f67b30ecde56 -> trunk/f47dd0ddef1359e5b43e4b962412f67b30ecde56 2025-12-04T09:33:40.5457071Z * [new tag] trunk/f49d32dfa4730dcfb1b60eeeb369b5889da983c8 -> trunk/f49d32dfa4730dcfb1b60eeeb369b5889da983c8 2025-12-04T09:33:40.5457946Z * [new tag] trunk/f4dedf78fc30fd4b93975787ca6074ee89db9467 -> trunk/f4dedf78fc30fd4b93975787ca6074ee89db9467 2025-12-04T09:33:40.5459147Z * [new tag] trunk/f7c0d03819ebed05c4038f095d66d1b8c54aca17 -> trunk/f7c0d03819ebed05c4038f095d66d1b8c54aca17 2025-12-04T09:33:40.5460208Z * [new tag] trunk/f7e1bd80a063e17453c361837ba6ea2570920a73 -> trunk/f7e1bd80a063e17453c361837ba6ea2570920a73 2025-12-04T09:33:40.5461112Z * [new tag] trunk/f9bd6c53624c7c0ea3772de78498326e84c2f0e7 -> trunk/f9bd6c53624c7c0ea3772de78498326e84c2f0e7 2025-12-04T09:33:40.5462459Z * [new tag] trunk/fb5be221a46b51bfc9509013b0d85bc5a9d4f15b -> trunk/fb5be221a46b51bfc9509013b0d85bc5a9d4f15b 2025-12-04T09:33:40.5463357Z * [new tag] trunk/fdf863d5e1de3b2688c9511e96876e34581dbfd7 -> trunk/fdf863d5e1de3b2688c9511e96876e34581dbfd7 2025-12-04T09:33:40.5465056Z * [new tag] trunk/fe0e65adfc0e7ca6e5f57e6ea8b16bd5cc967307 -> trunk/fe0e65adfc0e7ca6e5f57e6ea8b16bd5cc967307 2025-12-04T09:33:40.5466033Z * [new tag] trunk/fec710bf89173f5355468a7ce1afe9157c3d9009 -> trunk/fec710bf89173f5355468a7ce1afe9157c3d9009 2025-12-04T09:33:40.5467364Z * [new tag] trunk/ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 -> trunk/ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:33:40.5467962Z * [new tag] v0.1.1 -> v0.1.1 2025-12-04T09:33:40.5469058Z * [new tag] v0.1.10 -> v0.1.10 2025-12-04T09:33:40.5470095Z * [new tag] v0.1.11 -> v0.1.11 2025-12-04T09:33:40.5471141Z * [new tag] v0.1.12 -> v0.1.12 2025-12-04T09:33:40.5472100Z * [new tag] v0.1.2 -> v0.1.2 2025-12-04T09:33:40.5473148Z * [new tag] v0.1.3 -> v0.1.3 2025-12-04T09:33:40.5473877Z * [new tag] v0.1.4 -> v0.1.4 2025-12-04T09:33:40.5474923Z * [new tag] v0.1.5 -> v0.1.5 2025-12-04T09:33:40.5475911Z * [new tag] v0.1.6 -> v0.1.6 2025-12-04T09:33:40.5476756Z * [new tag] v0.1.7 -> v0.1.7 2025-12-04T09:33:40.5477766Z * [new tag] v0.1.8 -> v0.1.8 2025-12-04T09:33:40.5478747Z * [new tag] v0.1.9 -> v0.1.9 2025-12-04T09:33:40.5479737Z * [new tag] v0.2.0 -> v0.2.0 2025-12-04T09:33:40.5480827Z * [new tag] v0.3.0 -> v0.3.0 2025-12-04T09:33:40.5481877Z * [new tag] v0.3.1 -> v0.3.1 2025-12-04T09:33:40.5482900Z * [new tag] v0.4.0 -> v0.4.0 2025-12-04T09:33:40.5483738Z * [new tag] v0.4.1 -> v0.4.1 2025-12-04T09:33:40.5484833Z * [new tag] v1.0.0 -> v1.0.0 2025-12-04T09:33:40.5485806Z * [new tag] v1.0.0a0 -> v1.0.0a0 2025-12-04T09:33:40.5486765Z * [new tag] v1.0.1 -> v1.0.1 2025-12-04T09:33:40.5487908Z * [new tag] v1.0rc0 -> v1.0rc0 2025-12-04T09:33:40.5491178Z * [new tag] v1.0rc1 -> v1.0rc1 2025-12-04T09:33:40.5492372Z * [new tag] v1.1.0 -> v1.1.0 2025-12-04T09:33:40.5493437Z * [new tag] v1.1.0a0 -> v1.1.0a0 2025-12-04T09:33:40.5494720Z * [new tag] v1.10.0 -> v1.10.0 2025-12-04T09:33:40.5496316Z * [new tag] v1.10.0-rc1 -> v1.10.0-rc1 2025-12-04T09:33:40.5497342Z * [new tag] v1.10.0-rc2 -> v1.10.0-rc2 2025-12-04T09:33:40.5498046Z * [new tag] v1.10.0-rc3 -> v1.10.0-rc3 2025-12-04T09:33:40.5499377Z * [new tag] v1.10.1 -> v1.10.1 2025-12-04T09:33:40.5500343Z * [new tag] v1.10.1-rc1 -> v1.10.1-rc1 2025-12-04T09:33:40.5500919Z * [new tag] v1.10.2 -> v1.10.2 2025-12-04T09:33:40.5501700Z * [new tag] v1.10.2-rc1 -> v1.10.2-rc1 2025-12-04T09:33:40.5502863Z * [new tag] v1.11.0 -> v1.11.0 2025-12-04T09:33:40.5503975Z * [new tag] v1.11.0-rc1 -> v1.11.0-rc1 2025-12-04T09:33:40.5505057Z * [new tag] v1.11.0-rc2 -> v1.11.0-rc2 2025-12-04T09:33:40.5506152Z * [new tag] v1.11.0-rc3 -> v1.11.0-rc3 2025-12-04T09:33:40.5507234Z * [new tag] v1.11.0-rc4 -> v1.11.0-rc4 2025-12-04T09:33:40.5508307Z * [new tag] v1.11.0-rc5 -> v1.11.0-rc5 2025-12-04T09:33:40.5509019Z * [new tag] v1.11.0-rc6 -> v1.11.0-rc6 2025-12-04T09:33:40.5509804Z * [new tag] v1.11.0-rc7 -> v1.11.0-rc7 2025-12-04T09:33:40.5511205Z * [new tag] v1.12.0 -> v1.12.0 2025-12-04T09:33:40.5511907Z * [new tag] v1.12.0-rc1 -> v1.12.0-rc1 2025-12-04T09:33:40.5513101Z * [new tag] v1.12.0-rc2 -> v1.12.0-rc2 2025-12-04T09:33:40.5514154Z * [new tag] v1.12.0-rc3 -> v1.12.0-rc3 2025-12-04T09:33:40.5515245Z * [new tag] v1.12.0-rc4 -> v1.12.0-rc4 2025-12-04T09:33:40.5516227Z * [new tag] v1.12.0-rc5 -> v1.12.0-rc5 2025-12-04T09:33:40.5517380Z * [new tag] v1.12.0-rc6 -> v1.12.0-rc6 2025-12-04T09:33:40.5518068Z * [new tag] v1.12.0-rc7 -> v1.12.0-rc7 2025-12-04T09:33:40.5518873Z * [new tag] v1.12.0-rc8 -> v1.12.0-rc8 2025-12-04T09:33:40.5519702Z * [new tag] v1.12.1 -> v1.12.1 2025-12-04T09:33:40.5520997Z * [new tag] v1.12.1-rc1 -> v1.12.1-rc1 2025-12-04T09:33:40.5522048Z * [new tag] v1.12.1-rc2 -> v1.12.1-rc2 2025-12-04T09:33:40.5523111Z * [new tag] v1.12.1-rc3 -> v1.12.1-rc3 2025-12-04T09:33:40.5524181Z * [new tag] v1.12.1-rc4 -> v1.12.1-rc4 2025-12-04T09:33:40.5524981Z * [new tag] v1.12.1-rc5 -> v1.12.1-rc5 2025-12-04T09:33:40.5526177Z * [new tag] v1.13.0 -> v1.13.0 2025-12-04T09:33:40.5527155Z * [new tag] v1.13.0-rc1 -> v1.13.0-rc1 2025-12-04T09:33:40.5528146Z * [new tag] v1.13.0-rc2 -> v1.13.0-rc2 2025-12-04T09:33:40.5529187Z * [new tag] v1.13.0-rc3 -> v1.13.0-rc3 2025-12-04T09:33:40.5530369Z * [new tag] v1.13.0-rc4 -> v1.13.0-rc4 2025-12-04T09:33:40.5531083Z * [new tag] v1.13.0-rc5 -> v1.13.0-rc5 2025-12-04T09:33:40.5531880Z * [new tag] v1.13.0-rc6 -> v1.13.0-rc6 2025-12-04T09:33:40.5533108Z * [new tag] v1.13.1 -> v1.13.1 2025-12-04T09:33:40.5533789Z * [new tag] v1.13.1-rc1 -> v1.13.1-rc1 2025-12-04T09:33:40.5534925Z * [new tag] v1.2.0 -> v1.2.0 2025-12-04T09:33:40.5535906Z * [new tag] v1.2.0a0 -> v1.2.0a0 2025-12-04T09:33:40.5536948Z * [new tag] v1.3.0 -> v1.3.0 2025-12-04T09:33:40.5537822Z * [new tag] v1.3.0a0 -> v1.3.0a0 2025-12-04T09:33:40.5538738Z * [new tag] v1.3.1 -> v1.3.1 2025-12-04T09:33:40.5539849Z * [new tag] v1.4.0 -> v1.4.0 2025-12-04T09:33:40.5540868Z * [new tag] v1.4.0a0 -> v1.4.0a0 2025-12-04T09:33:40.5541483Z * [new tag] v1.4.1 -> v1.4.1 2025-12-04T09:33:40.5542725Z * [new tag] v1.5.0 -> v1.5.0 2025-12-04T09:33:40.5543858Z * [new tag] v1.5.0-rc1 -> v1.5.0-rc1 2025-12-04T09:33:40.5544935Z * [new tag] v1.5.0-rc2 -> v1.5.0-rc2 2025-12-04T09:33:40.5546035Z * [new tag] v1.5.0-rc3 -> v1.5.0-rc3 2025-12-04T09:33:40.5546894Z * [new tag] v1.5.0-rc4 -> v1.5.0-rc4 2025-12-04T09:33:40.5547696Z * [new tag] v1.5.0-rc5 -> v1.5.0-rc5 2025-12-04T09:33:40.5548917Z * [new tag] v1.5.1 -> v1.5.1 2025-12-04T09:33:40.5549632Z * [new tag] v1.5.1-rc1 -> v1.5.1-rc1 2025-12-04T09:33:40.5550431Z * [new tag] v1.6.0 -> v1.6.0 2025-12-04T09:33:40.5551569Z * [new tag] v1.6.0-rc1 -> v1.6.0-rc1 2025-12-04T09:33:40.5552789Z * [new tag] v1.6.0-rc2 -> v1.6.0-rc2 2025-12-04T09:33:40.5553745Z * [new tag] v1.6.0-rc3 -> v1.6.0-rc3 2025-12-04T09:33:40.5554839Z * [new tag] v1.6.0-rc4 -> v1.6.0-rc4 2025-12-04T09:33:40.5555881Z * [new tag] v1.6.0-rc5 -> v1.6.0-rc5 2025-12-04T09:33:40.5557003Z * [new tag] v1.6.0-rc6 -> v1.6.0-rc6 2025-12-04T09:33:40.5557683Z * [new tag] v1.6.0-rc7 -> v1.6.0-rc7 2025-12-04T09:33:40.5558898Z * [new tag] v1.7.0 -> v1.7.0 2025-12-04T09:33:40.5560418Z * [new tag] v1.7.0-rc1 -> v1.7.0-rc1 2025-12-04T09:33:40.5561584Z * [new tag] v1.7.0-rc2 -> v1.7.0-rc2 2025-12-04T09:33:40.5562633Z * [new tag] v1.7.0-rc3 -> v1.7.0-rc3 2025-12-04T09:33:40.5563319Z * [new tag] v1.7.0-rc4 -> v1.7.0-rc4 2025-12-04T09:33:40.5564503Z * [new tag] v1.7.1 -> v1.7.1 2025-12-04T09:33:40.5565681Z * [new tag] v1.7.1-rc1 -> v1.7.1-rc1 2025-12-04T09:33:40.5566763Z * [new tag] v1.7.1-rc2 -> v1.7.1-rc2 2025-12-04T09:33:40.5567485Z * [new tag] v1.7.1-rc3 -> v1.7.1-rc3 2025-12-04T09:33:40.5568667Z * [new tag] v1.8.0 -> v1.8.0 2025-12-04T09:33:40.5569427Z * [new tag] v1.8.0-rc1 -> v1.8.0-rc1 2025-12-04T09:33:40.5570585Z * [new tag] v1.8.0-rc2 -> v1.8.0-rc2 2025-12-04T09:33:40.5571676Z * [new tag] v1.8.0-rc3 -> v1.8.0-rc3 2025-12-04T09:33:40.5572537Z * [new tag] v1.8.0-rc4 -> v1.8.0-rc4 2025-12-04T09:33:40.5573361Z * [new tag] v1.8.0-rc5 -> v1.8.0-rc5 2025-12-04T09:33:40.5574179Z * [new tag] v1.8.1 -> v1.8.1 2025-12-04T09:33:40.5575331Z * [new tag] v1.8.1-rc1 -> v1.8.1-rc1 2025-12-04T09:33:40.5576039Z * [new tag] v1.8.1-rc2 -> v1.8.1-rc2 2025-12-04T09:33:40.5576838Z * [new tag] v1.8.1-rc3 -> v1.8.1-rc3 2025-12-04T09:33:40.5578622Z * [new tag] v1.8.2 -> v1.8.2 2025-12-04T09:33:40.5579398Z * [new tag] v1.8.2-rc1 -> v1.8.2-rc1 2025-12-04T09:33:40.5580555Z * [new tag] v1.9.0 -> v1.9.0 2025-12-04T09:33:40.5581619Z * [new tag] v1.9.0-rc1 -> v1.9.0-rc1 2025-12-04T09:33:40.5582758Z * [new tag] v1.9.0-rc2 -> v1.9.0-rc2 2025-12-04T09:33:40.5583818Z * [new tag] v1.9.0-rc3 -> v1.9.0-rc3 2025-12-04T09:33:40.5584539Z * [new tag] v1.9.0-rc4 -> v1.9.0-rc4 2025-12-04T09:33:40.5585691Z * [new tag] v1.9.1 -> v1.9.1 2025-12-04T09:33:40.5586982Z * [new tag] v1.9.1-rc1 -> v1.9.1-rc1 2025-12-04T09:33:40.5587675Z * [new tag] v1.9.1-rc2 -> v1.9.1-rc2 2025-12-04T09:33:40.5589073Z * [new tag] v2.0.0 -> v2.0.0 2025-12-04T09:33:40.5590186Z * [new tag] v2.0.0-rc1 -> v2.0.0-rc1 2025-12-04T09:33:40.5591291Z * [new tag] v2.0.0-rc2 -> v2.0.0-rc2 2025-12-04T09:33:40.5592369Z * [new tag] v2.0.0-rc3 -> v2.0.0-rc3 2025-12-04T09:33:40.5593289Z * [new tag] v2.0.0-rc4 -> v2.0.0-rc4 2025-12-04T09:33:40.5594479Z * [new tag] v2.0.0-rc5 -> v2.0.0-rc5 2025-12-04T09:33:40.5595299Z * [new tag] v2.0.0-rc6 -> v2.0.0-rc6 2025-12-04T09:33:40.5596397Z * [new tag] v2.0.1 -> v2.0.1 2025-12-04T09:33:40.5597464Z * [new tag] v2.0.1-rc1 -> v2.0.1-rc1 2025-12-04T09:33:40.5598202Z * [new tag] v2.0.1-rc2 -> v2.0.1-rc2 2025-12-04T09:33:40.5599284Z * [new tag] v2.0.1-rc3 -> v2.0.1-rc3 2025-12-04T09:33:40.5599973Z * [new tag] v2.0.1-rc4 -> v2.0.1-rc4 2025-12-04T09:33:40.5601704Z * [new tag] v2.1.0 -> v2.1.0 2025-12-04T09:33:40.5602670Z * [new tag] v2.1.0-rc1 -> v2.1.0-rc1 2025-12-04T09:33:40.5603799Z * [new tag] v2.1.0-rc2 -> v2.1.0-rc2 2025-12-04T09:33:40.5604976Z * [new tag] v2.1.0-rc3 -> v2.1.0-rc3 2025-12-04T09:33:40.5606027Z * [new tag] v2.1.0-rc4 -> v2.1.0-rc4 2025-12-04T09:33:40.5607110Z * [new tag] v2.1.0-rc5 -> v2.1.0-rc5 2025-12-04T09:33:40.5607793Z * [new tag] v2.1.0-rc6 -> v2.1.0-rc6 2025-12-04T09:33:40.5609081Z * [new tag] v2.1.1 -> v2.1.1 2025-12-04T09:33:40.5610216Z * [new tag] v2.1.1-rc1 -> v2.1.1-rc1 2025-12-04T09:33:40.5611238Z * [new tag] v2.1.1-rc2 -> v2.1.1-rc2 2025-12-04T09:33:40.5612436Z * [new tag] v2.1.1-rc3 -> v2.1.1-rc3 2025-12-04T09:33:40.5613490Z * [new tag] v2.1.1-rc4 -> v2.1.1-rc4 2025-12-04T09:33:40.5614470Z * [new tag] v2.1.1-rc5 -> v2.1.1-rc5 2025-12-04T09:33:40.5615220Z * [new tag] v2.1.1-rc6 -> v2.1.1-rc6 2025-12-04T09:33:40.5616334Z * [new tag] v2.1.2 -> v2.1.2 2025-12-04T09:33:40.5617476Z * [new tag] v2.1.2-rc1 -> v2.1.2-rc1 2025-12-04T09:33:40.5618608Z * [new tag] v2.1.2-rc2 -> v2.1.2-rc2 2025-12-04T09:33:40.5619396Z * [new tag] v2.1.2-rc3 -> v2.1.2-rc3 2025-12-04T09:33:40.5620610Z * [new tag] v2.2.0 -> v2.2.0 2025-12-04T09:33:40.5621631Z * [new tag] v2.2.0-rc1 -> v2.2.0-rc1 2025-12-04T09:33:40.5622699Z * [new tag] v2.2.0-rc2 -> v2.2.0-rc2 2025-12-04T09:33:40.5623803Z * [new tag] v2.2.0-rc3 -> v2.2.0-rc3 2025-12-04T09:33:40.5624750Z * [new tag] v2.2.0-rc4 -> v2.2.0-rc4 2025-12-04T09:33:40.5625758Z * [new tag] v2.2.0-rc5 -> v2.2.0-rc5 2025-12-04T09:33:40.5627355Z * [new tag] v2.2.0-rc6 -> v2.2.0-rc6 2025-12-04T09:33:40.5628046Z * [new tag] v2.2.0-rc7 -> v2.2.0-rc7 2025-12-04T09:33:40.5628864Z * [new tag] v2.2.0-rc8 -> v2.2.0-rc8 2025-12-04T09:33:40.5630078Z * [new tag] v2.2.1 -> v2.2.1 2025-12-04T09:33:40.5631221Z * [new tag] v2.2.1-rc1 -> v2.2.1-rc1 2025-12-04T09:33:40.5631933Z * [new tag] v2.2.1-rc2 -> v2.2.1-rc2 2025-12-04T09:33:40.5632720Z * [new tag] v2.2.1-rc3 -> v2.2.1-rc3 2025-12-04T09:33:40.5633880Z * [new tag] v2.2.2 -> v2.2.2 2025-12-04T09:33:40.5635110Z * [new tag] v2.2.2-rc1 -> v2.2.2-rc1 2025-12-04T09:33:40.5635845Z * [new tag] v2.2.2-rc2 -> v2.2.2-rc2 2025-12-04T09:33:40.5636677Z * [new tag] v2.2.2-rc3 -> v2.2.2-rc3 2025-12-04T09:33:40.5637929Z * [new tag] v2.3.0 -> v2.3.0 2025-12-04T09:33:40.5638902Z * [new tag] v2.3.0-rc1 -> v2.3.0-rc1 2025-12-04T09:33:40.5640054Z * [new tag] v2.3.0-rc10 -> v2.3.0-rc10 2025-12-04T09:33:40.5641167Z * [new tag] v2.3.0-rc11 -> v2.3.0-rc11 2025-12-04T09:33:40.5641908Z * [new tag] v2.3.0-rc12 -> v2.3.0-rc12 2025-12-04T09:33:40.5643082Z * [new tag] v2.3.0-rc2 -> v2.3.0-rc2 2025-12-04T09:33:40.5644172Z * [new tag] v2.3.0-rc3 -> v2.3.0-rc3 2025-12-04T09:33:40.5645198Z * [new tag] v2.3.0-rc4 -> v2.3.0-rc4 2025-12-04T09:33:40.5646266Z * [new tag] v2.3.0-rc5 -> v2.3.0-rc5 2025-12-04T09:33:40.5646988Z * [new tag] v2.3.0-rc6 -> v2.3.0-rc6 2025-12-04T09:33:40.5648146Z * [new tag] v2.3.0-rc7 -> v2.3.0-rc7 2025-12-04T09:33:40.5649169Z * [new tag] v2.3.0-rc8 -> v2.3.0-rc8 2025-12-04T09:33:40.5649858Z * [new tag] v2.3.0-rc9 -> v2.3.0-rc9 2025-12-04T09:33:40.5650677Z * [new tag] v2.3.1 -> v2.3.1 2025-12-04T09:33:40.5651935Z * [new tag] v2.3.1-rc1 -> v2.3.1-rc1 2025-12-04T09:33:40.5652970Z * [new tag] v2.3.1-rc2 -> v2.3.1-rc2 2025-12-04T09:33:40.5654072Z * [new tag] v2.3.1-rc3 -> v2.3.1-rc3 2025-12-04T09:33:40.5655176Z * [new tag] v2.4.0 -> v2.4.0 2025-12-04T09:33:40.5656325Z * [new tag] v2.4.0-rc1 -> v2.4.0-rc1 2025-12-04T09:33:40.5657337Z * [new tag] v2.4.0-rc2 -> v2.4.0-rc2 2025-12-04T09:33:40.5658472Z * [new tag] v2.4.0-rc3 -> v2.4.0-rc3 2025-12-04T09:33:40.5659549Z * [new tag] v2.4.0-rc4 -> v2.4.0-rc4 2025-12-04T09:33:40.5660646Z * [new tag] v2.4.0-rc5 -> v2.4.0-rc5 2025-12-04T09:33:40.5661761Z * [new tag] v2.4.0-rc6 -> v2.4.0-rc6 2025-12-04T09:33:40.5662809Z * [new tag] v2.4.0-rc7 -> v2.4.0-rc7 2025-12-04T09:33:40.5663848Z * [new tag] v2.4.0-rc8 -> v2.4.0-rc8 2025-12-04T09:33:40.5664978Z * [new tag] v2.4.0-rc9 -> v2.4.0-rc9 2025-12-04T09:33:40.5665705Z * [new tag] v2.4.1 -> v2.4.1 2025-12-04T09:33:40.5666970Z * [new tag] v2.4.1-rc1 -> v2.4.1-rc1 2025-12-04T09:33:40.5668082Z * [new tag] v2.4.1-rc2 -> v2.4.1-rc2 2025-12-04T09:33:40.5669163Z * [new tag] v2.4.1-rc3 -> v2.4.1-rc3 2025-12-04T09:33:40.5670285Z * [new tag] v2.5.0 -> v2.5.0 2025-12-04T09:33:40.5671310Z * [new tag] v2.5.0-rc1 -> v2.5.0-rc1 2025-12-04T09:33:40.5672014Z * [new tag] v2.5.0-rc10 -> v2.5.0-rc10 2025-12-04T09:33:40.5673156Z * [new tag] v2.5.0-rc2 -> v2.5.0-rc2 2025-12-04T09:33:40.5674208Z * [new tag] v2.5.0-rc3 -> v2.5.0-rc3 2025-12-04T09:33:40.5675273Z * [new tag] v2.5.0-rc4 -> v2.5.0-rc4 2025-12-04T09:33:40.5676327Z * [new tag] v2.5.0-rc5 -> v2.5.0-rc5 2025-12-04T09:33:40.5677491Z * [new tag] v2.5.0-rc6 -> v2.5.0-rc6 2025-12-04T09:33:40.5678513Z * [new tag] v2.5.0-rc7 -> v2.5.0-rc7 2025-12-04T09:33:40.5679631Z * [new tag] v2.5.0-rc8 -> v2.5.0-rc8 2025-12-04T09:33:40.5680752Z * [new tag] v2.5.0-rc9 -> v2.5.0-rc9 2025-12-04T09:33:40.5681396Z * [new tag] v2.5.1 -> v2.5.1 2025-12-04T09:33:40.5682187Z * [new tag] v2.5.1-rc1 -> v2.5.1-rc1 2025-12-04T09:33:40.5682998Z * [new tag] v2.6.0 -> v2.6.0 2025-12-04T09:33:40.5684226Z * [new tag] v2.6.0-rc1 -> v2.6.0-rc1 2025-12-04T09:33:40.5685373Z * [new tag] v2.6.0-rc2 -> v2.6.0-rc2 2025-12-04T09:33:40.5686474Z * [new tag] v2.6.0-rc3 -> v2.6.0-rc3 2025-12-04T09:33:40.5687507Z * [new tag] v2.6.0-rc4 -> v2.6.0-rc4 2025-12-04T09:33:40.5689025Z * [new tag] v2.6.0-rc5 -> v2.6.0-rc5 2025-12-04T09:33:40.5690274Z * [new tag] v2.6.0-rc6 -> v2.6.0-rc6 2025-12-04T09:33:40.5691390Z * [new tag] v2.6.0-rc7 -> v2.6.0-rc7 2025-12-04T09:33:40.5692623Z * [new tag] v2.6.0-rc8 -> v2.6.0-rc8 2025-12-04T09:33:40.5694115Z * [new tag] v2.6.0-rc9 -> v2.6.0-rc9 2025-12-04T09:33:40.5695414Z * [new tag] v2.7.0 -> v2.7.0 2025-12-04T09:33:40.5696325Z * [new tag] v2.7.0-rc1 -> v2.7.0-rc1 2025-12-04T09:33:40.5697199Z * [new tag] v2.7.0-rc10 -> v2.7.0-rc10 2025-12-04T09:33:40.5698599Z * [new tag] v2.7.0-rc2 -> v2.7.0-rc2 2025-12-04T09:33:40.5699791Z * [new tag] v2.7.0-rc3 -> v2.7.0-rc3 2025-12-04T09:33:40.5700892Z * [new tag] v2.7.0-rc4 -> v2.7.0-rc4 2025-12-04T09:33:40.5701890Z * [new tag] v2.7.0-rc5 -> v2.7.0-rc5 2025-12-04T09:33:40.5702933Z * [new tag] v2.7.0-rc6 -> v2.7.0-rc6 2025-12-04T09:33:40.5704014Z * [new tag] v2.7.0-rc7 -> v2.7.0-rc7 2025-12-04T09:33:40.5705106Z * [new tag] v2.7.0-rc8 -> v2.7.0-rc8 2025-12-04T09:33:40.5706269Z * [new tag] v2.7.0-rc9 -> v2.7.0-rc9 2025-12-04T09:33:40.5707003Z * [new tag] v2.7.1 -> v2.7.1 2025-12-04T09:33:40.5708222Z * [new tag] v2.7.1-rc1 -> v2.7.1-rc1 2025-12-04T09:33:40.5709341Z * [new tag] v2.7.1-rc2 -> v2.7.1-rc2 2025-12-04T09:33:40.5710508Z * [new tag] v2.7.1-rc3 -> v2.7.1-rc3 2025-12-04T09:33:40.5711619Z * [new tag] v2.7.1-rc4 -> v2.7.1-rc4 2025-12-04T09:33:40.5712635Z * [new tag] v2.7.1-rc5 -> v2.7.1-rc5 2025-12-04T09:33:40.5713480Z * [new tag] v2.8.0 -> v2.8.0 2025-12-04T09:33:40.5714556Z * [new tag] v2.8.0-rc1 -> v2.8.0-rc1 2025-12-04T09:33:40.5715566Z * [new tag] v2.8.0-rc2 -> v2.8.0-rc2 2025-12-04T09:33:40.5716706Z * [new tag] v2.8.0-rc3 -> v2.8.0-rc3 2025-12-04T09:33:40.5718080Z * [new tag] v2.8.0-rc4 -> v2.8.0-rc4 2025-12-04T09:33:40.5719081Z * [new tag] v2.8.0-rc5 -> v2.8.0-rc5 2025-12-04T09:33:40.5720225Z * [new tag] v2.8.0-rc6 -> v2.8.0-rc6 2025-12-04T09:33:40.5721342Z * [new tag] v2.8.0-rc7 -> v2.8.0-rc7 2025-12-04T09:33:40.5722396Z * [new tag] v2.8.0-rc8 -> v2.8.0-rc8 2025-12-04T09:33:40.5723547Z * [new tag] v2.9.0 -> v2.9.0 2025-12-04T09:33:40.5724600Z * [new tag] v2.9.0-rc1 -> v2.9.0-rc1 2025-12-04T09:33:40.5726050Z * [new tag] v2.9.0-rc10 -> v2.9.0-rc10 2025-12-04T09:33:40.5726947Z * [new tag] v2.9.0-rc11 -> v2.9.0-rc11 2025-12-04T09:33:40.5728787Z * [new tag] v2.9.0-rc2 -> v2.9.0-rc2 2025-12-04T09:33:40.5729448Z * [new tag] v2.9.0-rc3 -> v2.9.0-rc3 2025-12-04T09:33:40.5730689Z * [new tag] v2.9.0-rc4 -> v2.9.0-rc4 2025-12-04T09:33:40.5731693Z * [new tag] v2.9.0-rc5 -> v2.9.0-rc5 2025-12-04T09:33:40.5733211Z * [new tag] v2.9.0-rc6 -> v2.9.0-rc6 2025-12-04T09:33:40.5734147Z * [new tag] v2.9.0-rc7 -> v2.9.0-rc7 2025-12-04T09:33:40.5735578Z * [new tag] v2.9.0-rc8 -> v2.9.0-rc8 2025-12-04T09:33:40.5736316Z * [new tag] v2.9.0-rc9 -> v2.9.0-rc9 2025-12-04T09:33:40.5737143Z * [new tag] v2.9.1 -> v2.9.1 2025-12-04T09:33:40.5738390Z * [new tag] v2.9.1-rc1 -> v2.9.1-rc1 2025-12-04T09:33:40.5739711Z * [new tag] v2.9.1-rc2 -> v2.9.1-rc2 2025-12-04T09:33:40.5741148Z * [new tag] viable/strict/1759343184 -> viable/strict/1759343184 2025-12-04T09:33:40.5742049Z * [new tag] viable/strict/1759346540 -> viable/strict/1759346540 2025-12-04T09:33:40.5743197Z * [new tag] viable/strict/1759348181 -> viable/strict/1759348181 2025-12-04T09:33:40.5744171Z * [new tag] viable/strict/1759350324 -> viable/strict/1759350324 2025-12-04T09:33:40.5745054Z * [new tag] viable/strict/1759351793 -> viable/strict/1759351793 2025-12-04T09:33:40.5746059Z * [new tag] viable/strict/1759353844 -> viable/strict/1759353844 2025-12-04T09:33:40.5747052Z * [new tag] viable/strict/1759355374 -> viable/strict/1759355374 2025-12-04T09:33:40.5748014Z * [new tag] viable/strict/1759357472 -> viable/strict/1759357472 2025-12-04T09:33:40.5748998Z * [new tag] viable/strict/1759361002 -> viable/strict/1759361002 2025-12-04T09:33:40.5750404Z * [new tag] viable/strict/1759362585 -> viable/strict/1759362585 2025-12-04T09:33:40.5751489Z * [new tag] viable/strict/1759365359 -> viable/strict/1759365359 2025-12-04T09:33:40.5752548Z * [new tag] viable/strict/1759370089 -> viable/strict/1759370089 2025-12-04T09:33:40.5753611Z * [new tag] viable/strict/1759377554 -> viable/strict/1759377554 2025-12-04T09:33:40.5754663Z * [new tag] viable/strict/1759379133 -> viable/strict/1759379133 2025-12-04T09:33:40.5755670Z * [new tag] viable/strict/1759389871 -> viable/strict/1759389871 2025-12-04T09:33:40.5756750Z * [new tag] viable/strict/1759393562 -> viable/strict/1759393562 2025-12-04T09:33:40.5757875Z * [new tag] viable/strict/1759395076 -> viable/strict/1759395076 2025-12-04T09:33:40.5759108Z * [new tag] viable/strict/1759398579 -> viable/strict/1759398579 2025-12-04T09:33:40.5760109Z * [new tag] viable/strict/1759404142 -> viable/strict/1759404142 2025-12-04T09:33:40.5761106Z * [new tag] viable/strict/1759405773 -> viable/strict/1759405773 2025-12-04T09:33:40.5762135Z * [new tag] viable/strict/1759408041 -> viable/strict/1759408041 2025-12-04T09:33:40.5763235Z * [new tag] viable/strict/1759411593 -> viable/strict/1759411593 2025-12-04T09:33:40.5764204Z * [new tag] viable/strict/1759427395 -> viable/strict/1759427395 2025-12-04T09:33:40.5765224Z * [new tag] viable/strict/1759434582 -> viable/strict/1759434582 2025-12-04T09:33:40.5766254Z * [new tag] viable/strict/1759436720 -> viable/strict/1759436720 2025-12-04T09:33:40.5767428Z * [new tag] viable/strict/1759440219 -> viable/strict/1759440219 2025-12-04T09:33:40.5768378Z * [new tag] viable/strict/1759441948 -> viable/strict/1759441948 2025-12-04T09:33:40.5769374Z * [new tag] viable/strict/1759443860 -> viable/strict/1759443860 2025-12-04T09:33:40.5770499Z * [new tag] viable/strict/1759445377 -> viable/strict/1759445377 2025-12-04T09:33:40.5771650Z * [new tag] viable/strict/1759447415 -> viable/strict/1759447415 2025-12-04T09:33:40.5772637Z * [new tag] viable/strict/1759451750 -> viable/strict/1759451750 2025-12-04T09:33:40.5773678Z * [new tag] viable/strict/1759453910 -> viable/strict/1759453910 2025-12-04T09:33:40.5774730Z * [new tag] viable/strict/1759456483 -> viable/strict/1759456483 2025-12-04T09:33:40.5775874Z * [new tag] viable/strict/1759459279 -> viable/strict/1759459279 2025-12-04T09:33:40.5776909Z * [new tag] viable/strict/1759460742 -> viable/strict/1759460742 2025-12-04T09:33:40.5777934Z * [new tag] viable/strict/1759462025 -> viable/strict/1759462025 2025-12-04T09:33:40.5779238Z * [new tag] viable/strict/1759469086 -> viable/strict/1759469086 2025-12-04T09:33:40.5780243Z * [new tag] viable/strict/1759470581 -> viable/strict/1759470581 2025-12-04T09:33:40.5781285Z * [new tag] viable/strict/1759472786 -> viable/strict/1759472786 2025-12-04T09:33:40.5782315Z * [new tag] viable/strict/1759476294 -> viable/strict/1759476294 2025-12-04T09:33:40.5783340Z * [new tag] viable/strict/1759479963 -> viable/strict/1759479963 2025-12-04T09:33:40.5784354Z * [new tag] viable/strict/1759492177 -> viable/strict/1759492177 2025-12-04T09:33:40.5785392Z * [new tag] viable/strict/1759519278 -> viable/strict/1759519278 2025-12-04T09:33:40.5786409Z * [new tag] viable/strict/1759524580 -> viable/strict/1759524580 2025-12-04T09:33:40.5787392Z * [new tag] viable/strict/1759528193 -> viable/strict/1759528193 2025-12-04T09:33:40.5788962Z * [new tag] viable/strict/1759533797 -> viable/strict/1759533797 2025-12-04T09:33:40.5789933Z * [new tag] viable/strict/1759542780 -> viable/strict/1759542780 2025-12-04T09:33:40.5791044Z * [new tag] viable/strict/1759549779 -> viable/strict/1759549779 2025-12-04T09:33:40.5792083Z * [new tag] viable/strict/1759555455 -> viable/strict/1759555455 2025-12-04T09:33:40.5793225Z * [new tag] viable/strict/1759559176 -> viable/strict/1759559176 2025-12-04T09:33:40.5794280Z * [new tag] viable/strict/1759560629 -> viable/strict/1759560629 2025-12-04T09:33:40.5795306Z * [new tag] viable/strict/1759569848 -> viable/strict/1759569848 2025-12-04T09:33:40.5796686Z * [new tag] viable/strict/1759571382 -> viable/strict/1759571382 2025-12-04T09:33:40.5797609Z * [new tag] viable/strict/1759573474 -> viable/strict/1759573474 2025-12-04T09:33:40.5798614Z * [new tag] viable/strict/1759618187 -> viable/strict/1759618187 2025-12-04T09:33:40.5799678Z * [new tag] viable/strict/1759626742 -> viable/strict/1759626742 2025-12-04T09:33:40.5800770Z * [new tag] viable/strict/1759632427 -> viable/strict/1759632427 2025-12-04T09:33:40.5801799Z * [new tag] viable/strict/1759634971 -> viable/strict/1759634971 2025-12-04T09:33:40.5802937Z * [new tag] viable/strict/1759661382 -> viable/strict/1759661382 2025-12-04T09:33:40.5804004Z * [new tag] viable/strict/1759663294 -> viable/strict/1759663294 2025-12-04T09:33:40.5804884Z * [new tag] viable/strict/1759708178 -> viable/strict/1759708178 2025-12-04T09:33:40.5806661Z * [new tag] viable/strict/1759715695 -> viable/strict/1759715695 2025-12-04T09:33:40.5807499Z * [new tag] viable/strict/1759728293 -> viable/strict/1759728293 2025-12-04T09:33:40.5808599Z * [new tag] viable/strict/1759735513 -> viable/strict/1759735513 2025-12-04T09:33:40.5809679Z * [new tag] viable/strict/1759739177 -> viable/strict/1759739177 2025-12-04T09:33:40.5810695Z * [new tag] viable/strict/1759758635 -> viable/strict/1759758635 2025-12-04T09:33:40.5811714Z * [new tag] viable/strict/1759765784 -> viable/strict/1759765784 2025-12-04T09:33:40.5812734Z * [new tag] viable/strict/1759767948 -> viable/strict/1759767948 2025-12-04T09:33:40.5813893Z * [new tag] viable/strict/1759771461 -> viable/strict/1759771461 2025-12-04T09:33:40.5814742Z * [new tag] viable/strict/1759776706 -> viable/strict/1759776706 2025-12-04T09:33:40.5815804Z * [new tag] viable/strict/1759782317 -> viable/strict/1759782317 2025-12-04T09:33:40.5816944Z * [new tag] viable/strict/1759783777 -> viable/strict/1759783777 2025-12-04T09:33:40.5818091Z * [new tag] viable/strict/1759785815 -> viable/strict/1759785815 2025-12-04T09:33:40.5819315Z * [new tag] viable/strict/1759789459 -> viable/strict/1759789459 2025-12-04T09:33:40.5820324Z * [new tag] viable/strict/1759790974 -> viable/strict/1759790974 2025-12-04T09:33:40.5821227Z * [new tag] viable/strict/1759794583 -> viable/strict/1759794583 2025-12-04T09:33:40.5822360Z * [new tag] viable/strict/1759797408 -> viable/strict/1759797408 2025-12-04T09:33:40.5823456Z * [new tag] viable/strict/1759799518 -> viable/strict/1759799518 2025-12-04T09:33:40.5824473Z * [new tag] viable/strict/1759804909 -> viable/strict/1759804909 2025-12-04T09:33:40.5825567Z * [new tag] viable/strict/1759807643 -> viable/strict/1759807643 2025-12-04T09:33:40.5826600Z * [new tag] viable/strict/1759809089 -> viable/strict/1759809089 2025-12-04T09:33:40.5827733Z * [new tag] viable/strict/1759811145 -> viable/strict/1759811145 2025-12-04T09:33:40.5828824Z * [new tag] viable/strict/1759812581 -> viable/strict/1759812581 2025-12-04T09:33:40.5829864Z * [new tag] viable/strict/1759814683 -> viable/strict/1759814683 2025-12-04T09:33:40.5830904Z * [new tag] viable/strict/1759821889 -> viable/strict/1759821889 2025-12-04T09:33:40.5832007Z * [new tag] viable/strict/1759823376 -> viable/strict/1759823376 2025-12-04T09:33:40.5833037Z * [new tag] viable/strict/1759827107 -> viable/strict/1759827107 2025-12-04T09:33:40.5834056Z * [new tag] viable/strict/1759830577 -> viable/strict/1759830577 2025-12-04T09:33:40.5835261Z * [new tag] viable/strict/1759832720 -> viable/strict/1759832720 2025-12-04T09:33:40.5836296Z * [new tag] viable/strict/1759842063 -> viable/strict/1759842063 2025-12-04T09:33:40.5837309Z * [new tag] viable/strict/1759847121 -> viable/strict/1759847121 2025-12-04T09:33:40.5838809Z * [new tag] viable/strict/1759850721 -> viable/strict/1759850721 2025-12-04T09:33:40.5839734Z * [new tag] viable/strict/1759857870 -> viable/strict/1759857870 2025-12-04T09:33:40.5840828Z * [new tag] viable/strict/1759863143 -> viable/strict/1759863143 2025-12-04T09:33:40.5841831Z * [new tag] viable/strict/1759875874 -> viable/strict/1759875874 2025-12-04T09:33:40.5842725Z * [new tag] viable/strict/1759877385 -> viable/strict/1759877385 2025-12-04T09:33:40.5843827Z * [new tag] viable/strict/1759883801 -> viable/strict/1759883801 2025-12-04T09:33:40.5845004Z * [new tag] viable/strict/1759885922 -> viable/strict/1759885922 2025-12-04T09:33:40.5846017Z * [new tag] viable/strict/1759888488 -> viable/strict/1759888488 2025-12-04T09:33:40.5847044Z * [new tag] viable/strict/1759895471 -> viable/strict/1759895471 2025-12-04T09:33:40.5848075Z * [new tag] viable/strict/1759904803 -> viable/strict/1759904803 2025-12-04T09:33:40.5849463Z * [new tag] viable/strict/1759908300 -> viable/strict/1759908300 2025-12-04T09:33:40.5850521Z * [new tag] viable/strict/1759915520 -> viable/strict/1759915520 2025-12-04T09:33:40.5851548Z * [new tag] viable/strict/1759916978 -> viable/strict/1759916978 2025-12-04T09:33:40.5852433Z * [new tag] viable/strict/1759930024 -> viable/strict/1759930024 2025-12-04T09:33:40.5853515Z * [new tag] viable/strict/1759948122 -> viable/strict/1759948122 2025-12-04T09:33:40.5854604Z * [new tag] viable/strict/1759952983 -> viable/strict/1759952983 2025-12-04T09:33:40.5855723Z * [new tag] viable/strict/1759955121 -> viable/strict/1759955121 2025-12-04T09:33:40.5856728Z * [new tag] viable/strict/1759962298 -> viable/strict/1759962298 2025-12-04T09:33:40.5857783Z * [new tag] viable/strict/1759965837 -> viable/strict/1759965837 2025-12-04T09:33:40.5859054Z * [new tag] viable/strict/1759970213 -> viable/strict/1759970213 2025-12-04T09:33:40.5860104Z * [new tag] viable/strict/1759974894 -> viable/strict/1759974894 2025-12-04T09:33:40.5861117Z * [new tag] viable/strict/1759977763 -> viable/strict/1759977763 2025-12-04T09:33:40.5862360Z * [new tag] viable/strict/1759979241 -> viable/strict/1759979241 2025-12-04T09:33:40.5863448Z * [new tag] viable/strict/1759985417 -> viable/strict/1759985417 2025-12-04T09:33:40.5864491Z * [new tag] viable/strict/1759987490 -> viable/strict/1759987490 2025-12-04T09:33:40.5865517Z * [new tag] viable/strict/1759996180 -> viable/strict/1759996180 2025-12-04T09:33:40.5866546Z * [new tag] viable/strict/1760065682 -> viable/strict/1760065682 2025-12-04T09:33:40.5867634Z * [new tag] viable/strict/1760066894 -> viable/strict/1760066894 2025-12-04T09:33:40.5868877Z * [new tag] viable/strict/1760070345 -> viable/strict/1760070345 2025-12-04T09:33:40.5869863Z * [new tag] viable/strict/1760089782 -> viable/strict/1760089782 2025-12-04T09:33:40.5870896Z * [new tag] viable/strict/1760091921 -> viable/strict/1760091921 2025-12-04T09:33:40.5871974Z * [new tag] viable/strict/1760127924 -> viable/strict/1760127924 2025-12-04T09:33:40.5873050Z * [new tag] viable/strict/1760129489 -> viable/strict/1760129489 2025-12-04T09:33:40.5874194Z * [new tag] viable/strict/1760132980 -> viable/strict/1760132980 2025-12-04T09:33:40.5875783Z * [new tag] viable/strict/1760135060 -> viable/strict/1760135060 2025-12-04T09:33:40.5876747Z * [new tag] viable/strict/1760215782 -> viable/strict/1760215782 2025-12-04T09:33:40.5877959Z * [new tag] viable/strict/1760273849 -> viable/strict/1760273849 2025-12-04T09:33:40.5878961Z * [new tag] viable/strict/1760275517 -> viable/strict/1760275517 2025-12-04T09:33:40.5879986Z * [new tag] viable/strict/1760276979 -> viable/strict/1760276979 2025-12-04T09:33:40.5881043Z * [new tag] viable/strict/1760279007 -> viable/strict/1760279007 2025-12-04T09:33:40.5881973Z * [new tag] viable/strict/1760286328 -> viable/strict/1760286328 2025-12-04T09:33:40.5882832Z * [new tag] viable/strict/1760493304 -> viable/strict/1760493304 2025-12-04T09:33:40.5883955Z * [new tag] viable/strict/1760496298 -> viable/strict/1760496298 2025-12-04T09:33:40.5884924Z * [new tag] viable/strict/1760518396 -> viable/strict/1760518396 2025-12-04T09:33:40.5886042Z * [new tag] viable/strict/1760534864 -> viable/strict/1760534864 2025-12-04T09:33:40.5887099Z * [new tag] viable/strict/1760549062 -> viable/strict/1760549062 2025-12-04T09:33:40.5891762Z * [new tag] viable/strict/1760552799 -> viable/strict/1760552799 2025-12-04T09:33:40.5892962Z * [new tag] viable/strict/1760554355 -> viable/strict/1760554355 2025-12-04T09:33:40.5894153Z * [new tag] viable/strict/1760556275 -> viable/strict/1760556275 2025-12-04T09:33:40.5895268Z * [new tag] viable/strict/1760564979 -> viable/strict/1760564979 2025-12-04T09:33:40.5896389Z * [new tag] viable/strict/1760567049 -> viable/strict/1760567049 2025-12-04T09:33:40.5898032Z * [new tag] viable/strict/1760568585 -> viable/strict/1760568585 2025-12-04T09:33:40.5899154Z * [new tag] viable/strict/1760570630 -> viable/strict/1760570630 2025-12-04T09:33:40.5900159Z * [new tag] viable/strict/1760572180 -> viable/strict/1760572180 2025-12-04T09:33:40.5901349Z * [new tag] viable/strict/1760575094 -> viable/strict/1760575094 2025-12-04T09:33:40.5902496Z * [new tag] viable/strict/1760579709 -> viable/strict/1760579709 2025-12-04T09:33:40.5904236Z * [new tag] viable/strict/1760582614 -> viable/strict/1760582614 2025-12-04T09:33:40.5905191Z * [new tag] viable/strict/1760586815 -> viable/strict/1760586815 2025-12-04T09:33:40.5906062Z * [new tag] viable/strict/1760588829 -> viable/strict/1760588829 2025-12-04T09:33:40.5907123Z * [new tag] viable/strict/1760590200 -> viable/strict/1760590200 2025-12-04T09:33:40.5908217Z * [new tag] viable/strict/1760592311 -> viable/strict/1760592311 2025-12-04T09:33:40.5909275Z * [new tag] viable/strict/1760619733 -> viable/strict/1760619733 2025-12-04T09:33:40.5910158Z * [new tag] viable/strict/1760628335 -> viable/strict/1760628335 2025-12-04T09:33:40.5911196Z * [new tag] viable/strict/1760635490 -> viable/strict/1760635490 2025-12-04T09:33:40.5912289Z * [new tag] viable/strict/1760640743 -> viable/strict/1760640743 2025-12-04T09:33:40.5913353Z * [new tag] viable/strict/1760642528 -> viable/strict/1760642528 2025-12-04T09:33:40.5914354Z * [new tag] viable/strict/1760646330 -> viable/strict/1760646330 2025-12-04T09:33:40.5915389Z * [new tag] viable/strict/1760666101 -> viable/strict/1760666101 2025-12-04T09:33:40.5916451Z * [new tag] viable/strict/1760668990 -> viable/strict/1760668990 2025-12-04T09:33:40.5917487Z * [new tag] viable/strict/1760670600 -> viable/strict/1760670600 2025-12-04T09:33:40.5918514Z * [new tag] viable/strict/1760671704 -> viable/strict/1760671704 2025-12-04T09:33:40.5919596Z * [new tag] viable/strict/1760673121 -> viable/strict/1760673121 2025-12-04T09:33:40.5920644Z * [new tag] viable/strict/1760675352 -> viable/strict/1760675352 2025-12-04T09:33:40.5921712Z * [new tag] viable/strict/1760696731 -> viable/strict/1760696731 2025-12-04T09:33:40.5924442Z * [new tag] viable/strict/1760723515 -> viable/strict/1760723515 2025-12-04T09:33:40.5925373Z * [new tag] viable/strict/1760727234 -> viable/strict/1760727234 2025-12-04T09:33:40.5926505Z * [new tag] viable/strict/1760730578 -> viable/strict/1760730578 2025-12-04T09:33:40.5927593Z * [new tag] viable/strict/1760732726 -> viable/strict/1760732726 2025-12-04T09:33:40.5928754Z * [new tag] viable/strict/1760734180 -> viable/strict/1760734180 2025-12-04T09:33:40.5929692Z * [new tag] viable/strict/1760736251 -> viable/strict/1760736251 2025-12-04T09:33:40.5930859Z * [new tag] viable/strict/1760737772 -> viable/strict/1760737772 2025-12-04T09:33:40.5931935Z * [new tag] viable/strict/1760758005 -> viable/strict/1760758005 2025-12-04T09:33:40.5932958Z * [new tag] viable/strict/1760761532 -> viable/strict/1760761532 2025-12-04T09:33:40.5934032Z * [new tag] viable/strict/1760802581 -> viable/strict/1760802581 2025-12-04T09:33:40.5935045Z * [new tag] viable/strict/1760827772 -> viable/strict/1760827772 2025-12-04T09:33:40.5936102Z * [new tag] viable/strict/1760834524 -> viable/strict/1760834524 2025-12-04T09:33:40.5937511Z * [new tag] viable/strict/1760845009 -> viable/strict/1760845009 2025-12-04T09:33:40.5938565Z * [new tag] viable/strict/1760876836 -> viable/strict/1760876836 2025-12-04T09:33:40.5939658Z * [new tag] viable/strict/1760880329 -> viable/strict/1760880329 2025-12-04T09:33:40.5940755Z * [new tag] viable/strict/1760888987 -> viable/strict/1760888987 2025-12-04T09:33:40.5941809Z * [new tag] viable/strict/1760912664 -> viable/strict/1760912664 2025-12-04T09:33:40.5942837Z * [new tag] viable/strict/1760925321 -> viable/strict/1760925321 2025-12-04T09:33:40.5943832Z * [new tag] viable/strict/1760931488 -> viable/strict/1760931488 2025-12-04T09:33:40.5944889Z * [new tag] viable/strict/1760932693 -> viable/strict/1760932693 2025-12-04T09:33:40.5945937Z * [new tag] viable/strict/1761004184 -> viable/strict/1761004184 2025-12-04T09:33:40.5947019Z * [new tag] viable/strict/1761014748 -> viable/strict/1761014748 2025-12-04T09:33:40.5948076Z * [new tag] viable/strict/1761017491 -> viable/strict/1761017491 2025-12-04T09:33:40.5949124Z * [new tag] viable/strict/1761018806 -> viable/strict/1761018806 2025-12-04T09:33:40.5950824Z * [new tag] viable/strict/1761020754 -> viable/strict/1761020754 2025-12-04T09:33:40.5951765Z * [new tag] viable/strict/1761024303 -> viable/strict/1761024303 2025-12-04T09:33:40.5952746Z * [new tag] viable/strict/1761029582 -> viable/strict/1761029582 2025-12-04T09:33:40.5953832Z * [new tag] viable/strict/1761031535 -> viable/strict/1761031535 2025-12-04T09:33:40.5954864Z * [new tag] viable/strict/1761035196 -> viable/strict/1761035196 2025-12-04T09:33:40.5956020Z * [new tag] viable/strict/1761045825 -> viable/strict/1761045825 2025-12-04T09:33:40.5957071Z * [new tag] viable/strict/1761054796 -> viable/strict/1761054796 2025-12-04T09:33:40.5958158Z * [new tag] viable/strict/1761060314 -> viable/strict/1761060314 2025-12-04T09:33:40.5959232Z * [new tag] viable/strict/1761071198 -> viable/strict/1761071198 2025-12-04T09:33:40.5960357Z * [new tag] viable/strict/1761074628 -> viable/strict/1761074628 2025-12-04T09:33:40.5961378Z * [new tag] viable/strict/1761078351 -> viable/strict/1761078351 2025-12-04T09:33:40.5962486Z * [new tag] viable/strict/1761079822 -> viable/strict/1761079822 2025-12-04T09:33:40.5963506Z * [new tag] viable/strict/1761081873 -> viable/strict/1761081873 2025-12-04T09:33:40.5964603Z * [new tag] viable/strict/1761083392 -> viable/strict/1761083392 2025-12-04T09:33:40.5965983Z * [new tag] viable/strict/1761085465 -> viable/strict/1761085465 2025-12-04T09:33:40.5967051Z * [new tag] viable/strict/1761089099 -> viable/strict/1761089099 2025-12-04T09:33:40.5968164Z * [new tag] viable/strict/1761095535 -> viable/strict/1761095535 2025-12-04T09:33:40.5969117Z * [new tag] viable/strict/1761098119 -> viable/strict/1761098119 2025-12-04T09:33:40.5970787Z * [new tag] viable/strict/1761101330 -> viable/strict/1761101330 2025-12-04T09:33:40.5971719Z * [new tag] viable/strict/1761114425 -> viable/strict/1761114425 2025-12-04T09:33:40.5972874Z * [new tag] viable/strict/1761116036 -> viable/strict/1761116036 2025-12-04T09:33:40.5973922Z * [new tag] viable/strict/1761119379 -> viable/strict/1761119379 2025-12-04T09:33:40.5974976Z * [new tag] viable/strict/1761121601 -> viable/strict/1761121601 2025-12-04T09:33:40.5976000Z * [new tag] viable/strict/1761123234 -> viable/strict/1761123234 2025-12-04T09:33:40.5977031Z * [new tag] viable/strict/1761126621 -> viable/strict/1761126621 2025-12-04T09:33:40.5978089Z * [new tag] viable/strict/1761132259 -> viable/strict/1761132259 2025-12-04T09:33:40.5979309Z * [new tag] viable/strict/1761146746 -> viable/strict/1761146746 2025-12-04T09:33:40.5980348Z * [new tag] viable/strict/1761164752 -> viable/strict/1761164752 2025-12-04T09:33:40.5981366Z * [new tag] viable/strict/1761166198 -> viable/strict/1761166198 2025-12-04T09:33:40.5982462Z * [new tag] viable/strict/1761175424 -> viable/strict/1761175424 2025-12-04T09:33:40.5983988Z * [new tag] viable/strict/1761176983 -> viable/strict/1761176983 2025-12-04T09:33:40.5984877Z * [new tag] viable/strict/1761179891 -> viable/strict/1761179891 2025-12-04T09:33:40.5985965Z * [new tag] viable/strict/1761181930 -> viable/strict/1761181930 2025-12-04T09:33:40.5987004Z * [new tag] viable/strict/1761184516 -> viable/strict/1761184516 2025-12-04T09:33:40.5988306Z * [new tag] viable/strict/1761190179 -> viable/strict/1761190179 2025-12-04T09:33:40.5989395Z * [new tag] viable/strict/1761193558 -> viable/strict/1761193558 2025-12-04T09:33:40.5990436Z * [new tag] viable/strict/1761207990 -> viable/strict/1761207990 2025-12-04T09:33:40.5991470Z * [new tag] viable/strict/1761229539 -> viable/strict/1761229539 2025-12-04T09:33:40.5992918Z * [new tag] viable/strict/1761244031 -> viable/strict/1761244031 2025-12-04T09:33:40.5993841Z * [new tag] viable/strict/1761248986 -> viable/strict/1761248986 2025-12-04T09:33:40.5994869Z * [new tag] viable/strict/1761259791 -> viable/strict/1761259791 2025-12-04T09:33:40.5995903Z * [new tag] viable/strict/1761266139 -> viable/strict/1761266139 2025-12-04T09:33:40.5997043Z * [new tag] viable/strict/1761268316 -> viable/strict/1761268316 2025-12-04T09:33:40.5998074Z * [new tag] viable/strict/1761273805 -> viable/strict/1761273805 2025-12-04T09:33:40.5999110Z * [new tag] viable/strict/1761275261 -> viable/strict/1761275261 2025-12-04T09:33:40.6000192Z * [new tag] viable/strict/1761277913 -> viable/strict/1761277913 2025-12-04T09:33:40.6001350Z * [new tag] viable/strict/1761290701 -> viable/strict/1761290701 2025-12-04T09:33:40.6002447Z * [new tag] viable/strict/1761294396 -> viable/strict/1761294396 2025-12-04T09:33:40.6003475Z * [new tag] viable/strict/1761303047 -> viable/strict/1761303047 2025-12-04T09:33:40.6004569Z * [new tag] viable/strict/1761335388 -> viable/strict/1761335388 2025-12-04T09:33:40.6005634Z * [new tag] viable/strict/1761337551 -> viable/strict/1761337551 2025-12-04T09:33:40.6006792Z * [new tag] viable/strict/1761339007 -> viable/strict/1761339007 2025-12-04T09:33:40.6007948Z * [new tag] viable/strict/1761341050 -> viable/strict/1761341050 2025-12-04T09:33:40.6009017Z * [new tag] viable/strict/1761346188 -> viable/strict/1761346188 2025-12-04T09:33:40.6010204Z * [new tag] viable/strict/1761349792 -> viable/strict/1761349792 2025-12-04T09:33:40.6011247Z * [new tag] viable/strict/1761352620 -> viable/strict/1761352620 2025-12-04T09:33:40.6012289Z * [new tag] viable/strict/1761354730 -> viable/strict/1761354730 2025-12-04T09:33:40.6013390Z * [new tag] viable/strict/1761357298 -> viable/strict/1761357298 2025-12-04T09:33:40.6014425Z * [new tag] viable/strict/1761360201 -> viable/strict/1761360201 2025-12-04T09:33:40.6015497Z * [new tag] viable/strict/1761361753 -> viable/strict/1761361753 2025-12-04T09:33:40.6016588Z * [new tag] viable/strict/1761364351 -> viable/strict/1761364351 2025-12-04T09:33:40.6017581Z * [new tag] viable/strict/1761366338 -> viable/strict/1761366338 2025-12-04T09:33:40.6019095Z * [new tag] viable/strict/1761367802 -> viable/strict/1761367802 2025-12-04T09:33:40.6020059Z * [new tag] viable/strict/1761369889 -> viable/strict/1761369889 2025-12-04T09:33:40.6021743Z * [new tag] viable/strict/1761371385 -> viable/strict/1761371385 2025-12-04T09:33:40.6022654Z * [new tag] viable/strict/1761373581 -> viable/strict/1761373581 2025-12-04T09:33:40.6023852Z * [new tag] viable/strict/1761375054 -> viable/strict/1761375054 2025-12-04T09:33:40.6024983Z * [new tag] viable/strict/1761421785 -> viable/strict/1761421785 2025-12-04T09:33:40.6026190Z * [new tag] viable/strict/1761434614 -> viable/strict/1761434614 2025-12-04T09:33:40.6027746Z * [new tag] viable/strict/1761439254 -> viable/strict/1761439254 2025-12-04T09:33:40.6028704Z * [new tag] viable/strict/1761454187 -> viable/strict/1761454187 2025-12-04T09:33:40.6029834Z * [new tag] viable/strict/1761459991 -> viable/strict/1761459991 2025-12-04T09:33:40.6031255Z * [new tag] viable/strict/1761470668 -> viable/strict/1761470668 2025-12-04T09:33:40.6032753Z * [new tag] viable/strict/1761472188 -> viable/strict/1761472188 2025-12-04T09:33:40.6033739Z * [new tag] viable/strict/1761503178 -> viable/strict/1761503178 2025-12-04T09:33:40.6034801Z * [new tag] viable/strict/1761517492 -> viable/strict/1761517492 2025-12-04T09:33:40.6035882Z * [new tag] viable/strict/1761518981 -> viable/strict/1761518981 2025-12-04T09:33:40.6037012Z * [new tag] viable/strict/1761533609 -> viable/strict/1761533609 2025-12-04T09:33:40.6037934Z * [new tag] viable/strict/1761546438 -> viable/strict/1761546438 2025-12-04T09:33:40.6039193Z * [new tag] viable/strict/1761548133 -> viable/strict/1761548133 2025-12-04T09:33:40.6040687Z * [new tag] viable/strict/1761555186 -> viable/strict/1761555186 2025-12-04T09:33:40.6041705Z * [new tag] viable/strict/1761557178 -> viable/strict/1761557178 2025-12-04T09:33:40.6042743Z * [new tag] viable/strict/1761560772 -> viable/strict/1761560772 2025-12-04T09:33:40.6043824Z * [new tag] viable/strict/1761562266 -> viable/strict/1761562266 2025-12-04T09:33:40.6045080Z * [new tag] viable/strict/1761564260 -> viable/strict/1761564260 2025-12-04T09:33:40.6046165Z * [new tag] viable/strict/1761568072 -> viable/strict/1761568072 2025-12-04T09:33:40.6047220Z * [new tag] viable/strict/1761571683 -> viable/strict/1761571683 2025-12-04T09:33:40.6048206Z * [new tag] viable/strict/1761580199 -> viable/strict/1761580199 2025-12-04T09:33:40.6049220Z * [new tag] viable/strict/1761587383 -> viable/strict/1761587383 2025-12-04T09:33:40.6050350Z * [new tag] viable/strict/1761591165 -> viable/strict/1761591165 2025-12-04T09:33:40.6051410Z * [new tag] viable/strict/1761594575 -> viable/strict/1761594575 2025-12-04T09:33:40.6052481Z * [new tag] viable/strict/1761596710 -> viable/strict/1761596710 2025-12-04T09:33:40.6053581Z * [new tag] viable/strict/1761598189 -> viable/strict/1761598189 2025-12-04T09:33:40.6054669Z * [new tag] viable/strict/1761600254 -> viable/strict/1761600254 2025-12-04T09:33:40.6055724Z * [new tag] viable/strict/1761603879 -> viable/strict/1761603879 2025-12-04T09:33:40.6056818Z * [new tag] viable/strict/1761605429 -> viable/strict/1761605429 2025-12-04T09:33:40.6058020Z * [new tag] viable/strict/1761607468 -> viable/strict/1761607468 2025-12-04T09:33:40.6059288Z * [new tag] viable/strict/1761608983 -> viable/strict/1761608983 2025-12-04T09:33:40.6060343Z * [new tag] viable/strict/1761611846 -> viable/strict/1761611846 2025-12-04T09:33:40.6061495Z * [new tag] viable/strict/1761613922 -> viable/strict/1761613922 2025-12-04T09:33:40.6062409Z * [new tag] viable/strict/1761616504 -> viable/strict/1761616504 2025-12-04T09:33:40.6063312Z * [new tag] viable/strict/1761619599 -> viable/strict/1761619599 2025-12-04T09:33:40.6064399Z * [new tag] viable/strict/1761686693 -> viable/strict/1761686693 2025-12-04T09:33:40.6065472Z * [new tag] viable/strict/1761688179 -> viable/strict/1761688179 2025-12-04T09:33:40.6066510Z * [new tag] viable/strict/1761691973 -> viable/strict/1761691973 2025-12-04T09:33:40.6067743Z * [new tag] viable/strict/1761693884 -> viable/strict/1761693884 2025-12-04T09:33:40.6068845Z * [new tag] viable/strict/1761695389 -> viable/strict/1761695389 2025-12-04T09:33:40.6069928Z * [new tag] viable/strict/1761698408 -> viable/strict/1761698408 2025-12-04T09:33:40.6071007Z * [new tag] viable/strict/1761702931 -> viable/strict/1761702931 2025-12-04T09:33:40.6072076Z * [new tag] viable/strict/1761706307 -> viable/strict/1761706307 2025-12-04T09:33:40.6073167Z * [new tag] viable/strict/1761709065 -> viable/strict/1761709065 2025-12-04T09:33:40.6074380Z * [new tag] viable/strict/1761710285 -> viable/strict/1761710285 2025-12-04T09:33:40.6075489Z * [new tag] viable/strict/1761711983 -> viable/strict/1761711983 2025-12-04T09:33:40.6076673Z * [new tag] viable/strict/1761713514 -> viable/strict/1761713514 2025-12-04T09:33:40.6077881Z * [new tag] viable/strict/1761715523 -> viable/strict/1761715523 2025-12-04T09:33:40.6079027Z * [new tag] viable/strict/1761727973 -> viable/strict/1761727973 2025-12-04T09:33:40.6080266Z * [new tag] viable/strict/1761751558 -> viable/strict/1761751558 2025-12-04T09:33:40.6081379Z * [new tag] viable/strict/1761755187 -> viable/strict/1761755187 2025-12-04T09:33:40.6082550Z * [new tag] viable/strict/1761756826 -> viable/strict/1761756826 2025-12-04T09:33:40.6083696Z * [new tag] viable/strict/1761769551 -> viable/strict/1761769551 2025-12-04T09:33:40.6084957Z * [new tag] viable/strict/1761771032 -> viable/strict/1761771032 2025-12-04T09:33:40.6085896Z * [new tag] viable/strict/1761773101 -> viable/strict/1761773101 2025-12-04T09:33:40.6087036Z * [new tag] viable/strict/1761781792 -> viable/strict/1761781792 2025-12-04T09:33:40.6088499Z * [new tag] viable/strict/1761784788 -> viable/strict/1761784788 2025-12-04T09:33:40.6089495Z * [new tag] viable/strict/1761786740 -> viable/strict/1761786740 2025-12-04T09:33:40.6090580Z * [new tag] viable/strict/1761789332 -> viable/strict/1761789332 2025-12-04T09:33:40.6092351Z * [new tag] viable/strict/1761792569 -> viable/strict/1761792569 2025-12-04T09:33:40.6093346Z * [new tag] viable/strict/1761795289 -> viable/strict/1761795289 2025-12-04T09:33:40.6095040Z * [new tag] viable/strict/1761798345 -> viable/strict/1761798345 2025-12-04T09:33:40.6096072Z * [new tag] viable/strict/1761799827 -> viable/strict/1761799827 2025-12-04T09:33:40.6097238Z * [new tag] viable/strict/1761805604 -> viable/strict/1761805604 2025-12-04T09:33:40.6098405Z * [new tag] viable/strict/1761807202 -> viable/strict/1761807202 2025-12-04T09:33:40.6099593Z * [new tag] viable/strict/1761809094 -> viable/strict/1761809094 2025-12-04T09:33:40.6100674Z * [new tag] viable/strict/1761810576 -> viable/strict/1761810576 2025-12-04T09:33:40.6101817Z * [new tag] viable/strict/1761812771 -> viable/strict/1761812771 2025-12-04T09:33:40.6102912Z * [new tag] viable/strict/1761814363 -> viable/strict/1761814363 2025-12-04T09:33:40.6104018Z * [new tag] viable/strict/1761857410 -> viable/strict/1761857410 2025-12-04T09:33:40.6105127Z * [new tag] viable/strict/1761860985 -> viable/strict/1761860985 2025-12-04T09:33:40.6106289Z * [new tag] viable/strict/1761863094 -> viable/strict/1761863094 2025-12-04T09:33:40.6107376Z * [new tag] viable/strict/1761864590 -> viable/strict/1761864590 2025-12-04T09:33:40.6108500Z * [new tag] viable/strict/1761866675 -> viable/strict/1761866675 2025-12-04T09:33:40.6109912Z * [new tag] viable/strict/1761868178 -> viable/strict/1761868178 2025-12-04T09:33:40.6110997Z * [new tag] viable/strict/1761871111 -> viable/strict/1761871111 2025-12-04T09:33:40.6112122Z * [new tag] viable/strict/1761873126 -> viable/strict/1761873126 2025-12-04T09:33:40.6113228Z * [new tag] viable/strict/1761875714 -> viable/strict/1761875714 2025-12-04T09:33:40.6114356Z * [new tag] viable/strict/1761878924 -> viable/strict/1761878924 2025-12-04T09:33:40.6115477Z * [new tag] viable/strict/1761881727 -> viable/strict/1761881727 2025-12-04T09:33:40.6116655Z * [new tag] viable/strict/1761882959 -> viable/strict/1761882959 2025-12-04T09:33:40.6117891Z * [new tag] viable/strict/1761886268 -> viable/strict/1761886268 2025-12-04T09:33:40.6119002Z * [new tag] viable/strict/1761893641 -> viable/strict/1761893641 2025-12-04T09:33:40.6120105Z * [new tag] viable/strict/1761931517 -> viable/strict/1761931517 2025-12-04T09:33:40.6121209Z * [new tag] viable/strict/1761933080 -> viable/strict/1761933080 2025-12-04T09:33:40.6122316Z * [new tag] viable/strict/1761935217 -> viable/strict/1761935217 2025-12-04T09:33:40.6123461Z * [new tag] viable/strict/1761938533 -> viable/strict/1761938533 2025-12-04T09:33:40.6124646Z * [new tag] viable/strict/1761940184 -> viable/strict/1761940184 2025-12-04T09:33:40.6125761Z * [new tag] viable/strict/1761942338 -> viable/strict/1761942338 2025-12-04T09:33:40.6126817Z * [new tag] viable/strict/1761946100 -> viable/strict/1761946100 2025-12-04T09:33:40.6127938Z * [new tag] viable/strict/1761947374 -> viable/strict/1761947374 2025-12-04T09:33:40.6129038Z * [new tag] viable/strict/1761950978 -> viable/strict/1761950978 2025-12-04T09:33:40.6130177Z * [new tag] viable/strict/1761957727 -> viable/strict/1761957727 2025-12-04T09:33:40.6131197Z * [new tag] viable/strict/1761959532 -> viable/strict/1761959532 2025-12-04T09:33:40.6132374Z * [new tag] viable/strict/1761965366 -> viable/strict/1761965366 2025-12-04T09:33:40.6133783Z * [new tag] viable/strict/1761968066 -> viable/strict/1761968066 2025-12-04T09:33:40.6134793Z * [new tag] viable/strict/1761969322 -> viable/strict/1761969322 2025-12-04T09:33:40.6135886Z * [new tag] viable/strict/1761974723 -> viable/strict/1761974723 2025-12-04T09:33:40.6137051Z * [new tag] viable/strict/1761981837 -> viable/strict/1761981837 2025-12-04T09:33:40.6138283Z * [new tag] viable/strict/1761985546 -> viable/strict/1761985546 2025-12-04T09:33:40.6139483Z * [new tag] viable/strict/1761987030 -> viable/strict/1761987030 2025-12-04T09:33:40.6140624Z * [new tag] viable/strict/1762003554 -> viable/strict/1762003554 2025-12-04T09:33:40.6141771Z * [new tag] viable/strict/1762021560 -> viable/strict/1762021560 2025-12-04T09:33:40.6142899Z * [new tag] viable/strict/1762032190 -> viable/strict/1762032190 2025-12-04T09:33:40.6144010Z * [new tag] viable/strict/1762040981 -> viable/strict/1762040981 2025-12-04T09:33:40.6145109Z * [new tag] viable/strict/1762048525 -> viable/strict/1762048525 2025-12-04T09:33:40.6146271Z * [new tag] viable/strict/1762104223 -> viable/strict/1762104223 2025-12-04T09:33:40.6147434Z * [new tag] viable/strict/1762105778 -> viable/strict/1762105778 2025-12-04T09:33:40.6148543Z * [new tag] viable/strict/1762115109 -> viable/strict/1762115109 2025-12-04T09:33:40.6149637Z * [new tag] viable/strict/1762125840 -> viable/strict/1762125840 2025-12-04T09:33:40.6150604Z * [new tag] viable/strict/1762127377 -> viable/strict/1762127377 2025-12-04T09:33:40.6152302Z * [new tag] viable/strict/1762134925 -> viable/strict/1762134925 2025-12-04T09:33:40.6153181Z * [new tag] viable/strict/1762138338 -> viable/strict/1762138338 2025-12-04T09:33:40.6154404Z * [new tag] viable/strict/1762148993 -> viable/strict/1762148993 2025-12-04T09:33:40.6155513Z * [new tag] viable/strict/1762152871 -> viable/strict/1762152871 2025-12-04T09:33:40.6156717Z * [new tag] viable/strict/1762156183 -> viable/strict/1762156183 2025-12-04T09:33:40.6157808Z * [new tag] viable/strict/1762163457 -> viable/strict/1762163457 2025-12-04T09:33:40.6158909Z * [new tag] viable/strict/1762165569 -> viable/strict/1762165569 2025-12-04T09:33:40.6160018Z * [new tag] viable/strict/1762169035 -> viable/strict/1762169035 2025-12-04T09:33:40.6161133Z * [new tag] viable/strict/1762174936 -> viable/strict/1762174936 2025-12-04T09:33:40.6163859Z * [new tag] viable/strict/1762194412 -> viable/strict/1762194412 2025-12-04T09:33:40.6164452Z * [new tag] viable/strict/1762195876 -> viable/strict/1762195876 2025-12-04T09:33:40.6165036Z * [new tag] viable/strict/1762197788 -> viable/strict/1762197788 2025-12-04T09:33:40.6165617Z * [new tag] viable/strict/1762199389 -> viable/strict/1762199389 2025-12-04T09:33:40.6167126Z * [new tag] viable/strict/1762206585 -> viable/strict/1762206585 2025-12-04T09:33:40.6168732Z * [new tag] viable/strict/1762210184 -> viable/strict/1762210184 2025-12-04T09:33:40.6169607Z * [new tag] viable/strict/1762218736 -> viable/strict/1762218736 2025-12-04T09:33:40.6170752Z * [new tag] viable/strict/1762224529 -> viable/strict/1762224529 2025-12-04T09:33:40.6172056Z * [new tag] viable/strict/1762227253 -> viable/strict/1762227253 2025-12-04T09:33:40.6172944Z * [new tag] viable/strict/1762228515 -> viable/strict/1762228515 2025-12-04T09:33:40.6174159Z * [new tag] viable/strict/1762230349 -> viable/strict/1762230349 2025-12-04T09:33:40.6175256Z * [new tag] viable/strict/1762231859 -> viable/strict/1762231859 2025-12-04T09:33:40.6176476Z * [new tag] viable/strict/1762233925 -> viable/strict/1762233925 2025-12-04T09:33:40.6177886Z * [new tag] viable/strict/1762237630 -> viable/strict/1762237630 2025-12-04T09:33:40.6178891Z * [new tag] viable/strict/1762253522 -> viable/strict/1762253522 2025-12-04T09:33:40.6180115Z * [new tag] viable/strict/1762278588 -> viable/strict/1762278588 2025-12-04T09:33:40.6181275Z * [new tag] viable/strict/1762284203 -> viable/strict/1762284203 2025-12-04T09:33:40.6182433Z * [new tag] viable/strict/1762289446 -> viable/strict/1762289446 2025-12-04T09:33:40.6183546Z * [new tag] viable/strict/1762291515 -> viable/strict/1762291515 2025-12-04T09:33:40.6184720Z * [new tag] viable/strict/1762295100 -> viable/strict/1762295100 2025-12-04T09:33:40.6185681Z * [new tag] viable/strict/1762296590 -> viable/strict/1762296590 2025-12-04T09:33:40.6186669Z * [new tag] viable/strict/1762300179 -> viable/strict/1762300179 2025-12-04T09:33:40.6187676Z * [new tag] viable/strict/1762303207 -> viable/strict/1762303207 2025-12-04T09:33:40.6188948Z * [new tag] viable/strict/1762386584 -> viable/strict/1762386584 2025-12-04T09:33:40.6190049Z * [new tag] viable/strict/1762391537 -> viable/strict/1762391537 2025-12-04T09:33:40.6191116Z * [new tag] viable/strict/1762394119 -> viable/strict/1762394119 2025-12-04T09:33:40.6192742Z * [new tag] viable/strict/1762397437 -> viable/strict/1762397437 2025-12-04T09:33:40.6193757Z * [new tag] viable/strict/1762400256 -> viable/strict/1762400256 2025-12-04T09:33:40.6194863Z * [new tag] viable/strict/1762401469 -> viable/strict/1762401469 2025-12-04T09:33:40.6196109Z * [new tag] viable/strict/1762408195 -> viable/strict/1762408195 2025-12-04T09:33:40.6197282Z * [new tag] viable/strict/1762410411 -> viable/strict/1762410411 2025-12-04T09:33:40.6198398Z * [new tag] viable/strict/1762417613 -> viable/strict/1762417613 2025-12-04T09:33:40.6199507Z * [new tag] viable/strict/1762419198 -> viable/strict/1762419198 2025-12-04T09:33:40.6200646Z * [new tag] viable/strict/1762422656 -> viable/strict/1762422656 2025-12-04T09:33:40.6202346Z * [new tag] viable/strict/1762424746 -> viable/strict/1762424746 2025-12-04T09:33:40.6203473Z * [new tag] viable/strict/1762446386 -> viable/strict/1762446386 2025-12-04T09:33:40.6204568Z * [new tag] viable/strict/1762449912 -> viable/strict/1762449912 2025-12-04T09:33:40.6205718Z * [new tag] viable/strict/1762457031 -> viable/strict/1762457031 2025-12-04T09:33:40.6206882Z * [new tag] viable/strict/1762462441 -> viable/strict/1762462441 2025-12-04T09:33:40.6208057Z * [new tag] viable/strict/1762467909 -> viable/strict/1762467909 2025-12-04T09:33:40.6209257Z * [new tag] viable/strict/1762471493 -> viable/strict/1762471493 2025-12-04T09:33:40.6210400Z * [new tag] viable/strict/1762475990 -> viable/strict/1762475990 2025-12-04T09:33:40.6211632Z * [new tag] viable/strict/1762477933 -> viable/strict/1762477933 2025-12-04T09:33:40.6212825Z * [new tag] viable/strict/1762491053 -> viable/strict/1762491053 2025-12-04T09:33:40.6214058Z * [new tag] viable/strict/1762493118 -> viable/strict/1762493118 2025-12-04T09:33:40.6215097Z * [new tag] viable/strict/1762498442 -> viable/strict/1762498442 2025-12-04T09:33:40.6216220Z * [new tag] viable/strict/1762501778 -> viable/strict/1762501778 2025-12-04T09:33:40.6217392Z * [new tag] viable/strict/1762504001 -> viable/strict/1762504001 2025-12-04T09:33:40.6218849Z * [new tag] viable/strict/1762505583 -> viable/strict/1762505583 2025-12-04T09:33:40.6219976Z * [new tag] viable/strict/1762507523 -> viable/strict/1762507523 2025-12-04T09:33:40.6221157Z * [new tag] viable/strict/1762511140 -> viable/strict/1762511140 2025-12-04T09:33:40.6222527Z * [new tag] viable/strict/1762512632 -> viable/strict/1762512632 2025-12-04T09:33:40.6223623Z * [new tag] viable/strict/1762520467 -> viable/strict/1762520467 2025-12-04T09:33:40.6224767Z * [new tag] viable/strict/1762522016 -> viable/strict/1762522016 2025-12-04T09:33:40.6225838Z * [new tag] viable/strict/1762530591 -> viable/strict/1762530591 2025-12-04T09:33:40.6226967Z * [new tag] viable/strict/1762543405 -> viable/strict/1762543405 2025-12-04T09:33:40.6227940Z * [new tag] viable/strict/1762544998 -> viable/strict/1762544998 2025-12-04T09:33:40.6229140Z * [new tag] viable/strict/1762552182 -> viable/strict/1762552182 2025-12-04T09:33:40.6230325Z * [new tag] viable/strict/1762554297 -> viable/strict/1762554297 2025-12-04T09:33:40.6231339Z * [new tag] viable/strict/1762559381 -> viable/strict/1762559381 2025-12-04T09:33:40.6232474Z * [new tag] viable/strict/1762562222 -> viable/strict/1762562222 2025-12-04T09:33:40.6233564Z * [new tag] viable/strict/1762564319 -> viable/strict/1762564319 2025-12-04T09:33:40.6234544Z * [new tag] viable/strict/1762566904 -> viable/strict/1762566904 2025-12-04T09:33:40.6235675Z * [new tag] viable/strict/1762569781 -> viable/strict/1762569781 2025-12-04T09:33:40.6236690Z * [new tag] viable/strict/1762575940 -> viable/strict/1762575940 2025-12-04T09:33:40.6237815Z * [new tag] viable/strict/1762580974 -> viable/strict/1762580974 2025-12-04T09:33:40.6239000Z * [new tag] viable/strict/1762583185 -> viable/strict/1762583185 2025-12-04T09:33:40.6240135Z * [new tag] viable/strict/1762586647 -> viable/strict/1762586647 2025-12-04T09:33:40.6241215Z * [new tag] viable/strict/1762588183 -> viable/strict/1762588183 2025-12-04T09:33:40.6242904Z * [new tag] viable/strict/1762593886 -> viable/strict/1762593886 2025-12-04T09:33:40.6243936Z * [new tag] viable/strict/1762650743 -> viable/strict/1762650743 2025-12-04T09:33:40.6245149Z * [new tag] viable/strict/1762653328 -> viable/strict/1762653328 2025-12-04T09:33:40.6246309Z * [new tag] viable/strict/1762659342 -> viable/strict/1762659342 2025-12-04T09:33:40.6247441Z * [new tag] viable/strict/1762662360 -> viable/strict/1762662360 2025-12-04T09:33:40.6248545Z * [new tag] viable/strict/1762667377 -> viable/strict/1762667377 2025-12-04T09:33:40.6249667Z * [new tag] viable/strict/1762671090 -> viable/strict/1762671090 2025-12-04T09:33:40.6250767Z * [new tag] viable/strict/1762680284 -> viable/strict/1762680284 2025-12-04T09:33:40.6251893Z * [new tag] viable/strict/1762683900 -> viable/strict/1762683900 2025-12-04T09:33:40.6252987Z * [new tag] viable/strict/1762705541 -> viable/strict/1762705541 2025-12-04T09:33:40.6254161Z * [new tag] viable/strict/1762709004 -> viable/strict/1762709004 2025-12-04T09:33:40.6255370Z * [new tag] viable/strict/1762746004 -> viable/strict/1762746004 2025-12-04T09:33:40.6256576Z * [new tag] viable/strict/1762748799 -> viable/strict/1762748799 2025-12-04T09:33:40.6257698Z * [new tag] viable/strict/1762759504 -> viable/strict/1762759504 2025-12-04T09:33:40.6259184Z * [new tag] viable/strict/1762760973 -> viable/strict/1762760973 2025-12-04T09:33:40.6260214Z * [new tag] viable/strict/1762775374 -> viable/strict/1762775374 2025-12-04T09:33:40.6261368Z * [new tag] viable/strict/1762777661 -> viable/strict/1762777661 2025-12-04T09:33:40.6262480Z * [new tag] viable/strict/1762779774 -> viable/strict/1762779774 2025-12-04T09:33:40.6263880Z * [new tag] viable/strict/1762781259 -> viable/strict/1762781259 2025-12-04T09:33:40.6264996Z * [new tag] viable/strict/1762793628 -> viable/strict/1762793628 2025-12-04T09:33:40.6266263Z * [new tag] viable/strict/1762800711 -> viable/strict/1762800711 2025-12-04T09:33:40.6267513Z * [new tag] viable/strict/1762809894 -> viable/strict/1762809894 2025-12-04T09:33:40.6268615Z * [new tag] viable/strict/1762811384 -> viable/strict/1762811384 2025-12-04T09:33:40.6269921Z * [new tag] viable/strict/1762813841 -> viable/strict/1762813841 2025-12-04T09:33:40.6270966Z * [new tag] viable/strict/1762815047 -> viable/strict/1762815047 2025-12-04T09:33:40.6272442Z * [new tag] viable/strict/1762817094 -> viable/strict/1762817094 2025-12-04T09:33:40.6273502Z * [new tag] viable/strict/1762818582 -> viable/strict/1762818582 2025-12-04T09:33:40.6274721Z * [new tag] viable/strict/1762821623 -> viable/strict/1762821623 2025-12-04T09:33:40.6275695Z * [new tag] viable/strict/1762823531 -> viable/strict/1762823531 2025-12-04T09:33:40.6276901Z * [new tag] viable/strict/1762849583 -> viable/strict/1762849583 2025-12-04T09:33:40.6278013Z * [new tag] viable/strict/1762851200 -> viable/strict/1762851200 2025-12-04T09:33:40.6279151Z * [new tag] viable/strict/1762854603 -> viable/strict/1762854603 2025-12-04T09:33:40.6280349Z * [new tag] viable/strict/1762858276 -> viable/strict/1762858276 2025-12-04T09:33:40.6281670Z * [new tag] viable/strict/1762860891 -> viable/strict/1762860891 2025-12-04T09:33:40.6283491Z * [new tag] viable/strict/1762866174 -> viable/strict/1762866174 2025-12-04T09:33:40.6284601Z * [new tag] viable/strict/1762867653 -> viable/strict/1762867653 2025-12-04T09:33:40.6285724Z * [new tag] viable/strict/1762872669 -> viable/strict/1762872669 2025-12-04T09:33:40.6286560Z * [new tag] viable/strict/1762878380 -> viable/strict/1762878380 2025-12-04T09:33:40.6287981Z * [new tag] viable/strict/1762889003 -> viable/strict/1762889003 2025-12-04T09:33:40.6293139Z * [new tag] viable/strict/1762890589 -> viable/strict/1762890589 2025-12-04T09:33:40.6294387Z * [new tag] viable/strict/1762892743 -> viable/strict/1762892743 2025-12-04T09:33:40.6295595Z * [new tag] viable/strict/1762894271 -> viable/strict/1762894271 2025-12-04T09:33:40.6296449Z * [new tag] viable/strict/1762896287 -> viable/strict/1762896287 2025-12-04T09:33:40.6297733Z * [new tag] viable/strict/1762915871 -> viable/strict/1762915871 2025-12-04T09:33:40.6299060Z * [new tag] viable/strict/1762918569 -> viable/strict/1762918569 2025-12-04T09:33:40.6299945Z * [new tag] viable/strict/1762919776 -> viable/strict/1762919776 2025-12-04T09:33:40.6301193Z * [new tag] viable/strict/1762923072 -> viable/strict/1762923072 2025-12-04T09:33:40.6302468Z * [new tag] viable/strict/1762928826 -> viable/strict/1762928826 2025-12-04T09:33:40.6303663Z * [new tag] viable/strict/1762930451 -> viable/strict/1762930451 2025-12-04T09:33:40.6304778Z * [new tag] viable/strict/1762933780 -> viable/strict/1762933780 2025-12-04T09:33:40.6305949Z * [new tag] viable/strict/1762937638 -> viable/strict/1762937638 2025-12-04T09:33:40.6307318Z * [new tag] viable/strict/1762939545 -> viable/strict/1762939545 2025-12-04T09:33:40.6308594Z * [new tag] viable/strict/1762962692 -> viable/strict/1762962692 2025-12-04T09:33:40.6309756Z * [new tag] viable/strict/1762979143 -> viable/strict/1762979143 2025-12-04T09:33:40.6310860Z * [new tag] viable/strict/1762984188 -> viable/strict/1762984188 2025-12-04T09:33:40.6311724Z * [new tag] viable/strict/1762986306 -> viable/strict/1762986306 2025-12-04T09:33:40.6312993Z * [new tag] viable/strict/1762989903 -> viable/strict/1762989903 2025-12-04T09:33:40.6314150Z * [new tag] viable/strict/1762991377 -> viable/strict/1762991377 2025-12-04T09:33:40.6315257Z * [new tag] viable/strict/1762998921 -> viable/strict/1762998921 2025-12-04T09:33:40.6316612Z * [new tag] viable/strict/1763002287 -> viable/strict/1763002287 2025-12-04T09:33:40.6318123Z * [new tag] viable/strict/1763016840 -> viable/strict/1763016840 2025-12-04T09:33:40.6319314Z * [new tag] viable/strict/1763020180 -> viable/strict/1763020180 2025-12-04T09:33:40.6320555Z * [new tag] viable/strict/1763027421 -> viable/strict/1763027421 2025-12-04T09:33:40.6321687Z * [new tag] viable/strict/1763031120 -> viable/strict/1763031120 2025-12-04T09:33:40.6323290Z * [new tag] viable/strict/1763036861 -> viable/strict/1763036861 2025-12-04T09:33:40.6324488Z * [new tag] viable/strict/1763038993 -> viable/strict/1763038993 2025-12-04T09:33:40.6325684Z * [new tag] viable/strict/1763054703 -> viable/strict/1763054703 2025-12-04T09:33:40.6326546Z * [new tag] viable/strict/1763067061 -> viable/strict/1763067061 2025-12-04T09:33:40.6327842Z * [new tag] viable/strict/1763070847 -> viable/strict/1763070847 2025-12-04T09:33:40.6329035Z * [new tag] viable/strict/1763072706 -> viable/strict/1763072706 2025-12-04T09:33:40.6330216Z * [new tag] viable/strict/1763076302 -> viable/strict/1763076302 2025-12-04T09:33:40.6331485Z * [new tag] viable/strict/1763080816 -> viable/strict/1763080816 2025-12-04T09:33:40.6332589Z * [new tag] viable/strict/1763082732 -> viable/strict/1763082732 2025-12-04T09:33:40.6333773Z * [new tag] viable/strict/1763085329 -> viable/strict/1763085329 2025-12-04T09:33:40.6334889Z * [new tag] viable/strict/1763088623 -> viable/strict/1763088623 2025-12-04T09:33:40.6336122Z * [new tag] viable/strict/1763091402 -> viable/strict/1763091402 2025-12-04T09:33:40.6337249Z * [new tag] viable/strict/1763092602 -> viable/strict/1763092602 2025-12-04T09:33:40.6338465Z * [new tag] viable/strict/1763094355 -> viable/strict/1763094355 2025-12-04T09:33:40.6339691Z * [new tag] viable/strict/1763099390 -> viable/strict/1763099390 2025-12-04T09:33:40.6340851Z * [new tag] viable/strict/1763101608 -> viable/strict/1763101608 2025-12-04T09:33:40.6342004Z * [new tag] viable/strict/1763105102 -> viable/strict/1763105102 2025-12-04T09:33:40.6343237Z * [new tag] viable/strict/1763112347 -> viable/strict/1763112347 2025-12-04T09:33:40.6344409Z * [new tag] viable/strict/1763119471 -> viable/strict/1763119471 2025-12-04T09:33:40.6345370Z * [new tag] viable/strict/1763126835 -> viable/strict/1763126835 2025-12-04T09:33:40.6346463Z * [new tag] viable/strict/1763149779 -> viable/strict/1763149779 2025-12-04T09:33:40.6347617Z * [new tag] viable/strict/1763164178 -> viable/strict/1763164178 2025-12-04T09:33:40.6348735Z * [new tag] viable/strict/1763167104 -> viable/strict/1763167104 2025-12-04T09:33:40.6349832Z * [new tag] viable/strict/1763169132 -> viable/strict/1763169132 2025-12-04T09:33:40.6350987Z * [new tag] viable/strict/1763171708 -> viable/strict/1763171708 2025-12-04T09:33:40.6352082Z * [new tag] viable/strict/1763174759 -> viable/strict/1763174759 2025-12-04T09:33:40.6353306Z * [new tag] viable/strict/1763180744 -> viable/strict/1763180744 2025-12-04T09:33:40.6354430Z * [new tag] viable/strict/1763182227 -> viable/strict/1763182227 2025-12-04T09:33:40.6355544Z * [new tag] viable/strict/1763184309 -> viable/strict/1763184309 2025-12-04T09:33:40.6357211Z * [new tag] viable/strict/1763187991 -> viable/strict/1763187991 2025-12-04T09:33:40.6358325Z * [new tag] viable/strict/1763191445 -> viable/strict/1763191445 2025-12-04T09:33:40.6359759Z * [new tag] viable/strict/1763195152 -> viable/strict/1763195152 2025-12-04T09:33:40.6360776Z * [new tag] viable/strict/1763205769 -> viable/strict/1763205769 2025-12-04T09:33:40.6361943Z * [new tag] viable/strict/1763246990 -> viable/strict/1763246990 2025-12-04T09:33:40.6363173Z * [new tag] viable/strict/1763261578 -> viable/strict/1763261578 2025-12-04T09:33:40.6364195Z * [new tag] viable/strict/1763286573 -> viable/strict/1763286573 2025-12-04T09:33:40.6365093Z * [new tag] viable/strict/1763292167 -> viable/strict/1763292167 2025-12-04T09:33:40.6366350Z * [new tag] viable/strict/1763333386 -> viable/strict/1763333386 2025-12-04T09:33:40.6367480Z * [new tag] viable/strict/1763340082 -> viable/strict/1763340082 2025-12-04T09:33:40.6369430Z * [new tag] viable/strict/1763364324 -> viable/strict/1763364324 2025-12-04T09:33:40.6370590Z * [new tag] viable/strict/1763371569 -> viable/strict/1763371569 2025-12-04T09:33:40.6371737Z * [new tag] viable/strict/1763373067 -> viable/strict/1763373067 2025-12-04T09:33:40.6372896Z * [new tag] viable/strict/1763375157 -> viable/strict/1763375157 2025-12-04T09:33:40.6374051Z * [new tag] viable/strict/1763382462 -> viable/strict/1763382462 2025-12-04T09:33:40.6375255Z * [new tag] viable/strict/1763394661 -> viable/strict/1763394661 2025-12-04T09:33:40.6376604Z * [new tag] viable/strict/1763396797 -> viable/strict/1763396797 2025-12-04T09:33:40.6377792Z * [new tag] viable/strict/1763398542 -> viable/strict/1763398542 2025-12-04T09:33:40.6379088Z * [new tag] viable/strict/1763401807 -> viable/strict/1763401807 2025-12-04T09:33:40.6379971Z * [new tag] viable/strict/1763414698 -> viable/strict/1763414698 2025-12-04T09:33:40.6381255Z * [new tag] viable/strict/1763419807 -> viable/strict/1763419807 2025-12-04T09:33:40.6382433Z * [new tag] viable/strict/1763426369 -> viable/strict/1763426369 2025-12-04T09:33:40.6383670Z * [new tag] viable/strict/1763428331 -> viable/strict/1763428331 2025-12-04T09:33:40.6385012Z * [new tag] viable/strict/1763430922 -> viable/strict/1763430922 2025-12-04T09:33:40.6385854Z * [new tag] viable/strict/1763434184 -> viable/strict/1763434184 2025-12-04T09:33:40.6387162Z * [new tag] viable/strict/1763439973 -> viable/strict/1763439973 2025-12-04T09:33:40.6388580Z * [new tag] viable/strict/1763444995 -> viable/strict/1763444995 2025-12-04T09:33:40.6389695Z * [new tag] viable/strict/1763447206 -> viable/strict/1763447206 2025-12-04T09:33:40.6390930Z * [new tag] viable/strict/1763448826 -> viable/strict/1763448826 2025-12-04T09:33:40.6392053Z * [new tag] viable/strict/1763450717 -> viable/strict/1763450717 2025-12-04T09:33:40.6393217Z * [new tag] viable/strict/1763452183 -> viable/strict/1763452183 2025-12-04T09:33:40.6394464Z * [new tag] viable/strict/1763457945 -> viable/strict/1763457945 2025-12-04T09:33:40.6395555Z * [new tag] viable/strict/1763459439 -> viable/strict/1763459439 2025-12-04T09:33:40.6396433Z * [new tag] viable/strict/1763461556 -> viable/strict/1763461556 2025-12-04T09:33:40.6397745Z * [new tag] viable/strict/1763463103 -> viable/strict/1763463103 2025-12-04T09:33:40.6399463Z * [new tag] viable/strict/1763465100 -> viable/strict/1763465100 2025-12-04T09:33:40.6400295Z * [new tag] viable/strict/1763468866 -> viable/strict/1763468866 2025-12-04T09:33:40.6401393Z * [new tag] viable/strict/1763493823 -> viable/strict/1763493823 2025-12-04T09:33:40.6402271Z * [new tag] viable/strict/1763496249 -> viable/strict/1763496249 2025-12-04T09:33:40.6403554Z * [new tag] viable/strict/1763502620 -> viable/strict/1763502620 2025-12-04T09:33:40.6404784Z * [new tag] viable/strict/1763504715 -> viable/strict/1763504715 2025-12-04T09:33:40.6405921Z * [new tag] viable/strict/1763506208 -> viable/strict/1763506208 2025-12-04T09:33:40.6407088Z * [new tag] viable/strict/1763520590 -> viable/strict/1763520590 2025-12-04T09:33:40.6408284Z * [new tag] viable/strict/1763523357 -> viable/strict/1763523357 2025-12-04T09:33:40.6409443Z * [new tag] viable/strict/1763529922 -> viable/strict/1763529922 2025-12-04T09:33:40.6410686Z * [new tag] viable/strict/1763531408 -> viable/strict/1763531408 2025-12-04T09:33:40.6411829Z * [new tag] viable/strict/1763533622 -> viable/strict/1763533622 2025-12-04T09:33:40.6412976Z * [new tag] viable/strict/1763538576 -> viable/strict/1763538576 2025-12-04T09:33:40.6414228Z * [new tag] viable/strict/1763545823 -> viable/strict/1763545823 2025-12-04T09:33:40.6415086Z * [new tag] viable/strict/1763547951 -> viable/strict/1763547951 2025-12-04T09:33:40.6416402Z * [new tag] viable/strict/1763551477 -> viable/strict/1763551477 2025-12-04T09:33:40.6417529Z * [new tag] viable/strict/1763552982 -> viable/strict/1763552982 2025-12-04T09:33:40.6418746Z * [new tag] viable/strict/1763594698 -> viable/strict/1763594698 2025-12-04T09:33:40.6419929Z * [new tag] viable/strict/1763596178 -> viable/strict/1763596178 2025-12-04T09:33:40.6421107Z * [new tag] viable/strict/1763599155 -> viable/strict/1763599155 2025-12-04T09:33:40.6422351Z * [new tag] viable/strict/1763603717 -> viable/strict/1763603717 2025-12-04T09:33:40.6423532Z * [new tag] viable/strict/1763606923 -> viable/strict/1763606923 2025-12-04T09:33:40.6424649Z * [new tag] viable/strict/1763609715 -> viable/strict/1763609715 2025-12-04T09:33:40.6425782Z * [new tag] viable/strict/1763612757 -> viable/strict/1763612757 2025-12-04T09:33:40.6426925Z * [new tag] viable/strict/1763616325 -> viable/strict/1763616325 2025-12-04T09:33:40.6428076Z * [new tag] viable/strict/1763623509 -> viable/strict/1763623509 2025-12-04T09:33:40.6429370Z * [new tag] viable/strict/1763624984 -> viable/strict/1763624984 2025-12-04T09:33:40.6430574Z * [new tag] viable/strict/1763628796 -> viable/strict/1763628796 2025-12-04T09:33:40.6431658Z * [new tag] viable/strict/1763634343 -> viable/strict/1763634343 2025-12-04T09:33:40.6432728Z * [new tag] viable/strict/1763635867 -> viable/strict/1763635867 2025-12-04T09:33:40.6434030Z * [new tag] viable/strict/1763639382 -> viable/strict/1763639382 2025-12-04T09:33:40.6435156Z * [new tag] viable/strict/1763646626 -> viable/strict/1763646626 2025-12-04T09:33:40.6436459Z * [new tag] viable/strict/1763655997 -> viable/strict/1763655997 2025-12-04T09:33:40.6437630Z * [new tag] viable/strict/1763659444 -> viable/strict/1763659444 2025-12-04T09:33:40.6438731Z * [new tag] viable/strict/1763660992 -> viable/strict/1763660992 2025-12-04T09:33:40.6439917Z * [new tag] viable/strict/1763663201 -> viable/strict/1763663201 2025-12-04T09:33:40.6441116Z * [new tag] viable/strict/1763670362 -> viable/strict/1763670362 2025-12-04T09:33:40.6441985Z * [new tag] viable/strict/1763675378 -> viable/strict/1763675378 2025-12-04T09:33:40.6443244Z * [new tag] viable/strict/1763693343 -> viable/strict/1763693343 2025-12-04T09:33:40.6444205Z * [new tag] viable/strict/1763696088 -> viable/strict/1763696088 2025-12-04T09:33:40.6445738Z * [new tag] viable/strict/1763697343 -> viable/strict/1763697343 2025-12-04T09:33:40.6446846Z * [new tag] viable/strict/1763699165 -> viable/strict/1763699165 2025-12-04T09:33:40.6448005Z * [new tag] viable/strict/1763700660 -> viable/strict/1763700660 2025-12-04T09:33:40.6449089Z * [new tag] viable/strict/1763704209 -> viable/strict/1763704209 2025-12-04T09:33:40.6450228Z * [new tag] viable/strict/1763706411 -> viable/strict/1763706411 2025-12-04T09:33:40.6451366Z * [new tag] viable/strict/1763708082 -> viable/strict/1763708082 2025-12-04T09:33:40.6452262Z * [new tag] viable/strict/1763711381 -> viable/strict/1763711381 2025-12-04T09:33:40.6453437Z * [new tag] viable/strict/1763713593 -> viable/strict/1763713593 2025-12-04T09:33:40.6454574Z * [new tag] viable/strict/1763715201 -> viable/strict/1763715201 2025-12-04T09:33:40.6455669Z * [new tag] viable/strict/1763733017 -> viable/strict/1763733017 2025-12-04T09:33:40.6456834Z * [new tag] viable/strict/1763735108 -> viable/strict/1763735108 2025-12-04T09:33:40.6457979Z * [new tag] viable/strict/1763749579 -> viable/strict/1763749579 2025-12-04T09:33:40.6459332Z * [new tag] viable/strict/1763751113 -> viable/strict/1763751113 2025-12-04T09:33:40.6460499Z * [new tag] viable/strict/1763753035 -> viable/strict/1763753035 2025-12-04T09:33:40.6461695Z * [new tag] viable/strict/1763754578 -> viable/strict/1763754578 2025-12-04T09:33:40.6462834Z * [new tag] viable/strict/1763756748 -> viable/strict/1763756748 2025-12-04T09:33:40.6463926Z * [new tag] viable/strict/1763758205 -> viable/strict/1763758205 2025-12-04T09:33:40.6464784Z * [new tag] viable/strict/1763764050 -> viable/strict/1763764050 2025-12-04T09:33:40.6465993Z * [new tag] viable/strict/1763771887 -> viable/strict/1763771887 2025-12-04T09:33:40.6467308Z * [new tag] viable/strict/1763773920 -> viable/strict/1763773920 2025-12-04T09:33:40.6468386Z * [new tag] viable/strict/1763776501 -> viable/strict/1763776501 2025-12-04T09:33:40.6469511Z * [new tag] viable/strict/1763779437 -> viable/strict/1763779437 2025-12-04T09:33:40.6470898Z * [new tag] viable/strict/1763781038 -> viable/strict/1763781038 2025-12-04T09:33:40.6471813Z * [new tag] viable/strict/1763782245 -> viable/strict/1763782245 2025-12-04T09:33:40.6473434Z * [new tag] viable/strict/1763785568 -> viable/strict/1763785568 2025-12-04T09:33:40.6474641Z * [new tag] viable/strict/1763787006 -> viable/strict/1763787006 2025-12-04T09:33:40.6475847Z * [new tag] viable/strict/1763789103 -> viable/strict/1763789103 2025-12-04T09:33:40.6476946Z * [new tag] viable/strict/1763790578 -> viable/strict/1763790578 2025-12-04T09:33:40.6478060Z * [new tag] viable/strict/1763796275 -> viable/strict/1763796275 2025-12-04T09:33:40.6479465Z * [new tag] viable/strict/1763801465 -> viable/strict/1763801465 2025-12-04T09:33:40.6480569Z * [new tag] viable/strict/1763803522 -> viable/strict/1763803522 2025-12-04T09:33:40.6481683Z * [new tag] viable/strict/1763808581 -> viable/strict/1763808581 2025-12-04T09:33:40.6482823Z * [new tag] viable/strict/1763840977 -> viable/strict/1763840977 2025-12-04T09:33:40.6483940Z * [new tag] viable/strict/1763846659 -> viable/strict/1763846659 2025-12-04T09:33:40.6484995Z * [new tag] viable/strict/1763872065 -> viable/strict/1763872065 2025-12-04T09:33:40.6486175Z * [new tag] viable/strict/1763873648 -> viable/strict/1763873648 2025-12-04T09:33:40.6487341Z * [new tag] viable/strict/1763875506 -> viable/strict/1763875506 2025-12-04T09:33:40.6488304Z * [new tag] viable/strict/1763889904 -> viable/strict/1763889904 2025-12-04T09:33:40.6489668Z * [new tag] viable/strict/1763930999 -> viable/strict/1763930999 2025-12-04T09:33:40.6490796Z * [new tag] viable/strict/1763944964 -> viable/strict/1763944964 2025-12-04T09:33:40.6491667Z * [new tag] viable/strict/1763958474 -> viable/strict/1763958474 2025-12-04T09:33:40.6492880Z * [new tag] viable/strict/1763967263 -> viable/strict/1763967263 2025-12-04T09:33:40.6494018Z * [new tag] viable/strict/1763972803 -> viable/strict/1763972803 2025-12-04T09:33:40.6495099Z * [new tag] viable/strict/1763976376 -> viable/strict/1763976376 2025-12-04T09:33:40.6496365Z * [new tag] viable/strict/1763989404 -> viable/strict/1763989404 2025-12-04T09:33:40.6497513Z * [new tag] viable/strict/1763990887 -> viable/strict/1763990887 2025-12-04T09:33:40.6498685Z * [new tag] viable/strict/1764019919 -> viable/strict/1764019919 2025-12-04T09:33:40.6499975Z * [new tag] viable/strict/1764023134 -> viable/strict/1764023134 2025-12-04T09:33:40.6500817Z * [new tag] viable/strict/1764024593 -> viable/strict/1764024593 2025-12-04T09:33:40.6502025Z * [new tag] viable/strict/1764026706 -> viable/strict/1764026706 2025-12-04T09:33:40.6503543Z * [new tag] viable/strict/1764031139 -> viable/strict/1764031139 2025-12-04T09:33:40.6504678Z * [new tag] viable/strict/1764033131 -> viable/strict/1764033131 2025-12-04T09:33:40.6505544Z * [new tag] viable/strict/1764035725 -> viable/strict/1764035725 2025-12-04T09:33:40.6506603Z * [new tag] viable/strict/1764624265 -> viable/strict/1764624265 2025-12-04T09:33:40.6507488Z * [new tag] viable/strict/1764631514 -> viable/strict/1764631514 2025-12-04T09:33:40.6508522Z * [new tag] viable/strict/1764632987 -> viable/strict/1764632987 2025-12-04T09:33:40.6509362Z * [new tag] viable/strict/1764636063 -> viable/strict/1764636063 2025-12-04T09:33:40.6510436Z * [new tag] viable/strict/1764643975 -> viable/strict/1764643975 2025-12-04T09:33:40.6511301Z * [new tag] viable/strict/1764646859 -> viable/strict/1764646859 2025-12-04T09:33:40.6512459Z * [new tag] viable/strict/1764653120 -> viable/strict/1764653120 2025-12-04T09:33:40.6513171Z * [new tag] viable/strict/1764654632 -> viable/strict/1764654632 2025-12-04T09:33:40.6514234Z * [new tag] viable/strict/1764656821 -> viable/strict/1764656821 2025-12-04T09:33:40.6515081Z * [new tag] viable/strict/1764658557 -> viable/strict/1764658557 2025-12-04T09:33:40.6516091Z * [new tag] viable/strict/1764660333 -> viable/strict/1764660333 2025-12-04T09:33:40.6516991Z * [new tag] viable/strict/1764661812 -> viable/strict/1764661812 2025-12-04T09:33:40.6518002Z * [new tag] viable/strict/1764664023 -> viable/strict/1764664023 2025-12-04T09:33:40.6518883Z * [new tag] viable/strict/1764669150 -> viable/strict/1764669150 2025-12-04T09:33:40.6519897Z * [new tag] viable/strict/1764680709 -> viable/strict/1764680709 2025-12-04T09:33:40.6520749Z * [new tag] viable/strict/1764687619 -> viable/strict/1764687619 2025-12-04T09:33:40.6521805Z * [new tag] viable/strict/1764696355 -> viable/strict/1764696355 2025-12-04T09:33:40.6522688Z * [new tag] viable/strict/1764701767 -> viable/strict/1764701767 2025-12-04T09:33:40.6523619Z * [new tag] viable/strict/1764710768 -> viable/strict/1764710768 2025-12-04T09:33:40.6524663Z * [new tag] viable/strict/1764716202 -> viable/strict/1764716202 2025-12-04T09:33:40.6525509Z * [new tag] viable/strict/1764793566 -> viable/strict/1764793566 2025-12-04T09:33:40.6526513Z * [new tag] viable/strict/1764797093 -> viable/strict/1764797093 2025-12-04T09:33:40.6527382Z * [new tag] viable/strict/1764800729 -> viable/strict/1764800729 2025-12-04T09:33:40.6528784Z * [new tag] whc_flight_1 -> whc_flight_1 2025-12-04T09:33:40.6529918Z * [new tag] whc_flight_2 -> whc_flight_2 2025-12-04T09:33:40.6531246Z * [new tag] whc_flight_4 -> whc_flight_4 2025-12-04T09:33:40.7387118Z [command]/usr/bin/git rev-parse --verify --quiet ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32^{object} 2025-12-04T09:33:40.7415499Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:33:40.7419739Z ##[endgroup] 2025-12-04T09:33:40.7420089Z ##[group]Determining the checkout info 2025-12-04T09:33:40.7420862Z ##[endgroup] 2025-12-04T09:33:40.7425148Z [command]/usr/bin/git sparse-checkout disable 2025-12-04T09:33:40.7460105Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-12-04T09:33:40.7486271Z ##[group]Checking out the ref 2025-12-04T09:33:40.7490327Z [command]/usr/bin/git checkout --progress --force ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:33:41.7893981Z Updating files: 75% (15234/20121) 2025-12-04T09:33:41.8051470Z Updating files: 76% (15292/20121) 2025-12-04T09:33:41.8193641Z Updating files: 77% (15494/20121) 2025-12-04T09:33:41.8420774Z Updating files: 78% (15695/20121) 2025-12-04T09:33:41.8713568Z Updating files: 79% (15896/20121) 2025-12-04T09:33:41.9069692Z Updating files: 80% (16097/20121) 2025-12-04T09:33:41.9386587Z Updating files: 81% (16299/20121) 2025-12-04T09:33:41.9621008Z Updating files: 82% (16500/20121) 2025-12-04T09:33:41.9788136Z Updating files: 83% (16701/20121) 2025-12-04T09:33:41.9939286Z Updating files: 84% (16902/20121) 2025-12-04T09:33:42.0116815Z Updating files: 85% (17103/20121) 2025-12-04T09:33:42.0284087Z Updating files: 86% (17305/20121) 2025-12-04T09:33:42.0434160Z Updating files: 87% (17506/20121) 2025-12-04T09:33:42.0557238Z Updating files: 88% (17707/20121) 2025-12-04T09:33:42.0709493Z Updating files: 89% (17908/20121) 2025-12-04T09:33:42.0899650Z Updating files: 90% (18109/20121) 2025-12-04T09:33:42.1025036Z Updating files: 91% (18311/20121) 2025-12-04T09:33:42.1194226Z Updating files: 92% (18512/20121) 2025-12-04T09:33:42.1397524Z Updating files: 93% (18713/20121) 2025-12-04T09:33:42.1622401Z Updating files: 94% (18914/20121) 2025-12-04T09:33:42.1815328Z Updating files: 95% (19115/20121) 2025-12-04T09:33:42.1987624Z Updating files: 96% (19317/20121) 2025-12-04T09:33:42.2169076Z Updating files: 97% (19518/20121) 2025-12-04T09:33:42.2483416Z Updating files: 98% (19719/20121) 2025-12-04T09:33:42.2678314Z Updating files: 99% (19920/20121) 2025-12-04T09:33:42.2678674Z Updating files: 100% (20121/20121) 2025-12-04T09:33:42.2679031Z Updating files: 100% (20121/20121), done. 2025-12-04T09:33:42.2981960Z Note: switching to 'ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32'. 2025-12-04T09:33:42.2982329Z 2025-12-04T09:33:42.2982585Z You are in 'detached HEAD' state. You can look around, make experimental 2025-12-04T09:33:42.2983252Z changes and commit them, and you can discard any commits you make in this 2025-12-04T09:33:42.2983902Z state without impacting any branches by switching back to a branch. 2025-12-04T09:33:42.2984301Z 2025-12-04T09:33:42.2984553Z If you want to create a new branch to retain commits you create, you may 2025-12-04T09:33:42.2985137Z do so (now or later) by using -c with the switch command. Example: 2025-12-04T09:33:42.2985489Z 2025-12-04T09:33:42.2985616Z git switch -c 2025-12-04T09:33:42.2986107Z 2025-12-04T09:33:42.2986229Z Or undo this operation with: 2025-12-04T09:33:42.2986438Z 2025-12-04T09:33:42.2986552Z git switch - 2025-12-04T09:33:42.2986706Z 2025-12-04T09:33:42.2986983Z Turn off this advice by setting config variable advice.detachedHead to false 2025-12-04T09:33:42.2987407Z 2025-12-04T09:33:42.2989492Z HEAD is now at ffd9b0fb435 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T09:33:42.3074581Z ##[endgroup] 2025-12-04T09:33:42.3075072Z ##[group]Setting up auth for fetching submodules 2025-12-04T09:33:42.3081753Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T09:33:42.3131122Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-12-04T09:33:42.3160078Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-12-04T09:33:42.3188724Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-12-04T09:33:42.3213739Z ##[endgroup] 2025-12-04T09:33:42.3214184Z ##[group]Fetching submodules 2025-12-04T09:33:42.3218281Z [command]/usr/bin/git submodule sync --recursive 2025-12-04T09:33:42.3559732Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --recursive 2025-12-04T09:33:42.3889695Z Submodule 'android/libs/fbjni' (https://github.com/facebookincubator/fbjni.git) registered for path 'android/libs/fbjni' 2025-12-04T09:33:42.3891915Z Submodule 'third_party/NNPACK_deps/FP16' (https://github.com/Maratyszcza/FP16.git) registered for path 'third_party/FP16' 2025-12-04T09:33:42.3894863Z Submodule 'third_party/NNPACK_deps/FXdiv' (https://github.com/Maratyszcza/FXdiv.git) registered for path 'third_party/FXdiv' 2025-12-04T09:33:42.3897822Z Submodule 'third_party/NNPACK' (https://github.com/Maratyszcza/NNPACK.git) registered for path 'third_party/NNPACK' 2025-12-04T09:33:42.3901082Z Submodule 'third_party/NVTX' (https://github.com/NVIDIA/NVTX.git) registered for path 'third_party/NVTX' 2025-12-04T09:33:42.3904766Z Submodule 'third_party/VulkanMemoryAllocator' (https://github.com/GPUOpen-LibrariesAndSDKs/VulkanMemoryAllocator.git) registered for path 'third_party/VulkanMemoryAllocator' 2025-12-04T09:33:42.3907819Z Submodule 'third_party/XNNPACK' (https://github.com/google/XNNPACK.git) registered for path 'third_party/XNNPACK' 2025-12-04T09:33:42.3911211Z Submodule 'third_party/aiter' (https://github.com/ROCm/aiter.git) registered for path 'third_party/aiter' 2025-12-04T09:33:42.3914901Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/benchmark' 2025-12-04T09:33:42.3919011Z Submodule 'third_party/composable_kernel' (https://github.com/ROCm/composable_kernel.git) registered for path 'third_party/composable_kernel' 2025-12-04T09:33:42.3922574Z Submodule 'third_party/cpp-httplib' (https://github.com/yhirose/cpp-httplib.git) registered for path 'third_party/cpp-httplib' 2025-12-04T09:33:42.3926548Z Submodule 'third_party/cpuinfo' (https://github.com/pytorch/cpuinfo.git) registered for path 'third_party/cpuinfo' 2025-12-04T09:33:42.3930662Z Submodule 'third_party/cudnn_frontend' (https://github.com/NVIDIA/cudnn-frontend.git) registered for path 'third_party/cudnn_frontend' 2025-12-04T09:33:42.3934855Z Submodule 'third_party/cutlass' (https://github.com/NVIDIA/cutlass.git) registered for path 'third_party/cutlass' 2025-12-04T09:33:42.3939363Z Submodule 'third_party/fbgemm' (https://github.com/pytorch/fbgemm) registered for path 'third_party/fbgemm' 2025-12-04T09:33:42.3944532Z Submodule 'third_party/flash-attention' (https://github.com/Dao-AILab/flash-attention.git) registered for path 'third_party/flash-attention' 2025-12-04T09:33:42.3950757Z Submodule 'third_party/flatbuffers' (https://github.com/google/flatbuffers.git) registered for path 'third_party/flatbuffers' 2025-12-04T09:33:42.3955345Z Submodule 'third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/fmt' 2025-12-04T09:33:42.3960240Z Submodule 'third_party/gemmlowp/gemmlowp' (https://github.com/google/gemmlowp.git) registered for path 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:33:42.3964881Z Submodule 'third_party/gloo' (https://github.com/pytorch/gloo) registered for path 'third_party/gloo' 2025-12-04T09:33:42.3969914Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/googletest' 2025-12-04T09:33:42.3974777Z Submodule 'third_party/ideep' (https://github.com/intel/ideep) registered for path 'third_party/ideep' 2025-12-04T09:33:42.3980097Z Submodule 'third_party/ittapi' (https://github.com/intel/ittapi.git) registered for path 'third_party/ittapi' 2025-12-04T09:33:42.3985312Z Submodule 'third_party/kineto' (https://github.com/pytorch/kineto) registered for path 'third_party/kineto' 2025-12-04T09:33:42.3991150Z Submodule 'third_party/kleidiai' (https://github.com/ARM-software/kleidiai.git) registered for path 'third_party/kleidiai' 2025-12-04T09:33:42.3998277Z Submodule 'third_party/mimalloc' (https://github.com/microsoft/mimalloc.git) registered for path 'third_party/mimalloc' 2025-12-04T09:33:42.4002034Z Submodule 'third_party/nlohmann' (https://github.com/nlohmann/json.git) registered for path 'third_party/nlohmann' 2025-12-04T09:33:42.4007517Z Submodule 'third_party/onnx' (https://github.com/onnx/onnx.git) registered for path 'third_party/onnx' 2025-12-04T09:33:42.4013569Z Submodule 'third_party/opentelemetry-cpp' (https://github.com/open-telemetry/opentelemetry-cpp.git) registered for path 'third_party/opentelemetry-cpp' 2025-12-04T09:33:42.4019383Z Submodule 'third_party/pocketfft' (https://github.com/mreineck/pocketfft) registered for path 'third_party/pocketfft' 2025-12-04T09:33:42.4025489Z Submodule 'third_party/protobuf' (https://github.com/protocolbuffers/protobuf.git) registered for path 'third_party/protobuf' 2025-12-04T09:33:42.4031546Z Submodule 'third_party/NNPACK_deps/psimd' (https://github.com/Maratyszcza/psimd.git) registered for path 'third_party/psimd' 2025-12-04T09:33:42.4037858Z Submodule 'third_party/NNPACK_deps/pthreadpool' (https://github.com/Maratyszcza/pthreadpool.git) registered for path 'third_party/pthreadpool' 2025-12-04T09:33:42.4045608Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/pybind11' 2025-12-04T09:33:42.4052198Z Submodule 'third_party/python-peachpy' (https://github.com/malfet/PeachPy.git) registered for path 'third_party/python-peachpy' 2025-12-04T09:33:42.4058667Z Submodule 'third_party/sleef' (https://github.com/shibatch/sleef) registered for path 'third_party/sleef' 2025-12-04T09:33:42.4065632Z Submodule 'third_party/tensorpipe' (https://github.com/pytorch/tensorpipe.git) registered for path 'third_party/tensorpipe' 2025-12-04T09:33:42.4100093Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/android/libs/fbjni'... 2025-12-04T09:33:42.6561330Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/FP16'... 2025-12-04T09:33:42.6562402Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/FXdiv'... 2025-12-04T09:33:42.6563347Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/NNPACK'... 2025-12-04T09:33:42.6593105Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/flatbuffers'... 2025-12-04T09:33:46.5704603Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/benchmark'... 2025-12-04T09:33:46.5706340Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/NVTX'... 2025-12-04T09:33:46.5707967Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/gloo'... 2025-12-04T09:33:46.5709617Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/gemmlowp/gemmlowp'... 2025-12-04T09:33:46.5711247Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cpuinfo'... 2025-12-04T09:33:46.5712936Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/flash-attention'... 2025-12-04T09:33:46.5714855Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cpp-httplib'... 2025-12-04T09:33:46.5716496Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/ideep'... 2025-12-04T09:33:46.5718052Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/ittapi'... 2025-12-04T09:33:46.5719792Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kleidiai'... 2025-12-04T09:33:46.5721204Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/pocketfft'... 2025-12-04T09:33:46.5722932Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cudnn_frontend'... 2025-12-04T09:33:46.5724572Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/psimd'... 2025-12-04T09:33:46.5726062Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/pthreadpool'... 2025-12-04T09:33:46.5727669Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/googletest'... 2025-12-04T09:33:46.5729309Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/mimalloc'... 2025-12-04T09:33:46.5731053Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/VulkanMemoryAllocator'... 2025-12-04T09:33:46.6047007Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fmt'... 2025-12-04T09:33:46.6742820Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto'... 2025-12-04T09:33:47.3478233Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/python-peachpy'... 2025-12-04T09:33:47.3480193Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe'... 2025-12-04T09:33:47.4479536Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/onnx'... 2025-12-04T09:33:50.5136282Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/pybind11'... 2025-12-04T09:33:50.5138508Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/sleef'... 2025-12-04T09:33:50.5140377Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm'... 2025-12-04T09:33:50.5142197Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cutlass'... 2025-12-04T09:33:50.6137327Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/XNNPACK'... 2025-12-04T09:34:12.0587342Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/composable_kernel'... 2025-12-04T09:34:12.0588742Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/nlohmann'... 2025-12-04T09:34:12.0589673Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp'... 2025-12-04T09:34:12.0590571Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/aiter'... 2025-12-04T09:34:12.0591541Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/protobuf'... 2025-12-04T09:34:12.0768355Z Submodule path 'android/libs/fbjni': checked out '7e1e1fe3858c63c251c637ae41a20de425dde96f' 2025-12-04T09:34:12.0909777Z Submodule path 'third_party/FP16': checked out '4dfe081cf6bcd15db339cf2680b9281b8451eeb3' 2025-12-04T09:34:12.1019256Z Submodule path 'third_party/FXdiv': checked out 'b408327ac2a15ec3e43352421954f5b1967701d1' 2025-12-04T09:34:12.1306356Z Submodule path 'third_party/NNPACK': checked out 'c07e3a0400713d546e0dea2d5466dd22ea389c73' 2025-12-04T09:34:12.2261572Z Submodule path 'third_party/NVTX': checked out '3ebbc93ded7285963bff932c678fa367eb393ba6' 2025-12-04T09:34:12.2833689Z Submodule path 'third_party/VulkanMemoryAllocator': checked out '1d8f600fd424278486eade7ed3e877c99f0846b1' 2025-12-04T09:34:13.1237998Z Submodule path 'third_party/XNNPACK': checked out '51a0103656eff6fc9bfd39a4597923c4b542c883' 2025-12-04T09:34:13.3385630Z Submodule path 'third_party/aiter': checked out '01aae101b9e5e94d6c16a9514c9fb8df99c93150' 2025-12-04T09:34:13.3408281Z Submodule '3rdparty/composable_kernel' (https://github.com/ROCm/composable_kernel.git) registered for path 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:34:13.3438731Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/aiter/3rdparty/composable_kernel'... 2025-12-04T09:34:18.6500448Z Submodule path 'third_party/aiter/3rdparty/composable_kernel': checked out 'cffe8fa2a442ac8e80dd236a1a5d24fe3d7e0cbf' 2025-12-04T09:34:18.6770167Z Submodule path 'third_party/benchmark': checked out '299e5928955cc62af9968370293b916f5130916f' 2025-12-04T09:34:19.0810813Z Submodule path 'third_party/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T09:34:19.1360441Z Submodule path 'third_party/cpp-httplib': checked out '89c932f313c6437c38f2982869beacc89c2f2246' 2025-12-04T09:34:19.2470049Z Submodule path 'third_party/cpuinfo': checked out 'f858c30bcb16f8effd5ff46996f0514539e17abc' 2025-12-04T09:34:19.3019901Z Submodule path 'third_party/cudnn_frontend': checked out '0b1577c8c83401237d601d0d0db5210506705396' 2025-12-04T09:34:20.0452646Z Submodule path 'third_party/cutlass': checked out 'f88806b1e31dfa579842638740216dd41fc6c588' 2025-12-04T09:34:20.2228451Z Submodule path 'third_party/fbgemm': checked out 'c0b988d39a9e47c794d699f29930ed4d7c7e13a4' 2025-12-04T09:34:20.2252993Z Submodule 'external/asmjit' (https://github.com/asmjit/asmjit.git) registered for path 'third_party/fbgemm/external/asmjit' 2025-12-04T09:34:20.2256203Z Submodule 'external/composable_kernel' (https://github.com/ROCm/composable_kernel.git) registered for path 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:34:20.2259164Z Submodule 'external/cpuinfo' (https://github.com/pytorch/cpuinfo) registered for path 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:34:20.2262336Z Submodule 'external/cutlass' (https://github.com/jwfromm/cutlass) registered for path 'third_party/fbgemm/external/cutlass' 2025-12-04T09:34:20.2265653Z Submodule 'external/googletest' (https://github.com/google/googletest) registered for path 'third_party/fbgemm/external/googletest' 2025-12-04T09:34:20.2269107Z Submodule 'external/hipify_torch' (https://github.com/ROCmSoftwarePlatform/hipify_torch.git) registered for path 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:34:20.2273241Z Submodule 'external/json' (https://github.com/nlohmann/json.git) registered for path 'third_party/fbgemm/external/json' 2025-12-04T09:34:20.2305360Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/asmjit'... 2025-12-04T09:34:21.5479980Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/hipify_torch'... 2025-12-04T09:34:21.5481172Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/cpuinfo'... 2025-12-04T09:34:21.5482213Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/googletest'... 2025-12-04T09:34:21.6480030Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/composable_kernel'... 2025-12-04T09:34:25.1431644Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/cutlass'... 2025-12-04T09:34:25.2432126Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/json'... 2025-12-04T09:34:28.6896927Z Submodule path 'third_party/fbgemm/external/asmjit': checked out 'a3199e8857792cd10b7589ff5d58343d2c9008ea' 2025-12-04T09:34:29.0947578Z Submodule path 'third_party/fbgemm/external/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T09:34:29.2094734Z Submodule path 'third_party/fbgemm/external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-12-04T09:34:29.9367432Z Submodule path 'third_party/fbgemm/external/cutlass': checked out '98125ce499b0fdf7ffbe0e3052f5b8709f4840f8' 2025-12-04T09:34:29.9896434Z Submodule path 'third_party/fbgemm/external/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T09:34:30.0032678Z Submodule path 'third_party/fbgemm/external/hipify_torch': checked out '63b6a7b541fa7f08f8475ca7d74054db36ff2691' 2025-12-04T09:34:30.1189080Z Submodule path 'third_party/fbgemm/external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-12-04T09:34:30.1988040Z Submodule path 'third_party/flash-attention': checked out '979702c87a8713a8e0a5e9fee122b90d2ef13be5' 2025-12-04T09:34:30.2010057Z Submodule 'csrc/composable_kernel' (https://github.com/ROCm/composable_kernel.git) registered for path 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:34:30.2012842Z Submodule 'csrc/cutlass' (https://github.com/NVIDIA/cutlass.git) registered for path 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:34:30.2042491Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/flash-attention/csrc/composable_kernel'... 2025-12-04T09:34:34.9404557Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/flash-attention/csrc/cutlass'... 2025-12-04T09:34:35.2202600Z Submodule path 'third_party/flash-attention/csrc/composable_kernel': checked out '888317e698e9803c62bd38568abc9e05d7709f33' 2025-12-04T09:34:35.8612564Z Submodule path 'third_party/flash-attention/csrc/cutlass': checked out 'c506e16788cb08416a4a57e11a9067beeee29420' 2025-12-04T09:34:36.0198559Z Submodule path 'third_party/flatbuffers': checked out 'a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757' 2025-12-04T09:34:36.0537738Z Submodule path 'third_party/fmt': checked out '407c905e45ad75fc29bf0f9bb7c5c2fd3475976f' 2025-12-04T09:34:36.0992296Z Submodule path 'third_party/gemmlowp/gemmlowp': checked out '3fb5c176c17c765a3492cd2f0321b0dab712f350' 2025-12-04T09:34:36.1285363Z Submodule path 'third_party/gloo': checked out '54cbae0d3a67fa890b4c3d9ee162b7860315e341' 2025-12-04T09:34:36.1799281Z Submodule path 'third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T09:34:36.1947301Z Submodule path 'third_party/ideep': checked out '719d8e6cd7f7a0e01b155657526d693acf97c2b3' 2025-12-04T09:34:36.1966418Z Submodule 'mkl-dnn' (https://github.com/intel/mkl-dnn.git) registered for path 'third_party/ideep/mkl-dnn' 2025-12-04T09:34:36.1993789Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/ideep/mkl-dnn'... 2025-12-04T09:34:53.2365571Z Submodule path 'third_party/ideep/mkl-dnn': checked out '8d263e693366ef8db40acc569cc7d8edf644556d' 2025-12-04T09:34:53.2619153Z Submodule path 'third_party/ittapi': checked out 'dec1d23ca65ab069d225dfe40dea14f455170959' 2025-12-04T09:34:53.3592060Z Submodule path 'third_party/kineto': checked out '31f85df8fbd89c188f14ef10f1ec65379786b943' 2025-12-04T09:34:53.3612569Z Submodule 'libkineto/third_party/dynolog' (https://github.com/facebookincubator/dynolog.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:34:53.3614581Z Submodule 'libkineto/third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:34:53.3617999Z Submodule 'libkineto/third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:34:53.3648187Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog'... 2025-12-04T09:34:54.4064602Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/fmt'... 2025-12-04T09:34:55.0429971Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/googletest'... 2025-12-04T09:34:55.1543294Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog': checked out 'd2ffe0a4e3acace628db49974246b66fc3e85fb1' 2025-12-04T09:34:55.1563316Z Submodule 'third_party/DCGM' (https://github.com/NVIDIA/DCGM.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:34:55.1566353Z Submodule 'third_party/cpr' (https://github.com/libcpr/cpr.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:34:55.1569555Z Submodule 'third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:34:55.1573379Z Submodule 'third_party/gflags' (https://github.com/gflags/gflags.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:34:55.1576485Z Submodule 'third_party/glog' (https://github.com/google/glog.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:34:55.1580254Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:34:55.1583858Z Submodule 'third_party/json' (https://github.com/nlohmann/json.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:34:55.1587564Z Submodule 'third_party/pfs' (https://github.com/dtrugman/pfs.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:34:55.1592087Z Submodule 'third_party/prometheus-cpp' (https://github.com/jupp0r/prometheus-cpp.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:34:55.1625601Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM'... 2025-12-04T09:34:57.1303427Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/pfs'... 2025-12-04T09:34:57.1304847Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/gflags'... 2025-12-04T09:34:57.1306317Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp'... 2025-12-04T09:34:57.1307712Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/cpr'... 2025-12-04T09:34:57.1309059Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/glog'... 2025-12-04T09:34:57.1310443Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/googletest'... 2025-12-04T09:34:57.1312079Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/fmt'... 2025-12-04T09:34:57.2305162Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/json'... 2025-12-04T09:35:04.2572389Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM': checked out 'ffde4e54bc7249a6039a5e6b45b395141e1217f9' 2025-12-04T09:35:04.2780976Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr': checked out '871ed52d350214a034f6ef8a3b8f51c5ce1bd400' 2025-12-04T09:35:04.3197222Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt': checked out 'cd4af11efc9c622896a3e4cb599fa28668ca3d05' 2025-12-04T09:35:04.3354806Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags': checked out 'e171aa2d15ed9eb17054558e0b3a6a413bb01067' 2025-12-04T09:35:04.3372573Z Submodule 'doc' (https://github.com/gflags/gflags.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:35:04.3402200Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc'... 2025-12-04T09:35:04.6114528Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc': checked out '8411df715cf522606e3b1aca386ddfc0b63d34b4' 2025-12-04T09:35:04.6328013Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog': checked out 'b33e3bad4c46c8a6345525fd822af355e5ef9446' 2025-12-04T09:35:04.6843245Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T09:35:04.7981440Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json': checked out '4f8fba14066156b73f1189a2b8bd568bde5284c5' 2025-12-04T09:35:04.8178371Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs': checked out 'f68a2fa8ea36c783bdd760371411fcb495aa3150' 2025-12-04T09:35:04.8375634Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp': checked out 'b1234816facfdda29845c46696a02998a4af115a' 2025-12-04T09:35:04.8394215Z Submodule 'civetweb' (https://github.com/civetweb/civetweb.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:04.8397252Z Submodule 'googletest' (https://github.com/google/googletest.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:04.8426270Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb'... 2025-12-04T09:35:07.1026306Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest'... 2025-12-04T09:35:07.3903126Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'd7ba35bbb649209c66e582d5a0244ba988a15159' 2025-12-04T09:35:07.4444109Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T09:35:07.4808445Z Submodule path 'third_party/kineto/libkineto/third_party/fmt': checked out '40626af88bd7df9a5fb80be7b25ac85b122d6c21' 2025-12-04T09:35:07.5333184Z Submodule path 'third_party/kineto/libkineto/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T09:35:07.5935917Z Submodule path 'third_party/kleidiai': checked out 'd7770c89632329a9914ef1a90289917597639cbe' 2025-12-04T09:35:07.6372465Z Submodule path 'third_party/mimalloc': checked out 'fbd8b99c2b828428947d70fdc046bb55609be93e' 2025-12-04T09:35:07.7537796Z Submodule path 'third_party/nlohmann': checked out '55f93686c01528224f448c19128836e7df245f72' 2025-12-04T09:35:08.2227638Z Submodule path 'third_party/onnx': checked out 'e709452ef2bbc1d113faf678c24e6d3467696e83' 2025-12-04T09:35:08.2269168Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/onnx/third_party/pybind11' 2025-12-04T09:35:08.2299646Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/onnx/third_party/pybind11'... 2025-12-04T09:35:09.1605005Z Submodule path 'third_party/onnx/third_party/pybind11': checked out 'a2e59f0e7065404b44dfe92a28aca47ba1378dc4' 2025-12-04T09:35:09.2411001Z Submodule path 'third_party/opentelemetry-cpp': checked out 'a799f4aed9c94b765dcdaabaeab7d5e7e2310878' 2025-12-04T09:35:09.2432161Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark) registered for path 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:35:09.2434953Z Submodule 'third_party/googletest' (https://github.com/google/googletest) registered for path 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:35:09.2437889Z Submodule 'third_party/ms-gsl' (https://github.com/microsoft/GSL) registered for path 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:35:09.2441177Z Submodule 'third_party/nlohmann-json' (https://github.com/nlohmann/json) registered for path 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:35:09.2444700Z Submodule 'third_party/opentelemetry-proto' (https://github.com/open-telemetry/opentelemetry-proto) registered for path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:35:09.2447936Z Submodule 'third_party/opentracing-cpp' (https://github.com/opentracing/opentracing-cpp.git) registered for path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:35:09.2451433Z Submodule 'third_party/prometheus-cpp' (https://github.com/jupp0r/prometheus-cpp) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:35:09.2454767Z Submodule 'tools/vcpkg' (https://github.com/Microsoft/vcpkg) registered for path 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:35:09.2486617Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/benchmark'... 2025-12-04T09:35:09.6825667Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/opentracing-cpp'... 2025-12-04T09:35:09.6827131Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/opentelemetry-proto'... 2025-12-04T09:35:09.6828462Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/ms-gsl'... 2025-12-04T09:35:09.6829703Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp'... 2025-12-04T09:35:09.7826940Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/googletest'... 2025-12-04T09:35:10.5228227Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/nlohmann-json'... 2025-12-04T09:35:18.2030872Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/tools/vcpkg'... 2025-12-04T09:35:18.9425502Z Submodule path 'third_party/opentelemetry-cpp/third_party/benchmark': checked out 'd572f4777349d43653b21d6c2fc63020ab326db2' 2025-12-04T09:35:18.9888866Z Submodule path 'third_party/opentelemetry-cpp/third_party/googletest': checked out 'b796f7d44681514f58a683a3a71ff17c94edb0c1' 2025-12-04T09:35:19.0079652Z Submodule path 'third_party/opentelemetry-cpp/third_party/ms-gsl': checked out '6f4529395c5b7c2d661812257cd6780c67e54afa' 2025-12-04T09:35:19.1265784Z Submodule path 'third_party/opentelemetry-cpp/third_party/nlohmann-json': checked out 'bc889afb4c5bf1c0d8ee29ef35eaaf4c8bef8a5d' 2025-12-04T09:35:19.1422963Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto': checked out '4ca4f0335c63cda7ab31ea7ed70d6553aee14dce' 2025-12-04T09:35:19.1593673Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp': checked out '06b57f48ded1fa3bdd3d4346f6ef29e40e08eaf5' 2025-12-04T09:35:19.1777403Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp': checked out 'c9ffcdda9086ffd9e1283ea7a0276d831f3c8a8d' 2025-12-04T09:35:19.1795985Z Submodule 'civetweb' (https://github.com/civetweb/civetweb.git) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:19.1798972Z Submodule 'googletest' (https://github.com/google/googletest.git) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:19.1827435Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb'... 2025-12-04T09:35:21.4296989Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest'... 2025-12-04T09:35:21.7154582Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'eefb26f82b233268fc98577d265352720d477ba4' 2025-12-04T09:35:21.7702353Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T09:35:22.3230072Z Submodule path 'third_party/opentelemetry-cpp/tools/vcpkg': checked out '8eb57355a4ffb410a2e94c07b4dca2dffbee8e50' 2025-12-04T09:35:22.3369442Z Submodule path 'third_party/pocketfft': checked out '0fa0ef591e38c2758e3184c6c23e497b9f732ffa' 2025-12-04T09:35:22.6460269Z Submodule path 'third_party/protobuf': checked out 'd1eca4e4b421cd2997495c4b4e65cea6be4e9b8a' 2025-12-04T09:35:22.6486089Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:35:22.6489242Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/protobuf/third_party/googletest' 2025-12-04T09:35:22.6519847Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/protobuf/third_party/benchmark'... 2025-12-04T09:35:23.1962720Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/protobuf/third_party/googletest'... 2025-12-04T09:35:23.6914902Z Submodule path 'third_party/protobuf/third_party/benchmark': checked out '5b7683f49e1e9223cf9927b24f6fd3d6bd82e3f8' 2025-12-04T09:35:23.7737265Z Submodule path 'third_party/protobuf/third_party/googletest': checked out '5ec7f0c4a113e2f18ac2c6cc7df51ad6afc24081' 2025-12-04T09:35:23.7847403Z Submodule path 'third_party/psimd': checked out '072586a71b55b7f8c584153d223e95687148a900' 2025-12-04T09:35:23.7992207Z Submodule path 'third_party/pthreadpool': checked out '4fe0e1e183925bf8cfa6aae24237e724a96479b8' 2025-12-04T09:35:23.8480698Z Submodule path 'third_party/pybind11': checked out 'f5fbe867d2d26e4a0a9177a51f6e568868ad3dc8' 2025-12-04T09:35:23.8811936Z Submodule path 'third_party/python-peachpy': checked out 'f45429b087dd7d5bc78bb40dc7cf06425c252d67' 2025-12-04T09:35:23.9309012Z Submodule path 'third_party/sleef': checked out '5a1d179df9cf652951b59010a2d2075372d67f68' 2025-12-04T09:35:23.9621954Z Submodule path 'third_party/tensorpipe': checked out '2b4cd91092d335a697416b2a3cb398283246849d' 2025-12-04T09:35:23.9641182Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:35:23.9644174Z Submodule 'third_party/libnop' (https://github.com/google/libnop.git) registered for path 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:35:23.9647419Z Submodule 'third_party/libuv' (https://github.com/libuv/libuv.git) registered for path 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:35:23.9650544Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:35:23.9680654Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/googletest'... 2025-12-04T09:35:25.2175898Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/libnop'... 2025-12-04T09:35:25.2177045Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/pybind11'... 2025-12-04T09:35:25.2592762Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/libuv'... 2025-12-04T09:35:25.3245030Z Submodule path 'third_party/tensorpipe/third_party/googletest': checked out 'aee0f9d9b5b87796ee8a0ab26b7587ec30e8858e' 2025-12-04T09:35:25.3423737Z Submodule path 'third_party/tensorpipe/third_party/libnop': checked out '910b55815be16109f04f4180e9adee14fb4ce281' 2025-12-04T09:35:25.4257597Z Submodule path 'third_party/tensorpipe/third_party/libuv': checked out '5152db2cbfeb5582e9c27c5ea1dba2cd9e10759b' 2025-12-04T09:35:25.4585954Z Submodule path 'third_party/tensorpipe/third_party/pybind11': checked out 'a23996fce38ff6ccfbcdc09f1e63f2c4be5ea2ef' 2025-12-04T09:35:25.4604550Z Submodule 'tools/clang' (https://github.com/wjakob/clang-cindex-python3) registered for path 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:35:25.4632704Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/pybind11/tools/clang'... 2025-12-04T09:35:25.6582900Z Submodule path 'third_party/tensorpipe/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2025-12-04T09:35:25.6624920Z [command]/usr/bin/git submodule foreach --recursive git config --local gc.auto 0 2025-12-04T09:35:25.6961205Z Entering 'android/libs/fbjni' 2025-12-04T09:35:25.7010217Z Entering 'third_party/FP16' 2025-12-04T09:35:25.7058575Z Entering 'third_party/FXdiv' 2025-12-04T09:35:25.7106454Z Entering 'third_party/NNPACK' 2025-12-04T09:35:25.7154778Z Entering 'third_party/NVTX' 2025-12-04T09:35:25.7203041Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:35:25.7249980Z Entering 'third_party/XNNPACK' 2025-12-04T09:35:25.7315330Z Entering 'third_party/aiter' 2025-12-04T09:35:25.7362969Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:35:25.7419142Z Entering 'third_party/benchmark' 2025-12-04T09:35:25.7465851Z Entering 'third_party/composable_kernel' 2025-12-04T09:35:25.7522265Z Entering 'third_party/cpp-httplib' 2025-12-04T09:35:25.7568778Z Entering 'third_party/cpuinfo' 2025-12-04T09:35:25.7616502Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:35:25.7663952Z Entering 'third_party/cutlass' 2025-12-04T09:35:25.7719469Z Entering 'third_party/fbgemm' 2025-12-04T09:35:25.7772538Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:35:25.7819255Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:35:25.7874760Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:35:25.7921072Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:35:25.7975118Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:35:25.8022215Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:35:25.8068438Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:35:25.8118014Z Entering 'third_party/flash-attention' 2025-12-04T09:35:25.8165581Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:35:25.8218007Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:35:25.8273080Z Entering 'third_party/flatbuffers' 2025-12-04T09:35:25.8324886Z Entering 'third_party/fmt' 2025-12-04T09:35:25.8372061Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:35:25.8419493Z Entering 'third_party/gloo' 2025-12-04T09:35:25.8468376Z Entering 'third_party/googletest' 2025-12-04T09:35:25.8517227Z Entering 'third_party/ideep' 2025-12-04T09:35:25.8563374Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:35:25.8618103Z Entering 'third_party/ittapi' 2025-12-04T09:35:25.8665424Z Entering 'third_party/kineto' 2025-12-04T09:35:25.8711994Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:35:25.8757596Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:35:25.8805969Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:35:25.8853015Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:35:25.8899472Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:35:25.8943961Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:35:25.8991881Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:35:25.9037103Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:35:25.9086963Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:35:25.9134138Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:35:25.9179621Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:35:25.9226610Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:25.9274711Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:25.9326600Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:35:25.9373244Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:35:25.9420659Z Entering 'third_party/kleidiai' 2025-12-04T09:35:25.9468612Z Entering 'third_party/mimalloc' 2025-12-04T09:35:25.9515868Z Entering 'third_party/nlohmann' 2025-12-04T09:35:25.9563917Z Entering 'third_party/onnx' 2025-12-04T09:35:25.9631450Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:35:25.9680044Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:35:25.9731221Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:35:25.9776179Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:35:25.9822457Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:35:25.9867744Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:35:25.9913865Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:35:25.9958924Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:35:26.0004172Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:35:26.0048720Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:26.0096874Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:26.0143943Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:35:26.0213397Z Entering 'third_party/pocketfft' 2025-12-04T09:35:26.0259932Z Entering 'third_party/protobuf' 2025-12-04T09:35:26.0310890Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:35:26.0356842Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:35:26.0404523Z Entering 'third_party/psimd' 2025-12-04T09:35:26.0451335Z Entering 'third_party/pthreadpool' 2025-12-04T09:35:26.0498811Z Entering 'third_party/pybind11' 2025-12-04T09:35:26.0546472Z Entering 'third_party/python-peachpy' 2025-12-04T09:35:26.0594933Z Entering 'third_party/sleef' 2025-12-04T09:35:26.0641899Z Entering 'third_party/tensorpipe' 2025-12-04T09:35:26.0688307Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:35:26.0733628Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:35:26.0778805Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:35:26.0825037Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:35:26.0868980Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:35:26.0929249Z ##[endgroup] 2025-12-04T09:35:26.0929791Z ##[group]Persisting credentials for submodules 2025-12-04T09:35:26.0936877Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-12-04T09:35:26.1268262Z Entering 'android/libs/fbjni' 2025-12-04T09:35:26.1330396Z Entering 'third_party/FP16' 2025-12-04T09:35:26.1393713Z Entering 'third_party/FXdiv' 2025-12-04T09:35:26.1456096Z Entering 'third_party/NNPACK' 2025-12-04T09:35:26.1518873Z Entering 'third_party/NVTX' 2025-12-04T09:35:26.1583188Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:35:26.1644965Z Entering 'third_party/XNNPACK' 2025-12-04T09:35:26.1723377Z Entering 'third_party/aiter' 2025-12-04T09:35:26.1785747Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:35:26.1856110Z Entering 'third_party/benchmark' 2025-12-04T09:35:26.1918962Z Entering 'third_party/composable_kernel' 2025-12-04T09:35:26.1991192Z Entering 'third_party/cpp-httplib' 2025-12-04T09:35:26.2052590Z Entering 'third_party/cpuinfo' 2025-12-04T09:35:26.2115027Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:35:26.2177353Z Entering 'third_party/cutlass' 2025-12-04T09:35:26.2249661Z Entering 'third_party/fbgemm' 2025-12-04T09:35:26.2314338Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:35:26.2375764Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:35:26.2444293Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:35:26.2505561Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:35:26.2573867Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:35:26.2634664Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:35:26.2694814Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:35:26.2758523Z Entering 'third_party/flash-attention' 2025-12-04T09:35:26.2821572Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:35:26.2886852Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:35:26.2956328Z Entering 'third_party/flatbuffers' 2025-12-04T09:35:26.3021584Z Entering 'third_party/fmt' 2025-12-04T09:35:26.3083646Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:35:26.3145910Z Entering 'third_party/gloo' 2025-12-04T09:35:26.3209587Z Entering 'third_party/googletest' 2025-12-04T09:35:26.3271780Z Entering 'third_party/ideep' 2025-12-04T09:35:26.3332334Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:35:26.3402514Z Entering 'third_party/ittapi' 2025-12-04T09:35:26.3464975Z Entering 'third_party/kineto' 2025-12-04T09:35:26.3527483Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:35:26.3589393Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:35:26.3651527Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:35:26.3712564Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:35:26.3773611Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:35:26.3834043Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:35:26.3898743Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:35:26.3959332Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:35:26.4019936Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:35:26.4081027Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:35:26.4143571Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:35:26.4204372Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:26.4267275Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:26.4334164Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:35:26.4394236Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:35:26.4456209Z Entering 'third_party/kleidiai' 2025-12-04T09:35:26.4520147Z Entering 'third_party/mimalloc' 2025-12-04T09:35:26.4582213Z Entering 'third_party/nlohmann' 2025-12-04T09:35:26.4645493Z Entering 'third_party/onnx' 2025-12-04T09:35:26.4726862Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:35:26.4790050Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:35:26.4852991Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:35:26.4912972Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:35:26.4973528Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:35:26.5034412Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:35:26.5095373Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:35:26.5154836Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:35:26.5215575Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:35:26.5274778Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:26.5336761Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:26.5400011Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:35:26.5482161Z Entering 'third_party/pocketfft' 2025-12-04T09:35:26.5544406Z Entering 'third_party/protobuf' 2025-12-04T09:35:26.5610110Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:35:26.5670836Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:35:26.5733888Z Entering 'third_party/psimd' 2025-12-04T09:35:26.5796885Z Entering 'third_party/pthreadpool' 2025-12-04T09:35:26.5858457Z Entering 'third_party/pybind11' 2025-12-04T09:35:26.5921380Z Entering 'third_party/python-peachpy' 2025-12-04T09:35:26.5983051Z Entering 'third_party/sleef' 2025-12-04T09:35:26.6045097Z Entering 'third_party/tensorpipe' 2025-12-04T09:35:26.6106799Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:35:26.6165926Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:35:26.6227623Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:35:26.6288066Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:35:26.6346711Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:35:26.6426933Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-12-04T09:35:26.6753615Z Entering 'android/libs/fbjni' 2025-12-04T09:35:26.6811897Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T09:35:26.6831006Z Entering 'third_party/FP16' 2025-12-04T09:35:26.6889199Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T09:35:26.6907868Z Entering 'third_party/FXdiv' 2025-12-04T09:35:26.6965364Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T09:35:26.6984357Z Entering 'third_party/NNPACK' 2025-12-04T09:35:26.7041746Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T09:35:26.7060820Z Entering 'third_party/NVTX' 2025-12-04T09:35:26.7118259Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T09:35:26.7138280Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:35:26.7197177Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T09:35:26.7215261Z Entering 'third_party/XNNPACK' 2025-12-04T09:35:26.7272347Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T09:35:26.7309022Z Entering 'third_party/aiter' 2025-12-04T09:35:26.7365855Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T09:35:26.7384785Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:35:26.7442282Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T09:35:26.7470118Z Entering 'third_party/benchmark' 2025-12-04T09:35:26.7529037Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T09:35:26.7547824Z Entering 'third_party/composable_kernel' 2025-12-04T09:35:26.7607056Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T09:35:26.7634306Z Entering 'third_party/cpp-httplib' 2025-12-04T09:35:26.7692961Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T09:35:26.7711025Z Entering 'third_party/cpuinfo' 2025-12-04T09:35:26.7768609Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T09:35:26.7788132Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:35:26.7846741Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T09:35:26.7865119Z Entering 'third_party/cutlass' 2025-12-04T09:35:26.7924004Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T09:35:26.7952305Z Entering 'third_party/fbgemm' 2025-12-04T09:35:26.8011927Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T09:35:26.8031597Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:35:26.8088999Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T09:35:26.8106937Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:35:26.8164965Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T09:35:26.8192400Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:35:26.8250293Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T09:35:26.8268154Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:35:26.8325936Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T09:35:26.8353320Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:35:26.8412121Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T09:35:26.8429606Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:35:26.8487074Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T09:35:26.8504460Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:35:26.8561534Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T09:35:26.8582742Z Entering 'third_party/flash-attention' 2025-12-04T09:35:26.8641230Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T09:35:26.8659547Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:35:26.8717490Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T09:35:26.8741818Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:35:26.8799960Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T09:35:26.8827990Z Entering 'third_party/flatbuffers' 2025-12-04T09:35:26.8885534Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T09:35:26.8908086Z Entering 'third_party/fmt' 2025-12-04T09:35:26.8965933Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T09:35:26.8984483Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:35:26.9042755Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T09:35:26.9061039Z Entering 'third_party/gloo' 2025-12-04T09:35:26.9119042Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T09:35:26.9137176Z Entering 'third_party/googletest' 2025-12-04T09:35:26.9194952Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:35:26.9212979Z Entering 'third_party/ideep' 2025-12-04T09:35:26.9270349Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T09:35:26.9287094Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:35:26.9344211Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T09:35:26.9370396Z Entering 'third_party/ittapi' 2025-12-04T09:35:26.9429007Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T09:35:26.9448037Z Entering 'third_party/kineto' 2025-12-04T09:35:26.9506186Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T09:35:26.9523790Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:35:26.9581241Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T09:35:26.9598709Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:35:26.9655572Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T09:35:26.9674450Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:35:26.9734018Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T09:35:26.9752357Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:35:26.9810878Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T09:35:26.9828184Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:35:26.9886326Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T09:35:26.9902786Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:35:26.9960129Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T09:35:26.9979547Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:35:27.0039113Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T09:35:27.0057098Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:35:27.0114980Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:35:27.0132555Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:35:27.0191077Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T09:35:27.0209606Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:35:27.0267382Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T09:35:27.0285517Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:35:27.0343417Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T09:35:27.0360060Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:27.0417847Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T09:35:27.0438216Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:27.0497097Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T09:35:27.0518653Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:35:27.0575470Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T09:35:27.0592996Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:35:27.0648751Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T09:35:27.0668110Z Entering 'third_party/kleidiai' 2025-12-04T09:35:27.0726497Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T09:35:27.0744959Z Entering 'third_party/mimalloc' 2025-12-04T09:35:27.0801303Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T09:35:27.0819980Z Entering 'third_party/nlohmann' 2025-12-04T09:35:27.0877991Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T09:35:27.0899162Z Entering 'third_party/onnx' 2025-12-04T09:35:27.0956292Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T09:35:27.0993885Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:35:27.1051625Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T09:35:27.1071869Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:35:27.1130951Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T09:35:27.1149924Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:35:27.1206323Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T09:35:27.1223508Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:35:27.1280119Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:35:27.1297633Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:35:27.1354119Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T09:35:27.1370936Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:35:27.1427272Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T09:35:27.1446217Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:35:27.1502518Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T09:35:27.1519331Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:35:27.1575948Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T09:35:27.1594011Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:35:27.1650544Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T09:35:27.1666847Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:27.1724111Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T09:35:27.1743314Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:27.1801999Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T09:35:27.1821360Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:35:27.1877870Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T09:35:27.1918431Z Entering 'third_party/pocketfft' 2025-12-04T09:35:27.1975855Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T09:35:27.1994129Z Entering 'third_party/protobuf' 2025-12-04T09:35:27.2051199Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T09:35:27.2071830Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:35:27.2128516Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T09:35:27.2146180Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:35:27.2202513Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:35:27.2221855Z Entering 'third_party/psimd' 2025-12-04T09:35:27.2278290Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T09:35:27.2296686Z Entering 'third_party/pthreadpool' 2025-12-04T09:35:27.2353927Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T09:35:27.2371501Z Entering 'third_party/pybind11' 2025-12-04T09:35:27.2428484Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T09:35:27.2447888Z Entering 'third_party/python-peachpy' 2025-12-04T09:35:27.2506244Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T09:35:27.2524331Z Entering 'third_party/sleef' 2025-12-04T09:35:27.2581624Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T09:35:27.2600937Z Entering 'third_party/tensorpipe' 2025-12-04T09:35:27.2657447Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T09:35:27.2675414Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:35:27.2732571Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T09:35:27.2750246Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:35:27.2807511Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T09:35:27.2824425Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:35:27.2880468Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T09:35:27.2899578Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:35:27.2955904Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T09:35:27.2971657Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:35:27.3029567Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T09:35:27.3954525Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-12-04T09:35:27.4283747Z Entering 'android/libs/fbjni' 2025-12-04T09:35:27.4330561Z Entering 'third_party/FP16' 2025-12-04T09:35:27.4376998Z Entering 'third_party/FXdiv' 2025-12-04T09:35:27.4427392Z Entering 'third_party/NNPACK' 2025-12-04T09:35:27.4473683Z Entering 'third_party/NVTX' 2025-12-04T09:35:27.4520754Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:35:27.4567209Z Entering 'third_party/XNNPACK' 2025-12-04T09:35:27.4632538Z Entering 'third_party/aiter' 2025-12-04T09:35:27.4682166Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:35:27.4737795Z Entering 'third_party/benchmark' 2025-12-04T09:35:27.4785462Z Entering 'third_party/composable_kernel' 2025-12-04T09:35:27.4840078Z Entering 'third_party/cpp-httplib' 2025-12-04T09:35:27.4886279Z Entering 'third_party/cpuinfo' 2025-12-04T09:35:27.4933861Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:35:27.4980220Z Entering 'third_party/cutlass' 2025-12-04T09:35:27.5035826Z Entering 'third_party/fbgemm' 2025-12-04T09:35:27.5086539Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:35:27.5132042Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:35:27.5188904Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:35:27.5233713Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:35:27.5289088Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:35:27.5334629Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:35:27.5379509Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:35:27.5428595Z Entering 'third_party/flash-attention' 2025-12-04T09:35:27.5475896Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:35:27.5528184Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:35:27.5585510Z Entering 'third_party/flatbuffers' 2025-12-04T09:35:27.5637190Z Entering 'third_party/fmt' 2025-12-04T09:35:27.5684206Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:35:27.5731797Z Entering 'third_party/gloo' 2025-12-04T09:35:27.5779203Z Entering 'third_party/googletest' 2025-12-04T09:35:27.5826879Z Entering 'third_party/ideep' 2025-12-04T09:35:27.5871683Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:35:27.5926954Z Entering 'third_party/ittapi' 2025-12-04T09:35:27.5973686Z Entering 'third_party/kineto' 2025-12-04T09:35:27.6021821Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:35:27.6068573Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:35:27.6116719Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:35:27.6164071Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:35:27.6209938Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:35:27.6255072Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:35:27.6306461Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:35:27.6352462Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:35:27.6398982Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:35:27.6445726Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:35:27.6493103Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:35:27.6538182Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:27.6591618Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:27.6642273Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:35:27.6687884Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:35:27.6735034Z Entering 'third_party/kleidiai' 2025-12-04T09:35:27.6783669Z Entering 'third_party/mimalloc' 2025-12-04T09:35:27.6831223Z Entering 'third_party/nlohmann' 2025-12-04T09:35:27.6881523Z Entering 'third_party/onnx' 2025-12-04T09:35:27.6947446Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:35:27.6997092Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:35:27.7046155Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:35:27.7091787Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:35:27.7137550Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:35:27.7183546Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:35:27.7231413Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:35:27.7276372Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:35:27.7322855Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:35:27.7367073Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:27.7416398Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:27.7462476Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:35:27.7531267Z Entering 'third_party/pocketfft' 2025-12-04T09:35:27.7578173Z Entering 'third_party/protobuf' 2025-12-04T09:35:27.7628989Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:35:27.7676424Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:35:27.7724062Z Entering 'third_party/psimd' 2025-12-04T09:35:27.7771584Z Entering 'third_party/pthreadpool' 2025-12-04T09:35:27.7819595Z Entering 'third_party/pybind11' 2025-12-04T09:35:27.7866321Z Entering 'third_party/python-peachpy' 2025-12-04T09:35:27.7914169Z Entering 'third_party/sleef' 2025-12-04T09:35:27.7962759Z Entering 'third_party/tensorpipe' 2025-12-04T09:35:27.8010062Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:35:27.8054928Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:35:27.8102054Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:35:27.8146695Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:35:27.8194954Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:35:27.8258674Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-12-04T09:35:27.8591323Z Entering 'android/libs/fbjni' 2025-12-04T09:35:27.8637349Z Entering 'third_party/FP16' 2025-12-04T09:35:27.8688238Z Entering 'third_party/FXdiv' 2025-12-04T09:35:27.8735003Z Entering 'third_party/NNPACK' 2025-12-04T09:35:27.8782951Z Entering 'third_party/NVTX' 2025-12-04T09:35:27.8832636Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:35:27.8880702Z Entering 'third_party/XNNPACK' 2025-12-04T09:35:27.8944140Z Entering 'third_party/aiter' 2025-12-04T09:35:27.8992444Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:35:27.9046775Z Entering 'third_party/benchmark' 2025-12-04T09:35:27.9096630Z Entering 'third_party/composable_kernel' 2025-12-04T09:35:27.9152917Z Entering 'third_party/cpp-httplib' 2025-12-04T09:35:27.9201185Z Entering 'third_party/cpuinfo' 2025-12-04T09:35:27.9247860Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:35:27.9295017Z Entering 'third_party/cutlass' 2025-12-04T09:35:27.9350963Z Entering 'third_party/fbgemm' 2025-12-04T09:35:27.9403154Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:35:27.9448036Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:35:27.9503695Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:35:27.9548765Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:35:27.9605611Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:35:27.9650710Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:35:27.9695698Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:35:27.9744452Z Entering 'third_party/flash-attention' 2025-12-04T09:35:27.9792057Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:35:27.9846192Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:35:27.9902429Z Entering 'third_party/flatbuffers' 2025-12-04T09:35:27.9952149Z Entering 'third_party/fmt' 2025-12-04T09:35:27.9999730Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:35:28.0047656Z Entering 'third_party/gloo' 2025-12-04T09:35:28.0095699Z Entering 'third_party/googletest' 2025-12-04T09:35:28.0142867Z Entering 'third_party/ideep' 2025-12-04T09:35:28.0188500Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:35:28.0242471Z Entering 'third_party/ittapi' 2025-12-04T09:35:28.0291598Z Entering 'third_party/kineto' 2025-12-04T09:35:28.0338538Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:35:28.0385153Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:35:28.0432894Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:35:28.0479404Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:35:28.0525460Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:35:28.0570103Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:35:28.0621874Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:35:28.0667746Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:35:28.0713765Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:35:28.0761093Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:35:28.0806405Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:35:28.0851954Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:28.0901184Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:28.0951710Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:35:28.0998075Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:35:28.1045695Z Entering 'third_party/kleidiai' 2025-12-04T09:35:28.1095304Z Entering 'third_party/mimalloc' 2025-12-04T09:35:28.1144456Z Entering 'third_party/nlohmann' 2025-12-04T09:35:28.1195006Z Entering 'third_party/onnx' 2025-12-04T09:35:28.1262540Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:35:28.1312470Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:35:28.1363279Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:35:28.1410184Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:35:28.1457649Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:35:28.1503647Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:35:28.1550106Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:35:28.1596682Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:35:28.1641874Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:35:28.1686871Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:28.1735234Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:28.1783905Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:35:28.1851035Z Entering 'third_party/pocketfft' 2025-12-04T09:35:28.1898643Z Entering 'third_party/protobuf' 2025-12-04T09:35:28.1949384Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:35:28.1997174Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:35:28.2044919Z Entering 'third_party/psimd' 2025-12-04T09:35:28.2094229Z Entering 'third_party/pthreadpool' 2025-12-04T09:35:28.2140716Z Entering 'third_party/pybind11' 2025-12-04T09:35:28.2188551Z Entering 'third_party/python-peachpy' 2025-12-04T09:35:28.2236294Z Entering 'third_party/sleef' 2025-12-04T09:35:28.2284157Z Entering 'third_party/tensorpipe' 2025-12-04T09:35:28.2330951Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:35:28.2376278Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:35:28.2422547Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:35:28.2469362Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:35:28.2514029Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:35:28.2574646Z ##[endgroup] 2025-12-04T09:35:28.2611686Z [command]/usr/bin/git log -1 --format=%H 2025-12-04T09:35:28.2636324Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:35:28.2743285Z ##[group]Run cd "${GITHUB_WORKSPACE}" 2025-12-04T09:35:28.2743699Z cd "${GITHUB_WORKSPACE}" 2025-12-04T09:35:28.2744184Z # Clean stale submodule dirs 2025-12-04T09:35:28.2744560Z if [ -z "${NO_SUDO}" ]; then 2025-12-04T09:35:28.2745025Z  sudo git submodule foreach --recursive git clean -ffdx 2025-12-04T09:35:28.2745469Z else 2025-12-04T09:35:28.2745821Z  git submodule foreach --recursive git clean -ffdx 2025-12-04T09:35:28.2746271Z fi 2025-12-04T09:35:28.2756505Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:28.2756955Z env: 2025-12-04T09:35:28.2757208Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:28.2757503Z NO_SUDO: true 2025-12-04T09:35:28.2757761Z ##[endgroup] 2025-12-04T09:35:28.3107798Z Entering 'android/libs/fbjni' 2025-12-04T09:35:28.3143846Z Entering 'third_party/FP16' 2025-12-04T09:35:28.3177806Z Entering 'third_party/FXdiv' 2025-12-04T09:35:28.3211477Z Entering 'third_party/NNPACK' 2025-12-04T09:35:28.3249466Z Entering 'third_party/NVTX' 2025-12-04T09:35:28.3291936Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T09:35:28.3327841Z Entering 'third_party/XNNPACK' 2025-12-04T09:35:28.3464604Z Entering 'third_party/aiter' 2025-12-04T09:35:28.3511616Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T09:35:28.3631110Z Entering 'third_party/benchmark' 2025-12-04T09:35:28.3667139Z Entering 'third_party/composable_kernel' 2025-12-04T09:35:28.3796256Z Entering 'third_party/cpp-httplib' 2025-12-04T09:35:28.3832531Z Entering 'third_party/cpuinfo' 2025-12-04T09:35:28.3872959Z Entering 'third_party/cudnn_frontend' 2025-12-04T09:35:28.3911833Z Entering 'third_party/cutlass' 2025-12-04T09:35:28.4023714Z Entering 'third_party/fbgemm' 2025-12-04T09:35:28.4092643Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T09:35:28.4127119Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T09:35:28.4254823Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T09:35:28.4292655Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T09:35:28.4402498Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T09:35:28.4438352Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T09:35:28.4470132Z Entering 'third_party/fbgemm/external/json' 2025-12-04T09:35:28.4522544Z Entering 'third_party/flash-attention' 2025-12-04T09:35:28.4566950Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T09:35:28.4675549Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T09:35:28.4775953Z Entering 'third_party/flatbuffers' 2025-12-04T09:35:28.4854187Z Entering 'third_party/fmt' 2025-12-04T09:35:28.4890741Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T09:35:28.4927585Z Entering 'third_party/gloo' 2025-12-04T09:35:28.4965494Z Entering 'third_party/googletest' 2025-12-04T09:35:28.5006185Z Entering 'third_party/ideep' 2025-12-04T09:35:28.5042686Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T09:35:28.5138517Z Entering 'third_party/ittapi' 2025-12-04T09:35:28.5176829Z Entering 'third_party/kineto' 2025-12-04T09:35:28.5216458Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T09:35:28.5256743Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T09:35:28.5308328Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T09:35:28.5343842Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T09:35:28.5379760Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T09:35:28.5412158Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T09:35:28.5447438Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T09:35:28.5482381Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T09:35:28.5520653Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T09:35:28.5566882Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T09:35:28.5603360Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T09:35:28.5637964Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:28.5694521Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:28.5736570Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T09:35:28.5772778Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T09:35:28.5810235Z Entering 'third_party/kleidiai' 2025-12-04T09:35:28.5852878Z Entering 'third_party/mimalloc' 2025-12-04T09:35:28.5890658Z Entering 'third_party/nlohmann' 2025-12-04T09:35:28.5939966Z Entering 'third_party/onnx' 2025-12-04T09:35:28.6300938Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T09:35:28.6339868Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T09:35:28.6400420Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T09:35:28.6434647Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T09:35:28.6470989Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T09:35:28.6503956Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T09:35:28.6550262Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T09:35:28.6583434Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T09:35:28.6618013Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T09:35:28.6651723Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T09:35:28.6702937Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T09:35:28.6740921Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T09:35:28.7022278Z Entering 'third_party/pocketfft' 2025-12-04T09:35:28.7055171Z Entering 'third_party/protobuf' 2025-12-04T09:35:28.7138907Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T09:35:28.7171873Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T09:35:28.7211109Z Entering 'third_party/psimd' 2025-12-04T09:35:28.7244560Z Entering 'third_party/pthreadpool' 2025-12-04T09:35:28.7279543Z Entering 'third_party/pybind11' 2025-12-04T09:35:28.7325529Z Entering 'third_party/python-peachpy' 2025-12-04T09:35:28.7359496Z Entering 'third_party/sleef' 2025-12-04T09:35:28.7396636Z Entering 'third_party/tensorpipe' 2025-12-04T09:35:28.7433848Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T09:35:28.7469423Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T09:35:28.7503038Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T09:35:28.7541983Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T09:35:28.7574769Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T09:35:28.7749372Z Prepare all required actions 2025-12-04T09:35:28.7750006Z Getting action download info 2025-12-04T09:35:28.9277666Z ##[group]Run ./.github/actions/setup-linux 2025-12-04T09:35:28.9278029Z env: 2025-12-04T09:35:28.9278283Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:28.9278586Z ##[endgroup] 2025-12-04T09:35:28.9322106Z ##[group]Run set -euo pipefail 2025-12-04T09:35:28.9322533Z set -euo pipefail 2025-12-04T09:35:28.9322877Z function get_ec2_metadata() { 2025-12-04T09:35:28.9323326Z  # Pulled from instance metadata endpoint for EC2 2025-12-04T09:35:28.9324066Z  # see https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/instancedata-data-retrieval.html 2025-12-04T09:35:28.9324730Z  category=$1 2025-12-04T09:35:28.9325147Z  # If it is GCP runner (runner name contains gcp), do not run this 2025-12-04T09:35:28.9325647Z  runner_name_str=i-0144021b78afd54ea 2025-12-04T09:35:28.9326093Z  if [[ -f /.inarc ]]; then 2025-12-04T09:35:28.9326487Z  echo "ARC Runner, no info on ec2 metadata" 2025-12-04T09:35:28.9326944Z  elif [[ $runner_name_str == *"gcp"* ]]; then 2025-12-04T09:35:28.9327493Z  echo "Runner is from Google Cloud Platform, No info on ec2 metadata" 2025-12-04T09:35:28.9328004Z  else 2025-12-04T09:35:28.9329016Z  curl -H "X-aws-ec2-metadata-token: $(curl -s -X PUT "http://169.254.169.254/latest/api/token" -H "X-aws-ec2-metadata-token-ttl-seconds: 30")" -fsSL "http://169.254.169.254/latest/meta-data/${category}" 2025-12-04T09:35:28.9330118Z  fi 2025-12-04T09:35:28.9330370Z } 2025-12-04T09:35:28.9330677Z echo "ami-id: $(get_ec2_metadata ami-id)" 2025-12-04T09:35:28.9331170Z echo "instance-id: $(get_ec2_metadata instance-id)" 2025-12-04T09:35:28.9331738Z echo "instance-type: $(get_ec2_metadata instance-type)" 2025-12-04T09:35:28.9332228Z echo "system info $(uname -a)" 2025-12-04T09:35:28.9340014Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:28.9340458Z env: 2025-12-04T09:35:28.9340711Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:28.9340999Z ##[endgroup] 2025-12-04T09:35:28.9497533Z ami-id: ami-08982f1c5bf93d976 2025-12-04T09:35:28.9645887Z instance-id: i-0144021b78afd54ea 2025-12-04T09:35:28.9752365Z instance-type: g4dn.4xlarge 2025-12-04T09:35:28.9763921Z system info Linux ip-10-0-6-82.ec2.internal 6.1.150-174.273.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Sep 9 12:21:26 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-12-04T09:35:28.9795711Z ##[group]Run if [ -f /usr/bin/nvidia-smi ]; then nvidia-smi; fi 2025-12-04T09:35:28.9796889Z if [ -f /usr/bin/nvidia-smi ]; then nvidia-smi; fi 2025-12-04T09:35:28.9808611Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:28.9809459Z env: 2025-12-04T09:35:28.9809931Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:28.9810512Z ##[endgroup] 2025-12-04T09:35:30.3442706Z Thu Dec 4 09:35:30 2025 2025-12-04T09:35:30.3443429Z +-----------------------------------------------------------------------------------------+ 2025-12-04T09:35:30.3444061Z | NVIDIA-SMI 580.82.07 Driver Version: 580.82.07 CUDA Version: 13.0 | 2025-12-04T09:35:30.3444697Z +-----------------------------------------+------------------------+----------------------+ 2025-12-04T09:35:30.3445344Z | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | 2025-12-04T09:35:30.3446020Z | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | 2025-12-04T09:35:30.3446553Z | | | MIG M. | 2025-12-04T09:35:30.3446972Z |=========================================+========================+======================| 2025-12-04T09:35:30.3544739Z | 0 Tesla T4 Off | 00000000:00:1E.0 Off | 0 | 2025-12-04T09:35:30.3545910Z | N/A 30C P0 25W / 70W | 0MiB / 15360MiB | 9% Default | 2025-12-04T09:35:30.3546677Z | | | N/A | 2025-12-04T09:35:30.3547184Z +-----------------------------------------+------------------------+----------------------+ 2025-12-04T09:35:30.3547574Z 2025-12-04T09:35:30.3547800Z +-----------------------------------------------------------------------------------------+ 2025-12-04T09:35:30.3548491Z | Processes: | 2025-12-04T09:35:30.3549404Z | GPU GI CI PID Type Process name GPU Memory | 2025-12-04T09:35:30.3549927Z | ID ID Usage | 2025-12-04T09:35:30.3550356Z |=========================================================================================| 2025-12-04T09:35:30.3550909Z | No running processes found | 2025-12-04T09:35:30.3551510Z +-----------------------------------------------------------------------------------------+ 2025-12-04T09:35:30.7681437Z ##[group]Run echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T09:35:30.7682721Z echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T09:35:30.7691440Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:30.7691894Z env: 2025-12-04T09:35:30.7692136Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:30.7692444Z ##[endgroup] 2025-12-04T09:35:30.7768433Z ##[group]Run if systemctl is-active --quiet docker; then 2025-12-04T09:35:30.7768961Z if systemctl is-active --quiet docker; then 2025-12-04T09:35:30.7769422Z  echo "Docker daemon is running..."; 2025-12-04T09:35:30.7769812Z else 2025-12-04T09:35:30.7770231Z  echo "Starting docker daemon..." && sudo systemctl start docker; 2025-12-04T09:35:30.7770740Z fi 2025-12-04T09:35:30.7777392Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:30.7777838Z env: 2025-12-04T09:35:30.7778087Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:30.7778482Z ##[endgroup] 2025-12-04T09:35:30.7870242Z Docker daemon is running... 2025-12-04T09:35:30.7914437Z ##[group]Run nick-fields/retry@v3.0.0 2025-12-04T09:35:30.7914793Z with: 2025-12-04T09:35:30.7915025Z shell: bash 2025-12-04T09:35:30.7915279Z timeout_minutes: 5 2025-12-04T09:35:30.7915544Z max_attempts: 3 2025-12-04T09:35:30.7915815Z retry_wait_seconds: 30 2025-12-04T09:35:30.7918567Z command: AWS_ACCOUNT_ID=$(aws sts get-caller-identity|grep Account|cut -f4 -d\") aws ecr get-login-password --region "$AWS_DEFAULT_REGION" | docker login --username AWS \ --password-stdin "$AWS_ACCOUNT_ID.dkr.ecr.$AWS_DEFAULT_REGION.amazonaws.com" # For LF Runners we need to make sure we also login to Meta's ECR docker registry too. META_AWS_ACCOUNT_ID=308535385114 if [ "$AWS_ACCOUNT_ID" != "$META_AWS_ACCOUNT_ID" ] ; then aws ecr get-login-password --region "$AWS_DEFAULT_REGION" | docker login --username AWS \ --password-stdin "$META_AWS_ACCOUNT_ID.dkr.ecr.$AWS_DEFAULT_REGION.amazonaws.com" fi 2025-12-04T09:35:30.7921352Z polling_interval_seconds: 1 2025-12-04T09:35:30.7921682Z warning_on_retry: true 2025-12-04T09:35:30.7921974Z continue_on_error: false 2025-12-04T09:35:30.7922264Z env: 2025-12-04T09:35:30.7922500Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:30.7922806Z AWS_RETRY_MODE: standard 2025-12-04T09:35:30.7923094Z AWS_MAX_ATTEMPTS: 5 2025-12-04T09:35:30.7923388Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:35:30.7923707Z ##[endgroup] 2025-12-04T09:35:32.0467358Z WARNING! Your password will be stored unencrypted in /home/ec2-user/.docker/config.json. 2025-12-04T09:35:32.0468450Z Configure a credential helper to remove this warning. See 2025-12-04T09:35:32.0469142Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2025-12-04T09:35:32.0469603Z 2025-12-04T09:35:32.0469721Z Login Succeeded 2025-12-04T09:35:32.8867132Z Command completed after 1 attempt(s). 2025-12-04T09:35:32.8922831Z ##[group]Run env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:35:32.8923499Z env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:35:32.8924051Z env | grep '^CI' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:35:32.8931654Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:32.8932106Z env: 2025-12-04T09:35:32.8932364Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:32.8932664Z ##[endgroup] 2025-12-04T09:35:32.9021032Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2025-12-04T09:35:32.9021697Z # ignore expansion of "docker ps -q" since it could be empty 2025-12-04T09:35:32.9022231Z # shellcheck disable=SC2046 2025-12-04T09:35:32.9022635Z docker stop $(docker ps -q) || true 2025-12-04T09:35:32.9023052Z # Prune all of the docker images 2025-12-04T09:35:32.9023429Z docker system prune -af 2025-12-04T09:35:32.9030209Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:32.9030660Z env: 2025-12-04T09:35:32.9030921Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:32.9031230Z ##[endgroup] 2025-12-04T09:35:32.9314457Z "docker stop" requires at least 1 argument. 2025-12-04T09:35:32.9314919Z See 'docker stop --help'. 2025-12-04T09:35:32.9315229Z 2025-12-04T09:35:32.9315429Z Usage: docker stop [OPTIONS] CONTAINER [CONTAINER...] 2025-12-04T09:35:32.9315746Z 2025-12-04T09:35:32.9315871Z Stop one or more running containers 2025-12-04T09:35:32.9531985Z Total reclaimed space: 0B 2025-12-04T09:35:32.9739167Z ##[group]Run pytorch/test-infra/.github/actions/calculate-docker-image@main 2025-12-04T09:35:32.9739726Z with: 2025-12-04T09:35:32.9740671Z docker-image-name: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:32.9741744Z use-custom-docker-registry: true 2025-12-04T09:35:32.9742111Z docker-build-dir: .ci/docker 2025-12-04T09:35:32.9742440Z docker-build-script: ./build.sh 2025-12-04T09:35:32.9742781Z working-directory: . 2025-12-04T09:35:32.9743330Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:35:32.9743784Z force-push: false 2025-12-04T09:35:32.9744047Z env: 2025-12-04T09:35:32.9744290Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:32.9744576Z ##[endgroup] 2025-12-04T09:35:32.9765354Z ##[group]Run set -ex 2025-12-04T09:35:32.9765685Z set -ex 2025-12-04T09:35:32.9765942Z  2025-12-04T09:35:32.9766444Z # If the docker build directory or the build script doesn't exist, the action will 2025-12-04T09:35:32.9767227Z # gracefully return the docker image name as it is. Pulling docker image in Linux 2025-12-04T09:35:32.9767898Z # job could then download the pre-built image as usual 2025-12-04T09:35:32.9768711Z if [[ -d "${DOCKER_BUILD_DIR}" ]] && [[ -f "${DOCKER_BUILD_DIR}/${DOCKER_BUILD_SCRIPT}" ]] && [[ "${USE_CUSTOM_DOCKER_REGISTRY}" == "true" ]]; then 2025-12-04T09:35:32.9769460Z  echo "skip=false" >> "${GITHUB_OUTPUT}" 2025-12-04T09:35:32.9769863Z else 2025-12-04T09:35:32.9770161Z  echo "skip=true" >> "${GITHUB_OUTPUT}" 2025-12-04T09:35:32.9770683Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:35:32.9771147Z  2025-12-04T09:35:32.9771799Z  echo "Not using custom ECR registry. Either it was not requested or there is no Docker build script in the ${REPO_NAME} repo..." 2025-12-04T09:35:32.9772559Z  exit 0 2025-12-04T09:35:32.9772802Z fi 2025-12-04T09:35:32.9773043Z  2025-12-04T09:35:32.9773434Z if [[ "${DOCKER_IMAGE_NAME}" == *"${DOCKER_REGISTRY}/${REPO_NAME}"* ]]; then 2025-12-04T09:35:32.9774146Z  # The docker image name already includes the ECR prefix and tag, so we can just 2025-12-04T09:35:32.9774764Z  # use it as it is, but first let's extract the tag 2025-12-04T09:35:32.9775325Z  DOCKER_TAG=$(echo "${DOCKER_IMAGE_NAME}" | awk -F '[:,]' '{print $2}') 2025-12-04T09:35:32.9775930Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:35:32.9776488Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:35:32.9776961Z else 2025-12-04T09:35:32.9777260Z  if [[ "${DOCKER_IMAGE_NAME}" == *:* ]]; then 2025-12-04T09:35:32.9777706Z  CUSTOM_TAG_PREFIX=${DOCKER_IMAGE_NAME#*:} 2025-12-04T09:35:32.9778275Z  DOCKER_IMAGE_NAME=${DOCKER_IMAGE_NAME%%:*} 2025-12-04T09:35:32.9778684Z  fi 2025-12-04T09:35:32.9779221Z  DOCKER_TAG=${CUSTOM_TAG_PREFIX:+${CUSTOM_TAG_PREFIX}-}$(git rev-parse HEAD:"${DOCKER_BUILD_DIR}") 2025-12-04T09:35:32.9779934Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:35:32.9780700Z  echo "docker-image=${DOCKER_REGISTRY}/${REPO_NAME}/${DOCKER_IMAGE_NAME}:${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:35:32.9781541Z  echo "custom-tag-prefix=${CUSTOM_TAG_PREFIX}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:35:32.9782060Z fi 2025-12-04T09:35:32.9789216Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:32.9789657Z env: 2025-12-04T09:35:32.9789902Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:32.9790196Z REPO_NAME: pytorch 2025-12-04T09:35:32.9791314Z DOCKER_IMAGE_NAME: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:32.9792365Z DOCKER_BUILD_DIR: .ci/docker 2025-12-04T09:35:32.9792701Z DOCKER_BUILD_SCRIPT: ./build.sh 2025-12-04T09:35:32.9793132Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:35:32.9793607Z USE_CUSTOM_DOCKER_REGISTRY: true 2025-12-04T09:35:32.9793946Z CUSTOM_TAG_PREFIX: 2025-12-04T09:35:32.9794208Z ##[endgroup] 2025-12-04T09:35:32.9820919Z + [[ -d .ci/docker ]] 2025-12-04T09:35:32.9821228Z + [[ -f .ci/docker/./build.sh ]] 2025-12-04T09:35:32.9821708Z + [[ true == \t\r\u\e ]] 2025-12-04T09:35:32.9822017Z + echo skip=false 2025-12-04T09:35:32.9823278Z + [[ 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a == *\3\0\8\5\3\5\3\8\5\1\1\4\.\d\k\r\.\e\c\r\.\u\s\-\e\a\s\t\-\1\.\a\m\a\z\o\n\a\w\s\.\c\o\m\/\p\y\t\o\r\c\h* ]] 2025-12-04T09:35:32.9828725Z ++ echo 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:32.9829715Z ++ awk -F '[:,]' '{print $2}' 2025-12-04T09:35:32.9853304Z + DOCKER_TAG=pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:32.9854358Z + echo docker-tag=pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:32.9855880Z + echo docker-image=308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:32.9880778Z ##[group]Run set +e 2025-12-04T09:35:32.9881123Z set +e 2025-12-04T09:35:32.9881369Z set -x 2025-12-04T09:35:32.9881622Z  2025-12-04T09:35:32.9881860Z login() { 2025-12-04T09:35:32.9882408Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2025-12-04T09:35:32.9883028Z } 2025-12-04T09:35:32.9883276Z  2025-12-04T09:35:32.9883500Z retry () { 2025-12-04T09:35:32.9883810Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2025-12-04T09:35:32.9884176Z } 2025-12-04T09:35:32.9884398Z  2025-12-04T09:35:32.9884666Z retry login "${DOCKER_REGISTRY}" 2025-12-04T09:35:32.9885019Z  2025-12-04T09:35:32.9885266Z START_TIME=$(date +%s) 2025-12-04T09:35:32.9885593Z # Wait up to 120 minutes 2025-12-04T09:35:32.9886012Z while [[ $(( $(date +%s) - 7200 )) -lt $START_TIME ]]; do 2025-12-04T09:35:32.9886599Z  # Check if image already exists, if it does then skip building it 2025-12-04T09:35:32.9887164Z  if docker manifest inspect "${DOCKER_IMAGE}"; then 2025-12-04T09:35:32.9887588Z  exit 0 2025-12-04T09:35:32.9888616Z  fi 2025-12-04T09:35:32.9888855Z  2025-12-04T09:35:32.9889313Z  # NB: This flag is used by Docker build workflow to push the image to ECR, so we can 2025-12-04T09:35:32.9890101Z  # use this to differentiate between the Docker build and regular build jobs. For the 2025-12-04T09:35:32.9890883Z  # latter, it will wait for the Docker images to become available before continuing 2025-12-04T09:35:32.9891484Z  if [ "${DOCKER_PUSH:-false}" == "true" ]; then 2025-12-04T09:35:32.9891951Z  # It's a Docker build job, let's build the image 2025-12-04T09:35:32.9892354Z  break 2025-12-04T09:35:32.9892618Z  else 2025-12-04T09:35:32.9893004Z  # It's a regular build job, wait for the image to become available 2025-12-04T09:35:32.9893491Z  sleep 300 2025-12-04T09:35:32.9893775Z  fi 2025-12-04T09:35:32.9894011Z done 2025-12-04T09:35:32.9894253Z  2025-12-04T09:35:32.9894664Z # NB: This part requires a full checkout. Otherwise, the merge base will 2025-12-04T09:35:32.9895492Z # be empty. The default action would be to continue rebuild the image 2025-12-04T09:35:32.9896103Z if [[ "$BASE_REVISION" = "$(git rev-parse HEAD)" ]]; then 2025-12-04T09:35:32.9896638Z  # if we're on the base branch then use the parent commit 2025-12-04T09:35:32.9897112Z  MERGE_BASE=$(git rev-parse HEAD~) 2025-12-04T09:35:32.9897468Z else 2025-12-04T09:35:32.9897847Z  # otherwise we're on a PR, so use the most recent base commit 2025-12-04T09:35:32.9898482Z  MERGE_BASE=$(git merge-base HEAD "$BASE_REVISION") 2025-12-04T09:35:32.9898996Z fi 2025-12-04T09:35:32.9899232Z  2025-12-04T09:35:32.9899497Z if [[ -z "${MERGE_BASE}" ]]; then 2025-12-04T09:35:32.9899911Z  echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2025-12-04T09:35:32.9900280Z  2025-12-04T09:35:32.9900826Z  echo "Finding merge base only works with full checkout, please set fetch-depth to 0, continuing ..." 2025-12-04T09:35:32.9901485Z  exit 0 2025-12-04T09:35:32.9901729Z fi 2025-12-04T09:35:32.9901967Z  2025-12-04T09:35:32.9902320Z if ! git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}"; then 2025-12-04T09:35:32.9903129Z  echo "Directory '${DOCKER_BUILD_DIR}' not found in commit $MERGE_BASE, you should rebase onto a more recent commit" 2025-12-04T09:35:32.9903814Z  exit 1 2025-12-04T09:35:32.9904069Z fi 2025-12-04T09:35:32.9904307Z  2025-12-04T09:35:32.9904715Z PREVIOUS_DOCKER_TAG=$(git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}") 2025-12-04T09:35:32.9905493Z # If no image exists but the hash is the same as the previous hash then we should error out here 2025-12-04T09:35:32.9906185Z if [[ "${PREVIOUS_DOCKER_TAG}" == "${DOCKER_TAG}" ]]; then 2025-12-04T09:35:32.9906990Z  echo "WARNING: Something has gone wrong and the previous image isn't available for the merge-base of your branch" 2025-12-04T09:35:32.9907893Z  echo " Will re-build docker image to store in local cache, TTS may be longer" 2025-12-04T09:35:32.9908429Z fi 2025-12-04T09:35:32.9908670Z  2025-12-04T09:35:32.9908957Z echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2025-12-04T09:35:32.9915326Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:32.9915769Z env: 2025-12-04T09:35:32.9916022Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:32.9916330Z DOCKER_BUILD_DIR: .ci/docker 2025-12-04T09:35:32.9916727Z BASE_REVISION: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:35:32.9917829Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:32.9919154Z DOCKER_TAG: pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:32.9919952Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:35:32.9920409Z DOCKER_PUSH: 2025-12-04T09:35:32.9920667Z ##[endgroup] 2025-12-04T09:35:32.9946459Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:35:32.9946971Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:35:32.9949445Z + aws ecr get-login-password --region us-east-1 2025-12-04T09:35:32.9950986Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:35:33.5798550Z WARNING! Your password will be stored unencrypted in /home/ec2-user/.docker/config.json. 2025-12-04T09:35:33.5799295Z Configure a credential helper to remove this warning. See 2025-12-04T09:35:33.5799966Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2025-12-04T09:35:33.5800423Z 2025-12-04T09:35:33.5805029Z Login Succeeded 2025-12-04T09:35:33.5819481Z ++ date +%s 2025-12-04T09:35:33.5830145Z + START_TIME=1764840933 2025-12-04T09:35:33.5833575Z ++ date +%s 2025-12-04T09:35:33.5844038Z + [[ 1764833733 -lt 1764840933 ]] 2025-12-04T09:35:33.5845105Z + docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:33.8068063Z { 2025-12-04T09:35:33.8068543Z "schemaVersion": 2, 2025-12-04T09:35:33.8069061Z "mediaType": "application/vnd.docker.distribution.manifest.v2+json", 2025-12-04T09:35:33.8069586Z "config": { 2025-12-04T09:35:33.8069964Z "mediaType": "application/vnd.docker.container.image.v1+json", 2025-12-04T09:35:33.8070443Z "size": 34787, 2025-12-04T09:35:33.8071248Z "digest": "sha256:5465aa79632b68f6240c23f0d0b021df4d0fd595333b61a40d36a0cf73656024" 2025-12-04T09:35:33.8071789Z }, 2025-12-04T09:35:33.8072016Z "layers": [ 2025-12-04T09:35:33.8072263Z { 2025-12-04T09:35:33.8072630Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8073121Z "size": 30447951, 2025-12-04T09:35:33.8073634Z "digest": "sha256:63e5bc7682b85ae57a1221210f64d62e7a90b0a30f19af4ca734b8242ae49d63" 2025-12-04T09:35:33.8074173Z }, 2025-12-04T09:35:33.8074393Z { 2025-12-04T09:35:33.8074765Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8075249Z "size": 1554, 2025-12-04T09:35:33.8075705Z "digest": "sha256:835841cca3b7e1464290cdb78e48773e03583413fbed852c3cc5165a392ea44d" 2025-12-04T09:35:33.8076254Z }, 2025-12-04T09:35:33.8076469Z { 2025-12-04T09:35:33.8076854Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8077343Z "size": 313276213, 2025-12-04T09:35:33.8077859Z "digest": "sha256:1bf1bb125deaa5b8a3adf121671e87ba2fa7e229f9eb1dff7ade581cb737175a" 2025-12-04T09:35:33.8078416Z }, 2025-12-04T09:35:33.8078638Z { 2025-12-04T09:35:33.8079011Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8079485Z "size": 787, 2025-12-04T09:35:33.8079968Z "digest": "sha256:b21856d1bf420da6fa8ec7331b82ab355d4f4178644e7d3a3d3d0fbc3610109a" 2025-12-04T09:35:33.8080529Z }, 2025-12-04T09:35:33.8080732Z { 2025-12-04T09:35:33.8081100Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8081581Z "size": 106, 2025-12-04T09:35:33.8082065Z "digest": "sha256:848ba2c095e2b9e6acfb0ecf077adb526fb2fa82ed44cf6648ebde97f296f8ec" 2025-12-04T09:35:33.8082613Z }, 2025-12-04T09:35:33.8082828Z { 2025-12-04T09:35:33.8083199Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8083670Z "size": 704, 2025-12-04T09:35:33.8084150Z "digest": "sha256:029495b23122c840ca0e52d487afa8d2c4dbf1991cd7f204ec3e434dcf947bf4" 2025-12-04T09:35:33.8084702Z }, 2025-12-04T09:35:33.8084904Z { 2025-12-04T09:35:33.8085274Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8086005Z "size": 1216, 2025-12-04T09:35:33.8086493Z "digest": "sha256:073bb82063cfba4639b11fea43753dbb128f9238353189fc02d2e2aa0b2ad359" 2025-12-04T09:35:33.8087029Z }, 2025-12-04T09:35:33.8087255Z { 2025-12-04T09:35:33.8087629Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8088306Z "size": 484, 2025-12-04T09:35:33.8088781Z "digest": "sha256:59b63930883363c7d2aaab27cc61555d9f3e119dc18247a8624c98ebdaa354a5" 2025-12-04T09:35:33.8089328Z }, 2025-12-04T09:35:33.8089532Z { 2025-12-04T09:35:33.8089913Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8090401Z "size": 110362071, 2025-12-04T09:35:33.8090878Z "digest": "sha256:1c6177b2970db2d7743b4337c420a35f2ec79f338c30d97d534a1f0987c00913" 2025-12-04T09:35:33.8091432Z }, 2025-12-04T09:35:33.8091648Z { 2025-12-04T09:35:33.8092020Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8092492Z "size": 4961, 2025-12-04T09:35:33.8092972Z "digest": "sha256:fabe466dd5f33c3209a56abf5cb46b9b07fe21c57fb43b98e13308c8665c0864" 2025-12-04T09:35:33.8093528Z }, 2025-12-04T09:35:33.8093728Z { 2025-12-04T09:35:33.8094289Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8094779Z "size": 1755, 2025-12-04T09:35:33.8095235Z "digest": "sha256:2b5a11b41761d8ea3b829e4772e4064cb6c4e4989126af324d0057661e4493a1" 2025-12-04T09:35:33.8095781Z }, 2025-12-04T09:35:33.8095998Z { 2025-12-04T09:35:33.8096356Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8096837Z "size": 724, 2025-12-04T09:35:33.8097303Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T09:35:33.8097948Z }, 2025-12-04T09:35:33.8098239Z { 2025-12-04T09:35:33.8098623Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8099114Z "size": 544, 2025-12-04T09:35:33.8099572Z "digest": "sha256:dc0780902fca810498f16efa71f8e5990385f141a0cfcc552616a4acc434f79a" 2025-12-04T09:35:33.8100123Z }, 2025-12-04T09:35:33.8100343Z { 2025-12-04T09:35:33.8100710Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8115128Z "size": 3185191720, 2025-12-04T09:35:33.8115729Z "digest": "sha256:5b09a2b135c8e540e2b9374b68991afdd63a5dfaba75fb44efe054a591f400c2" 2025-12-04T09:35:33.8116312Z }, 2025-12-04T09:35:33.8116541Z { 2025-12-04T09:35:33.8116925Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8117402Z "size": 32, 2025-12-04T09:35:33.8117882Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:35:33.8118442Z }, 2025-12-04T09:35:33.8118664Z { 2025-12-04T09:35:33.8119040Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8119523Z "size": 396, 2025-12-04T09:35:33.8120004Z "digest": "sha256:5bfdaeb5578d6ffcd7db29c48303cbceb13c591210feaa216a8daa7a6d445b4b" 2025-12-04T09:35:33.8120574Z }, 2025-12-04T09:35:33.8120790Z { 2025-12-04T09:35:33.8121148Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8121642Z "size": 236865, 2025-12-04T09:35:33.8122117Z "digest": "sha256:0ef42867f370b8a14b8c301388793b78a0bd2533bb2a317b129b03c8667dc767" 2025-12-04T09:35:33.8122660Z }, 2025-12-04T09:35:33.8122864Z { 2025-12-04T09:35:33.8123238Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8123720Z "size": 230, 2025-12-04T09:35:33.8124172Z "digest": "sha256:446083e497f322789c2d87933a77fb2dfd94e18d2e85f6d4362e6e9521b82c4e" 2025-12-04T09:35:33.8124716Z }, 2025-12-04T09:35:33.8124931Z { 2025-12-04T09:35:33.8125289Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8125769Z "size": 3043500, 2025-12-04T09:35:33.8126256Z "digest": "sha256:d8a170bef0f4e0e28f5ba0952320dd465552adf74f0864b4f47cc11f4c4f82f7" 2025-12-04T09:35:33.8126800Z }, 2025-12-04T09:35:33.8127010Z { 2025-12-04T09:35:33.8127378Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8127848Z "size": 1472, 2025-12-04T09:35:33.8128332Z "digest": "sha256:e2b6cd6a5bd0418a1e4aca3f37942324d4d9f9b0177597e37fc8d1a5626048e1" 2025-12-04T09:35:33.8128891Z }, 2025-12-04T09:35:33.8129112Z { 2025-12-04T09:35:33.8129473Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8129955Z "size": 481, 2025-12-04T09:35:33.8130326Z + exit 0 2025-12-04T09:35:33.8130757Z "digest": "sha256:93efc0181a22218a544413f1d57e9e0e7a0f492e41bef598084c5b9177e3987a" 2025-12-04T09:35:33.8131304Z }, 2025-12-04T09:35:33.8131522Z { 2025-12-04T09:35:33.8131888Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8132371Z "size": 202, 2025-12-04T09:35:33.8132836Z "digest": "sha256:7454c938f17425bcf167ad28a62b42b95f638a7d2cf0840885cfe5ffe8480a12" 2025-12-04T09:35:33.8133370Z }, 2025-12-04T09:35:33.8133584Z { 2025-12-04T09:35:33.8133955Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8134424Z "size": 607, 2025-12-04T09:35:33.8135050Z "digest": "sha256:4d57ff55f6d4161cb6c29e2c0b08d47e65898427db3938479158684899f0023d" 2025-12-04T09:35:33.8135607Z }, 2025-12-04T09:35:33.8135832Z { 2025-12-04T09:35:33.8136196Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8136689Z "size": 6243016141, 2025-12-04T09:35:33.8137191Z "digest": "sha256:b0301534b4a58072d5b140b08a7608bbead41d126fa29fdc78c1e8a43ebb865d" 2025-12-04T09:35:33.8137729Z }, 2025-12-04T09:35:33.8137955Z { 2025-12-04T09:35:33.8138440Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8139005Z "size": 829, 2025-12-04T09:35:33.8139481Z "digest": "sha256:1969e15d0c13874ea5883ed829235a19ef6dc21c8aa6172032b78a8ffa6ff262" 2025-12-04T09:35:33.8140028Z }, 2025-12-04T09:35:33.8140232Z { 2025-12-04T09:35:33.8140606Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8141086Z "size": 33450177, 2025-12-04T09:35:33.8141587Z "digest": "sha256:73180a0f2d5a961a0cc0ba2c3cf375fdcfb43ae5e4e5c63a000c4b4366d52a64" 2025-12-04T09:35:33.8142129Z }, 2025-12-04T09:35:33.8142343Z { 2025-12-04T09:35:33.8142708Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8143174Z "size": 104, 2025-12-04T09:35:33.8143648Z "digest": "sha256:ad81b25cb69f8cf42a4a96678a64b7d0598a8f95236a3e63d1fec4e53edff613" 2025-12-04T09:35:33.8144202Z }, 2025-12-04T09:35:33.8144402Z { 2025-12-04T09:35:33.8144771Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8145252Z "size": 1496, 2025-12-04T09:35:33.8145714Z "digest": "sha256:8165374f8dccf88a7791a5d31afbe29e4d4542b4f1cf1904945e07f9af6bf8ba" 2025-12-04T09:35:33.8146262Z }, 2025-12-04T09:35:33.8146475Z { 2025-12-04T09:35:33.8146832Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8147342Z "size": 458786969, 2025-12-04T09:35:33.8147845Z "digest": "sha256:7779c0bb9be2030df9060b526b98d0afeed1ce5b61ee0530321ef04a4e145e8c" 2025-12-04T09:35:33.8148403Z }, 2025-12-04T09:35:33.8148604Z { 2025-12-04T09:35:33.8148973Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8149450Z "size": 164, 2025-12-04T09:35:33.8149904Z "digest": "sha256:4d0a1c027262ed8c83181b931b64afa1c41c3cac97580231c4cae3a524ebd7d5" 2025-12-04T09:35:33.8150453Z }, 2025-12-04T09:35:33.8150667Z { 2025-12-04T09:35:33.8151026Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8151507Z "size": 346, 2025-12-04T09:35:33.8151973Z "digest": "sha256:a51e0dab2d596e6563483f27c12660007160847d177ba4c31812a8f44ada5754" 2025-12-04T09:35:33.8152502Z }, 2025-12-04T09:35:33.8152716Z { 2025-12-04T09:35:33.8153085Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8153568Z "size": 32, 2025-12-04T09:35:33.8154031Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:35:33.8154593Z }, 2025-12-04T09:35:33.8154813Z { 2025-12-04T09:35:33.8155173Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8155655Z "size": 106, 2025-12-04T09:35:33.8156132Z "digest": "sha256:3eb6d4ff040b8761b1e3e1da768bdb884ce0e5324e3d0f6471b0a8b2ddf4736f" 2025-12-04T09:35:33.8156676Z }, 2025-12-04T09:35:33.8156893Z { 2025-12-04T09:35:33.8157264Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8157729Z "size": 424, 2025-12-04T09:35:33.8158199Z "digest": "sha256:b168858b85373f8ddca549d79267a06de4fa945d04bf791c55c9ddc93957fa3c" 2025-12-04T09:35:33.8158752Z }, 2025-12-04T09:35:33.8158953Z { 2025-12-04T09:35:33.8159322Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8159802Z "size": 19309367, 2025-12-04T09:35:33.8160284Z "digest": "sha256:d77a39278026a8899e2f97643918bdcf96e711ca26951880b4841b319dc71321" 2025-12-04T09:35:33.8160813Z }, 2025-12-04T09:35:33.8161124Z { 2025-12-04T09:35:33.8161501Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8161968Z "size": 108, 2025-12-04T09:35:33.8162448Z "digest": "sha256:36fbd357280b6b40e90f36ac3d19da3da10e5dbf0027a5cfe8e2f29d1870d347" 2025-12-04T09:35:33.8163005Z }, 2025-12-04T09:35:33.8163209Z { 2025-12-04T09:35:33.8163579Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8164060Z "size": 826, 2025-12-04T09:35:33.8164519Z "digest": "sha256:4e3b10a5dd6aed29f238d604925e2a4f873141c1087c8dd4fdde5c61e7560893" 2025-12-04T09:35:33.8165140Z }, 2025-12-04T09:35:33.8165351Z { 2025-12-04T09:35:33.8165709Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8166187Z "size": 724, 2025-12-04T09:35:33.8166648Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T09:35:33.8167184Z }, 2025-12-04T09:35:33.8167386Z { 2025-12-04T09:35:33.8167771Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8168251Z "size": 149, 2025-12-04T09:35:33.8168714Z "digest": "sha256:3092fab73b59190b9facfc49bf18f58612172bc2fd68dfa339a1118632616939" 2025-12-04T09:35:33.8169265Z }, 2025-12-04T09:35:33.8169481Z { 2025-12-04T09:35:33.8169841Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8170321Z "size": 136, 2025-12-04T09:35:33.8170804Z "digest": "sha256:20020dd28a15ba092fcbfe906ee39cdddfcc9d0b7eb42fdd6f4c08a984fa9c00" 2025-12-04T09:35:33.8171363Z }, 2025-12-04T09:35:33.8171577Z { 2025-12-04T09:35:33.8171948Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8172417Z "size": 140, 2025-12-04T09:35:33.8172892Z "digest": "sha256:ae5280ce969dcff08c091e9a5f7641f13561b2b0ee44d78b7c3f81d8fe8e6d32" 2025-12-04T09:35:33.8173444Z }, 2025-12-04T09:35:33.8173659Z { 2025-12-04T09:35:33.8174023Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8174503Z "size": 32, 2025-12-04T09:35:33.8174971Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:35:33.8175513Z }, 2025-12-04T09:35:33.8175728Z { 2025-12-04T09:35:33.8176098Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8176565Z "size": 223, 2025-12-04T09:35:33.8177031Z "digest": "sha256:026e4484b749dfc556dcf7c8f45c1759518a89072e4dbc974d9405ada1582d03" 2025-12-04T09:35:33.8177579Z }, 2025-12-04T09:35:33.8177785Z { 2025-12-04T09:35:33.8178152Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8178736Z "size": 256, 2025-12-04T09:35:33.8179225Z "digest": "sha256:1be9da2ce53d20d8befad5c024ee0eb41ee35984307cbd5621d8effae0353073" 2025-12-04T09:35:33.8179777Z }, 2025-12-04T09:35:33.8179992Z { 2025-12-04T09:35:33.8180367Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8180838Z "size": 32, 2025-12-04T09:35:33.8181315Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:35:33.8181871Z }, 2025-12-04T09:35:33.8182073Z { 2025-12-04T09:35:33.8182442Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8182924Z "size": 106, 2025-12-04T09:35:33.8183379Z "digest": "sha256:6481b7a1d9fb4001fd6f9e2a8d1600192529ddb957128e41671ca4630fa06ad4" 2025-12-04T09:35:33.8183930Z }, 2025-12-04T09:35:33.8184145Z { 2025-12-04T09:35:33.8184502Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8184991Z "size": 312293471, 2025-12-04T09:35:33.8185486Z "digest": "sha256:fa519d18c39d8f297109c056017ebce7efc322d058afd27fdac5880d6c8d35b0" 2025-12-04T09:35:33.8186040Z }, 2025-12-04T09:35:33.8186240Z { 2025-12-04T09:35:33.8186608Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8187090Z "size": 3058012325, 2025-12-04T09:35:33.8187927Z "digest": "sha256:d172f25b97f78fce0f6c6701f0db794b1c994a9cdf8cff9ddc6bdd1a1bea835c" 2025-12-04T09:35:33.8188536Z }, 2025-12-04T09:35:33.8188754Z { 2025-12-04T09:35:33.8189131Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8189603Z "size": 129, 2025-12-04T09:35:33.8190080Z "digest": "sha256:fd60ab6b1c2c85a932e9894b5d0cf5c9e75fa21782e3028ea40d76017ecfbf85" 2025-12-04T09:35:33.8190635Z }, 2025-12-04T09:35:33.8190829Z { 2025-12-04T09:35:33.8191180Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8191778Z "size": 880, 2025-12-04T09:35:33.8192236Z "digest": "sha256:0afe45579c2c87002db8c1abf7b32a748e6cb3b9b57e9b391f91cad9f84df476" 2025-12-04T09:35:33.8192767Z }, 2025-12-04T09:35:33.8192970Z { 2025-12-04T09:35:33.8193330Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8193784Z "size": 724, 2025-12-04T09:35:33.8194235Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T09:35:33.8194758Z }, 2025-12-04T09:35:33.8194951Z { 2025-12-04T09:35:33.8195313Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8195778Z "size": 139, 2025-12-04T09:35:33.8196222Z "digest": "sha256:5884ffd6720b47274f651262d5f9224f55960f9ea717faafe332aa20afb0ffa4" 2025-12-04T09:35:33.8196755Z }, 2025-12-04T09:35:33.8196956Z { 2025-12-04T09:35:33.8197300Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8197767Z "size": 32, 2025-12-04T09:35:33.8198225Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:35:33.8198770Z }, 2025-12-04T09:35:33.8198965Z { 2025-12-04T09:35:33.8199325Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8199793Z "size": 160, 2025-12-04T09:35:33.8200259Z "digest": "sha256:ab7a7c316fa7a9b7a96304ce96fafdffbc5cc6b960a4bb2def9131b36d9225c5" 2025-12-04T09:35:33.8200812Z }, 2025-12-04T09:35:33.8201010Z { 2025-12-04T09:35:33.8201359Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8201833Z "size": 1012, 2025-12-04T09:35:33.8202295Z "digest": "sha256:c7775ce5574bdde75b4c09a1db19f7d0dc027f1f4c1f961022fc55833133e616" 2025-12-04T09:35:33.8202822Z }, 2025-12-04T09:35:33.8203019Z { 2025-12-04T09:35:33.8203376Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8203832Z "size": 724, 2025-12-04T09:35:33.8204277Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T09:35:33.8204802Z }, 2025-12-04T09:35:33.8205001Z { 2025-12-04T09:35:33.8205350Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8205817Z "size": 134, 2025-12-04T09:35:33.8206270Z "digest": "sha256:81945c4fb228ca73f4bac38b6d8a1eca7139585d4a078219dfaa16ea13945949" 2025-12-04T09:35:33.8206800Z }, 2025-12-04T09:35:33.8207003Z { 2025-12-04T09:35:33.8207358Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8207820Z "size": 32, 2025-12-04T09:35:33.8208279Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:35:33.8208818Z }, 2025-12-04T09:35:33.8209008Z { 2025-12-04T09:35:33.8209366Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8209830Z "size": 158, 2025-12-04T09:35:33.8210286Z "digest": "sha256:663cbe24d60bf42bc7a440cb4867e4287cacf54194dd3152406668e61d7e92e5" 2025-12-04T09:35:33.8210816Z }, 2025-12-04T09:35:33.8211016Z { 2025-12-04T09:35:33.8211372Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8211835Z "size": 603, 2025-12-04T09:35:33.8212269Z "digest": "sha256:43f216b027865c8ca16f855703465445f3a548614a4d7e29387337b9651ac25c" 2025-12-04T09:35:33.8212787Z }, 2025-12-04T09:35:33.8212981Z { 2025-12-04T09:35:33.8213450Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8213922Z "size": 724, 2025-12-04T09:35:33.8214360Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T09:35:33.8214891Z }, 2025-12-04T09:35:33.8215095Z { 2025-12-04T09:35:33.8215446Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8215918Z "size": 155, 2025-12-04T09:35:33.8216381Z "digest": "sha256:c47c3cfeb68763aa19727693ad52fe0c80561a98139adaa2ab5eccea35c2d1b4" 2025-12-04T09:35:33.8216993Z }, 2025-12-04T09:35:33.8217190Z { 2025-12-04T09:35:33.8217547Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8218019Z "size": 32, 2025-12-04T09:35:33.8218556Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:35:33.8219091Z }, 2025-12-04T09:35:33.8219286Z { 2025-12-04T09:35:33.8219638Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8220100Z "size": 188, 2025-12-04T09:35:33.8220551Z "digest": "sha256:7d326b9e267322de9337ac2a71ddeac4cb61f28a018a6155863f83a164ad9437" 2025-12-04T09:35:33.8221072Z }, 2025-12-04T09:35:33.8221273Z { 2025-12-04T09:35:33.8221625Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8222082Z "size": 1370, 2025-12-04T09:35:33.8222536Z "digest": "sha256:7ec8f17141c8335192fa21b660dfe1fe0ad16b202bc234e7d4ef063b35124158" 2025-12-04T09:35:33.8223064Z }, 2025-12-04T09:35:33.8223265Z { 2025-12-04T09:35:33.8223609Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8224073Z "size": 32, 2025-12-04T09:35:33.8224539Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:35:33.8225078Z }, 2025-12-04T09:35:33.8225291Z { 2025-12-04T09:35:33.8225660Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8226131Z "size": 136, 2025-12-04T09:35:33.8226602Z "digest": "sha256:26249ea175bf816b87c4c83e5efb78fd386a800fa10e819ba85b06858bcf877e" 2025-12-04T09:35:33.8227153Z }, 2025-12-04T09:35:33.8227356Z { 2025-12-04T09:35:33.8227729Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8228210Z "size": 529, 2025-12-04T09:35:33.8228683Z "digest": "sha256:5e8e9ccb36f30a8c3a7e6a5011ee5001152f36c9c749397f3e234b1822326dd0" 2025-12-04T09:35:33.8229223Z }, 2025-12-04T09:35:33.8229441Z { 2025-12-04T09:35:33.8229819Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8230286Z "size": 32, 2025-12-04T09:35:33.8230766Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:35:33.8231327Z }, 2025-12-04T09:35:33.8231532Z { 2025-12-04T09:35:33.8231906Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8232394Z "size": 104, 2025-12-04T09:35:33.8232858Z "digest": "sha256:5bc72d4e1de83a1a254e8808f727118dd54cf048c14ff298a5299e015a116bfd" 2025-12-04T09:35:33.8233413Z }, 2025-12-04T09:35:33.8233632Z { 2025-12-04T09:35:33.8233992Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8234473Z "size": 436, 2025-12-04T09:35:33.8234944Z "digest": "sha256:83cddbd497794c27254e11c4c00105d1f61399e7fef9d208a0be250724efd2c0" 2025-12-04T09:35:33.8235495Z }, 2025-12-04T09:35:33.8235698Z { 2025-12-04T09:35:33.8236069Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8236551Z "size": 32, 2025-12-04T09:35:33.8237007Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:35:33.8237561Z }, 2025-12-04T09:35:33.8237771Z { 2025-12-04T09:35:33.8238128Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8238612Z "size": 109, 2025-12-04T09:35:33.8239172Z "digest": "sha256:60c25d8c3dd2d78785f659204d0b1e64954ca581f89874b68ffe8fee23c6b661" 2025-12-04T09:35:33.8239710Z }, 2025-12-04T09:35:33.8239929Z { 2025-12-04T09:35:33.8240301Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8240773Z "size": 1896, 2025-12-04T09:35:33.8241260Z "digest": "sha256:a534dcf4b9a9e5fabed742c8a8fc43c9cfe7346ea88ab3c177c3b14fd3afe00a" 2025-12-04T09:35:33.8241827Z }, 2025-12-04T09:35:33.8242042Z { 2025-12-04T09:35:33.8242400Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8242949Z "size": 245582017, 2025-12-04T09:35:33.8243441Z "digest": "sha256:10138310c65c78d7de8375225ce37f5f7bfae7898e4e8bbcb90bd56a1bd05db4" 2025-12-04T09:35:33.8243973Z }, 2025-12-04T09:35:33.8244188Z { 2025-12-04T09:35:33.8244556Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8245024Z "size": 106, 2025-12-04T09:35:33.8245504Z "digest": "sha256:8487679f252b6fb703dc9398d73aaeec68df724bfc961579ec5bdae62ebe3a37" 2025-12-04T09:35:33.8246052Z }, 2025-12-04T09:35:33.8246252Z { 2025-12-04T09:35:33.8246624Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8247103Z "size": 162, 2025-12-04T09:35:33.8247577Z "digest": "sha256:52580ee2caa9ab69b0ac640315ee350e847cd0955c0a1eafa933a076669e87ad" 2025-12-04T09:35:33.8248114Z }, 2025-12-04T09:35:33.8248330Z { 2025-12-04T09:35:33.8248699Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8249170Z "size": 7944, 2025-12-04T09:35:33.8249660Z "digest": "sha256:741c215cb2ffb295ab6a07fab3f0dfdde029463779ff9c0bbff4add26a340cfb" 2025-12-04T09:35:33.8250221Z }, 2025-12-04T09:35:33.8250423Z { 2025-12-04T09:35:33.8250795Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8251273Z "size": 8070, 2025-12-04T09:35:33.8251722Z "digest": "sha256:d17f5aba17a608d1c7851cb3940a25d43f063385813051127074f693d0ede19b" 2025-12-04T09:35:33.8252270Z }, 2025-12-04T09:35:33.8252485Z { 2025-12-04T09:35:33.8252845Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8253326Z "size": 304, 2025-12-04T09:35:33.8253806Z "digest": "sha256:bc08246bb4ba18c3ec5bc69e16b6b4e929c5bd0f3fae10eeb0b1a622a63d6fa2" 2025-12-04T09:35:33.8254366Z }, 2025-12-04T09:35:33.8254568Z { 2025-12-04T09:35:33.8254938Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8255424Z "size": 23755574, 2025-12-04T09:35:33.8255893Z "digest": "sha256:7323bf084bf98f915db061b178c56525a0f95bd34d211b381c7527ad242c5a58" 2025-12-04T09:35:33.8256442Z }, 2025-12-04T09:35:33.8256655Z { 2025-12-04T09:35:33.8257012Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8257488Z "size": 108, 2025-12-04T09:35:33.8257973Z "digest": "sha256:d344ecc97fd77c7d12fd68ddb67aeb6cc3dd2e723de5ad1ca2c80b45c8d6bd77" 2025-12-04T09:35:33.8258608Z }, 2025-12-04T09:35:33.8258825Z { 2025-12-04T09:35:33.8259199Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8259668Z "size": 54145663, 2025-12-04T09:35:33.8260157Z "digest": "sha256:fb60b2d2147ff57c218f449f5b680132af8f7f8032ed69f422b48a3c3c1424f4" 2025-12-04T09:35:33.8260704Z }, 2025-12-04T09:35:33.8260923Z { 2025-12-04T09:35:33.8261284Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T09:35:33.8261766Z "size": 32, 2025-12-04T09:35:33.8262243Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T09:35:33.8262790Z } 2025-12-04T09:35:33.8263006Z ] 2025-12-04T09:35:33.8263220Z } 2025-12-04T09:35:33.8293131Z ##[group]Run set -eux 2025-12-04T09:35:33.8293453Z set -eux 2025-12-04T09:35:33.8293924Z # It's ok if this steps fails, it would then be an anonymous user like what we used to have 2025-12-04T09:35:33.8295427Z aws secretsmanager get-secret-value --secret-id docker_hub_readonly_token | jq --raw-output '.SecretString' | jq -r .docker_hub_readonly_token | docker login --username pytorchbot --password-stdin || true 2025-12-04T09:35:33.8303183Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:33.8303628Z env: 2025-12-04T09:35:33.8303862Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:33.8304164Z ##[endgroup] 2025-12-04T09:35:33.8334401Z + aws secretsmanager get-secret-value --secret-id docker_hub_readonly_token 2025-12-04T09:35:33.8335197Z + jq --raw-output .SecretString 2025-12-04T09:35:33.8336571Z + jq -r .docker_hub_readonly_token 2025-12-04T09:35:33.8337563Z + docker login --username pytorchbot --password-stdin 2025-12-04T09:35:34.4451819Z WARNING! Your password will be stored unencrypted in /home/ec2-user/.docker/config.json. 2025-12-04T09:35:34.4452557Z Configure a credential helper to remove this warning. See 2025-12-04T09:35:34.4453316Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2025-12-04T09:35:34.4453835Z 2025-12-04T09:35:34.4454003Z Login Succeeded 2025-12-04T09:35:34.4544467Z ##[group]Run tag=${ECR_DOCKER_IMAGE##*:} 2025-12-04T09:35:34.4544907Z tag=${ECR_DOCKER_IMAGE##*:} 2025-12-04T09:35:34.4545377Z echo "docker pull ghcr.io/pytorch/ci-image:${tag/:/-}" 2025-12-04T09:35:34.4552205Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:34.4552636Z env: 2025-12-04T09:35:34.4552883Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:34.4553878Z ECR_DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:34.4554918Z ##[endgroup] 2025-12-04T09:35:34.4584384Z docker pull ghcr.io/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:34.4634580Z ##[group]Run pytorch/test-infra/.github/actions/pull-docker-image@main 2025-12-04T09:35:34.4635079Z with: 2025-12-04T09:35:34.4635993Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:34.4637129Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:35:34.4637583Z env: 2025-12-04T09:35:34.4637815Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:34.4638116Z ##[endgroup] 2025-12-04T09:35:34.4654165Z ##[group]Run set -x 2025-12-04T09:35:34.4654477Z set -x 2025-12-04T09:35:34.4654724Z set +e 2025-12-04T09:35:34.4654976Z  2025-12-04T09:35:34.4655236Z login() { 2025-12-04T09:35:34.4655781Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2025-12-04T09:35:34.4656395Z } 2025-12-04T09:35:34.4656634Z  2025-12-04T09:35:34.4656908Z retry () { 2025-12-04T09:35:34.4657209Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2025-12-04T09:35:34.4657578Z } 2025-12-04T09:35:34.4657812Z  2025-12-04T09:35:34.4658077Z retry login "${DOCKER_REGISTRY}" 2025-12-04T09:35:34.4658544Z  2025-12-04T09:35:34.4659120Z IMAGE_SIZE=$(docker manifest inspect "${DOCKER_IMAGE}" | jq '[.layers[].size, .config.size] | add / 1024 / 1024') 2025-12-04T09:35:34.4659911Z echo "Compressed size of image in MB: ${IMAGE_SIZE}" 2025-12-04T09:35:34.4660336Z  2025-12-04T09:35:34.4660572Z set -e 2025-12-04T09:35:34.4660966Z # ignore output since only exit code is used for conditional 2025-12-04T09:35:34.4661550Z # only pull docker image if it's not available locally 2025-12-04T09:35:34.4662181Z if ! docker inspect --type=image "${DOCKER_IMAGE}" >/dev/null 2>/dev/null; then 2025-12-04T09:35:34.4662780Z  retry docker pull "${DOCKER_IMAGE}" 2025-12-04T09:35:34.4663161Z fi 2025-12-04T09:35:34.4669167Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:35:34.4669606Z env: 2025-12-04T09:35:34.4669852Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:35:34.4670822Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:34.4671948Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:35:34.4672404Z ##[endgroup] 2025-12-04T09:35:34.4697923Z + set +e 2025-12-04T09:35:34.4698403Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:35:34.4698919Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:35:34.4701797Z + aws ecr get-login-password --region us-east-1 2025-12-04T09:35:34.4703283Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T09:35:35.0507110Z WARNING! Your password will be stored unencrypted in /home/ec2-user/.docker/config.json. 2025-12-04T09:35:35.0507849Z Configure a credential helper to remove this warning. See 2025-12-04T09:35:35.0508676Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2025-12-04T09:35:35.0509152Z 2025-12-04T09:35:35.0509275Z Login Succeeded 2025-12-04T09:35:35.0528089Z ++ docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:35.0529244Z ++ jq '[.layers[].size, .config.size] | add / 1024 / 1024' 2025-12-04T09:35:35.2932147Z + IMAGE_SIZE=13438.219573020935 2025-12-04T09:35:35.2932731Z + echo 'Compressed size of image in MB: 13438.219573020935' 2025-12-04T09:35:35.2933214Z + set -e 2025-12-04T09:35:35.2934197Z + docker inspect --type=image 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:35.2935648Z Compressed size of image in MB: 13438.219573020935 2025-12-04T09:35:35.3062648Z + retry docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:35.3064363Z + docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:35:35.5360231Z pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a: Pulling from pytorch/ci-image 2025-12-04T09:35:35.5361192Z 63e5bc7682b8: Pulling fs layer 2025-12-04T09:35:35.5361592Z 835841cca3b7: Pulling fs layer 2025-12-04T09:35:35.5362028Z 1bf1bb125dea: Pulling fs layer 2025-12-04T09:35:35.5362422Z b21856d1bf42: Pulling fs layer 2025-12-04T09:35:35.5362797Z 848ba2c095e2: Pulling fs layer 2025-12-04T09:35:35.5363203Z 029495b23122: Pulling fs layer 2025-12-04T09:35:35.5363537Z 073bb82063cf: Pulling fs layer 2025-12-04T09:35:35.5363911Z 59b639308833: Pulling fs layer 2025-12-04T09:35:35.5364234Z 1c6177b2970d: Pulling fs layer 2025-12-04T09:35:35.5364626Z fabe466dd5f3: Pulling fs layer 2025-12-04T09:35:35.5364939Z 2b5a11b41761: Pulling fs layer 2025-12-04T09:35:35.5365324Z 9681563a88ff: Pulling fs layer 2025-12-04T09:35:35.5365648Z dc0780902fca: Pulling fs layer 2025-12-04T09:35:35.5365960Z 5b09a2b135c8: Pulling fs layer 2025-12-04T09:35:35.5366347Z 4f4fb700ef54: Pulling fs layer 2025-12-04T09:35:35.5366676Z 5bfdaeb5578d: Pulling fs layer 2025-12-04T09:35:35.5367055Z 0ef42867f370: Pulling fs layer 2025-12-04T09:35:35.5367370Z 446083e497f3: Pulling fs layer 2025-12-04T09:35:35.5367750Z d8a170bef0f4: Pulling fs layer 2025-12-04T09:35:35.5368069Z e2b6cd6a5bd0: Pulling fs layer 2025-12-04T09:35:35.5368452Z 93efc0181a22: Pulling fs layer 2025-12-04T09:35:35.5368778Z 7454c938f174: Pulling fs layer 2025-12-04T09:35:35.5369109Z 4d57ff55f6d4: Pulling fs layer 2025-12-04T09:35:35.5369513Z b0301534b4a5: Pulling fs layer 2025-12-04T09:35:35.5369832Z 1969e15d0c13: Pulling fs layer 2025-12-04T09:35:35.5370219Z 73180a0f2d5a: Pulling fs layer 2025-12-04T09:35:35.5370530Z ad81b25cb69f: Pulling fs layer 2025-12-04T09:35:35.5370918Z 8165374f8dcc: Pulling fs layer 2025-12-04T09:35:35.5371243Z 7779c0bb9be2: Pulling fs layer 2025-12-04T09:35:35.5371598Z 4d0a1c027262: Pulling fs layer 2025-12-04T09:35:35.5371946Z a51e0dab2d59: Pulling fs layer 2025-12-04T09:35:35.5372277Z 3eb6d4ff040b: Pulling fs layer 2025-12-04T09:35:35.5372648Z b168858b8537: Pulling fs layer 2025-12-04T09:35:35.5372964Z d77a39278026: Pulling fs layer 2025-12-04T09:35:35.5373347Z 36fbd357280b: Pulling fs layer 2025-12-04T09:35:35.5373862Z 4e3b10a5dd6a: Pulling fs layer 2025-12-04T09:35:35.5374592Z 3092fab73b59: Pulling fs layer 2025-12-04T09:35:35.5374915Z 20020dd28a15: Pulling fs layer 2025-12-04T09:35:35.5375236Z ae5280ce969d: Pulling fs layer 2025-12-04T09:35:35.5375607Z 026e4484b749: Pulling fs layer 2025-12-04T09:35:35.5375933Z 1be9da2ce53d: Pulling fs layer 2025-12-04T09:35:35.5376273Z 6481b7a1d9fb: Pulling fs layer 2025-12-04T09:35:35.5376589Z fa519d18c39d: Pulling fs layer 2025-12-04T09:35:35.5376918Z d172f25b97f7: Pulling fs layer 2025-12-04T09:35:35.5377247Z fd60ab6b1c2c: Pulling fs layer 2025-12-04T09:35:35.5377613Z 0afe45579c2c: Pulling fs layer 2025-12-04T09:35:35.5377922Z 5884ffd6720b: Pulling fs layer 2025-12-04T09:35:35.5378319Z ab7a7c316fa7: Pulling fs layer 2025-12-04T09:35:35.5378647Z c7775ce5574b: Pulling fs layer 2025-12-04T09:35:35.5378955Z 81945c4fb228: Pulling fs layer 2025-12-04T09:35:35.5379281Z 663cbe24d60b: Pulling fs layer 2025-12-04T09:35:35.5379606Z 43f216b02786: Pulling fs layer 2025-12-04T09:35:35.5379922Z c47c3cfeb687: Pulling fs layer 2025-12-04T09:35:35.5391661Z 7d326b9e2673: Pulling fs layer 2025-12-04T09:35:35.5392161Z 7ec8f17141c8: Pulling fs layer 2025-12-04T09:35:35.5392485Z 26249ea175bf: Pulling fs layer 2025-12-04T09:35:35.5392817Z 5e8e9ccb36f3: Pulling fs layer 2025-12-04T09:35:35.5393152Z 5bc72d4e1de8: Pulling fs layer 2025-12-04T09:35:35.5393681Z 83cddbd49779: Pulling fs layer 2025-12-04T09:35:35.5394015Z 60c25d8c3dd2: Pulling fs layer 2025-12-04T09:35:35.5394347Z a534dcf4b9a9: Pulling fs layer 2025-12-04T09:35:35.5394659Z 10138310c65c: Pulling fs layer 2025-12-04T09:35:35.5394982Z 8487679f252b: Pulling fs layer 2025-12-04T09:35:35.5395306Z 52580ee2caa9: Pulling fs layer 2025-12-04T09:35:35.5395630Z 741c215cb2ff: Pulling fs layer 2025-12-04T09:35:35.5395944Z d17f5aba17a6: Pulling fs layer 2025-12-04T09:35:35.5396275Z bc08246bb4ba: Pulling fs layer 2025-12-04T09:35:35.5396600Z 7323bf084bf9: Pulling fs layer 2025-12-04T09:35:35.5396900Z 0ef42867f370: Waiting 2025-12-04T09:35:35.5397181Z d8a170bef0f4: Waiting 2025-12-04T09:35:35.5397457Z 073bb82063cf: Waiting 2025-12-04T09:35:35.5397725Z 93efc0181a22: Waiting 2025-12-04T09:35:35.5398015Z d344ecc97fd7: Pulling fs layer 2025-12-04T09:35:35.5398333Z e2b6cd6a5bd0: Waiting 2025-12-04T09:35:35.5398596Z 4d57ff55f6d4: Waiting 2025-12-04T09:35:35.5398887Z fb60b2d2147f: Pulling fs layer 2025-12-04T09:35:35.5399204Z b21856d1bf42: Waiting 2025-12-04T09:35:35.5399461Z 029495b23122: Waiting 2025-12-04T09:35:35.5399725Z 1969e15d0c13: Waiting 2025-12-04T09:35:35.5399990Z 2b5a11b41761: Waiting 2025-12-04T09:35:35.5400244Z 73180a0f2d5a: Waiting 2025-12-04T09:35:35.5400508Z 9681563a88ff: Waiting 2025-12-04T09:35:35.5400773Z 8165374f8dcc: Waiting 2025-12-04T09:35:35.5401030Z ad81b25cb69f: Waiting 2025-12-04T09:35:35.5401300Z 7779c0bb9be2: Waiting 2025-12-04T09:35:35.5401572Z 5bfdaeb5578d: Waiting 2025-12-04T09:35:35.5401845Z 848ba2c095e2: Waiting 2025-12-04T09:35:35.5402099Z 446083e497f3: Waiting 2025-12-04T09:35:35.5402367Z dc0780902fca: Waiting 2025-12-04T09:35:35.5402640Z 3eb6d4ff040b: Waiting 2025-12-04T09:35:35.5402898Z 5b09a2b135c8: Waiting 2025-12-04T09:35:35.5403175Z 4f4fb700ef54: Waiting 2025-12-04T09:35:35.5403448Z fabe466dd5f3: Waiting 2025-12-04T09:35:35.5403697Z 7454c938f174: Waiting 2025-12-04T09:35:35.5403964Z 59b639308833: Waiting 2025-12-04T09:35:35.5404232Z d77a39278026: Waiting 2025-12-04T09:35:35.5404494Z 36fbd357280b: Waiting 2025-12-04T09:35:35.5404766Z 20020dd28a15: Waiting 2025-12-04T09:35:35.5405036Z ae5280ce969d: Waiting 2025-12-04T09:35:35.5405295Z 1be9da2ce53d: Waiting 2025-12-04T09:35:35.5405568Z 1c6177b2970d: Waiting 2025-12-04T09:35:35.5405837Z fa519d18c39d: Waiting 2025-12-04T09:35:35.5406096Z fd60ab6b1c2c: Waiting 2025-12-04T09:35:35.5406369Z 6481b7a1d9fb: Waiting 2025-12-04T09:35:35.5406640Z 0afe45579c2c: Waiting 2025-12-04T09:35:35.5406894Z 026e4484b749: Waiting 2025-12-04T09:35:35.5407162Z 5884ffd6720b: Waiting 2025-12-04T09:35:35.5407433Z ab7a7c316fa7: Waiting 2025-12-04T09:35:35.5407774Z b168858b8537: Waiting 2025-12-04T09:35:35.5408190Z c7775ce5574b: Waiting 2025-12-04T09:35:35.5408599Z 43f216b02786: Waiting 2025-12-04T09:35:35.5409167Z 81945c4fb228: Waiting 2025-12-04T09:35:35.5409424Z 7ec8f17141c8: Waiting 2025-12-04T09:35:35.5409697Z 7d326b9e2673: Waiting 2025-12-04T09:35:35.5409969Z c47c3cfeb687: Waiting 2025-12-04T09:35:35.5410225Z 26249ea175bf: Waiting 2025-12-04T09:35:35.5410511Z 60c25d8c3dd2: Waiting 2025-12-04T09:35:35.5410788Z a534dcf4b9a9: Waiting 2025-12-04T09:35:35.5411047Z 663cbe24d60b: Waiting 2025-12-04T09:35:35.5411323Z bc08246bb4ba: Waiting 2025-12-04T09:35:35.5411595Z 52580ee2caa9: Waiting 2025-12-04T09:35:35.5411855Z d17f5aba17a6: Waiting 2025-12-04T09:35:35.5412126Z 8487679f252b: Waiting 2025-12-04T09:35:35.5412389Z 7323bf084bf9: Waiting 2025-12-04T09:35:35.5412643Z 10138310c65c: Waiting 2025-12-04T09:35:35.5412907Z d344ecc97fd7: Waiting 2025-12-04T09:35:35.5413179Z fb60b2d2147f: Waiting 2025-12-04T09:35:35.5413435Z 5bc72d4e1de8: Waiting 2025-12-04T09:35:35.5413712Z 83cddbd49779: Waiting 2025-12-04T09:35:35.5413985Z 741c215cb2ff: Waiting 2025-12-04T09:35:35.5414242Z 5e8e9ccb36f3: Waiting 2025-12-04T09:35:35.5414518Z 3092fab73b59: Waiting 2025-12-04T09:35:35.5414791Z 4e3b10a5dd6a: Waiting 2025-12-04T09:35:35.5415066Z d172f25b97f7: Waiting 2025-12-04T09:35:35.5415341Z 4d0a1c027262: Waiting 2025-12-04T09:35:35.5415612Z a51e0dab2d59: Waiting 2025-12-04T09:35:35.5415980Z b0301534b4a5: Waiting 2025-12-04T09:35:35.6304524Z 835841cca3b7: Download complete 2025-12-04T09:35:35.7265112Z b21856d1bf42: Download complete 2025-12-04T09:35:35.8161995Z 848ba2c095e2: Verifying Checksum 2025-12-04T09:35:35.8162743Z 848ba2c095e2: Download complete 2025-12-04T09:35:35.8881388Z 63e5bc7682b8: Download complete 2025-12-04T09:35:35.9046891Z 029495b23122: Verifying Checksum 2025-12-04T09:35:35.9047560Z 029495b23122: Download complete 2025-12-04T09:35:35.9875209Z 073bb82063cf: Verifying Checksum 2025-12-04T09:35:35.9875633Z 073bb82063cf: Download complete 2025-12-04T09:35:36.0327517Z 59b639308833: Verifying Checksum 2025-12-04T09:35:36.0327915Z 59b639308833: Download complete 2025-12-04T09:35:36.1557390Z fabe466dd5f3: Verifying Checksum 2025-12-04T09:35:36.1557916Z fabe466dd5f3: Download complete 2025-12-04T09:35:36.2263640Z 2b5a11b41761: Verifying Checksum 2025-12-04T09:35:36.2264070Z 2b5a11b41761: Download complete 2025-12-04T09:35:36.3186443Z 9681563a88ff: Download complete 2025-12-04T09:35:36.3956875Z dc0780902fca: Verifying Checksum 2025-12-04T09:35:36.3957595Z dc0780902fca: Download complete 2025-12-04T09:35:36.8311553Z 63e5bc7682b8: Pull complete 2025-12-04T09:35:36.8525516Z 835841cca3b7: Pull complete 2025-12-04T09:35:37.7899065Z 1c6177b2970d: Verifying Checksum 2025-12-04T09:35:37.7899547Z 1c6177b2970d: Download complete 2025-12-04T09:35:37.7978264Z 4f4fb700ef54: Download complete 2025-12-04T09:35:37.9749572Z 5bfdaeb5578d: Verifying Checksum 2025-12-04T09:35:38.0588604Z 5bfdaeb5578d: Download complete 2025-12-04T09:35:38.0589116Z 0ef42867f370: Download complete 2025-12-04T09:35:38.1627416Z 446083e497f3: Verifying Checksum 2025-12-04T09:35:38.1627831Z 446083e497f3: Download complete 2025-12-04T09:35:38.4627717Z d8a170bef0f4: Download complete 2025-12-04T09:35:38.5773171Z e2b6cd6a5bd0: Verifying Checksum 2025-12-04T09:35:38.5773824Z e2b6cd6a5bd0: Download complete 2025-12-04T09:35:38.6715675Z 93efc0181a22: Download complete 2025-12-04T09:35:38.7590000Z 7454c938f174: Download complete 2025-12-04T09:35:38.8237541Z 4d57ff55f6d4: Verifying Checksum 2025-12-04T09:35:38.8237965Z 4d57ff55f6d4: Download complete 2025-12-04T09:35:40.8309054Z 1bf1bb125dea: Verifying Checksum 2025-12-04T09:35:40.8309699Z 1bf1bb125dea: Download complete 2025-12-04T09:35:40.9244273Z 1969e15d0c13: Download complete 2025-12-04T09:35:41.4012678Z 73180a0f2d5a: Verifying Checksum 2025-12-04T09:35:41.4013103Z 73180a0f2d5a: Download complete 2025-12-04T09:35:41.5227335Z ad81b25cb69f: Verifying Checksum 2025-12-04T09:35:41.5227824Z ad81b25cb69f: Download complete 2025-12-04T09:35:41.6150159Z 8165374f8dcc: Verifying Checksum 2025-12-04T09:35:41.6150575Z 8165374f8dcc: Download complete 2025-12-04T09:35:49.3075241Z 7779c0bb9be2: Verifying Checksum 2025-12-04T09:35:49.3076259Z 7779c0bb9be2: Download complete 2025-12-04T09:35:49.4228157Z 4d0a1c027262: Verifying Checksum 2025-12-04T09:35:49.4228588Z 4d0a1c027262: Download complete 2025-12-04T09:35:49.5226667Z a51e0dab2d59: Verifying Checksum 2025-12-04T09:35:49.5227082Z a51e0dab2d59: Download complete 2025-12-04T09:35:49.6205455Z 3eb6d4ff040b: Download complete 2025-12-04T09:35:49.6251528Z 1bf1bb125dea: Pull complete 2025-12-04T09:35:49.7343926Z b168858b8537: Verifying Checksum 2025-12-04T09:35:49.7344314Z b168858b8537: Download complete 2025-12-04T09:35:49.8325937Z b21856d1bf42: Pull complete 2025-12-04T09:35:49.9735781Z 848ba2c095e2: Pull complete 2025-12-04T09:35:50.1185787Z 029495b23122: Pull complete 2025-12-04T09:35:50.1349336Z d77a39278026: Verifying Checksum 2025-12-04T09:35:50.1349712Z d77a39278026: Download complete 2025-12-04T09:35:50.2363781Z 36fbd357280b: Verifying Checksum 2025-12-04T09:35:50.2364264Z 36fbd357280b: Download complete 2025-12-04T09:35:50.2878964Z 073bb82063cf: Pull complete 2025-12-04T09:35:50.3347767Z 4e3b10a5dd6a: Download complete 2025-12-04T09:35:50.4224791Z 3092fab73b59: Download complete 2025-12-04T09:35:50.4678773Z 59b639308833: Pull complete 2025-12-04T09:35:50.5099664Z 20020dd28a15: Verifying Checksum 2025-12-04T09:35:50.5100328Z 20020dd28a15: Download complete 2025-12-04T09:35:50.6178722Z ae5280ce969d: Verifying Checksum 2025-12-04T09:35:50.6179136Z ae5280ce969d: Download complete 2025-12-04T09:35:50.6923631Z 026e4484b749: Verifying Checksum 2025-12-04T09:35:50.6924054Z 026e4484b749: Download complete 2025-12-04T09:35:50.7838599Z 1be9da2ce53d: Verifying Checksum 2025-12-04T09:35:50.7839028Z 1be9da2ce53d: Download complete 2025-12-04T09:35:50.8630719Z 6481b7a1d9fb: Verifying Checksum 2025-12-04T09:35:50.8631196Z 6481b7a1d9fb: Download complete 2025-12-04T09:35:53.1641008Z 1c6177b2970d: Pull complete 2025-12-04T09:35:53.3823985Z fabe466dd5f3: Pull complete 2025-12-04T09:35:53.5913634Z 2b5a11b41761: Pull complete 2025-12-04T09:35:53.8132199Z 9681563a88ff: Pull complete 2025-12-04T09:35:54.0333771Z dc0780902fca: Pull complete 2025-12-04T09:35:55.7696088Z fa519d18c39d: Verifying Checksum 2025-12-04T09:35:55.7696521Z fa519d18c39d: Download complete 2025-12-04T09:36:25.4925533Z 5b09a2b135c8: Verifying Checksum 2025-12-04T09:36:25.4926003Z 5b09a2b135c8: Download complete 2025-12-04T09:36:25.5916508Z fd60ab6b1c2c: Verifying Checksum 2025-12-04T09:36:25.5916938Z fd60ab6b1c2c: Download complete 2025-12-04T09:36:25.6860779Z 0afe45579c2c: Verifying Checksum 2025-12-04T09:36:25.6861218Z 0afe45579c2c: Download complete 2025-12-04T09:36:25.7795741Z 5884ffd6720b: Verifying Checksum 2025-12-04T09:36:25.7796379Z 5884ffd6720b: Download complete 2025-12-04T09:36:25.8568429Z ab7a7c316fa7: Download complete 2025-12-04T09:36:25.9670803Z c7775ce5574b: Verifying Checksum 2025-12-04T09:36:25.9671248Z c7775ce5574b: Download complete 2025-12-04T09:36:26.0626744Z 81945c4fb228: Verifying Checksum 2025-12-04T09:36:26.0627180Z 81945c4fb228: Download complete 2025-12-04T09:36:26.1509788Z 663cbe24d60b: Verifying Checksum 2025-12-04T09:36:26.1510231Z 663cbe24d60b: Download complete 2025-12-04T09:36:26.2401212Z 43f216b02786: Download complete 2025-12-04T09:36:26.3439491Z c47c3cfeb687: Verifying Checksum 2025-12-04T09:36:26.3440032Z c47c3cfeb687: Download complete 2025-12-04T09:36:26.4267913Z 7d326b9e2673: Verifying Checksum 2025-12-04T09:36:26.4268632Z 7d326b9e2673: Download complete 2025-12-04T09:36:26.5247274Z 7ec8f17141c8: Verifying Checksum 2025-12-04T09:36:26.5247891Z 7ec8f17141c8: Download complete 2025-12-04T09:36:26.6229426Z 26249ea175bf: Verifying Checksum 2025-12-04T09:36:26.6230070Z 26249ea175bf: Download complete 2025-12-04T09:36:26.6831327Z 5e8e9ccb36f3: Verifying Checksum 2025-12-04T09:36:26.6833766Z 5e8e9ccb36f3: Download complete 2025-12-04T09:36:26.8162791Z 5bc72d4e1de8: Verifying Checksum 2025-12-04T09:36:26.8163315Z 5bc72d4e1de8: Download complete 2025-12-04T09:36:26.9027470Z 83cddbd49779: Verifying Checksum 2025-12-04T09:36:26.9027920Z 83cddbd49779: Download complete 2025-12-04T09:36:26.9755675Z 60c25d8c3dd2: Verifying Checksum 2025-12-04T09:36:26.9756342Z 60c25d8c3dd2: Download complete 2025-12-04T09:36:27.0918970Z a534dcf4b9a9: Download complete 2025-12-04T09:36:30.6489970Z 10138310c65c: Verifying Checksum 2025-12-04T09:36:30.7395996Z 8487679f252b: Verifying Checksum 2025-12-04T09:36:30.7396602Z 8487679f252b: Download complete 2025-12-04T09:36:30.8027054Z 52580ee2caa9: Verifying Checksum 2025-12-04T09:36:30.8027661Z 52580ee2caa9: Download complete 2025-12-04T09:36:30.9516687Z 741c215cb2ff: Verifying Checksum 2025-12-04T09:36:30.9517300Z 741c215cb2ff: Download complete 2025-12-04T09:36:31.0472078Z d17f5aba17a6: Download complete 2025-12-04T09:36:31.1304060Z bc08246bb4ba: Verifying Checksum 2025-12-04T09:36:31.1304475Z bc08246bb4ba: Download complete 2025-12-04T09:36:31.5527456Z 7323bf084bf9: Verifying Checksum 2025-12-04T09:36:31.5528156Z 7323bf084bf9: Download complete 2025-12-04T09:36:31.6601095Z d344ecc97fd7: Download complete 2025-12-04T09:36:32.5890107Z fb60b2d2147f: Verifying Checksum 2025-12-04T09:36:32.5890577Z fb60b2d2147f: Download complete 2025-12-04T09:36:46.7203514Z d172f25b97f7: Verifying Checksum 2025-12-04T09:36:46.7204082Z d172f25b97f7: Download complete 2025-12-04T09:37:17.3534475Z 5b09a2b135c8: Pull complete 2025-12-04T09:37:17.5779011Z 4f4fb700ef54: Pull complete 2025-12-04T09:37:17.8061939Z 5bfdaeb5578d: Pull complete 2025-12-04T09:37:18.0596463Z 0ef42867f370: Pull complete 2025-12-04T09:37:18.2759624Z 446083e497f3: Pull complete 2025-12-04T09:37:18.5548018Z d8a170bef0f4: Pull complete 2025-12-04T09:37:18.7689501Z e2b6cd6a5bd0: Pull complete 2025-12-04T09:37:18.9896485Z 93efc0181a22: Pull complete 2025-12-04T09:37:19.2161289Z 7454c938f174: Pull complete 2025-12-04T09:37:19.4334599Z 4d57ff55f6d4: Pull complete 2025-12-04T09:37:20.4975368Z b0301534b4a5: Verifying Checksum 2025-12-04T09:37:20.4975788Z b0301534b4a5: Download complete 2025-12-04T09:38:37.7474568Z b0301534b4a5: Pull complete 2025-12-04T09:38:37.9685577Z 1969e15d0c13: Pull complete 2025-12-04T09:38:38.7146947Z 73180a0f2d5a: Pull complete 2025-12-04T09:38:38.9463644Z ad81b25cb69f: Pull complete 2025-12-04T09:38:39.1542230Z 8165374f8dcc: Pull complete 2025-12-04T09:38:47.3463998Z 7779c0bb9be2: Pull complete 2025-12-04T09:38:47.5340044Z 4d0a1c027262: Pull complete 2025-12-04T09:38:47.7018470Z a51e0dab2d59: Pull complete 2025-12-04T09:38:48.1141314Z 3eb6d4ff040b: Pull complete 2025-12-04T09:38:48.3448492Z b168858b8537: Pull complete 2025-12-04T09:38:48.8087091Z d77a39278026: Pull complete 2025-12-04T09:38:49.0277302Z 36fbd357280b: Pull complete 2025-12-04T09:38:49.2557790Z 4e3b10a5dd6a: Pull complete 2025-12-04T09:38:49.6667700Z 3092fab73b59: Pull complete 2025-12-04T09:38:49.8942704Z 20020dd28a15: Pull complete 2025-12-04T09:38:50.1078753Z ae5280ce969d: Pull complete 2025-12-04T09:38:50.5154723Z 026e4484b749: Pull complete 2025-12-04T09:38:50.7266956Z 1be9da2ce53d: Pull complete 2025-12-04T09:38:51.1258993Z 6481b7a1d9fb: Pull complete 2025-12-04T09:38:52.9869152Z fa519d18c39d: Pull complete 2025-12-04T09:39:52.5518342Z d172f25b97f7: Pull complete 2025-12-04T09:39:52.7733952Z fd60ab6b1c2c: Pull complete 2025-12-04T09:39:52.9954191Z 0afe45579c2c: Pull complete 2025-12-04T09:39:53.3512115Z 5884ffd6720b: Pull complete 2025-12-04T09:39:53.7328956Z ab7a7c316fa7: Pull complete 2025-12-04T09:39:53.9479062Z c7775ce5574b: Pull complete 2025-12-04T09:39:54.3647223Z 81945c4fb228: Pull complete 2025-12-04T09:39:54.7774724Z 663cbe24d60b: Pull complete 2025-12-04T09:39:55.0099453Z 43f216b02786: Pull complete 2025-12-04T09:39:55.4235538Z c47c3cfeb687: Pull complete 2025-12-04T09:39:55.6715562Z 7d326b9e2673: Pull complete 2025-12-04T09:39:55.7794517Z 7ec8f17141c8: Pull complete 2025-12-04T09:39:56.0138938Z 26249ea175bf: Pull complete 2025-12-04T09:39:56.1125743Z 5e8e9ccb36f3: Pull complete 2025-12-04T09:39:56.4010089Z 5bc72d4e1de8: Pull complete 2025-12-04T09:39:56.5470164Z 83cddbd49779: Pull complete 2025-12-04T09:39:56.8899844Z 60c25d8c3dd2: Pull complete 2025-12-04T09:39:56.9263751Z a534dcf4b9a9: Pull complete 2025-12-04T09:40:03.4159886Z 10138310c65c: Pull complete 2025-12-04T09:40:03.6394256Z 8487679f252b: Pull complete 2025-12-04T09:40:03.8610022Z 52580ee2caa9: Pull complete 2025-12-04T09:40:04.0608072Z 741c215cb2ff: Pull complete 2025-12-04T09:40:04.2225738Z d17f5aba17a6: Pull complete 2025-12-04T09:40:04.3161316Z bc08246bb4ba: Pull complete 2025-12-04T09:40:05.8295409Z 7323bf084bf9: Pull complete 2025-12-04T09:40:05.9651143Z d344ecc97fd7: Pull complete 2025-12-04T09:40:07.8619460Z fb60b2d2147f: Pull complete 2025-12-04T09:40:08.2027395Z Digest: sha256:ae30f11a5b50741bd652aa0c94ad89ef791c4e50157eff642748620825cf7940 2025-12-04T09:40:08.2531577Z Status: Downloaded newer image for 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:40:08.2777434Z 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:40:08.2837991Z ##[group]Run echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T09:40:08.2839178Z echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T09:40:08.2849383Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:40:08.2849830Z env: 2025-12-04T09:40:08.2850073Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:40:08.2850379Z ##[endgroup] 2025-12-04T09:40:08.3045027Z ##[group]Run pytorch/test-infra/.github/actions/setup-nvidia@main 2025-12-04T09:40:08.3045519Z with: 2025-12-04T09:40:08.3045777Z driver-version: 525.105.17 2025-12-04T09:40:08.3046082Z env: 2025-12-04T09:40:08.3046338Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:40:08.3046638Z ##[endgroup] 2025-12-04T09:40:08.3132098Z ##[group]Run echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T09:40:08.3133205Z echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T09:40:08.3140627Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:40:08.3141068Z env: 2025-12-04T09:40:08.3141317Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:40:08.3141611Z ##[endgroup] 2025-12-04T09:40:08.3251698Z ##[group]Run set -euo pipefail 2025-12-04T09:40:08.3252087Z set -euo pipefail 2025-12-04T09:40:08.3252443Z  2025-12-04T09:40:08.3252680Z has_gpu=false 2025-12-04T09:40:08.3252976Z devices="" 2025-12-04T09:40:08.3253251Z  2025-12-04T09:40:08.3253561Z if command -v nvidia-smi >/dev/null 2>&1; then 2025-12-04T09:40:08.3254102Z  if nvidia-smi -L >/tmp/nvidia_devices 2>/dev/null; then 2025-12-04T09:40:08.3254563Z  has_gpu=true 2025-12-04T09:40:08.3254912Z  devices=$(cat /tmp/nvidia_devices) 2025-12-04T09:40:08.3255276Z  fi 2025-12-04T09:40:08.3255538Z fi 2025-12-04T09:40:08.3255781Z  2025-12-04T09:40:08.3256028Z if [ "$has_gpu" = false ]; then 2025-12-04T09:40:08.3256490Z  if ls /dev/nvidia* >/tmp/nvidia_devices 2>/dev/null; then 2025-12-04T09:40:08.3256946Z  has_gpu=true 2025-12-04T09:40:08.3257291Z  devices=$(cat /tmp/nvidia_devices) 2025-12-04T09:40:08.3257651Z  fi 2025-12-04T09:40:08.3257897Z fi 2025-12-04T09:40:08.3258139Z  2025-12-04T09:40:08.3258598Z if [ "$has_gpu" = false ] && command -v lspci >/dev/null 2>&1; then 2025-12-04T09:40:08.3259208Z  if lspci | grep -i 'nvidia' >/tmp/nvidia_devices 2>/dev/null; then 2025-12-04T09:40:08.3259702Z  has_gpu=true 2025-12-04T09:40:08.3260034Z  devices=$(cat /tmp/nvidia_devices) 2025-12-04T09:40:08.3260408Z  fi 2025-12-04T09:40:08.3260654Z fi 2025-12-04T09:40:08.3260876Z  2025-12-04T09:40:08.3261394Z printf 'HAS_NVIDIA=%s\n' "$has_gpu" >> "$GITHUB_OUTPUT" 2025-12-04T09:40:08.3262031Z printf 'DETECTED_DEVICES<> "$GITHUB_OUTPUT" 2025-12-04T09:40:08.3268272Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:40:08.3268704Z env: 2025-12-04T09:40:08.3268957Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:40:08.3269262Z ##[endgroup] 2025-12-04T09:40:09.8786718Z ##[group]Run if [ "${HAS_NVIDIA}" = "true" ]; then 2025-12-04T09:40:09.8787224Z if [ "${HAS_NVIDIA}" = "true" ]; then 2025-12-04T09:40:09.8787683Z  echo "HAS_NVIDIA_GPU=true" >> "${GITHUB_ENV}" 2025-12-04T09:40:09.8788497Z  echo "GPU_FLAG=--gpus all -e NVIDIA_DRIVER_CAPABILITIES=all" >> "${GITHUB_ENV}" 2025-12-04T09:40:09.8789050Z else 2025-12-04T09:40:09.8789366Z  echo "HAS_NVIDIA_GPU=false" >> "${GITHUB_ENV}" 2025-12-04T09:40:09.8789775Z fi 2025-12-04T09:40:09.8797220Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:40:09.8797669Z env: 2025-12-04T09:40:09.8797923Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:40:09.8798234Z HAS_NVIDIA: true 2025-12-04T09:40:09.8798486Z ##[endgroup] 2025-12-04T09:40:09.8896517Z ##[group]Run nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482 2025-12-04T09:40:09.8897004Z with: 2025-12-04T09:40:09.8897251Z timeout_minutes: 10 2025-12-04T09:40:09.8897542Z max_attempts: 3 2025-12-04T09:40:09.8930448Z command: # Is it disgusting to have a full shell script here in this github action? Sure # But is it the best way to make it so that this action relies on nothing else? Absolutely set -eou pipefail DISTRIBUTION=$(. /etc/os-release;echo $ID$VERSION_ID) DRIVER_FN="NVIDIA-Linux-x86_64-${DRIVER_VERSION}.run" install_nvidia_docker2_amzn2() { ( set -x # Needed for yum-config-manager sudo yum install -y yum-utils if [[ "${DISTRIBUTION}" == "amzn2023" ]] ; then YUM_REPO_URL="https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo" else # Amazon Linux 2 YUM_REPO_URL="https://nvidia.github.io/nvidia-docker/${DISTRIBUTION}/nvidia-docker.repo" fi sudo yum-config-manager --add-repo "${YUM_REPO_URL}" sudo yum install -y \ nvidia-container-toolkit-1.17.8 \ libnvidia-container-tools-1.17.8 \ libnvidia-container1-1.17.8 \ nvidia-container-toolkit-base-1.17.8 sudo systemctl restart docker ) } install_nvidia_docker2_ubuntu20() { ( set -x # Install nvidia-driver package if not installed status="$(dpkg-query -W --showformat='${db:Status-Status}' nvidia-docker2 2>&1)" if [ ! $? = 0 ] || [ ! "$status" = installed ]; then sudo apt-get install -y nvidia-container-toolkit-1.17.8 sudo systemctl restart docker fi ) } pre_install_nvidia_driver_amzn2() { ( # Purge any nvidia driver installed from RHEL repo sudo yum remove -y nvidia-driver-latest-dkms ) } install_nvidia_driver_common() { ( # Try to gather more information about the runner and its existing NVIDIA driver if any echo "Before installing NVIDIA driver" lspci lsmod modinfo nvidia || true HAS_NVIDIA_DRIVER=0 # Check if NVIDIA driver has already been installed if [ -x "$(command -v nvidia-smi)" ]; then set +e # The driver exists, check its version next. Also check only the first GPU if there are more than one of them # so that the same driver version is not print over multiple lines INSTALLED_DRIVER_VERSION=$(nvidia-smi --query-gpu=driver_version --format=csv,noheader --id=0) NVIDIA_SMI_STATUS=$? if [ "$NVIDIA_SMI_STATUS" -ne 0 ] && [ "$NVIDIA_SMI_STATUS" -ne 14 ]; then echo "Failed to get NVIDIA driver version ($INSTALLED_DRIVER_VERSION). Continuing" elif [ "$INSTALLED_DRIVER_VERSION" != "$DRIVER_VERSION" ]; then echo "NVIDIA driver ($INSTALLED_DRIVER_VERSION) has been installed, but we expect to have $DRIVER_VERSION instead. Continuing" # Turn off persistent mode so that the installation script can unload the kernel module sudo killall nvidia-persistenced || true else HAS_NVIDIA_DRIVER=1 echo "NVIDIA driver ($INSTALLED_DRIVER_VERSION) has already been installed. Skipping NVIDIA driver installation" fi set -e fi if [ "$HAS_NVIDIA_DRIVER" -eq 0 ]; then # CAUTION: this may need to be updated in future if [ "${DISTRIBUTION}" != ubuntu20.04 ]; then sudo yum groupinstall -y "Development Tools" # ensure our kernel install is the same as our underlying kernel, # groupinstall "Development Tools" has a habit of mismatching kernel headers sudo yum install -y "kernel-devel-uname-r == $(uname -r)" sudo modprobe backlight fi sudo curl -fsL -o /tmp/nvidia_driver "https://s3.amazonaws.com/ossci-linux/nvidia_driver/$DRIVER_FN" set +e sudo /bin/bash /tmp/nvidia_driver -s --no-drm NVIDIA_INSTALLATION_STATUS=$? RESET_GPU=0 if [ "$NVIDIA_INSTALLATION_STATUS" -ne 0 ]; then sudo cat /var/log/nvidia-installer.log # Fail to install NVIDIA driver, try to reset the GPU RESET_GPU=1 elif [ -x "$(command -v nvidia-smi)" ]; then # Check again if nvidia-smi works even if the driver installation completes successfully INSTALLED_DRIVER_VERSION=$(nvidia-smi --query-gpu=driver_version --format=csv,noheader --id=0) NVIDIA_SMI_STATUS=$? if [ "$NVIDIA_SMI_STATUS" -ne 0 ] && [ "$NVIDIA_SMI_STATUS" -ne 14 ]; then RESET_GPU=1 fi fi if [ "$RESET_GPU" -eq 1 ]; then NVIDIA_DEVICES=$(lspci -D | grep -i NVIDIA | cut -d' ' -f1) # The GPU can get stuck in a failure state if somehow the test crashs the GPU microcode. When this # happens, we'll try to reset all NVIDIA devices https://github.com/pytorch/pytorch/issues/88388 for PCI_ID in $NVIDIA_DEVICES; do DEVICE_ENABLED=$(cat /sys/bus/pci/devices/$PCI_ID/enable) echo "Reseting $PCI_ID (enabled state: $DEVICE_ENABLED)" # This requires sudo permission of course echo "1" | sudo tee /sys/bus/pci/devices/$PCI_ID/reset sleep 1 done fi sudo rm -fv /tmp/nvidia_driver set -e fi ) } post_install_nvidia_driver_common() { ( sudo modprobe nvidia || true echo "After installing NVIDIA driver" lspci lsmod modinfo nvidia || true ( set +e nvidia-smi # NB: Annoyingly, nvidia-smi command returns successfully with return code 0 even in # the case where the driver has already crashed as it still can get the driver version # and some basic information like the bus ID. However, the rest of the information # would be missing (ERR!), for example: # # +-----------------------------------------------------------------------------+ # | NVIDIA-SMI 525.89.02 Driver Version: 525.89.02 CUDA Version: 12.0 | # |-------------------------------+----------------------+----------------------+ # | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | # | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | # | | | MIG M. | # |===============================+======================+======================| # | 0 ERR! Off | 00000000:00:1E.0 Off | ERR! | # |ERR! ERR! ERR! ERR! / ERR! | 4184MiB / 23028MiB | ERR! Default | # | | | ERR! | # +-------------------------------+----------------------+----------------------+ # # +-----------------------------------------------------------------------------+ # | Processes: | # | GPU GI CI PID Type Process name GPU Memory | # | ID ID Usage | # |=============================================================================| # +-----------------------------------------------------------------------------+ # # This should be reported as a failure instead as it will guarantee to fail when # Docker tries to run with --gpus all # # So, the correct check here is to query one of the missing piece of info like # GPU name, so that the command can fail accordingly nvidia-smi --query-gpu=gpu_name --format=csv,noheader --id=0 NVIDIA_SMI_STATUS=$? # Allowable exit statuses for nvidia-smi, see: https://github.com/NVIDIA/gpu-operator/issues/285 if [ "$NVIDIA_SMI_STATUS" -eq 0 ] || [ "$NVIDIA_SMI_STATUS" -eq 14 ]; then echo "INFO: Ignoring allowed status ${NVIDIA_SMI_STATUS}" else echo "ERROR: nvidia-smi exited with unresolved status ${NVIDIA_SMI_STATUS}" exit ${NVIDIA_SMI_STATUS} fi set -e ) ) } install_nvidia_driver_amzn2() { ( set -x pre_install_nvidia_driver_amzn2 install_nvidia_driver_common post_install_nvidia_driver_common ) } install_nvidia_driver_ubuntu20() { ( set -x install_nvidia_driver_common post_install_nvidia_driver_common ) } echo "== Installing nvidia driver ${DRIVER_FN} ==" case "${DISTRIBUTION}" in amzn*) install_nvidia_driver_amzn2 ;; ubuntu20.04) install_nvidia_driver_ubuntu20 ;; *) echo "ERROR: Unknown distribution ${DISTRIBUTION}" exit 1 ;; esac # Install container toolkit based on distribution echo "== Installing nvidia container toolkit for ${DISTRIBUTION} ==" case "${DISTRIBUTION}" in amzn*) install_nvidia_docker2_amzn2 ;; ubuntu20.04) install_nvidia_docker2_ubuntu20 ;; *) echo "ERROR: Unknown distribution ${DISTRIBUTION}" exit 1 ;; esac # Fix https://github.com/NVIDIA/nvidia-docker/issues/1648 on runners with # more than one GPUs. This just needs to be run once. The command fails # on subsequent runs and complains that the mode is already on, but that's # ok sudo nvidia-persistenced || true # This should show persistence mode ON nvidia-smi # check if the container-toolkit is correctly installed and CUDA is available inside a container docker run --rm -t --gpus=all public.ecr.aws/docker/library/python:3.13 nvidia-smi 2025-12-04T09:40:09.8963609Z retry_wait_seconds: 10 2025-12-04T09:40:09.8963928Z polling_interval_seconds: 1 2025-12-04T09:40:09.8964249Z warning_on_retry: true 2025-12-04T09:40:09.8964561Z continue_on_error: false 2025-12-04T09:40:09.8964860Z env: 2025-12-04T09:40:09.8965087Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:40:09.8965389Z HAS_NVIDIA_GPU: true 2025-12-04T09:40:09.8965752Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:40:09.8966173Z DRIVER_VERSION: 525.105.17 2025-12-04T09:40:09.8966482Z ##[endgroup] 2025-12-04T09:40:10.0445820Z == Installing nvidia driver NVIDIA-Linux-x86_64-525.105.17.run == 2025-12-04T09:40:10.0446598Z + pre_install_nvidia_driver_amzn2 2025-12-04T09:40:10.0447725Z + sudo yum remove -y nvidia-driver-latest-dkms 2025-12-04T09:40:10.7816898Z No match for argument: nvidia-driver-latest-dkms 2025-12-04T09:40:10.7817394Z No packages marked for removal. 2025-12-04T09:40:10.7887155Z Dependencies resolved. 2025-12-04T09:40:10.7898489Z Nothing to do. 2025-12-04T09:40:10.7899279Z Complete! 2025-12-04T09:40:10.8921292Z + install_nvidia_driver_common 2025-12-04T09:40:10.8924127Z + echo 'Before installing NVIDIA driver' 2025-12-04T09:40:10.8926363Z Before installing NVIDIA driver 2025-12-04T09:40:10.8929062Z + lspci 2025-12-04T09:40:10.9924166Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-12-04T09:40:10.9924800Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-12-04T09:40:10.9925522Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-12-04T09:40:10.9926201Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 2025-12-04T09:40:10.9926849Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller 2025-12-04T09:40:10.9927516Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-12-04T09:40:10.9928155Z 00:1e.0 3D controller: NVIDIA Corporation TU104GL [Tesla T4] (rev a1) 2025-12-04T09:40:10.9928791Z 00:1f.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe SSD Controller 2025-12-04T09:40:10.9929293Z + lsmod 2025-12-04T09:40:10.9968512Z Module Size Used by 2025-12-04T09:40:10.9969138Z nvidia_uvm 1925120 0 2025-12-04T09:40:10.9969482Z nvidia 14286848 1 nvidia_uvm 2025-12-04T09:40:10.9969835Z drm 602112 1 nvidia 2025-12-04T09:40:10.9970222Z drm_panel_orientation_quirks 32768 1 drm 2025-12-04T09:40:10.9970612Z backlight 24576 1 drm 2025-12-04T09:40:10.9970952Z i2c_core 110592 2 nvidia,drm 2025-12-04T09:40:10.9971312Z xt_conntrack 16384 1 2025-12-04T09:40:10.9971632Z nft_chain_nat 16384 3 2025-12-04T09:40:10.9971953Z xt_MASQUERADE 20480 1 2025-12-04T09:40:10.9972470Z nf_nat 57344 2 nft_chain_nat,xt_MASQUERADE 2025-12-04T09:40:10.9972973Z nf_conntrack_netlink 57344 0 2025-12-04T09:40:10.9973477Z nf_conntrack 184320 4 xt_conntrack,nf_nat,nf_conntrack_netlink,xt_MASQUERADE 2025-12-04T09:40:10.9974022Z nf_defrag_ipv6 24576 1 nf_conntrack 2025-12-04T09:40:10.9974418Z nf_defrag_ipv4 16384 1 nf_conntrack 2025-12-04T09:40:10.9974835Z xfrm_user 57344 1 2025-12-04T09:40:10.9975169Z xfrm_algo 16384 1 xfrm_user 2025-12-04T09:40:10.9975523Z xt_addrtype 16384 2 2025-12-04T09:40:10.9975851Z nft_compat 20480 4 2025-12-04T09:40:10.9976234Z nf_tables 311296 57 nft_compat,nft_chain_nat 2025-12-04T09:40:10.9976747Z nfnetlink 20480 4 nft_compat,nf_conntrack_netlink,nf_tables 2025-12-04T09:40:10.9977229Z br_netfilter 36864 0 2025-12-04T09:40:10.9977578Z bridge 323584 1 br_netfilter 2025-12-04T09:40:10.9977948Z stp 16384 1 bridge 2025-12-04T09:40:10.9978380Z llc 16384 2 bridge,stp 2025-12-04T09:40:10.9978738Z overlay 167936 0 2025-12-04T09:40:10.9979051Z tls 139264 0 2025-12-04T09:40:10.9979347Z nls_ascii 16384 1 2025-12-04T09:40:10.9979662Z nls_cp437 20480 1 2025-12-04T09:40:10.9979974Z vfat 24576 1 2025-12-04T09:40:10.9980280Z fat 86016 1 vfat 2025-12-04T09:40:10.9980617Z sunrpc 700416 1 2025-12-04T09:40:10.9980918Z i8042 45056 0 2025-12-04T09:40:10.9981205Z ena 184320 0 2025-12-04T09:40:10.9981514Z skx_edac_common 28672 0 2025-12-04T09:40:10.9981831Z serio 28672 3 i8042 2025-12-04T09:40:10.9982161Z ghash_clmulni_intel 16384 0 2025-12-04T09:40:10.9982481Z button 24576 0 2025-12-04T09:40:10.9982796Z sch_fq_codel 20480 17 2025-12-04T09:40:10.9983279Z dm_mod 188416 0 2025-12-04T09:40:10.9983589Z fuse 184320 1 2025-12-04T09:40:10.9983902Z loop 36864 0 2025-12-04T09:40:10.9984217Z configfs 57344 1 2025-12-04T09:40:10.9984518Z dmi_sysfs 20480 0 2025-12-04T09:40:10.9984837Z crc32_pclmul 16384 0 2025-12-04T09:40:10.9985150Z crc32c_intel 24576 0 2025-12-04T09:40:10.9985455Z efivarfs 24576 1 2025-12-04T09:40:10.9985775Z + modinfo nvidia 2025-12-04T09:40:10.9987658Z filename: /lib/modules/6.1.150-174.273.amzn2023.x86_64/kernel/drivers/video/nvidia.ko 2025-12-04T09:40:10.9988631Z import_ns: DMA_BUF 2025-12-04T09:40:10.9989049Z alias: char-major-195-* 2025-12-04T09:40:10.9989522Z version: 580.82.07 2025-12-04T09:40:10.9989946Z supported: external 2025-12-04T09:40:10.9990473Z license: Dual MIT/GPL 2025-12-04T09:40:10.9991011Z firmware: nvidia/580.82.07/gsp_tu10x.bin 2025-12-04T09:40:10.9991684Z firmware: nvidia/580.82.07/gsp_ga10x.bin 2025-12-04T09:40:10.9992280Z srcversion: BA7240A71DCF7DC6FE88C1D 2025-12-04T09:40:10.9992893Z alias: of:N*T*Cnvidia,tegra264-displayC* 2025-12-04T09:40:10.9993572Z alias: of:N*T*Cnvidia,tegra264-display 2025-12-04T09:40:10.9994270Z alias: of:N*T*Cnvidia,tegra234-displayC* 2025-12-04T09:40:10.9995001Z alias: of:N*T*Cnvidia,tegra234-display 2025-12-04T09:40:10.9996010Z alias: pci:v000010DEd*sv*sd*bc06sc80i00* 2025-12-04T09:40:10.9996740Z alias: pci:v000010DEd*sv*sd*bc03sc02i00* 2025-12-04T09:40:10.9997448Z alias: pci:v000010DEd*sv*sd*bc03sc00i00* 2025-12-04T09:40:10.9998097Z depends: i2c-core,drm 2025-12-04T09:40:10.9998587Z retpoline: Y 2025-12-04T09:40:10.9998993Z name: nvidia 2025-12-04T09:40:10.9999731Z vermagic: 6.1.150-174.273.amzn2023.x86_64 SMP preempt mod_unload modversions 2025-12-04T09:40:11.0000774Z parm: NvSwitchRegDwords:NvSwitch regkey (charp) 2025-12-04T09:40:11.0001747Z parm: NvSwitchBlacklist:NvSwitchBlacklist=uuid[,uuid...] (charp) 2025-12-04T09:40:11.0002695Z parm: NVreg_ResmanDebugLevel:int 2025-12-04T09:40:11.0003305Z parm: NVreg_RmLogonRC:int 2025-12-04T09:40:11.0003937Z parm: NVreg_ModifyDeviceFiles:int 2025-12-04T09:40:11.0004519Z parm: NVreg_DeviceFileUID:int 2025-12-04T09:40:11.0005148Z parm: NVreg_DeviceFileGID:int 2025-12-04T09:40:11.0005805Z parm: NVreg_DeviceFileMode:int 2025-12-04T09:40:11.0006485Z parm: NVreg_InitializeSystemMemoryAllocations:int 2025-12-04T09:40:11.0007293Z parm: NVreg_UsePageAttributeTable:int 2025-12-04T09:40:11.0007953Z parm: NVreg_EnablePCIeGen3:int 2025-12-04T09:40:11.0008571Z parm: NVreg_EnableMSI:int 2025-12-04T09:40:11.0009060Z parm: NVreg_EnableStreamMemOPs:int 2025-12-04T09:40:11.0009508Z parm: NVreg_RestrictProfilingToAdminUsers:int 2025-12-04T09:40:11.0010011Z parm: NVreg_PreserveVideoMemoryAllocations:int 2025-12-04T09:40:11.0010465Z parm: NVreg_EnableS0ixPowerManagement:int 2025-12-04T09:40:11.0010970Z parm: NVreg_S0ixPowerManagementVideoMemoryThreshold:int 2025-12-04T09:40:11.0011473Z parm: NVreg_DynamicPowerManagement:int 2025-12-04T09:40:11.0011974Z parm: NVreg_DynamicPowerManagementVideoMemoryThreshold:int 2025-12-04T09:40:11.0012481Z parm: NVreg_EnableGpuFirmware:int 2025-12-04T09:40:11.0012904Z parm: NVreg_EnableGpuFirmwareLogs:int 2025-12-04T09:40:11.0013345Z parm: NVreg_OpenRmEnableUnsupportedGpus:int 2025-12-04T09:40:11.0013805Z parm: NVreg_EnableUserNUMAManagement:int 2025-12-04T09:40:11.0014233Z parm: NVreg_MemoryPoolSize:int 2025-12-04T09:40:11.0014636Z parm: NVreg_KMallocHeapMaxSize:int 2025-12-04T09:40:11.0015037Z parm: NVreg_VMallocHeapMaxSize:int 2025-12-04T09:40:11.0015440Z parm: NVreg_IgnoreMMIOCheck:int 2025-12-04T09:40:11.0016042Z parm: NVreg_NvLinkDisable:int 2025-12-04T09:40:11.0016456Z parm: NVreg_EnablePCIERelaxedOrderingMode:int 2025-12-04T09:40:11.0016905Z parm: NVreg_RegisterPCIDriver:int 2025-12-04T09:40:11.0017343Z parm: NVreg_RegisterPlatformDeviceDriver:int 2025-12-04T09:40:11.0017776Z parm: NVreg_EnableResizableBar:int 2025-12-04T09:40:11.0018299Z parm: NVreg_EnableDbgBreakpoint:int 2025-12-04T09:40:11.0018742Z parm: NVreg_EnableNonblockingOpen:int 2025-12-04T09:40:11.0019173Z parm: NVreg_CoherentGPUMemoryMode:charp 2025-12-04T09:40:11.0019603Z parm: NVreg_RegistryDwords:charp 2025-12-04T09:40:11.0020031Z parm: NVreg_RegistryDwordsPerDevice:charp 2025-12-04T09:40:11.0020457Z parm: NVreg_RmMsg:charp 2025-12-04T09:40:11.0020809Z parm: NVreg_GpuBlacklist:charp 2025-12-04T09:40:11.0021216Z parm: NVreg_TemporaryFilePath:charp 2025-12-04T09:40:11.0021626Z parm: NVreg_ExcludedGpus:charp 2025-12-04T09:40:11.0022013Z parm: NVreg_DmaRemapPeerMmio:int 2025-12-04T09:40:11.0022425Z parm: NVreg_RmNvlinkBandwidth:charp 2025-12-04T09:40:11.0022873Z parm: NVreg_RmNvlinkBandwidthLinkCount:int 2025-12-04T09:40:11.0023298Z parm: NVreg_ImexChannelCount:int 2025-12-04T09:40:11.0023709Z parm: NVreg_CreateImexChannel0:int 2025-12-04T09:40:11.0024147Z parm: NVreg_GrdmaPciTopoCheckOverride:int 2025-12-04T09:40:11.0024662Z parm: rm_firmware_active:charp 2025-12-04T09:40:11.0025016Z + HAS_NVIDIA_DRIVER=0 2025-12-04T09:40:11.0025337Z ++ command -v nvidia-smi 2025-12-04T09:40:11.0025659Z + '[' -x /usr/bin/nvidia-smi ']' 2025-12-04T09:40:11.0025968Z + set +e 2025-12-04T09:40:11.0026357Z ++ nvidia-smi --query-gpu=driver_version --format=csv,noheader --id=0 2025-12-04T09:40:12.5134406Z + INSTALLED_DRIVER_VERSION=580.82.07 2025-12-04T09:40:12.5134844Z + NVIDIA_SMI_STATUS=0 2025-12-04T09:40:12.5135469Z + '[' 0 -ne 0 ']' 2025-12-04T09:40:12.5135755Z + '[' 580.82.07 '!=' 525.105.17 ']' 2025-12-04T09:40:12.5136379Z + echo 'NVIDIA driver (580.82.07) has been installed, but we expect to have 525.105.17 instead. Continuing' 2025-12-04T09:40:12.5137037Z + sudo killall nvidia-persistenced 2025-12-04T09:40:12.5137628Z NVIDIA driver (580.82.07) has been installed, but we expect to have 525.105.17 instead. Continuing 2025-12-04T09:40:12.7334455Z nvidia-persistenced: no process found 2025-12-04T09:40:12.7352573Z + true 2025-12-04T09:40:12.7352866Z + set -e 2025-12-04T09:40:12.7353112Z + '[' 0 -eq 0 ']' 2025-12-04T09:40:12.7353380Z + '[' amzn2023 '!=' ubuntu20.04 ']' 2025-12-04T09:40:12.7353775Z + sudo yum groupinstall -y 'Development Tools' 2025-12-04T09:40:13.2665991Z Last metadata expiration check: 0:23:12 ago on Thu Dec 4 09:17:01 2025. 2025-12-04T09:40:13.3103931Z No match for group package "system-rpm-config" 2025-12-04T09:40:13.3124530Z No match for group package "rcs" 2025-12-04T09:40:13.3150553Z No match for group package "pkgconfig" 2025-12-04T09:40:13.3729690Z Dependencies resolved. 2025-12-04T09:40:13.4056188Z ================================================================================ 2025-12-04T09:40:13.4057184Z Package Architecture Version Repository Size 2025-12-04T09:40:13.4057699Z ================================================================================ 2025-12-04T09:40:13.4058092Z Installing Groups: 2025-12-04T09:40:13.4058555Z Development Tools 2025-12-04T09:40:13.4058904Z 2025-12-04T09:40:13.4059024Z Transaction Summary 2025-12-04T09:40:13.4059319Z ================================================================================ 2025-12-04T09:40:13.4059602Z 2025-12-04T09:40:14.2797223Z ================================================================================ 2025-12-04T09:40:14.2798002Z WARNING: 2025-12-04T09:40:14.2798291Z A newer release of "Amazon Linux" is available. 2025-12-04T09:40:14.2798596Z 2025-12-04T09:40:14.2799730Z Available Versions: 2025-12-04T09:40:14.2799914Z 2025-12-04T09:40:14.2800041Z Version 2023.9.20250929: 2025-12-04T09:40:14.2800431Z Run the following command to upgrade to 2023.9.20250929: 2025-12-04T09:40:14.2800754Z 2025-12-04T09:40:14.2800905Z dnf upgrade --releasever=2023.9.20250929 2025-12-04T09:40:14.2801184Z 2025-12-04T09:40:14.2801288Z Release notes: 2025-12-04T09:40:14.2801819Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20250929.html 2025-12-04T09:40:14.2802294Z 2025-12-04T09:40:14.2802400Z Version 2023.9.20251014: 2025-12-04T09:40:14.2802785Z Run the following command to upgrade to 2023.9.20251014: 2025-12-04T09:40:14.2803115Z 2025-12-04T09:40:14.2803255Z dnf upgrade --releasever=2023.9.20251014 2025-12-04T09:40:14.2803527Z 2025-12-04T09:40:14.2803698Z Release notes: 2025-12-04T09:40:14.2804633Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251014.html 2025-12-04T09:40:14.2805145Z 2025-12-04T09:40:14.2805250Z Version 2023.9.20251020: 2025-12-04T09:40:14.2805634Z Run the following command to upgrade to 2023.9.20251020: 2025-12-04T09:40:14.2805951Z 2025-12-04T09:40:14.2806087Z dnf upgrade --releasever=2023.9.20251020 2025-12-04T09:40:14.2806363Z 2025-12-04T09:40:14.2806464Z Release notes: 2025-12-04T09:40:14.2806959Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251020.html 2025-12-04T09:40:14.2807429Z 2025-12-04T09:40:14.2807821Z Version 2023.9.20251027: 2025-12-04T09:40:14.2808195Z Run the following command to upgrade to 2023.9.20251027: 2025-12-04T09:40:14.2808526Z 2025-12-04T09:40:14.2808662Z dnf upgrade --releasever=2023.9.20251027 2025-12-04T09:40:14.2808937Z 2025-12-04T09:40:14.2809036Z Release notes: 2025-12-04T09:40:14.2809539Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251027.html 2025-12-04T09:40:14.2810016Z 2025-12-04T09:40:14.2810121Z Version 2023.9.20251105: 2025-12-04T09:40:14.2810508Z Run the following command to upgrade to 2023.9.20251105: 2025-12-04T09:40:14.2810820Z 2025-12-04T09:40:14.2810967Z dnf upgrade --releasever=2023.9.20251105 2025-12-04T09:40:14.2811227Z 2025-12-04T09:40:14.2811327Z Release notes: 2025-12-04T09:40:14.2811821Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251105.html 2025-12-04T09:40:14.2812301Z 2025-12-04T09:40:14.2812406Z Version 2023.9.20251110: 2025-12-04T09:40:14.2812790Z Run the following command to upgrade to 2023.9.20251110: 2025-12-04T09:40:14.2813103Z 2025-12-04T09:40:14.2813239Z dnf upgrade --releasever=2023.9.20251110 2025-12-04T09:40:14.2813510Z 2025-12-04T09:40:14.2813611Z Release notes: 2025-12-04T09:40:14.2814098Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251110.html 2025-12-04T09:40:14.2814565Z 2025-12-04T09:40:14.2814682Z Version 2023.9.20251117: 2025-12-04T09:40:14.2815049Z Run the following command to upgrade to 2023.9.20251117: 2025-12-04T09:40:14.2815382Z 2025-12-04T09:40:14.2815518Z dnf upgrade --releasever=2023.9.20251117 2025-12-04T09:40:14.2815777Z 2025-12-04T09:40:14.2815889Z Release notes: 2025-12-04T09:40:14.2816367Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251117.html 2025-12-04T09:40:14.2816848Z 2025-12-04T09:40:14.2816978Z ================================================================================ 2025-12-04T09:40:14.2817373Z Complete! 2025-12-04T09:40:14.3625909Z ++ uname -r 2025-12-04T09:40:14.3636958Z + sudo yum install -y 'kernel-devel-uname-r == 6.1.150-174.273.amzn2023.x86_64' 2025-12-04T09:40:14.8883657Z Last metadata expiration check: 0:23:13 ago on Thu Dec 4 09:17:01 2025. 2025-12-04T09:40:14.9180676Z Using '==' operator in reldeps can result in an undefined behavior. It is deprecated and the support will be dropped in future versions. Use '=' operator instead. 2025-12-04T09:40:14.9303804Z Package kernel-devel-1:6.1.150-174.273.amzn2023.x86_64 is already installed. 2025-12-04T09:40:14.9914003Z Dependencies resolved. 2025-12-04T09:40:15.0243467Z Nothing to do. 2025-12-04T09:40:15.0243841Z Complete! 2025-12-04T09:40:15.1252754Z + sudo modprobe backlight 2025-12-04T09:40:15.3276071Z + sudo curl -fsL -o /tmp/nvidia_driver https://s3.amazonaws.com/ossci-linux/nvidia_driver/NVIDIA-Linux-x86_64-525.105.17.run 2025-12-04T09:40:19.6931264Z + set +e 2025-12-04T09:40:19.6931642Z + sudo /bin/bash /tmp/nvidia_driver -s --no-drm 2025-12-04T09:40:21.2049829Z Verifying archive integrity... OK 2025-12-04T09:40:48.3752338Z Uncompressing NVIDIA Accelerated Graphics Driver for Linux-x86_64 525.105.17................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................... 2025-12-04T09:40:49.0347481Z 2025-12-04T09:40:49.0348391Z WARNING: The nvidia-drm module will not be installed. As a result, DRM-KMS will not function with this installation of the NVIDIA driver. 2025-12-04T09:40:49.0349115Z 2025-12-04T09:41:15.0842857Z 2025-12-04T09:41:15.0844780Z WARNING: nvidia-installer was forced to guess the X library path '/usr/lib64' and X module path '/usr/lib64/xorg/modules'; these paths were not queryable from the system. If X fails to find the NVIDIA X driver module, please install the `pkg-config` utility and the X.Org SDK/development package for your distribution and reinstall the driver. 2025-12-04T09:41:15.0846493Z 2025-12-04T09:41:15.0863588Z 2025-12-04T09:41:15.0865520Z WARNING: This NVIDIA driver package includes Vulkan components, but no Vulkan ICD loader was detected on this system. The NVIDIA Vulkan ICD will not function without the loader. Most distributions package the Vulkan loader; try installing the "vulkan-loader", "vulkan-icd-loader", or "libvulkan1" package. 2025-12-04T09:41:15.0867020Z 2025-12-04T09:41:26.5905655Z + NVIDIA_INSTALLATION_STATUS=0 2025-12-04T09:41:26.5906056Z + RESET_GPU=0 2025-12-04T09:41:26.5906320Z + '[' 0 -ne 0 ']' 2025-12-04T09:41:26.5906634Z ++ command -v nvidia-smi 2025-12-04T09:41:26.5909788Z + '[' -x /usr/bin/nvidia-smi ']' 2025-12-04T09:41:26.5914885Z ++ nvidia-smi --query-gpu=driver_version --format=csv,noheader --id=0 2025-12-04T09:41:29.1421291Z + INSTALLED_DRIVER_VERSION=525.105.17 2025-12-04T09:41:29.1421728Z + NVIDIA_SMI_STATUS=0 2025-12-04T09:41:29.1422030Z + '[' 0 -ne 0 ']' 2025-12-04T09:41:29.1422279Z + '[' 0 -eq 1 ']' 2025-12-04T09:41:29.1422566Z + sudo rm -fv /tmp/nvidia_driver 2025-12-04T09:41:29.3287462Z removed '/tmp/nvidia_driver' 2025-12-04T09:41:29.3305071Z + set -e 2025-12-04T09:41:29.3307683Z + post_install_nvidia_driver_common 2025-12-04T09:41:29.3311366Z + sudo modprobe nvidia 2025-12-04T09:41:29.5267327Z + echo 'After installing NVIDIA driver' 2025-12-04T09:41:29.5267846Z + lspci 2025-12-04T09:41:29.5268276Z After installing NVIDIA driver 2025-12-04T09:41:29.5400751Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-12-04T09:41:29.5401406Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-12-04T09:41:29.5402106Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-12-04T09:41:29.5403080Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 2025-12-04T09:41:29.5403698Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller 2025-12-04T09:41:29.5404378Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-12-04T09:41:29.5405002Z 00:1e.0 3D controller: NVIDIA Corporation TU104GL [Tesla T4] (rev a1) 2025-12-04T09:41:29.5405636Z 00:1f.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe SSD Controller 2025-12-04T09:41:29.5406162Z + lsmod 2025-12-04T09:41:29.5432513Z Module Size Used by 2025-12-04T09:41:29.5432859Z nvidia 56537088 0 2025-12-04T09:41:29.5433182Z drm 602112 1 nvidia 2025-12-04T09:41:29.5433568Z drm_panel_orientation_quirks 32768 1 drm 2025-12-04T09:41:29.5433943Z backlight 24576 1 drm 2025-12-04T09:41:29.5434299Z i2c_core 110592 2 nvidia,drm 2025-12-04T09:41:29.5434659Z xt_conntrack 16384 1 2025-12-04T09:41:29.5434987Z nft_chain_nat 16384 3 2025-12-04T09:41:29.5435312Z xt_MASQUERADE 20480 1 2025-12-04T09:41:29.5435682Z nf_nat 57344 2 nft_chain_nat,xt_MASQUERADE 2025-12-04T09:41:29.5436087Z nf_conntrack_netlink 57344 0 2025-12-04T09:41:29.5436594Z nf_conntrack 184320 4 xt_conntrack,nf_nat,nf_conntrack_netlink,xt_MASQUERADE 2025-12-04T09:41:29.5437151Z nf_defrag_ipv6 24576 1 nf_conntrack 2025-12-04T09:41:29.5437714Z nf_defrag_ipv4 16384 1 nf_conntrack 2025-12-04T09:41:29.5438072Z xfrm_user 57344 1 2025-12-04T09:41:29.5438405Z xfrm_algo 16384 1 xfrm_user 2025-12-04T09:41:29.5438765Z xt_addrtype 16384 2 2025-12-04T09:41:29.5439073Z nft_compat 20480 4 2025-12-04T09:41:29.5439453Z nf_tables 311296 57 nft_compat,nft_chain_nat 2025-12-04T09:41:29.5439973Z nfnetlink 20480 4 nft_compat,nf_conntrack_netlink,nf_tables 2025-12-04T09:41:29.5440437Z br_netfilter 36864 0 2025-12-04T09:41:29.5440789Z bridge 323584 1 br_netfilter 2025-12-04T09:41:29.5441160Z stp 16384 1 bridge 2025-12-04T09:41:29.5441516Z llc 16384 2 bridge,stp 2025-12-04T09:41:29.5441876Z overlay 167936 0 2025-12-04T09:41:29.5442190Z tls 139264 0 2025-12-04T09:41:29.5442501Z nls_ascii 16384 1 2025-12-04T09:41:29.5442800Z nls_cp437 20480 1 2025-12-04T09:41:29.5443116Z vfat 24576 1 2025-12-04T09:41:29.5443427Z fat 86016 1 vfat 2025-12-04T09:41:29.5443788Z sunrpc 700416 1 2025-12-04T09:41:29.5444097Z i8042 45056 0 2025-12-04T09:41:29.5444401Z ena 184320 0 2025-12-04T09:41:29.5444702Z skx_edac_common 28672 0 2025-12-04T09:41:29.5445028Z serio 28672 3 i8042 2025-12-04T09:41:29.5445373Z ghash_clmulni_intel 16384 0 2025-12-04T09:41:29.5445698Z button 24576 0 2025-12-04T09:41:29.5446008Z sch_fq_codel 20480 17 2025-12-04T09:41:29.5446332Z dm_mod 188416 0 2025-12-04T09:41:29.5446644Z fuse 184320 1 2025-12-04T09:41:29.5446939Z loop 36864 0 2025-12-04T09:41:29.5447255Z configfs 57344 1 2025-12-04T09:41:29.5447569Z dmi_sysfs 20480 0 2025-12-04T09:41:29.5447869Z crc32_pclmul 16384 0 2025-12-04T09:41:29.5448183Z crc32c_intel 24576 0 2025-12-04T09:41:29.5448501Z efivarfs 24576 1 2025-12-04T09:41:29.5448795Z + modinfo nvidia 2025-12-04T09:41:29.5449554Z filename: /lib/modules/6.1.150-174.273.amzn2023.x86_64/kernel/drivers/video/nvidia.ko 2025-12-04T09:41:29.5450164Z firmware: nvidia/525.105.17/gsp_tu10x.bin 2025-12-04T09:41:29.5450595Z firmware: nvidia/525.105.17/gsp_ad10x.bin 2025-12-04T09:41:29.5450982Z alias: char-major-195-* 2025-12-04T09:41:29.5451319Z version: 525.105.17 2025-12-04T09:41:29.5451623Z supported: external 2025-12-04T09:41:29.5452007Z license: NVIDIA 2025-12-04T09:41:29.5452314Z srcversion: 98F82D76E0EF3952EEE57A7 2025-12-04T09:41:29.5452716Z alias: pci:v000010DEd*sv*sd*bc06sc80i00* 2025-12-04T09:41:29.5453131Z alias: pci:v000010DEd*sv*sd*bc03sc02i00* 2025-12-04T09:41:29.5453552Z alias: pci:v000010DEd*sv*sd*bc03sc00i00* 2025-12-04T09:41:29.5453943Z depends: i2c-core,drm 2025-12-04T09:41:29.5454265Z retpoline: Y 2025-12-04T09:41:29.5454524Z name: nvidia 2025-12-04T09:41:29.5454970Z vermagic: 6.1.150-174.273.amzn2023.x86_64 SMP preempt mod_unload modversions 2025-12-04T09:41:29.5455565Z parm: NvSwitchRegDwords:NvSwitch regkey (charp) 2025-12-04T09:41:29.5456113Z parm: NvSwitchBlacklist:NvSwitchBlacklist=uuid[,uuid...] (charp) 2025-12-04T09:41:29.5456640Z parm: NVreg_ResmanDebugLevel:int 2025-12-04T09:41:29.5457026Z parm: NVreg_RmLogonRC:int 2025-12-04T09:41:29.5457388Z parm: NVreg_ModifyDeviceFiles:int 2025-12-04T09:41:29.5457787Z parm: NVreg_DeviceFileUID:int 2025-12-04T09:41:29.5458254Z parm: NVreg_DeviceFileGID:int 2025-12-04T09:41:29.5458649Z parm: NVreg_DeviceFileMode:int 2025-12-04T09:41:29.5459086Z parm: NVreg_InitializeSystemMemoryAllocations:int 2025-12-04T09:41:29.5459571Z parm: NVreg_UsePageAttributeTable:int 2025-12-04T09:41:29.5459995Z parm: NVreg_EnablePCIeGen3:int 2025-12-04T09:41:29.5460458Z parm: NVreg_EnableMSI:int 2025-12-04T09:41:29.5460827Z parm: NVreg_TCEBypassMode:int 2025-12-04T09:41:29.5461229Z parm: NVreg_EnableStreamMemOPs:int 2025-12-04T09:41:29.5461681Z parm: NVreg_RestrictProfilingToAdminUsers:int 2025-12-04T09:41:29.5462160Z parm: NVreg_PreserveVideoMemoryAllocations:int 2025-12-04T09:41:29.5462633Z parm: NVreg_EnableS0ixPowerManagement:int 2025-12-04T09:41:29.5463145Z parm: NVreg_S0ixPowerManagementVideoMemoryThreshold:int 2025-12-04T09:41:29.5463642Z parm: NVreg_DynamicPowerManagement:int 2025-12-04T09:41:29.5464157Z parm: NVreg_DynamicPowerManagementVideoMemoryThreshold:int 2025-12-04T09:41:29.5464667Z parm: NVreg_EnableGpuFirmware:int 2025-12-04T09:41:29.5465076Z parm: NVreg_EnableGpuFirmwareLogs:int 2025-12-04T09:41:29.5465533Z parm: NVreg_OpenRmEnableUnsupportedGpus:int 2025-12-04T09:41:29.5465999Z parm: NVreg_EnableUserNUMAManagement:int 2025-12-04T09:41:29.5466431Z parm: NVreg_MemoryPoolSize:int 2025-12-04T09:41:29.5466818Z parm: NVreg_KMallocHeapMaxSize:int 2025-12-04T09:41:29.5467230Z parm: NVreg_VMallocHeapMaxSize:int 2025-12-04T09:41:29.5467640Z parm: NVreg_IgnoreMMIOCheck:int 2025-12-04T09:41:29.5468015Z parm: NVreg_NvLinkDisable:int 2025-12-04T09:41:29.5468442Z parm: NVreg_EnablePCIERelaxedOrderingMode:int 2025-12-04T09:41:29.5468885Z parm: NVreg_RegisterPCIDriver:int 2025-12-04T09:41:29.5469283Z parm: NVreg_EnableDbgBreakpoint:int 2025-12-04T09:41:29.5469702Z parm: NVreg_RegistryDwords:charp 2025-12-04T09:41:29.5470125Z parm: NVreg_RegistryDwordsPerDevice:charp 2025-12-04T09:41:29.5470525Z parm: NVreg_RmMsg:charp 2025-12-04T09:41:29.5470883Z parm: NVreg_GpuBlacklist:charp 2025-12-04T09:41:29.5471286Z parm: NVreg_TemporaryFilePath:charp 2025-12-04T09:41:29.5471698Z parm: NVreg_ExcludedGpus:charp 2025-12-04T09:41:29.5472083Z parm: NVreg_DmaRemapPeerMmio:int 2025-12-04T09:41:29.5472476Z parm: rm_firmware_active:charp 2025-12-04T09:41:29.5472828Z + set +e 2025-12-04T09:41:29.5473052Z + nvidia-smi 2025-12-04T09:41:31.5043179Z Thu Dec 4 09:41:31 2025 2025-12-04T09:41:31.5043959Z +-----------------------------------------------------------------------------+ 2025-12-04T09:41:31.5044940Z | NVIDIA-SMI 525.105.17 Driver Version: 525.105.17 CUDA Version: 12.0 | 2025-12-04T09:41:31.5045761Z |-------------------------------+----------------------+----------------------+ 2025-12-04T09:41:31.5046971Z | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | 2025-12-04T09:41:31.5047915Z | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | 2025-12-04T09:41:31.5048697Z | | | MIG M. | 2025-12-04T09:41:31.5049279Z |===============================+======================+======================| 2025-12-04T09:41:31.5121402Z | 0 Tesla T4 Off | 00000000:00:1E.0 Off | 0 | 2025-12-04T09:41:31.5122286Z | N/A 25C P0 25W / 70W | 2MiB / 15360MiB | 4% Default | 2025-12-04T09:41:31.5122935Z | | | N/A | 2025-12-04T09:41:31.5123701Z +-------------------------------+----------------------+----------------------+ 2025-12-04T09:41:31.5124500Z 2025-12-04T09:41:31.5125240Z +-----------------------------------------------------------------------------+ 2025-12-04T09:41:31.5125961Z | Processes: | 2025-12-04T09:41:31.5126793Z | GPU GI CI PID Type Process name GPU Memory | 2025-12-04T09:41:31.5127482Z | ID ID Usage | 2025-12-04T09:41:31.5128385Z |=============================================================================| 2025-12-04T09:41:31.5129119Z | No running processes found | 2025-12-04T09:41:31.5129890Z +-----------------------------------------------------------------------------+ 2025-12-04T09:41:31.9517122Z + nvidia-smi --query-gpu=gpu_name --format=csv,noheader --id=0 2025-12-04T09:41:33.9142534Z Tesla T4 2025-12-04T09:41:34.3147377Z + NVIDIA_SMI_STATUS=0 2025-12-04T09:41:34.3147871Z + '[' 0 -eq 0 ']' 2025-12-04T09:41:34.3148393Z + echo 'INFO: Ignoring allowed status 0' 2025-12-04T09:41:34.3148786Z + set -e 2025-12-04T09:41:34.3149029Z INFO: Ignoring allowed status 0 2025-12-04T09:41:34.3155489Z == Installing nvidia container toolkit for amzn2023 == 2025-12-04T09:41:34.3159191Z + sudo yum install -y yum-utils 2025-12-04T09:41:34.8626794Z Last metadata expiration check: 0:24:33 ago on Thu Dec 4 09:17:01 2025. 2025-12-04T09:41:34.8947376Z Package dnf-utils-4.3.0-13.amzn2023.0.5.noarch is already installed. 2025-12-04T09:41:34.9553814Z Dependencies resolved. 2025-12-04T09:41:34.9889747Z Nothing to do. 2025-12-04T09:41:34.9890129Z Complete! 2025-12-04T09:41:35.0876736Z + [[ amzn2023 == \a\m\z\n\2\0\2\3 ]] 2025-12-04T09:41:35.0877566Z + YUM_REPO_URL=https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo 2025-12-04T09:41:35.0878750Z + sudo yum-config-manager --add-repo https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo 2025-12-04T09:41:35.4694170Z Adding repo from: https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo 2025-12-04T09:41:35.5244386Z + sudo yum install -y nvidia-container-toolkit-1.17.8 libnvidia-container-tools-1.17.8 libnvidia-container1-1.17.8 nvidia-container-toolkit-base-1.17.8 2025-12-04T09:41:36.0818965Z nvidia-container-toolkit 20 kB/s | 833 B 00:00 2025-12-04T09:41:36.1817292Z Dependencies resolved. 2025-12-04T09:41:36.2135942Z ================================================================================ 2025-12-04T09:41:36.2136444Z Package Arch Version Repository Size 2025-12-04T09:41:36.2136925Z ================================================================================ 2025-12-04T09:41:36.2137307Z Downgrading: 2025-12-04T09:41:36.2137767Z libnvidia-container-tools x86_64 1.17.8-1 nvidia-container-toolkit 40 k 2025-12-04T09:41:36.2138531Z libnvidia-container1 x86_64 1.17.8-1 nvidia-container-toolkit 1.0 M 2025-12-04T09:41:36.2139476Z nvidia-container-toolkit x86_64 1.17.8-1 nvidia-container-toolkit 1.2 M 2025-12-04T09:41:36.2140225Z nvidia-container-toolkit-base x86_64 1.17.8-1 nvidia-container-toolkit 5.8 M 2025-12-04T09:41:36.2140677Z 2025-12-04T09:41:36.2140786Z Transaction Summary 2025-12-04T09:41:36.2141091Z ================================================================================ 2025-12-04T09:41:36.2141569Z Downgrade 4 Packages 2025-12-04T09:41:36.2141750Z 2025-12-04T09:41:36.2141893Z Total download size: 8.0 M 2025-12-04T09:41:36.2142586Z Downloading Packages: 2025-12-04T09:41:36.2609789Z (1/4): libnvidia-container-tools-1.17.8-1.x86_6 892 kB/s | 40 kB 00:00 2025-12-04T09:41:36.3010257Z (2/4): libnvidia-container1-1.17.8-1.x86_64.rpm 11 MB/s | 1.0 MB 00:00 2025-12-04T09:41:36.3521694Z (3/4): nvidia-container-toolkit-1.17.8-1.x86_64 9.1 MB/s | 1.2 MB 00:00 2025-12-04T09:41:36.4971859Z (4/4): nvidia-container-toolkit-base-1.17.8-1.x 24 MB/s | 5.8 MB 00:00 2025-12-04T09:41:36.4982082Z -------------------------------------------------------------------------------- 2025-12-04T09:41:36.4986405Z Total 28 MB/s | 8.0 MB 00:00 2025-12-04T09:41:36.4989946Z Running transaction check 2025-12-04T09:41:36.5146919Z Transaction check succeeded. 2025-12-04T09:41:36.5147634Z Running transaction test 2025-12-04T09:41:36.5693360Z Transaction test succeeded. 2025-12-04T09:41:36.5696549Z Running transaction 2025-12-04T09:41:37.5815056Z Preparing : 1/1 2025-12-04T09:41:37.7360091Z Downgrading : nvidia-container-toolkit-base-1.17.8-1.x86_64 1/8 2025-12-04T09:41:37.7685304Z Downgrading : libnvidia-container1-1.17.8-1.x86_64 2/8 2025-12-04T09:41:37.8369432Z Running scriptlet: libnvidia-container1-1.17.8-1.x86_64 2/8 2025-12-04T09:41:37.9948555Z Downgrading : libnvidia-container-tools-1.17.8-1.x86_64 3/8 2025-12-04T09:41:38.0217540Z Downgrading : nvidia-container-toolkit-1.17.8-1.x86_64 4/8 2025-12-04T09:41:38.0924440Z Running scriptlet: nvidia-container-toolkit-1.17.8-1.x86_64 4/8 2025-12-04T09:41:38.0983150Z Running scriptlet: nvidia-container-toolkit-1.18.1-1.x86_64 5/8 2025-12-04T09:41:38.0984391Z Cleanup : nvidia-container-toolkit-1.18.1-1.x86_64 5/8 2025-12-04T09:41:38.1302841Z Running scriptlet: nvidia-container-toolkit-1.18.1-1.x86_64 5/8 2025-12-04T09:41:38.1351596Z Running scriptlet: libnvidia-container-tools-1.18.1-1.x86_64 6/8 2025-12-04T09:41:38.1352961Z Cleanup : libnvidia-container-tools-1.18.1-1.x86_64 6/8 2025-12-04T09:41:38.1750768Z Running scriptlet: libnvidia-container-tools-1.18.1-1.x86_64 6/8 2025-12-04T09:41:38.1805173Z Running scriptlet: libnvidia-container1-1.18.1-1.x86_64 7/8 2025-12-04T09:41:38.1806406Z Cleanup : libnvidia-container1-1.18.1-1.x86_64 7/8 2025-12-04T09:41:38.2173827Z Running scriptlet: libnvidia-container1-1.18.1-1.x86_64 7/8 2025-12-04T09:41:38.2225588Z Running scriptlet: nvidia-container-toolkit-base-1.18.1-1.x86_64 8/8 2025-12-04T09:41:38.2226870Z Cleanup : nvidia-container-toolkit-base-1.18.1-1.x86_64 8/8 2025-12-04T09:41:38.2610455Z Running scriptlet: nvidia-container-toolkit-base-1.18.1-1.x86_64 8/8 2025-12-04T09:41:38.3258556Z Running scriptlet: nvidia-container-toolkit-1.17.8-1.x86_64 8/8 2025-12-04T09:41:39.7549562Z Running scriptlet: nvidia-container-toolkit-base-1.18.1-1.x86_64 8/8 2025-12-04T09:41:39.7550350Z Verifying : libnvidia-container-tools-1.17.8-1.x86_64 1/8 2025-12-04T09:41:39.7551041Z Verifying : libnvidia-container-tools-1.18.1-1.x86_64 2/8 2025-12-04T09:41:39.7551705Z Verifying : libnvidia-container1-1.17.8-1.x86_64 3/8 2025-12-04T09:41:39.7552641Z Verifying : libnvidia-container1-1.18.1-1.x86_64 4/8 2025-12-04T09:41:39.7553312Z Verifying : nvidia-container-toolkit-1.17.8-1.x86_64 5/8 2025-12-04T09:41:39.7553981Z Verifying : nvidia-container-toolkit-1.18.1-1.x86_64 6/8 2025-12-04T09:41:39.7554645Z Verifying : nvidia-container-toolkit-base-1.17.8-1.x86_64 7/8 2025-12-04T09:41:39.9142892Z Verifying : nvidia-container-toolkit-base-1.18.1-1.x86_64 8/8================================================================================ 2025-12-04T09:41:39.9143629Z WARNING: 2025-12-04T09:41:39.9143928Z A newer release of "Amazon Linux" is available. 2025-12-04T09:41:39.9144219Z 2025-12-04T09:41:39.9144344Z Available Versions: 2025-12-04T09:41:39.9144522Z 2025-12-04T09:41:39.9144629Z Version 2023.9.20250929: 2025-12-04T09:41:39.9145015Z Run the following command to upgrade to 2023.9.20250929: 2025-12-04T09:41:39.9145347Z 2025-12-04T09:41:39.9145507Z dnf upgrade --releasever=2023.9.20250929 2025-12-04T09:41:39.9145771Z 2025-12-04T09:41:39.9145871Z Release notes: 2025-12-04T09:41:39.9146390Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20250929.html 2025-12-04T09:41:39.9146875Z 2025-12-04T09:41:39.9146980Z Version 2023.9.20251014: 2025-12-04T09:41:39.9147574Z Run the following command to upgrade to 2023.9.20251014: 2025-12-04T09:41:39.9147895Z 2025-12-04T09:41:39.9148035Z dnf upgrade --releasever=2023.9.20251014 2025-12-04T09:41:39.9148310Z 2025-12-04T09:41:39.9148412Z Release notes: 2025-12-04T09:41:39.9148912Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251014.html 2025-12-04T09:41:39.9149386Z 2025-12-04T09:41:39.9149510Z Version 2023.9.20251020: 2025-12-04T09:41:39.9149884Z Run the following command to upgrade to 2023.9.20251020: 2025-12-04T09:41:39.9150217Z 2025-12-04T09:41:39.9150354Z dnf upgrade --releasever=2023.9.20251020 2025-12-04T09:41:39.9150617Z 2025-12-04T09:41:39.9150732Z Release notes: 2025-12-04T09:41:39.9151213Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251020.html 2025-12-04T09:41:39.9151696Z 2025-12-04T09:41:39.9151801Z Version 2023.9.20251027: 2025-12-04T09:41:39.9152186Z Run the following command to upgrade to 2023.9.20251027: 2025-12-04T09:41:39.9152500Z 2025-12-04T09:41:39.9152653Z dnf upgrade --releasever=2023.9.20251027 2025-12-04T09:41:39.9152915Z 2025-12-04T09:41:39.9153018Z Release notes: 2025-12-04T09:41:39.9153512Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251027.html 2025-12-04T09:41:39.9153985Z 2025-12-04T09:41:39.9154103Z Version 2023.9.20251105: 2025-12-04T09:41:39.9154472Z Run the following command to upgrade to 2023.9.20251105: 2025-12-04T09:41:39.9154802Z 2025-12-04T09:41:39.9154938Z dnf upgrade --releasever=2023.9.20251105 2025-12-04T09:41:39.9155219Z 2025-12-04T09:41:39.9155318Z Release notes: 2025-12-04T09:41:39.9155810Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251105.html 2025-12-04T09:41:39.9156281Z 2025-12-04T09:41:39.9156387Z Version 2023.9.20251110: 2025-12-04T09:41:39.9156769Z Run the following command to upgrade to 2023.9.20251110: 2025-12-04T09:41:39.9157084Z 2025-12-04T09:41:39.9157231Z dnf upgrade --releasever=2023.9.20251110 2025-12-04T09:41:39.9157496Z 2025-12-04T09:41:39.9157609Z Release notes: 2025-12-04T09:41:39.9158087Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251110.html 2025-12-04T09:41:39.9158571Z 2025-12-04T09:41:39.9158673Z Version 2023.9.20251117: 2025-12-04T09:41:39.9159055Z Run the following command to upgrade to 2023.9.20251117: 2025-12-04T09:41:39.9159369Z 2025-12-04T09:41:39.9159506Z dnf upgrade --releasever=2023.9.20251117 2025-12-04T09:41:39.9159778Z 2025-12-04T09:41:39.9159995Z Release notes: 2025-12-04T09:41:39.9160492Z https://docs.aws.amazon.com/linux/al2023/release-notes/relnotes-2023.9.20251117.html 2025-12-04T09:41:39.9160961Z 2025-12-04T09:41:39.9161107Z ================================================================================ 2025-12-04T09:41:39.9791985Z 2025-12-04T09:41:39.9792159Z 2025-12-04T09:41:39.9792259Z Downgraded: 2025-12-04T09:41:39.9792719Z libnvidia-container-tools-1.17.8-1.x86_64 2025-12-04T09:41:39.9793441Z libnvidia-container1-1.17.8-1.x86_64 2025-12-04T09:41:39.9794129Z nvidia-container-toolkit-1.17.8-1.x86_64 2025-12-04T09:41:39.9794858Z nvidia-container-toolkit-base-1.17.8-1.x86_64 2025-12-04T09:41:39.9795293Z 2025-12-04T09:41:39.9795404Z Complete! 2025-12-04T09:41:40.0346162Z + sudo systemctl restart docker 2025-12-04T09:41:46.3904823Z Thu Dec 4 09:41:46 2025 2025-12-04T09:41:46.3905363Z +-----------------------------------------------------------------------------+ 2025-12-04T09:41:46.3905973Z | NVIDIA-SMI 525.105.17 Driver Version: 525.105.17 CUDA Version: 12.0 | 2025-12-04T09:41:46.3906554Z |-------------------------------+----------------------+----------------------+ 2025-12-04T09:41:46.3907162Z | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | 2025-12-04T09:41:46.3908073Z | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | 2025-12-04T09:41:46.3908595Z | | | MIG M. | 2025-12-04T09:41:46.3909011Z |===============================+======================+======================| 2025-12-04T09:41:46.4007446Z | 0 Tesla T4 On | 00000000:00:1E.0 Off | 0 | 2025-12-04T09:41:46.4007976Z | N/A 25C P0 25W / 70W | 2MiB / 15360MiB | 7% Default | 2025-12-04T09:41:46.4008428Z | | | N/A | 2025-12-04T09:41:46.4008908Z +-------------------------------+----------------------+----------------------+ 2025-12-04T09:41:46.4009383Z 2025-12-04T09:41:46.4009847Z +-----------------------------------------------------------------------------+ 2025-12-04T09:41:46.4010343Z | Processes: | 2025-12-04T09:41:46.4010871Z | GPU GI CI PID Type Process name GPU Memory | 2025-12-04T09:41:46.4011361Z | ID ID Usage | 2025-12-04T09:41:46.4011768Z |=============================================================================| 2025-12-04T09:41:46.4012286Z | No running processes found | 2025-12-04T09:41:46.4012847Z +-----------------------------------------------------------------------------+ 2025-12-04T09:41:46.4737053Z Unable to find image 'public.ecr.aws/docker/library/python:3.13' locally 2025-12-04T09:41:46.6273421Z 3.13: Pulling from docker/library/python 2025-12-04T09:41:46.7056079Z 53c88f1dfeb7: Pulling fs layer 2025-12-04T09:41:46.7056470Z eae668646f44: Pulling fs layer 2025-12-04T09:41:46.7056865Z ff2e6e687b6c: Pulling fs layer 2025-12-04T09:41:46.7057211Z 7c40a3faff76: Pulling fs layer 2025-12-04T09:41:46.7057549Z 967a3b1c8fef: Pulling fs layer 2025-12-04T09:41:46.7057960Z a64e1a44f22a: Pulling fs layer 2025-12-04T09:41:46.7058412Z 52655f8a5bcc: Pulling fs layer 2025-12-04T09:41:46.7058749Z a64e1a44f22a: Waiting 2025-12-04T09:41:46.7059033Z 967a3b1c8fef: Waiting 2025-12-04T09:41:46.7059317Z 7c40a3faff76: Waiting 2025-12-04T09:41:46.7059583Z 52655f8a5bcc: Waiting 2025-12-04T09:41:46.8998500Z eae668646f44: Verifying Checksum 2025-12-04T09:41:46.8999218Z eae668646f44: Download complete 2025-12-04T09:41:46.9957399Z 53c88f1dfeb7: Verifying Checksum 2025-12-04T09:41:46.9958106Z 53c88f1dfeb7: Download complete 2025-12-04T09:41:47.0813777Z 967a3b1c8fef: Verifying Checksum 2025-12-04T09:41:47.0814577Z 967a3b1c8fef: Download complete 2025-12-04T09:41:47.1042155Z ff2e6e687b6c: Verifying Checksum 2025-12-04T09:41:47.1042587Z ff2e6e687b6c: Download complete 2025-12-04T09:41:47.1575599Z 52655f8a5bcc: Verifying Checksum 2025-12-04T09:41:47.1576568Z 52655f8a5bcc: Download complete 2025-12-04T09:41:47.2731042Z a64e1a44f22a: Verifying Checksum 2025-12-04T09:41:47.2731512Z a64e1a44f22a: Download complete 2025-12-04T09:41:48.0919538Z 7c40a3faff76: Verifying Checksum 2025-12-04T09:41:48.0919976Z 7c40a3faff76: Download complete 2025-12-04T09:41:48.4676639Z 53c88f1dfeb7: Pull complete 2025-12-04T09:41:49.0682579Z eae668646f44: Pull complete 2025-12-04T09:41:51.0504742Z ff2e6e687b6c: Pull complete 2025-12-04T09:41:56.7725722Z 7c40a3faff76: Pull complete 2025-12-04T09:41:56.9967959Z 967a3b1c8fef: Pull complete 2025-12-04T09:41:57.6427337Z a64e1a44f22a: Pull complete 2025-12-04T09:41:57.6636449Z 52655f8a5bcc: Pull complete 2025-12-04T09:41:57.6767711Z Digest: sha256:3f986299a7b8b44b0d8cf9bda2b22361ce5c3058ef5d7cb17fb7452506680ab0 2025-12-04T09:41:57.6807926Z Status: Downloaded newer image for public.ecr.aws/docker/library/python:3.13 2025-12-04T09:42:05.1580247Z Thu Dec 4 09:42:05 2025 2025-12-04T09:42:05.1580737Z +-----------------------------------------------------------------------------+ 2025-12-04T09:42:05.1581620Z | NVIDIA-SMI 525.105.17 Driver Version: 525.105.17 CUDA Version: 12.0 | 2025-12-04T09:42:05.1582218Z |-------------------------------+----------------------+----------------------+ 2025-12-04T09:42:05.1582818Z | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | 2025-12-04T09:42:05.1583469Z | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | 2025-12-04T09:42:05.1583998Z | | | MIG M. | 2025-12-04T09:42:05.1584393Z |===============================+======================+======================| 2025-12-04T09:42:05.1734212Z | 0 Tesla T4 On | 00000000:00:1E.0 Off | 0 | 2025-12-04T09:42:05.1734746Z | N/A 25C P8 9W / 70W | 2MiB / 15360MiB | 0% Default | 2025-12-04T09:42:05.1735191Z | | | N/A | 2025-12-04T09:42:05.1735665Z +-------------------------------+----------------------+----------------------+ 2025-12-04T09:42:05.1736149Z 2025-12-04T09:42:05.1736621Z +-----------------------------------------------------------------------------+ 2025-12-04T09:42:05.1737123Z | Processes: | 2025-12-04T09:42:05.1737651Z | GPU GI CI PID Type Process name GPU Memory | 2025-12-04T09:42:05.1738141Z | ID ID Usage | 2025-12-04T09:42:05.1738648Z |=============================================================================| 2025-12-04T09:42:05.1739157Z | No running processes found | 2025-12-04T09:42:05.1739727Z +-----------------------------------------------------------------------------+ 2025-12-04T09:42:06.0834898Z Command completed after 1 attempt(s). 2025-12-04T09:42:06.0945264Z Prepare all required actions 2025-12-04T09:42:06.0979364Z ##[group]Run ./.github/actions/get-workflow-job-id 2025-12-04T09:42:06.0979759Z with: 2025-12-04T09:42:06.0980433Z github-token: *** 2025-12-04T09:42:06.0980702Z env: 2025-12-04T09:42:06.0980940Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:06.0981255Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:06.0981622Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:06.0982044Z ##[endgroup] 2025-12-04T09:42:06.0998099Z ##[group]Run set -eux 2025-12-04T09:42:06.0998396Z set -eux 2025-12-04T09:42:06.0998916Z python3 .github/scripts/get_workflow_job_id.py "${GITHUB_RUN_ID}" "${RUNNER_NAME}" 2025-12-04T09:42:06.1010479Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:42:06.1010932Z env: 2025-12-04T09:42:06.1011181Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:06.1011478Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:06.1011898Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:06.1012508Z GITHUB_TOKEN: *** 2025-12-04T09:42:06.1012765Z ##[endgroup] 2025-12-04T09:42:06.1049736Z + python3 .github/scripts/get_workflow_job_id.py 19922826259 i-0144021b78afd54ea 2025-12-04T09:42:08.1614006Z Setting output job-id=57119749259 2025-12-04T09:42:08.1614958Z Setting output job-name=linux-jammy-cuda12.4-py3.10-gcc11 / test (legacy_nvidia_driver, 2, 5, linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check, unstable) 2025-12-04T09:42:08.1744846Z ##[group]Run python3 -m pip install psutil==5.9.8 dataclasses_json==0.6.7 nvidia-ml-py==11.525.84 2025-12-04T09:42:08.1745741Z python3 -m pip install psutil==5.9.8 dataclasses_json==0.6.7 nvidia-ml-py==11.525.84 2025-12-04T09:42:08.1746902Z python3 -m tools.stats.monitor --log-interval "$MONITOR_LOG_INTERVAL" --data-collect-interval "$MONITOR_DATA_COLLECT_INTERVAL" > usage_log.txt 2>&1 & 2025-12-04T09:42:08.1747923Z echo "monitor-script-pid=${!}" >> "${GITHUB_OUTPUT}" 2025-12-04T09:42:08.1756947Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:42:08.1757387Z env: 2025-12-04T09:42:08.1757642Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:08.1757957Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:08.1758311Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:08.1758732Z JOB_ID: 57119749259 2025-12-04T09:42:08.1759501Z JOB_NAME: linux-jammy-cuda12.4-py3.10-gcc11 / test (legacy_nvidia_driver, 2, 5, linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check, unstable) 2025-12-04T09:42:08.1760304Z WORKFLOW_NAME: periodic 2025-12-04T09:42:08.1760618Z WORKFLOW_RUN_ID: 19922826259 2025-12-04T09:42:08.1760955Z MONITOR_LOG_INTERVAL: 5 2025-12-04T09:42:08.1761258Z MONITOR_DATA_COLLECT_INTERVAL: 1 2025-12-04T09:42:08.1761598Z ##[endgroup] 2025-12-04T09:42:08.4855444Z Defaulting to user installation because normal site-packages is not writeable 2025-12-04T09:42:08.8961421Z Collecting psutil==5.9.8 2025-12-04T09:42:08.9149001Z Downloading psutil-5.9.8-cp36-abi3-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (288 kB) 2025-12-04T09:42:08.9966187Z Collecting dataclasses_json==0.6.7 2025-12-04T09:42:09.0002356Z Downloading dataclasses_json-0.6.7-py3-none-any.whl (28 kB) 2025-12-04T09:42:09.0308145Z Collecting nvidia-ml-py==11.525.84 2025-12-04T09:42:09.0344784Z Downloading nvidia_ml_py-11.525.84-py3-none-any.whl (34 kB) 2025-12-04T09:42:09.0699112Z Collecting typing-inspect<1,>=0.4.0 2025-12-04T09:42:09.0734049Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-12-04T09:42:09.1937735Z Collecting marshmallow<4.0.0,>=3.18.0 2025-12-04T09:42:09.1976181Z Downloading marshmallow-3.26.1-py3-none-any.whl (50 kB) 2025-12-04T09:42:09.2599237Z Collecting packaging>=17.0 2025-12-04T09:42:09.2634811Z Downloading packaging-25.0-py3-none-any.whl (66 kB) 2025-12-04T09:42:09.3220909Z Collecting typing-extensions>=3.7.4 2025-12-04T09:42:09.3259085Z Downloading typing_extensions-4.15.0-py3-none-any.whl (44 kB) 2025-12-04T09:42:09.3472327Z Collecting mypy-extensions>=0.3.0 2025-12-04T09:42:09.3510181Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-12-04T09:42:09.4497185Z Installing collected packages: typing-extensions, packaging, mypy-extensions, typing-inspect, marshmallow, psutil, nvidia-ml-py, dataclasses-json 2025-12-04T09:42:09.7572189Z Successfully installed dataclasses-json-0.6.7 marshmallow-3.26.1 mypy-extensions-1.1.0 nvidia-ml-py-11.525.84 packaging-25.0 psutil-5.9.8 typing-extensions-4.15.0 typing-inspect-0.9.0 2025-12-04T09:42:09.9578254Z Prepare all required actions 2025-12-04T09:42:09.9578743Z Getting action download info 2025-12-04T09:42:10.1388211Z Download action repository 'seemethere/download-artifact-s3@v4' (SHA:1da556a7aa0a088e3153970611f6c432d58e80e6) 2025-12-04T09:42:10.3906417Z Download action repository 'actions/download-artifact@v4' (SHA:d3f86a106a0bac45b974a628896c90dbdf5c8093) 2025-12-04T09:42:10.7532345Z ##[group]Run ./.github/actions/download-build-artifacts 2025-12-04T09:42:10.7532781Z with: 2025-12-04T09:42:10.7533063Z name: linux-jammy-cuda12.4-py3.10-gcc11 2025-12-04T09:42:10.7533451Z s3-bucket: gha-artifacts 2025-12-04T09:42:10.7533736Z env: 2025-12-04T09:42:10.7533980Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:10.7534290Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:10.7534643Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:10.7535063Z ##[endgroup] 2025-12-04T09:42:10.7567838Z ##[group]Run seemethere/download-artifact-s3@v4 2025-12-04T09:42:10.7568224Z with: 2025-12-04T09:42:10.7568559Z name: linux-jammy-cuda12.4-py3.10-gcc11 2025-12-04T09:42:10.7568950Z s3-bucket: gha-artifacts 2025-12-04T09:42:10.7569259Z region: us-east-1 2025-12-04T09:42:10.7569534Z env: 2025-12-04T09:42:10.7569778Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:10.7570073Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:10.7570439Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:10.7570861Z ##[endgroup] 2025-12-04T09:42:11.2686849Z (node:68738) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2025-12-04T09:42:11.2687441Z 2025-12-04T09:42:11.2687663Z Please migrate your code to use AWS SDK for JavaScript (v3). 2025-12-04T09:42:11.2688545Z For more information, check the migration guide at https://a.co/7PzMCcy 2025-12-04T09:42:11.2689217Z (Use `node --trace-warnings ...` to show where the warning was created) 2025-12-04T09:42:11.5603971Z Found 1 objects with prefix pytorch/pytorch/19922826259/linux-jammy-cuda12.4-py3.10-gcc11/ 2025-12-04T09:42:11.5604912Z Starting download (1/1): /home/ec2-user/actions-runner/_work/pytorch/pytorch/artifacts.zip 2025-12-04T09:42:18.2958334Z Finished download (1/1): /home/ec2-user/actions-runner/_work/pytorch/pytorch/artifacts.zip 2025-12-04T09:42:18.2964810Z Artifact download has finished successfully 2025-12-04T09:42:18.3168027Z ##[group]Run unzip -o artifacts.zip 2025-12-04T09:42:18.3168412Z unzip -o artifacts.zip 2025-12-04T09:42:18.3175782Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:42:18.3176232Z env: 2025-12-04T09:42:18.3176500Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:18.3176820Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:18.3177191Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:18.3177597Z ##[endgroup] 2025-12-04T09:42:18.3253110Z Archive: artifacts.zip 2025-12-04T09:42:18.3254618Z creating: dist/ 2025-12-04T09:42:20.3356362Z inflating: dist/torch-2.10.0a0+gitffd9b0f-cp310-cp310-linux_x86_64.whl 2025-12-04T09:42:20.3499712Z inflating: dist/.ninja_log 2025-12-04T09:42:20.3500500Z creating: build/custom_test_artifacts/ 2025-12-04T09:42:20.3501048Z creating: build/custom_test_artifacts/custom-op-build/ 2025-12-04T09:42:20.3501637Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/ 2025-12-04T09:42:20.3502347Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:42:20.3509902Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:42:20.3510717Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/ 2025-12-04T09:42:20.3511519Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:42:20.3512375Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:42:20.3513190Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:42:20.3514438Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:42:20.3515743Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:42:20.3516655Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:42:20.3517546Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:42:20.3518425Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:42:20.3520046Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:42:20.3521516Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:42:20.3522583Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:42:20.3524429Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:42:20.3526142Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:42:20.3527117Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/ 2025-12-04T09:42:20.3527981Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/ 2025-12-04T09:42:20.3589956Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp4.ii 2025-12-04T09:42:20.3654017Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.cpp 2025-12-04T09:42:20.3655317Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.module_id 2025-12-04T09:42:20.3722806Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp1.ii 2025-12-04T09:42:20.3724058Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.c 2025-12-04T09:42:20.3725351Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.gpu 2025-12-04T09:42:20.3726643Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.stub.c 2025-12-04T09:42:20.3727908Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.ptx 2025-12-04T09:42:20.3729136Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.sm_52.cubin 2025-12-04T09:42:20.3730390Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin 2025-12-04T09:42:20.3731622Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin.c 2025-12-04T09:42:20.3732830Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.o 2025-12-04T09:42:20.3733973Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.sm_52.cubin 2025-12-04T09:42:20.3735076Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.reg.c 2025-12-04T09:42:20.3736157Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.fatbin 2025-12-04T09:42:20.3737243Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.fatbin.c 2025-12-04T09:42:20.3738382Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.o 2025-12-04T09:42:20.3739653Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/CMakeCUDACompilerId.cu 2025-12-04T09:42:20.3817064Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCUDA/a.out 2025-12-04T09:42:20.3818354Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeCUDACompiler.cmake 2025-12-04T09:42:20.3899507Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CUDA.bin 2025-12-04T09:42:20.3900496Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:42:20.3901260Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:42:20.3902045Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/cmake.check_cache 2025-12-04T09:42:20.3902865Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/ 2025-12-04T09:42:20.3903803Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.ts 2025-12-04T09:42:20.3904851Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.make 2025-12-04T09:42:20.3905861Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/depend.make 2025-12-04T09:42:20.3906792Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/link.txt 2025-12-04T09:42:20.3907746Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/cmake_clean.cmake 2025-12-04T09:42:20.3908718Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/build.make 2025-12-04T09:42:20.3909690Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/DependInfo.cmake 2025-12-04T09:42:20.3910648Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/flags.make 2025-12-04T09:42:20.3911611Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/progress.make 2025-12-04T09:42:20.3929041Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o.d 2025-12-04T09:42:20.4147461Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o 2025-12-04T09:42:20.4148355Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/ 2025-12-04T09:42:20.4149335Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.ts 2025-12-04T09:42:20.4150439Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.make 2025-12-04T09:42:20.4151515Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/depend.make 2025-12-04T09:42:20.4152492Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/link.txt 2025-12-04T09:42:20.4153517Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/cmake_clean.cmake 2025-12-04T09:42:20.4154544Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/build.make 2025-12-04T09:42:20.4155566Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/DependInfo.cmake 2025-12-04T09:42:20.4156591Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/flags.make 2025-12-04T09:42:20.4157605Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/progress.make 2025-12-04T09:42:20.4176009Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o.d 2025-12-04T09:42:20.4265289Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o 2025-12-04T09:42:20.4266581Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:42:20.4267541Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:42:20.4268525Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/progress.marks 2025-12-04T09:42:20.4269328Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile2 2025-12-04T09:42:20.4270114Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile.cmake 2025-12-04T09:42:20.4270897Z inflating: build/custom_test_artifacts/custom-op-build/detect_cuda_version.cc 2025-12-04T09:42:20.4273093Z inflating: build/custom_test_artifacts/custom-op-build/CMakeCache.txt 2025-12-04T09:42:20.4273978Z inflating: build/custom_test_artifacts/custom-op-build/Makefile 2025-12-04T09:42:20.4274859Z inflating: build/custom_test_artifacts/custom-op-build/cmake_install.cmake 2025-12-04T09:42:20.4464679Z inflating: build/custom_test_artifacts/custom-op-build/libcustom_ops.so 2025-12-04T09:42:20.4527346Z inflating: build/custom_test_artifacts/custom-op-build/test_custom_ops 2025-12-04T09:42:20.4527961Z creating: build/custom_test_artifacts/jit-hook-build/ 2025-12-04T09:42:20.4528519Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/ 2025-12-04T09:42:20.4529206Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:42:20.4536736Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:42:20.4537528Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/ 2025-12-04T09:42:20.4538349Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:42:20.4539234Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:42:20.4540047Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:42:20.4541111Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:42:20.4542639Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:42:20.4543536Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:42:20.4544400Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:42:20.4545238Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:42:20.4546814Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:42:20.4548328Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:42:20.4549370Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:42:20.4551018Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:42:20.4552955Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:42:20.4553916Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/ 2025-12-04T09:42:20.4554773Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/ 2025-12-04T09:42:20.4616751Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp4.ii 2025-12-04T09:42:20.4731118Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.cpp 2025-12-04T09:42:20.4732551Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.module_id 2025-12-04T09:42:20.4750448Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp1.ii 2025-12-04T09:42:20.4751707Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.c 2025-12-04T09:42:20.4753082Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.gpu 2025-12-04T09:42:20.4754373Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.stub.c 2025-12-04T09:42:20.4755611Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.ptx 2025-12-04T09:42:20.4756832Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.sm_52.cubin 2025-12-04T09:42:20.4758060Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin 2025-12-04T09:42:20.4759294Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin.c 2025-12-04T09:42:20.4760471Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.o 2025-12-04T09:42:20.4761603Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.sm_52.cubin 2025-12-04T09:42:20.4762696Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.reg.c 2025-12-04T09:42:20.4763766Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.fatbin 2025-12-04T09:42:20.4764831Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.fatbin.c 2025-12-04T09:42:20.4765876Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.o 2025-12-04T09:42:20.4766942Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/CMakeCUDACompilerId.cu 2025-12-04T09:42:20.4844704Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCUDA/a.out 2025-12-04T09:42:20.4846162Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeCUDACompiler.cmake 2025-12-04T09:42:20.4927332Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CUDA.bin 2025-12-04T09:42:20.4928260Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:42:20.4928986Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:42:20.4929748Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/cmake.check_cache 2025-12-04T09:42:20.4930538Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/ 2025-12-04T09:42:20.4931465Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.ts 2025-12-04T09:42:20.4932519Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.make 2025-12-04T09:42:20.4933509Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/depend.make 2025-12-04T09:42:20.4934433Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/link.txt 2025-12-04T09:42:20.4935399Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/cmake_clean.cmake 2025-12-04T09:42:20.4936372Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/build.make 2025-12-04T09:42:20.4937325Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/DependInfo.cmake 2025-12-04T09:42:20.4938344Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/flags.make 2025-12-04T09:42:20.4939433Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/progress.make 2025-12-04T09:42:20.4957186Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o.d 2025-12-04T09:42:20.5026816Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o 2025-12-04T09:42:20.5027849Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:42:20.5028784Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:42:20.5029619Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/progress.marks 2025-12-04T09:42:20.5030392Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile2 2025-12-04T09:42:20.5031277Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile.cmake 2025-12-04T09:42:20.5032049Z inflating: build/custom_test_artifacts/jit-hook-build/detect_cuda_version.cc 2025-12-04T09:42:20.5034659Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeCache.txt 2025-12-04T09:42:20.5035525Z inflating: build/custom_test_artifacts/jit-hook-build/Makefile 2025-12-04T09:42:20.5036204Z inflating: build/custom_test_artifacts/jit-hook-build/cmake_install.cmake 2025-12-04T09:42:20.5078852Z inflating: build/custom_test_artifacts/jit-hook-build/test_jit_hooks 2025-12-04T09:42:20.5079492Z creating: build/custom_test_artifacts/custom-backend-build/ 2025-12-04T09:42:20.5080125Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/ 2025-12-04T09:42:20.5080863Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:42:20.5088451Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:42:20.5089451Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/ 2025-12-04T09:42:20.5090307Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:42:20.5091215Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:42:20.5092111Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:42:20.5093133Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:42:20.5094290Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:42:20.5095248Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:42:20.5096208Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:42:20.5097128Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:42:20.5098578Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:42:20.5100086Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:42:20.5101162Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:42:20.5102767Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:42:20.5104702Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:42:20.5105743Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/ 2025-12-04T09:42:20.5106656Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/ 2025-12-04T09:42:20.5168223Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp4.ii 2025-12-04T09:42:20.5232538Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.cpp 2025-12-04T09:42:20.5234009Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.module_id 2025-12-04T09:42:20.5301333Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp1.ii 2025-12-04T09:42:20.5302653Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.c 2025-12-04T09:42:20.5304001Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.gpu 2025-12-04T09:42:20.5305373Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.stub.c 2025-12-04T09:42:20.5306712Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.ptx 2025-12-04T09:42:20.5308007Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.sm_52.cubin 2025-12-04T09:42:20.5309322Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin 2025-12-04T09:42:20.5310619Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin.c 2025-12-04T09:42:20.5311888Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/CMakeCUDACompilerId.o 2025-12-04T09:42:20.5313079Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.sm_52.cubin 2025-12-04T09:42:20.5314255Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.reg.c 2025-12-04T09:42:20.5315407Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.fatbin 2025-12-04T09:42:20.5316566Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.fatbin.c 2025-12-04T09:42:20.5317677Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/tmp/a_dlink.o 2025-12-04T09:42:20.5318822Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/CMakeCUDACompilerId.cu 2025-12-04T09:42:20.5395722Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCUDA/a.out 2025-12-04T09:42:20.5397092Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeCUDACompiler.cmake 2025-12-04T09:42:20.5477666Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CUDA.bin 2025-12-04T09:42:20.5478687Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:42:20.5479500Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:42:20.5480330Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/cmake.check_cache 2025-12-04T09:42:20.5481210Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/ 2025-12-04T09:42:20.5482195Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.ts 2025-12-04T09:42:20.5483336Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.make 2025-12-04T09:42:20.5484434Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/depend.make 2025-12-04T09:42:20.5485449Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/link.txt 2025-12-04T09:42:20.5486860Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/cmake_clean.cmake 2025-12-04T09:42:20.5488190Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/build.make 2025-12-04T09:42:20.5489260Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/DependInfo.cmake 2025-12-04T09:42:20.5490322Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/flags.make 2025-12-04T09:42:20.5491353Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/progress.make 2025-12-04T09:42:20.5492472Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o.d 2025-12-04T09:42:20.5621442Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o 2025-12-04T09:42:20.5622498Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/ 2025-12-04T09:42:20.5623542Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.ts 2025-12-04T09:42:20.5624736Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.make 2025-12-04T09:42:20.5625885Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/depend.make 2025-12-04T09:42:20.5626957Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/link.txt 2025-12-04T09:42:20.5628044Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/cmake_clean.cmake 2025-12-04T09:42:20.5629165Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/build.make 2025-12-04T09:42:20.5630285Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/DependInfo.cmake 2025-12-04T09:42:20.5631403Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/flags.make 2025-12-04T09:42:20.5632493Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/progress.make 2025-12-04T09:42:20.5649964Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o.d 2025-12-04T09:42:20.5710709Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o 2025-12-04T09:42:20.5711877Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:42:20.5712891Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:42:20.5713789Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/progress.marks 2025-12-04T09:42:20.5714633Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile2 2025-12-04T09:42:20.5715461Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile.cmake 2025-12-04T09:42:20.5716296Z inflating: build/custom_test_artifacts/custom-backend-build/detect_cuda_version.cc 2025-12-04T09:42:20.5718414Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeCache.txt 2025-12-04T09:42:20.5719328Z inflating: build/custom_test_artifacts/custom-backend-build/Makefile 2025-12-04T09:42:20.5720212Z inflating: build/custom_test_artifacts/custom-backend-build/cmake_install.cmake 2025-12-04T09:42:20.5832495Z inflating: build/custom_test_artifacts/custom-backend-build/libcustom_backend.so 2025-12-04T09:42:20.5876592Z inflating: build/custom_test_artifacts/custom-backend-build/test_custom_backend 2025-12-04T09:42:20.5877178Z creating: build/lib/ 2025-12-04T09:42:20.5967518Z inflating: build/lib/libprotobuf-lite.a 2025-12-04T09:42:20.6454231Z inflating: build/lib/libprotobuf.a 2025-12-04T09:42:20.6999876Z inflating: build/lib/libprotoc.a 2025-12-04T09:42:20.7010465Z inflating: build/lib/libpthreadpool.a 2025-12-04T09:42:20.7019551Z inflating: build/lib/libcpuinfo.a 2025-12-04T09:42:20.7028286Z inflating: build/lib/libcpuinfo_internals.a 2025-12-04T09:42:20.7029243Z inflating: build/lib/libclog.a 2025-12-04T09:42:20.7050390Z inflating: build/lib/libpytorch_qnnpack.a 2025-12-04T09:42:20.7053054Z inflating: build/lib/libnnpack_reference_layers.a 2025-12-04T09:42:20.7072745Z inflating: build/lib/libnnpack.a 2025-12-04T09:42:20.7278189Z inflating: build/lib/libmicrokernels-prod.a 2025-12-04T09:42:20.8247525Z inflating: build/lib/libmicrokernels-all.a 2025-12-04T09:42:20.8324569Z inflating: build/lib/libgtest.a 2025-12-04T09:42:20.8343560Z inflating: build/lib/libgmock.a 2025-12-04T09:42:20.8344313Z inflating: build/lib/libgtest_main.a 2025-12-04T09:42:20.8345177Z inflating: build/lib/libgmock_main.a 2025-12-04T09:42:20.8445308Z inflating: build/lib/libXNNPACK.a 2025-12-04T09:42:20.8529006Z inflating: build/lib/libbenchmark.a 2025-12-04T09:42:20.8529889Z inflating: build/lib/libbenchmark_main.a 2025-12-04T09:42:20.8538764Z inflating: build/lib/libittnotify.a 2025-12-04T09:42:20.8612109Z inflating: build/lib/libasmjit.a 2025-12-04T09:42:20.8613099Z inflating: build/lib/libjitprofiling.a 2025-12-04T09:42:20.9891696Z inflating: build/lib/libfbgemm.a 2025-12-04T09:42:20.9925587Z inflating: build/lib/libtensorpipe_uv.a 2025-12-04T09:42:21.0520580Z inflating: build/lib/libtensorpipe.a 2025-12-04T09:42:21.0788337Z inflating: build/lib/libtensorpipe_cuda.a 2025-12-04T09:42:21.0936905Z inflating: build/lib/libgloo.a 2025-12-04T09:42:21.0988303Z inflating: build/lib/libonnx_proto.a 2025-12-04T09:42:21.1457787Z inflating: build/lib/libgloo_cuda.a 2025-12-04T09:42:21.2241194Z inflating: build/lib/libonnx.a 2025-12-04T09:42:22.3317643Z inflating: build/lib/libdnnl.a 2025-12-04T09:42:22.3339304Z inflating: build/lib/libfmt.a 2025-12-04T09:42:22.3867186Z inflating: build/lib/libkineto.a 2025-12-04T09:42:22.3996180Z inflating: build/lib/libc10.so 2025-12-04T09:42:22.4050989Z inflating: build/lib/libc10_cuda.so 2025-12-04T09:42:22.4052578Z inflating: build/lib/libtorch_global_deps.so 2025-12-04T09:42:22.4054577Z inflating: build/lib/libcaffe2_nvrtc.so 2025-12-04T09:42:25.8185383Z inflating: build/lib/libtorch_cpu.so 2025-12-04T09:42:27.6162297Z inflating: build/lib/libtorch_cuda.so 2025-12-04T09:42:27.6167406Z inflating: build/lib/libshm.so 2025-12-04T09:42:27.6168819Z inflating: build/lib/libtorch.so 2025-12-04T09:42:27.6222678Z inflating: build/lib/libtorch_cuda_linalg.so 2025-12-04T09:42:27.6225429Z inflating: build/lib/libc10d_cuda_test.so 2025-12-04T09:42:27.6303975Z inflating: build/lib/libtorchbind_test.so 2025-12-04T09:42:27.6325168Z inflating: build/lib/libjitbackend_test.so 2025-12-04T09:42:27.6351670Z inflating: build/lib/libbackend_with_compiler.so 2025-12-04T09:42:27.6380506Z inflating: build/lib/libaoti_custom_ops.so 2025-12-04T09:42:27.9012742Z inflating: build/lib/libtorch_python.so 2025-12-04T09:42:27.9052621Z inflating: build/lib/libnnapi_backend.so 2025-12-04T09:42:27.9053020Z creating: build/bin/ 2025-12-04T09:42:27.9561349Z inflating: build/bin/protoc-3.13.0.0 2025-12-04T09:42:28.0070187Z inflating: build/bin/protoc 2025-12-04T09:42:28.0136228Z inflating: build/bin/c10_AllocatorConfig_test 2025-12-04T09:42:28.0198056Z inflating: build/bin/c10_CompileTimeFunctionPointer_test 2025-12-04T09:42:28.0261472Z inflating: build/bin/c10_DeviceGuard_test 2025-12-04T09:42:28.0325251Z inflating: build/bin/c10_Device_test 2025-12-04T09:42:28.0398149Z inflating: build/bin/c10_DispatchKeySet_test 2025-12-04T09:42:28.0458777Z inflating: build/bin/c10_StreamGuard_test 2025-12-04T09:42:28.0525487Z inflating: build/bin/c10_Scalar_test 2025-12-04T09:42:28.0594430Z inflating: build/bin/c10_SymInt_test 2025-12-04T09:42:28.0663122Z inflating: build/bin/c10_InlineStreamGuard_test 2025-12-04T09:42:28.0729981Z inflating: build/bin/c10_InlineDeviceGuard_test 2025-12-04T09:42:28.0798870Z inflating: build/bin/c10_SizesAndStrides_test 2025-12-04T09:42:28.0860498Z inflating: build/bin/c10_ArrayRef_test 2025-12-04T09:42:28.0921278Z inflating: build/bin/c10_ConstexprCrc_test 2025-12-04T09:42:28.1006489Z inflating: build/bin/c10_cow_test 2025-12-04T09:42:28.1071447Z inflating: build/bin/c10_Bitset_test 2025-12-04T09:42:28.1132977Z inflating: build/bin/c10_DeadlockDetection_test 2025-12-04T09:42:28.1203034Z inflating: build/bin/c10_Enumerate_test 2025-12-04T09:42:28.1265954Z inflating: build/bin/c10_Half_test 2025-12-04T09:42:28.1331283Z inflating: build/bin/c10_IntrusiveList_test 2025-12-04T09:42:28.1396942Z inflating: build/bin/c10_NetworkFlow_test 2025-12-04T09:42:28.1465537Z inflating: build/bin/c10_LeftRight_test 2025-12-04T09:42:28.1527137Z inflating: build/bin/c10_Synchronized_test 2025-12-04T09:42:28.1589291Z inflating: build/bin/c10_Semaphore_test 2025-12-04T09:42:28.1657177Z inflating: build/bin/c10_ThreadLocal_test 2025-12-04T09:42:28.1721097Z inflating: build/bin/c10_TypeIndex_test 2025-12-04T09:42:28.1784713Z inflating: build/bin/c10_accumulate_test 2025-12-04T09:42:28.1853401Z inflating: build/bin/c10_bfloat16_test 2025-12-04T09:42:28.1923096Z inflating: build/bin/c10_complex_math_test 2025-12-04T09:42:28.1985333Z inflating: build/bin/c10_bit_cast_test 2025-12-04T09:42:28.2046679Z inflating: build/bin/c10_error_test 2025-12-04T09:42:28.2114565Z inflating: build/bin/c10_complex_test 2025-12-04T09:42:28.2179226Z inflating: build/bin/c10_exception_test 2025-12-04T09:42:28.2241240Z inflating: build/bin/c10_flags_test 2025-12-04T09:42:28.2303529Z inflating: build/bin/c10_generic_math_test 2025-12-04T09:42:28.2487834Z inflating: build/bin/c10_intrusive_ptr_test 2025-12-04T09:42:28.2550675Z inflating: build/bin/c10_irange_test 2025-12-04T09:42:28.2616621Z inflating: build/bin/c10_lazy_test 2025-12-04T09:42:28.2686658Z inflating: build/bin/c10_logging_test 2025-12-04T09:42:28.2748464Z inflating: build/bin/c10_nofatal_test 2025-12-04T09:42:28.2838829Z inflating: build/bin/c10_optional_test 2025-12-04T09:42:28.2914366Z inflating: build/bin/c10_ordered_preserving_dict_test 2025-12-04T09:42:28.2979689Z inflating: build/bin/c10_registry_test 2025-12-04T09:42:28.3158972Z inflating: build/bin/c10_small_vector_test 2025-12-04T09:42:28.3222608Z inflating: build/bin/c10_ssize_test 2025-12-04T09:42:28.3291786Z inflating: build/bin/c10_string_util_test 2025-12-04T09:42:28.3347080Z inflating: build/bin/c10_intrusive_ptr_benchmark 2025-12-04T09:42:28.3409042Z inflating: build/bin/c10_tempfile_test 2025-12-04T09:42:28.3469718Z inflating: build/bin/c10_string_view_test 2025-12-04T09:42:28.3538670Z inflating: build/bin/c10_typeid_test 2025-12-04T09:42:28.3603931Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_catches_thread_and_block_and_device 2025-12-04T09:42:28.3669084Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_multiple_writes_from_multiple_blocks 2025-12-04T09:42:28.3733436Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_from_2_processes 2025-12-04T09:42:28.3798926Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_multiple_writes_from_blocks_and_threads 2025-12-04T09:42:28.3859757Z inflating: build/bin/c10_cuda_CUDATest 2025-12-04T09:42:28.3925254Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_catches_stream 2025-12-04T09:42:28.3990379Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_1_var_test 2025-12-04T09:42:28.4055400Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_multiple_writes_from_same_block 2025-12-04T09:42:28.4722911Z inflating: build/bin/vec_test_all_types_DEFAULT 2025-12-04T09:42:28.5410090Z inflating: build/bin/vec_test_all_types_AVX512 2025-12-04T09:42:28.6107431Z inflating: build/bin/vec_test_all_types_AVX2 2025-12-04T09:42:28.6168520Z inflating: build/bin/test_vec_half_DEFAULT 2025-12-04T09:42:28.6285180Z inflating: build/bin/test_aoti_abi_check 2025-12-04T09:42:28.6346871Z inflating: build/bin/test_vec_half_AVX512 2025-12-04T09:42:28.6408486Z inflating: build/bin/test_vec_half_AVX2 2025-12-04T09:42:28.6473277Z inflating: build/bin/BackoffTest 2025-12-04T09:42:28.6538899Z inflating: build/bin/FileStoreTest 2025-12-04T09:42:28.6608364Z inflating: build/bin/TCPStoreTest 2025-12-04T09:42:28.6674307Z inflating: build/bin/HashStoreTest 2025-12-04T09:42:28.6690127Z inflating: build/bin/ProcessGroupMPITest 2025-12-04T09:42:28.6694320Z inflating: build/bin/torch_shm_manager 2025-12-04T09:42:28.6782733Z inflating: build/bin/Dict_test 2025-12-04T09:42:28.6847570Z inflating: build/bin/Dimname_test 2025-12-04T09:42:28.6926644Z inflating: build/bin/MaybeOwned_test 2025-12-04T09:42:28.6996430Z inflating: build/bin/NamedTensor_test 2025-12-04T09:42:28.7068150Z inflating: build/bin/apply_utils_test 2025-12-04T09:42:28.7140184Z inflating: build/bin/atest 2025-12-04T09:42:28.7217946Z inflating: build/bin/basic 2025-12-04T09:42:28.7284436Z inflating: build/bin/broadcast_test 2025-12-04T09:42:28.7347198Z inflating: build/bin/cpu_allocator_test 2025-12-04T09:42:28.7418302Z inflating: build/bin/cpu_generator_test 2025-12-04T09:42:28.7483325Z inflating: build/bin/cpu_profiling_allocator_test 2025-12-04T09:42:28.7593848Z inflating: build/bin/cpu_rng_test 2025-12-04T09:42:28.7656733Z inflating: build/bin/dlconvertor_test 2025-12-04T09:42:28.7727153Z inflating: build/bin/extension_backend_test 2025-12-04T09:42:28.7795494Z inflating: build/bin/half_test 2025-12-04T09:42:28.7911840Z inflating: build/bin/ivalue_test 2025-12-04T09:42:28.7973487Z inflating: build/bin/lazy_tensor_test 2025-12-04T09:42:28.8038933Z inflating: build/bin/math_kernel_test 2025-12-04T09:42:28.8104323Z inflating: build/bin/memory_format_test 2025-12-04T09:42:28.8169810Z inflating: build/bin/memory_overlapping_test 2025-12-04T09:42:28.8235576Z inflating: build/bin/mobile_memory_cleanup 2025-12-04T09:42:28.8304403Z inflating: build/bin/native_test 2025-12-04T09:42:28.8366915Z inflating: build/bin/operator_name_test 2025-12-04T09:42:28.8429562Z inflating: build/bin/operators_test 2025-12-04T09:42:28.8494012Z inflating: build/bin/packedtensoraccessor_test 2025-12-04T09:42:28.8576032Z inflating: build/bin/pow_test 2025-12-04T09:42:28.8645385Z inflating: build/bin/quantized_test 2025-12-04T09:42:28.8707216Z inflating: build/bin/reduce_ops_test 2025-12-04T09:42:28.8769913Z inflating: build/bin/reportMemoryUsage_test 2025-12-04T09:42:28.8838615Z inflating: build/bin/scalar_tensor_test 2025-12-04T09:42:28.8909097Z inflating: build/bin/scalar_test 2025-12-04T09:42:28.8972510Z inflating: build/bin/StorageUtils_test 2025-12-04T09:42:28.9036598Z inflating: build/bin/stride_properties_test 2025-12-04T09:42:28.9131212Z inflating: build/bin/tensor_iterator_test 2025-12-04T09:42:28.9197982Z inflating: build/bin/test_parallel 2025-12-04T09:42:28.9260312Z inflating: build/bin/thread_init_test 2025-12-04T09:42:28.9327882Z inflating: build/bin/type_ptr_test 2025-12-04T09:42:28.9400441Z inflating: build/bin/type_test 2025-12-04T09:42:28.9465116Z inflating: build/bin/undefined_tensor_test 2025-12-04T09:42:28.9526492Z inflating: build/bin/verify_api_visibility 2025-12-04T09:42:28.9612445Z inflating: build/bin/legacy_vmap_test 2025-12-04T09:42:28.9675466Z inflating: build/bin/weakref_test 2025-12-04T09:42:28.9738947Z inflating: build/bin/wrapdim_test 2025-12-04T09:42:28.9801930Z inflating: build/bin/xla_tensor_test 2025-12-04T09:42:28.9874593Z inflating: build/bin/IListRef_test 2025-12-04T09:42:28.9999976Z inflating: build/bin/List_test 2025-12-04T09:42:29.0080245Z inflating: build/bin/KernelFunction_test 2025-12-04T09:42:29.0221876Z inflating: build/bin/kernel_function_legacy_test 2025-12-04T09:42:29.0335464Z inflating: build/bin/kernel_function_test 2025-12-04T09:42:29.0483875Z inflating: build/bin/kernel_lambda_legacy_test 2025-12-04T09:42:29.0604581Z inflating: build/bin/kernel_lambda_test 2025-12-04T09:42:29.0677746Z inflating: build/bin/kernel_stackbased_test 2025-12-04T09:42:29.0791566Z inflating: build/bin/make_boxed_from_unboxed_functor_test 2025-12-04T09:42:29.0854785Z inflating: build/bin/CppSignature_test 2025-12-04T09:42:29.0922368Z inflating: build/bin/backend_fallback_test 2025-12-04T09:42:29.0982856Z inflating: build/bin/op_allowlist_test 2025-12-04T09:42:29.1338278Z inflating: build/bin/op_registration_test 2025-12-04T09:42:29.1419312Z inflating: build/bin/inline_container_test 2025-12-04T09:42:29.1485089Z inflating: build/bin/cuda_allocator_test 2025-12-04T09:42:29.1550275Z inflating: build/bin/cuda_apply_test 2025-12-04T09:42:29.1623443Z inflating: build/bin/cuda_atomic_ops_test 2025-12-04T09:42:29.1692653Z inflating: build/bin/cuda_caching_host_allocator_test 2025-12-04T09:42:29.1776725Z inflating: build/bin/cuda_complex_math_test 2025-12-04T09:42:29.1849255Z inflating: build/bin/cuda_complex_test 2025-12-04T09:42:29.1921102Z inflating: build/bin/cuda_cub_test 2025-12-04T09:42:29.1986160Z inflating: build/bin/cuda_cublas_handle_pool_test 2025-12-04T09:42:29.2047555Z inflating: build/bin/cuda_device_test 2025-12-04T09:42:29.2126081Z inflating: build/bin/cuda_distributions_test 2025-12-04T09:42:29.2190105Z inflating: build/bin/cuda_dlconvertor_test 2025-12-04T09:42:29.2255630Z inflating: build/bin/cuda_event_test 2025-12-04T09:42:29.2316995Z inflating: build/bin/cuda_exchange_device_test 2025-12-04T09:42:29.2386189Z inflating: build/bin/cuda_generator_test 2025-12-04T09:42:29.2447662Z inflating: build/bin/cuda_half_test 2025-12-04T09:42:29.2510906Z inflating: build/bin/cuda_integer_divider_test 2025-12-04T09:42:29.2571928Z inflating: build/bin/cuda_optional_test 2025-12-04T09:42:29.2636248Z inflating: build/bin/cuda_packedtensoraccessor_test 2025-12-04T09:42:29.2701063Z inflating: build/bin/cuda_reportMemoryUsage_test 2025-12-04T09:42:29.2762457Z inflating: build/bin/cuda_allocatorTraceTracker_test 2025-12-04T09:42:29.2837199Z inflating: build/bin/cuda_stream_test 2025-12-04T09:42:29.2901995Z inflating: build/bin/cuda_vectorized_test 2025-12-04T09:42:29.2963934Z inflating: build/bin/cuda_cudnn_test 2025-12-04T09:42:29.3362429Z inflating: build/bin/test_lazy 2025-12-04T09:42:29.3444007Z inflating: build/bin/ProcessGroupGlooTest 2025-12-04T09:42:29.3513471Z inflating: build/bin/ProcessGroupGlooAsyncTest 2025-12-04T09:42:29.4765710Z inflating: build/bin/test_jit 2025-12-04T09:42:29.4843649Z inflating: build/bin/ProcessGroupNCCLTest 2025-12-04T09:42:29.4918755Z inflating: build/bin/ProcessGroupNCCLErrorsTest 2025-12-04T09:42:29.4922023Z inflating: build/bin/example_allreduce 2025-12-04T09:42:29.4989785Z inflating: build/bin/test_dist_autograd 2025-12-04T09:42:29.5072672Z inflating: build/bin/test_cpp_rpc 2025-12-04T09:42:29.5075488Z inflating: build/bin/parallel_benchmark 2025-12-04T09:42:29.6416542Z inflating: build/bin/test_api 2025-12-04T09:42:29.6416965Z creating: .additional_ci_files/ 2025-12-04T09:42:29.6488886Z inflating: .additional_ci_files/test-times.json 2025-12-04T09:42:29.6750671Z inflating: .additional_ci_files/test-class-times.json 2025-12-04T09:42:29.6788690Z ##[group]Run rm artifacts.zip 2025-12-04T09:42:29.6789035Z rm artifacts.zip 2025-12-04T09:42:29.6796025Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:42:29.6796472Z env: 2025-12-04T09:42:29.6796709Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:29.6797020Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:29.6797551Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:29.6797961Z ##[endgroup] 2025-12-04T09:42:29.9886861Z ##[group]Run df -H 2025-12-04T09:42:29.9887311Z df -H 2025-12-04T09:42:29.9894027Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:42:29.9894471Z env: 2025-12-04T09:42:29.9894724Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:29.9895034Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:29.9895382Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:29.9895799Z ##[endgroup] 2025-12-04T09:42:29.9944216Z Filesystem Size Used Avail Use% Mounted on 2025-12-04T09:42:29.9944680Z devtmpfs 4.2M 0 4.2M 0% /dev 2025-12-04T09:42:29.9945079Z tmpfs 34G 0 34G 0% /dev/shm 2025-12-04T09:42:29.9945469Z tmpfs 14G 562k 14G 1% /run 2025-12-04T09:42:29.9945848Z /dev/nvme0n1p1 161G 51G 111G 32% / 2025-12-04T09:42:29.9946228Z tmpfs 34G 17k 34G 1% /tmp 2025-12-04T09:42:29.9946964Z /dev/nvme0n1p128 11M 1.4M 9.2M 13% /boot/efi 2025-12-04T09:42:29.9947393Z tmpfs 6.7G 0 6.7G 0% /run/user/0 2025-12-04T09:42:29.9985001Z Prepare all required actions 2025-12-04T09:42:29.9985857Z Getting action download info 2025-12-04T09:42:30.1418942Z ##[group]Run ./.github/actions/download-td-artifacts 2025-12-04T09:42:30.1419361Z with: 2025-12-04T09:42:30.1419599Z env: 2025-12-04T09:42:30.1419829Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:30.1420140Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:30.1420506Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:30.1420912Z ##[endgroup] 2025-12-04T09:42:30.1508048Z ##[group]Run seemethere/download-artifact-s3@v4 2025-12-04T09:42:30.1508456Z with: 2025-12-04T09:42:30.1508685Z name: td_results 2025-12-04T09:42:30.1508967Z s3-bucket: gha-artifacts 2025-12-04T09:42:30.1509279Z region: us-east-1 2025-12-04T09:42:30.1509525Z env: 2025-12-04T09:42:30.1509771Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:30.1510079Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:30.1510435Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:30.1510946Z ##[endgroup] 2025-12-04T09:42:30.8521669Z (node:68763) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2025-12-04T09:42:30.8522258Z 2025-12-04T09:42:30.8522487Z Please migrate your code to use AWS SDK for JavaScript (v3). 2025-12-04T09:42:30.8523132Z For more information, check the migration guide at https://a.co/7PzMCcy 2025-12-04T09:42:30.8523793Z (Use `node --trace-warnings ...` to show where the warning was created) 2025-12-04T09:42:30.9533295Z Found 1 objects with prefix pytorch/pytorch/19922826259/td_results/ 2025-12-04T09:42:30.9534141Z Starting download (1/1): /home/ec2-user/actions-runner/_work/pytorch/pytorch/td_results.json 2025-12-04T09:42:31.0106010Z Finished download (1/1): /home/ec2-user/actions-runner/_work/pytorch/pytorch/td_results.json 2025-12-04T09:42:31.0112661Z Artifact download has finished successfully 2025-12-04T09:42:31.0289823Z ##[group]Run mkdir -p .additional_ci_files 2025-12-04T09:42:31.0290271Z mkdir -p .additional_ci_files 2025-12-04T09:42:31.0290794Z mv td_results.json .additional_ci_files/td_results.json || true 2025-12-04T09:42:31.0297756Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:42:31.0298284Z env: 2025-12-04T09:42:31.0298539Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:31.0298852Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:31.0299209Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:31.0299630Z ##[endgroup] 2025-12-04T09:42:31.0421017Z ##[group]Run .github/scripts/parse_ref.py 2025-12-04T09:42:31.0421466Z .github/scripts/parse_ref.py 2025-12-04T09:42:31.0427557Z shell: /usr/bin/bash -e {0} 2025-12-04T09:42:31.0427859Z env: 2025-12-04T09:42:31.0428108Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:31.0428424Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:31.0428781Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:31.0429203Z ##[endgroup] 2025-12-04T09:42:31.0668939Z Setting output branch=main 2025-12-04T09:42:31.0813357Z Prepare all required actions 2025-12-04T09:42:31.0813967Z Getting action download info 2025-12-04T09:42:31.2319498Z ##[group]Run ./.github/actions/filter-test-configs 2025-12-04T09:42:31.2319899Z with: 2025-12-04T09:42:31.2320342Z github-token: *** 2025-12-04T09:42:31.2327397Z test-matrix: {"include": [{"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]} 2025-12-04T09:42:31.2335151Z job-name: linux-jammy-cuda12.4-py3.10-gcc11 / test (legacy_nvidia_driver, 2, 5, linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check, unstable) 2025-12-04T09:42:31.2335927Z env: 2025-12-04T09:42:31.2336171Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:31.2336467Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:31.2336830Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:31.2337242Z ##[endgroup] 2025-12-04T09:42:31.2376571Z ##[group]Run nick-fields/retry@v3.0.0 2025-12-04T09:42:31.2376916Z with: 2025-12-04T09:42:31.2377153Z shell: bash 2025-12-04T09:42:31.2377414Z timeout_minutes: 10 2025-12-04T09:42:31.2377693Z max_attempts: 5 2025-12-04T09:42:31.2377951Z retry_wait_seconds: 30 2025-12-04T09:42:31.2379007Z command: set -eux # PyYAML 6.0 doesn't work with MacOS x86 anymore # This must run on Python-3.7 (AmazonLinux2) so can't use request=3.32.2 python3 -m pip install requests==2.27.1 pyyaml==6.0.2 2025-12-04T09:42:31.2380006Z polling_interval_seconds: 1 2025-12-04T09:42:31.2380354Z warning_on_retry: true 2025-12-04T09:42:31.2380662Z continue_on_error: false 2025-12-04T09:42:31.2380944Z env: 2025-12-04T09:42:31.2381187Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:31.2381497Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:31.2381860Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:31.2382444Z GITHUB_TOKEN: *** 2025-12-04T09:42:31.2382714Z ##[endgroup] 2025-12-04T09:42:31.3449363Z + python3 -m pip install requests==2.27.1 pyyaml==6.0.2 2025-12-04T09:42:31.6139040Z Defaulting to user installation because normal site-packages is not writeable 2025-12-04T09:42:31.7411654Z Collecting requests==2.27.1 2025-12-04T09:42:31.7600149Z Downloading requests-2.27.1-py2.py3-none-any.whl (63 kB) 2025-12-04T09:42:31.9626107Z Collecting pyyaml==6.0.2 2025-12-04T09:42:31.9669606Z Downloading PyYAML-6.0.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (737 kB) 2025-12-04T09:42:31.9946450Z Requirement already satisfied: idna<4,>=2.5 in /usr/lib/python3.9/site-packages (from requests==2.27.1) (2.10) 2025-12-04T09:42:32.4408662Z Collecting charset-normalizer~=2.0.0 2025-12-04T09:42:32.4451029Z Downloading charset_normalizer-2.0.12-py3-none-any.whl (39 kB) 2025-12-04T09:42:32.5030361Z Collecting certifi>=2017.4.17 2025-12-04T09:42:32.5074117Z Downloading certifi-2025.11.12-py3-none-any.whl (159 kB) 2025-12-04T09:42:32.5142111Z Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/lib/python3.9/site-packages (from requests==2.27.1) (1.25.10) 2025-12-04T09:42:32.6097263Z Installing collected packages: charset-normalizer, certifi, requests, pyyaml 2025-12-04T09:42:32.7415228Z Successfully installed certifi-2025.11.12 charset-normalizer-2.0.12 pyyaml-6.0.2 requests-2.27.1 2025-12-04T09:42:33.3251806Z Command completed after 1 attempt(s). 2025-12-04T09:42:33.3312221Z ##[group]Run set -x 2025-12-04T09:42:33.3312537Z set -x 2025-12-04T09:42:33.3312820Z  2025-12-04T09:42:33.3313319Z # Use relative path here as this could be checked out anywhere, not necessarily 2025-12-04T09:42:33.3313913Z # in runner workspace 2025-12-04T09:42:33.3314394Z python3 "${GITHUB_ACTION_PATH}/../../scripts/parse_ref.py" 2025-12-04T09:42:33.3321729Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:42:33.3322191Z env: 2025-12-04T09:42:33.3322443Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:33.3322776Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:33.3323158Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:33.3323574Z ##[endgroup] 2025-12-04T09:42:33.3355150Z + python3 /home/ec2-user/actions-runner/_work/pytorch/pytorch/./.github/actions/filter-test-configs/../../scripts/parse_ref.py 2025-12-04T09:42:33.3574274Z Setting output branch=main 2025-12-04T09:42:33.3637695Z ##[group]Run echo "Workflow: ${GITHUB_WORKFLOW}" 2025-12-04T09:42:33.3638202Z echo "Workflow: ${GITHUB_WORKFLOW}" 2025-12-04T09:42:33.3638666Z echo "Job name: ${JOB_NAME}" 2025-12-04T09:42:33.3639003Z  2025-12-04T09:42:33.3639467Z # Use relative path here as this could be checked out anywhere, not necessarily 2025-12-04T09:42:33.3640224Z # in runner workspace 2025-12-04T09:42:33.3640736Z python3 "${GITHUB_ACTION_PATH}/../../scripts/filter_test_configs.py" \ 2025-12-04T09:42:33.3641297Z  --workflow "${GITHUB_WORKFLOW}" \ 2025-12-04T09:42:33.3641699Z  --job-name "${JOB_NAME}" \ 2025-12-04T09:42:33.3649119Z  --test-matrix "{"include": [{"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]}" \ 2025-12-04T09:42:33.3656655Z  --selected-test-configs "" \ 2025-12-04T09:42:33.3657069Z  --pr-number "${PR_NUMBER}" \ 2025-12-04T09:42:33.3657451Z  --tag "${TAG}" \ 2025-12-04T09:42:33.3657810Z  --event-name "${EVENT_NAME}" \ 2025-12-04T09:42:33.3658201Z  --schedule "${SCHEDULE}" \ 2025-12-04T09:42:33.3658684Z  --branch "${HEAD_BRANCH}" 2025-12-04T09:42:33.3665253Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:42:33.3665712Z env: 2025-12-04T09:42:33.3665963Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:33.3666293Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:33.3666681Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:33.3667374Z GITHUB_TOKEN: *** 2025-12-04T09:42:33.3668108Z JOB_NAME: linux-jammy-cuda12.4-py3.10-gcc11 / test (legacy_nvidia_driver, 2, 5, linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check, unstable) 2025-12-04T09:42:33.3668918Z PR_NUMBER: 2025-12-04T09:42:33.3669179Z TAG: 2025-12-04T09:42:33.3669414Z EVENT_NAME: schedule 2025-12-04T09:42:33.3669711Z SCHEDULE: 29 8 * * * 2025-12-04T09:42:33.3670003Z HEAD_BRANCH: main 2025-12-04T09:42:33.3670260Z ##[endgroup] 2025-12-04T09:42:33.3697036Z Workflow: periodic 2025-12-04T09:42:33.3697782Z Job name: linux-jammy-cuda12.4-py3.10-gcc11 / test (legacy_nvidia_driver, 2, 5, linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check, unstable) 2025-12-04T09:42:33.5611743Z Setting output keep-going=True 2025-12-04T09:42:33.5612199Z Setting output ci-verbose-test-logs=False 2025-12-04T09:42:33.5612620Z Setting output ci-test-showlocals=False 2025-12-04T09:42:33.5613005Z Setting output ci-no-test-timeout=False 2025-12-04T09:42:33.5613391Z Setting output ci-no-td=False 2025-12-04T09:42:33.5613780Z Setting output ci-td-distributed=False 2025-12-04T09:42:33.5614156Z Setting output is-unstable=True 2025-12-04T09:42:33.5614508Z Setting output reenabled-issues= 2025-12-04T09:42:33.5630533Z Setting output test-matrix={"include": [{"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]} 2025-12-04T09:42:33.5646506Z Setting output is-test-matrix-empty=False 2025-12-04T09:42:33.5747176Z ##[group]Run echo "Filtered matrix:" 2025-12-04T09:42:33.5747624Z echo "Filtered matrix:" 2025-12-04T09:42:33.5763334Z echo "{"include": [{"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "legacy_nvidia_driver", "shard": 1, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "legacy_nvidia_driver", "shard": 2, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "legacy_nvidia_driver", "shard": 3, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "legacy_nvidia_driver", "shard": 4, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "legacy_nvidia_driver", "shard": 5, "num_shards": 5, "runner": "linux.g4dn.4xlarge.nvidia.gpu", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]}" 2025-12-04T09:42:33.5779354Z  2025-12-04T09:42:33.5779593Z echo 2025-12-04T09:42:33.5779904Z echo "Is the current job unstable? True" 2025-12-04T09:42:33.5780283Z  2025-12-04T09:42:33.5780517Z echo 2025-12-04T09:42:33.5780815Z echo "Is keep-going label set? True" 2025-12-04T09:42:33.5781177Z  2025-12-04T09:42:33.5781409Z echo 2025-12-04T09:42:33.5781681Z echo "Reenabled issues? " 2025-12-04T09:42:33.5788432Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:42:33.5788880Z env: 2025-12-04T09:42:33.5789133Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:33.5789431Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:33.5789798Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:33.5790215Z ##[endgroup] 2025-12-04T09:42:33.5815562Z Filtered matrix: 2025-12-04T09:42:33.5834881Z {include: [{config: legacy_nvidia_driver, shard: 1, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check: mem_leak_check, unstable: unstable}, {config: legacy_nvidia_driver, shard: 1, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: legacy_nvidia_driver, shard: 1, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: legacy_nvidia_driver, shard: 1, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: legacy_nvidia_driver, shard: 2, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check: mem_leak_check, unstable: unstable}, {config: legacy_nvidia_driver, shard: 2, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: legacy_nvidia_driver, shard: 2, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: legacy_nvidia_driver, shard: 2, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: legacy_nvidia_driver, shard: 3, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check: mem_leak_check, unstable: unstable}, {config: legacy_nvidia_driver, shard: 3, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: legacy_nvidia_driver, shard: 3, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: legacy_nvidia_driver, shard: 3, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: legacy_nvidia_driver, shard: 4, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check: mem_leak_check, unstable: unstable}, {config: legacy_nvidia_driver, shard: 4, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: legacy_nvidia_driver, shard: 4, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: legacy_nvidia_driver, shard: 4, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: legacy_nvidia_driver, shard: 5, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check: mem_leak_check, unstable: unstable}, {config: legacy_nvidia_driver, shard: 5, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: legacy_nvidia_driver, shard: 5, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: legacy_nvidia_driver, shard: 5, num_shards: 5, runner: linux.g4dn.4xlarge.nvidia.gpu, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}]} 2025-12-04T09:42:33.5850633Z 2025-12-04T09:42:33.5850768Z Is the current job unstable? True 2025-12-04T09:42:33.5851008Z 2025-12-04T09:42:33.5851127Z Is keep-going label set? True 2025-12-04T09:42:33.5851338Z 2025-12-04T09:42:33.5851452Z Reenabled issues? 2025-12-04T09:42:33.5885467Z ##[group]Run echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2025-12-04T09:42:33.5886088Z echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2025-12-04T09:42:33.5892584Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:42:33.5893031Z env: 2025-12-04T09:42:33.5893284Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:33.5893583Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:33.5893961Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:33.5894385Z JOB_TIMEOUT: 600 2025-12-04T09:42:33.5894640Z ##[endgroup] 2025-12-04T09:42:33.5945146Z ##[group]Run env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:42:33.5945757Z env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:42:33.5946299Z env | grep '^CI' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:42:33.5952282Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:42:33.5952725Z env: 2025-12-04T09:42:33.5952976Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:33.5953282Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:33.5953631Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:33.5954045Z ##[endgroup] 2025-12-04T09:42:33.6066600Z ##[group]Run set -x 2025-12-04T09:42:33.6066980Z set -x 2025-12-04T09:42:33.6067233Z  2025-12-04T09:42:33.6067524Z if [[ $TEST_CONFIG == 'multigpu' ]]; then 2025-12-04T09:42:33.6068104Z  TEST_COMMAND=.ci/pytorch/multigpu-test.sh 2025-12-04T09:42:33.6068577Z elif [[ $BUILD_ENVIRONMENT == *onnx* ]]; then 2025-12-04T09:42:33.6069005Z  TEST_COMMAND=.ci/onnx/test.sh 2025-12-04T09:42:33.6069366Z else 2025-12-04T09:42:33.6069641Z  TEST_COMMAND=.ci/pytorch/test.sh 2025-12-04T09:42:33.6070005Z fi 2025-12-04T09:42:33.6070241Z  2025-12-04T09:42:33.6070531Z # Leaving 1GB for the runner and other things 2025-12-04T09:42:33.6071221Z TOTAL_AVAILABLE_MEMORY_IN_GB=$(awk '/MemTotal/ { printf "%.3f \n", $2/1024/1024 - 1 }' /proc/meminfo) 2025-12-04T09:42:33.6072270Z # https://docs.docker.com/engine/containers/resource_constraints/#--memory-swap-details, the 3GB swap 2025-12-04T09:42:33.6073099Z # comes from https://github.com/pytorch/test-infra/pull/6058 2025-12-04T09:42:33.6073726Z TOTAL_MEMORY_WITH_SWAP=$(("${TOTAL_AVAILABLE_MEMORY_IN_GB%.*}" + 3)) 2025-12-04T09:42:33.6074223Z  2025-12-04T09:42:33.6074532Z if [[ ${BUILD_ENVIRONMENT} == *"s390x"* ]]; then 2025-12-04T09:42:33.6074925Z  SHM_OPTS= 2025-12-04T09:42:33.6075211Z  JENKINS_USER= 2025-12-04T09:42:33.6075622Z  # ensure that docker container cleanly exits in 12 hours 2025-12-04T09:42:33.6076174Z  # if for some reason cleanup action doesn't stop container 2025-12-04T09:42:33.6076644Z  # when job is cancelled 2025-12-04T09:42:33.6077007Z  DOCKER_SHELL_CMD="sleep 12h" 2025-12-04T09:42:33.6077383Z  USED_IMAGE="${DOCKER_IMAGE_S390X}" 2025-12-04T09:42:33.6077748Z else 2025-12-04T09:42:33.6078040Z  SHM_OPTS="--shm-size=${SHM_SIZE}" 2025-12-04T09:42:33.6078438Z  JENKINS_USER="--user jenkins" 2025-12-04T09:42:33.6078790Z  DOCKER_SHELL_CMD= 2025-12-04T09:42:33.6079122Z  USED_IMAGE="${DOCKER_IMAGE}" 2025-12-04T09:42:33.6079470Z fi 2025-12-04T09:42:33.6079696Z  2025-12-04T09:42:33.6080087Z # detached container should get cleaned up by teardown_ec2_linux 2025-12-04T09:42:33.6080718Z # TODO: Stop building test binaries as part of the build phase 2025-12-04T09:42:33.6081443Z # Used for GPU_FLAG, SHM_OPTS, JENKINS_USER and DOCKER_SHELL_CMD since that doesn't play nice 2025-12-04T09:42:33.6082066Z # shellcheck disable=SC2086,SC2090 2025-12-04T09:42:33.6082464Z container_name=$(docker run \ 2025-12-04T09:42:33.6082832Z  ${GPU_FLAG:-} \ 2025-12-04T09:42:33.6083173Z  ${SCCACHE_SERVER_PORT_DOCKER_FLAG:-} \ 2025-12-04T09:42:33.6083583Z  -e BUILD_ENVIRONMENT \ 2025-12-04T09:42:33.6083935Z  -e PR_NUMBER \ 2025-12-04T09:42:33.6084247Z  -e GITHUB_ACTIONS \ 2025-12-04T09:42:33.6084587Z  -e GITHUB_REPOSITORY \ 2025-12-04T09:42:33.6084943Z  -e GITHUB_WORKFLOW \ 2025-12-04T09:42:33.6085286Z  -e GITHUB_JOB \ 2025-12-04T09:42:33.6085590Z  -e GITHUB_RUN_ID \ 2025-12-04T09:42:33.6085916Z  -e GITHUB_RUN_NUMBER \ 2025-12-04T09:42:33.6086264Z  -e GITHUB_RUN_ATTEMPT \ 2025-12-04T09:42:33.6086590Z  -e JOB_ID \ 2025-12-04T09:42:33.6086880Z  -e JOB_NAME \ 2025-12-04T09:42:33.6087181Z  -e BASE_SHA \ 2025-12-04T09:42:33.6087462Z  -e BRANCH \ 2025-12-04T09:42:33.6087994Z  -e SHA1 \ 2025-12-04T09:42:33.6088298Z  -e AWS_DEFAULT_REGION \ 2025-12-04T09:42:33.6088631Z  -e IN_WHEEL_TEST \ 2025-12-04T09:42:33.6088959Z  -e SHARD_NUMBER \ 2025-12-04T09:42:33.6089281Z  -e TEST_CONFIG \ 2025-12-04T09:42:33.6089604Z  -e NUM_TEST_SHARDS \ 2025-12-04T09:42:33.6090091Z  -e REENABLED_ISSUES \ 2025-12-04T09:42:33.6090451Z  -e CONTINUE_THROUGH_ERROR \ 2025-12-04T09:42:33.6090820Z  -e VERBOSE_TEST_LOGS \ 2025-12-04T09:42:33.6091243Z  -e TEST_SHOWLOCALS \ 2025-12-04T09:42:33.6091579Z  -e NO_TEST_TIMEOUT \ 2025-12-04T09:42:33.6091900Z  -e NO_TD \ 2025-12-04T09:42:33.6092178Z  -e TD_DISTRIBUTED \ 2025-12-04T09:42:33.6092507Z  -e PR_LABELS \ 2025-12-04T09:42:33.6092850Z  -e MAX_JOBS="$(nproc --ignore=2)" \ 2025-12-04T09:42:33.6093244Z  -e SCCACHE_BUCKET \ 2025-12-04T09:42:33.6093559Z  -e SCCACHE_REGION \ 2025-12-04T09:42:33.6093883Z  -e XLA_CUDA \ 2025-12-04T09:42:33.6094219Z  -e XLA_CLANG_CACHE_S3_BUCKET_NAME \ 2025-12-04T09:42:33.6094629Z  -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK \ 2025-12-04T09:42:33.6095064Z  -e PYTORCH_TEST_RERUN_DISABLED_TESTS \ 2025-12-04T09:42:33.6095504Z  -e SKIP_SCCACHE_INITIALIZATION=1 \ 2025-12-04T09:42:33.6095893Z  -e HUGGING_FACE_HUB_TOKEN \ 2025-12-04T09:42:33.6096278Z  -e VLLM_TEST_HUGGING_FACE_TOKEN \ 2025-12-04T09:42:33.6096684Z  -e SCRIBE_GRAPHQL_ACCESS_TOKEN \ 2025-12-04T09:42:33.6097062Z  -e DASHBOARD_TAG \ 2025-12-04T09:42:33.6097377Z  -e ARTIFACTS_FILE_SUFFIX \ 2025-12-04T09:42:33.6097798Z  --memory="${TOTAL_AVAILABLE_MEMORY_IN_GB%.*}g" \ 2025-12-04T09:42:33.6098350Z  --memory-swap="${TOTAL_MEMORY_WITH_SWAP}g" \ 2025-12-04T09:42:33.6098829Z  --env-file="/tmp/github_env_${GITHUB_RUN_ID}" \ 2025-12-04T09:42:33.6099293Z  --security-opt seccomp=unconfined \ 2025-12-04T09:42:33.6099699Z  --cap-add=SYS_PTRACE \ 2025-12-04T09:42:33.6100046Z  --ipc=host \ 2025-12-04T09:42:33.6100332Z  ${SHM_OPTS} \ 2025-12-04T09:42:33.6100627Z  --tty \ 2025-12-04T09:42:33.6100898Z  --detach \ 2025-12-04T09:42:33.6101195Z  --name="${container_name}" \ 2025-12-04T09:42:33.6101561Z  ${JENKINS_USER} \ 2025-12-04T09:42:33.6101961Z  -v "${GITHUB_WORKSPACE}:/var/lib/jenkins/workspace" \ 2025-12-04T09:42:33.6102422Z  -w /var/lib/jenkins/workspace \ 2025-12-04T09:42:33.6102794Z  "${USED_IMAGE}" \ 2025-12-04T09:42:33.6103111Z  ${DOCKER_SHELL_CMD} 2025-12-04T09:42:33.6103406Z ) 2025-12-04T09:42:33.6103791Z echo "DOCKER_CONTAINER_ID=${container_name}" >> "${GITHUB_ENV}" 2025-12-04T09:42:33.6104275Z  2025-12-04T09:42:33.6104581Z if [[ ${BUILD_ENVIRONMENT} == *"s390x"* ]]; then 2025-12-04T09:42:33.6105254Z  docker exec -t "${container_name}" sh -c "python3 -m pip install -r .ci/docker/requirements-ci.txt" 2025-12-04T09:42:33.6105877Z fi 2025-12-04T09:42:33.6106117Z  2025-12-04T09:42:33.6106689Z docker exec -t "${container_name}" sh -c "python3 -m pip install $(echo dist/*.whl)[opt-einsum] && ${TEST_COMMAND}" 2025-12-04T09:42:33.6112981Z shell: /usr/bin/bash -e {0} 2025-12-04T09:42:33.6113301Z env: 2025-12-04T09:42:33.6113545Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:42:33.6113848Z HAS_NVIDIA_GPU: true 2025-12-04T09:42:33.6114213Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:33.6114719Z BUILD_ENVIRONMENT: linux-jammy-cuda12.4-py3.10-gcc11 2025-12-04T09:42:33.6115120Z PR_NUMBER: 2025-12-04T09:42:33.6115399Z GITHUB_REPOSITORY: pytorch/pytorch 2025-12-04T09:42:33.6115758Z GITHUB_WORKFLOW: periodic 2025-12-04T09:42:33.6116050Z GITHUB_JOB: test 2025-12-04T09:42:33.6116323Z GITHUB_RUN_ID: 19922826259 2025-12-04T09:42:33.6116634Z GITHUB_RUN_NUMBER: 19107 2025-12-04T09:42:33.6116923Z GITHUB_RUN_ATTEMPT: 1 2025-12-04T09:42:33.6117208Z JOB_ID: 57119749259 2025-12-04T09:42:33.6117924Z JOB_NAME: linux-jammy-cuda12.4-py3.10-gcc11 / test (legacy_nvidia_driver, 2, 5, linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check, unstable) 2025-12-04T09:42:33.6118809Z BRANCH: main 2025-12-04T09:42:33.6119109Z SHA1: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:42:33.6119567Z BASE_SHA: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:42:33.6120052Z TEST_CONFIG: legacy_nvidia_driver 2025-12-04T09:42:33.6120376Z SHARD_NUMBER: 2 2025-12-04T09:42:33.6120638Z NUM_TEST_SHARDS: 5 2025-12-04T09:42:33.6120910Z EXTRA_FLAGS: 2025-12-04T09:42:33.6121156Z OP_BENCHMARK_TESTS: 2025-12-04T09:42:33.6121438Z REENABLED_ISSUES: 2025-12-04T09:42:33.6121726Z CONTINUE_THROUGH_ERROR: True 2025-12-04T09:42:33.6122052Z VERBOSE_TEST_LOGS: False 2025-12-04T09:42:33.6122341Z TEST_SHOWLOCALS: False 2025-12-04T09:42:33.6122637Z NO_TEST_TIMEOUT: False 2025-12-04T09:42:33.6122900Z NO_TD: False 2025-12-04T09:42:33.6123145Z TD_DISTRIBUTED: False 2025-12-04T09:42:33.6123497Z SCCACHE_BUCKET: ossci-compiler-cache-circleci-v2 2025-12-04T09:42:33.6123897Z SCCACHE_REGION: us-east-1 2025-12-04T09:42:33.6124186Z SHM_SIZE: 2g 2025-12-04T09:42:33.6125105Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:42:33.6126792Z DOCKER_IMAGE_S390X: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:42:33.6127798Z XLA_CUDA: 2025-12-04T09:42:33.6128193Z XLA_CLANG_CACHE_S3_BUCKET_NAME: ossci-compiler-clang-cache-circleci-xla 2025-12-04T09:42:33.6128710Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK: 1 2025-12-04T09:42:33.6129067Z PYTORCH_TEST_RERUN_DISABLED_TESTS: 0 2025-12-04T09:42:33.6129398Z DASHBOARD_TAG: 2025-12-04T09:42:33.6129886Z VLLM_TEST_HUGGING_FACE_TOKEN: *** 2025-12-04T09:42:33.6130344Z HUGGING_FACE_HUB_TOKEN: *** 2025-12-04T09:42:33.6130815Z SCRIBE_GRAPHQL_ACCESS_TOKEN: *** 2025-12-04T09:42:33.6131417Z ARTIFACTS_FILE_SUFFIX: test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259 2025-12-04T09:42:33.6132040Z ##[endgroup] 2025-12-04T09:42:33.6156716Z + [[ legacy_nvidia_driver == \m\u\l\t\i\g\p\u ]] 2025-12-04T09:42:33.6157165Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *onnx* ]] 2025-12-04T09:42:33.6157580Z + TEST_COMMAND=.ci/pytorch/test.sh 2025-12-04T09:42:33.6160235Z ++ awk '/MemTotal/ { printf "%.3f \n", $2/1024/1024 - 1 }' /proc/meminfo 2025-12-04T09:42:33.6180892Z + TOTAL_AVAILABLE_MEMORY_IN_GB='61.094 ' 2025-12-04T09:42:33.6181269Z + TOTAL_MEMORY_WITH_SWAP=64 2025-12-04T09:42:33.6181649Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *\s\3\9\0\x* ]] 2025-12-04T09:42:33.6182067Z + SHM_OPTS=--shm-size=2g 2025-12-04T09:42:33.6182360Z + JENKINS_USER='--user jenkins' 2025-12-04T09:42:33.6182659Z + DOCKER_SHELL_CMD= 2025-12-04T09:42:33.6183581Z + USED_IMAGE=308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:42:33.6190236Z +++ nproc --ignore=2 2025-12-04T09:42:33.6218001Z ++ docker run --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all -e BUILD_ENVIRONMENT -e PR_NUMBER -e GITHUB_ACTIONS -e GITHUB_REPOSITORY -e GITHUB_WORKFLOW -e GITHUB_JOB -e GITHUB_RUN_ID -e GITHUB_RUN_NUMBER -e GITHUB_RUN_ATTEMPT -e JOB_ID -e JOB_NAME -e BASE_SHA -e BRANCH -e SHA1 -e AWS_DEFAULT_REGION -e IN_WHEEL_TEST -e SHARD_NUMBER -e TEST_CONFIG -e NUM_TEST_SHARDS -e REENABLED_ISSUES -e CONTINUE_THROUGH_ERROR -e VERBOSE_TEST_LOGS -e TEST_SHOWLOCALS -e NO_TEST_TIMEOUT -e NO_TD -e TD_DISTRIBUTED -e PR_LABELS -e MAX_JOBS=14 -e SCCACHE_BUCKET -e SCCACHE_REGION -e XLA_CUDA -e XLA_CLANG_CACHE_S3_BUCKET_NAME -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK -e PYTORCH_TEST_RERUN_DISABLED_TESTS -e SKIP_SCCACHE_INITIALIZATION=1 -e HUGGING_FACE_HUB_TOKEN -e VLLM_TEST_HUGGING_FACE_TOKEN -e SCRIBE_GRAPHQL_ACCESS_TOKEN -e DASHBOARD_TAG -e ARTIFACTS_FILE_SUFFIX --memory=61g --memory-swap=64g --env-file=/tmp/github_env_19922826259 --security-opt seccomp=unconfined --cap-add=SYS_PTRACE --ipc=host --shm-size=2g --tty --detach --name= --user jenkins -v /home/ec2-user/actions-runner/_work/pytorch/pytorch:/var/lib/jenkins/workspace -w /var/lib/jenkins/workspace 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:42:41.5837720Z + container_name=fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T09:42:41.5838599Z + echo DOCKER_CONTAINER_ID=fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T09:42:41.5839335Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *\s\3\9\0\x* ]] 2025-12-04T09:42:41.5844188Z ++ echo dist/torch-2.10.0a0+gitffd9b0f-cp310-cp310-linux_x86_64.whl 2025-12-04T09:42:41.5846246Z + docker exec -t fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 sh -c 'python3 -m pip install dist/torch-2.10.0a0+gitffd9b0f-cp310-cp310-linux_x86_64.whl[opt-einsum] && .ci/pytorch/test.sh' 2025-12-04T09:42:42.0966612Z Processing ./dist/torch-2.10.0a0+gitffd9b0f-cp310-cp310-linux_x86_64.whl (from torch==2.10.0a0+gitffd9b0f) 2025-12-04T09:42:42.9713352Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f->torch==2.10.0a0+gitffd9b0f) (3.18.0) 2025-12-04T09:42:42.9717864Z Requirement already satisfied: typing-extensions>=4.10.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f->torch==2.10.0a0+gitffd9b0f) (4.12.2) 2025-12-04T09:42:42.9723249Z Requirement already satisfied: sympy>=1.13.3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f->torch==2.10.0a0+gitffd9b0f) (1.13.3) 2025-12-04T09:42:42.9728319Z Requirement already satisfied: networkx>=2.5.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f->torch==2.10.0a0+gitffd9b0f) (2.8.8) 2025-12-04T09:42:42.9732631Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f->torch==2.10.0a0+gitffd9b0f) (3.1.6) 2025-12-04T09:42:42.9737797Z Requirement already satisfied: fsspec>=0.8.5 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f->torch==2.10.0a0+gitffd9b0f) (2025.10.0) 2025-12-04T09:42:42.9753242Z Requirement already satisfied: opt-einsum>=3.3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f->torch==2.10.0a0+gitffd9b0f) (3.3.0) 2025-12-04T09:42:43.0182841Z Requirement already satisfied: numpy>=1.7 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from opt-einsum>=3.3->torch==2.10.0a0+gitffd9b0f->torch==2.10.0a0+gitffd9b0f) (1.22.4) 2025-12-04T09:42:43.0205668Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy>=1.13.3->torch==2.10.0a0+gitffd9b0f->torch==2.10.0a0+gitffd9b0f) (1.3.0) 2025-12-04T09:42:43.0272010Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch==2.10.0a0+gitffd9b0f->torch==2.10.0a0+gitffd9b0f) (3.0.3) 2025-12-04T09:42:43.4522142Z Installing collected packages: torch 2025-12-04T09:42:55.7771447Z Successfully installed torch-2.10.0a0+gitffd9b0f 2025-12-04T09:42:55.8390736Z + export TERM=vt100 2025-12-04T09:42:55.8391046Z + TERM=vt100 2025-12-04T09:42:55.8393663Z ++ dirname .ci/pytorch/test.sh 2025-12-04T09:42:55.8402318Z + source .ci/pytorch/common.sh 2025-12-04T09:42:55.8405904Z +++ dirname .ci/pytorch/common.sh 2025-12-04T09:42:55.8413043Z ++ source .ci/pytorch/common_utils.sh 2025-12-04T09:42:55.8414758Z +++ declare -f -t trap_add 2025-12-04T09:42:55.8420332Z ++ set -ex -o pipefail 2025-12-04T09:42:55.8420693Z ++ [[ linux-jammy-cuda12.4-py3.10-gcc11 == *rocm* ]] 2025-12-04T09:42:55.8421115Z ++ BUILD_TEST_LIBTORCH=0 2025-12-04T09:42:55.8424098Z ++ dirname .ci/pytorch/test.sh 2025-12-04T09:42:55.8431687Z + source .ci/pytorch/common-build.sh 2025-12-04T09:42:55.8433490Z ++ [[ linux-jammy-cuda12.4-py3.10-gcc11 != *win-* ]] 2025-12-04T09:42:55.8439629Z ++++ dirname .ci/pytorch/common-build.sh 2025-12-04T09:42:55.8448084Z +++ cd .ci/pytorch 2025-12-04T09:42:55.8448388Z +++ pwd -P 2025-12-04T09:42:55.8450513Z ++ script_dir=/var/lib/jenkins/workspace/.ci/pytorch 2025-12-04T09:42:55.8451151Z ++ [[ linux-jammy-cuda12.4-py3.10-gcc11 == *-pch* ]] 2025-12-04T09:42:55.8451554Z ++ which sccache 2025-12-04T09:42:55.8468272Z ++ [[ -z ossci-compiler-cache-circleci-v2 ]] 2025-12-04T09:42:55.8468670Z ++ sccache --stop-server 2025-12-04T09:42:55.8494773Z ++ true 2025-12-04T09:42:55.8495063Z ++ rm -f /var/lib/jenkins/sccache_error.log 2025-12-04T09:42:55.8504444Z ++ trap_add sccache_epilogue EXIT 2025-12-04T09:42:55.8504804Z ++ trap_add_cmd=sccache_epilogue 2025-12-04T09:42:55.8505126Z ++ shift 2025-12-04T09:42:55.8505380Z ++ for trap_add_name in "$@" 2025-12-04T09:42:55.8512500Z ++++ trap -p EXIT 2025-12-04T09:42:55.8514656Z +++ eval 'extract_trap_cmd ' 2025-12-04T09:42:55.8515117Z ++++ extract_trap_cmd 2025-12-04T09:42:55.8515397Z ++++ printf '%s\n' '' 2025-12-04T09:42:55.8515723Z +++ printf '%s\n' sccache_epilogue 2025-12-04T09:42:55.8517036Z ++ trap -- ' 2025-12-04T09:42:55.8517370Z sccache_epilogue' EXIT 2025-12-04T09:42:55.8517775Z ++ [[ -n 1 ]] 2025-12-04T09:42:55.8518229Z ++ echo 'Skipping sccache server initialization, setting environment variables' 2025-12-04T09:42:55.8518964Z Skipping sccache server initialization, setting environment variables 2025-12-04T09:42:55.8519481Z ++ export SCCACHE_IDLE_TIMEOUT=0 2025-12-04T09:42:55.8519822Z ++ SCCACHE_IDLE_TIMEOUT=0 2025-12-04T09:42:55.8520225Z ++ export SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log 2025-12-04T09:42:55.8520747Z ++ SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log 2025-12-04T09:42:55.8527917Z ++ export RUST_LOG=sccache::server=error 2025-12-04T09:42:55.8528356Z ++ RUST_LOG=sccache::server=error 2025-12-04T09:42:55.8528719Z ++ sccache --zero-stats 2025-12-04T09:42:55.9881997Z Statistics zeroed. 2025-12-04T09:42:55.9884110Z ++ which ccache 2025-12-04T09:42:55.9907248Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 != *rocm* ]] 2025-12-04T09:42:55.9907771Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 != *s390x* ]] 2025-12-04T09:42:55.9908198Z + [[ -d /var/lib/jenkins/workspace ]] 2025-12-04T09:42:55.9910312Z ++ stat -c %u /var/lib/jenkins/workspace 2025-12-04T09:42:55.9924749Z + WORKSPACE_ORIGINAL_OWNER_ID=1000 2025-12-04T09:42:55.9925152Z + trap_add cleanup_workspace EXIT 2025-12-04T09:42:55.9925521Z + trap_add_cmd=cleanup_workspace 2025-12-04T09:42:55.9925846Z + shift 2025-12-04T09:42:55.9926081Z + for trap_add_name in "$@" 2025-12-04T09:42:55.9932510Z +++ trap -p EXIT 2025-12-04T09:42:55.9934917Z ++ eval 'extract_trap_cmd trap -- '\'' 2025-12-04T09:42:55.9935342Z sccache_epilogue'\'' EXIT' 2025-12-04T09:42:55.9935654Z +++ extract_trap_cmd trap -- ' 2025-12-04T09:42:55.9936081Z sccache_epilogue' EXIT 2025-12-04T09:42:55.9936368Z +++ printf '%s\n' ' 2025-12-04T09:42:55.9936624Z sccache_epilogue' 2025-12-04T09:42:55.9936913Z ++ printf '%s\n' cleanup_workspace 2025-12-04T09:42:55.9938204Z + trap -- ' 2025-12-04T09:42:55.9938516Z sccache_epilogue 2025-12-04T09:42:55.9938824Z cleanup_workspace' EXIT 2025-12-04T09:42:55.9939187Z + sudo chown -R jenkins /var/lib/jenkins/workspace 2025-12-04T09:42:56.7108091Z + git config --global --add safe.directory /var/lib/jenkins/workspace 2025-12-04T09:42:56.7127186Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *cuda* ]] 2025-12-04T09:42:56.7130703Z ++ python -c 'import os;import numba.cuda; print(os.path.dirname(numba.cuda.__file__))' 2025-12-04T09:42:57.1816791Z + NUMBA_CUDA_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/numba/cuda 2025-12-04T09:42:57.1817527Z + '[' -n /opt/conda/envs/py_3.10/lib/python3.10/site-packages/numba/cuda ']' 2025-12-04T09:42:57.1822995Z +++ realpath .ci/pytorch/test.sh 2025-12-04T09:42:57.1832929Z ++ dirname /var/lib/jenkins/workspace/.ci/pytorch/test.sh 2025-12-04T09:42:57.1851543Z + NUMBA_PATCH=/var/lib/jenkins/workspace/.ci/pytorch/numba-cuda-13.patch 2025-12-04T09:42:57.1852220Z + pushd /opt/conda/envs/py_3.10/lib/python3.10/site-packages/numba/cuda 2025-12-04T09:42:57.1853220Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/numba/cuda ~/workspace 2025-12-04T09:42:57.1853772Z + patch -p4 2025-12-04T09:42:57.1866159Z patching file cudadrv/driver.py 2025-12-04T09:42:57.1866755Z Hunk #1 succeeded at 357 (offset -8 lines). 2025-12-04T09:42:57.1875985Z + popd 2025-12-04T09:42:57.1876232Z ~/workspace 2025-12-04T09:42:57.1876545Z + echo 'Environment variables:' 2025-12-04T09:42:57.1876879Z Environment variables: 2025-12-04T09:42:57.1877156Z + env 2025-12-04T09:42:57.1885911Z GITHUB_WORKSPACE=/home/ec2-user/actions-runner/_work/pytorch/pytorch 2025-12-04T09:42:57.1886707Z CONTINUE_THROUGH_ERROR=True 2025-12-04T09:42:57.1887210Z BUILD_ENVIRONMENT=linux-jammy-cuda12.4-py3.10-gcc11 2025-12-04T09:42:57.1888145Z VLLM_TEST_HUGGING_FACE_TOKEN=*** 2025-12-04T09:42:57.1888537Z HOSTNAME=fffd24d16954 2025-12-04T09:42:57.1889232Z GITHUB_PATH=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/add_path_751444db-9cfd-4cb1-be9d-c9f4352c3149 2025-12-04T09:42:57.1889992Z GITHUB_ACTION=__run_3 2025-12-04T09:42:57.1890298Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 2025-12-04T09:42:57.1890647Z GITHUB_RUN_NUMBER=19107 2025-12-04T09:42:57.1890995Z TEST_CONFIG=legacy_nvidia_driver 2025-12-04T09:42:57.1891353Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-12-04T09:42:57.1891733Z TORCH_NVCC_FLAGS=-Xfatbin -compress-all 2025-12-04T09:42:57.1892088Z SCCACHE_IDLE_TIMEOUT=0 2025-12-04T09:42:57.1892545Z SCRIBE_GRAPHQL_ACCESS_TOKEN=*** 2025-12-04T09:42:57.1892888Z GITHUB_TRIGGERING_ACTOR=huydhn 2025-12-04T09:42:57.1893356Z GITHUB_REF_TYPE=branch 2025-12-04T09:42:57.1893706Z BASE_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:42:57.1894104Z XLA_CUDA= 2025-12-04T09:42:57.1894354Z NCCL_LIB_DIR=/usr/local/cuda/lib64/ 2025-12-04T09:42:57.1894840Z HUGGING_FACE_HUB_TOKEN=*** 2025-12-04T09:42:57.1895348Z *** 2025-12-04T09:42:57.1895600Z GITHUB_REPOSITORY_ID=65600975 2025-12-04T09:42:57.1895912Z GITHUB_ACTIONS=true 2025-12-04T09:42:57.1896206Z NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:57.1896612Z SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log 2025-12-04T09:42:57.1897062Z SHA1=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:42:57.1897505Z GITHUB_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:42:57.1898133Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/periodic.yml@refs/heads/main 2025-12-04T09:42:57.1898787Z UCC_HOME=/usr 2025-12-04T09:42:57.1899116Z VERBOSE_TEST_LOGS=False 2025-12-04T09:42:57.1899419Z GITHUB_REF=refs/heads/main 2025-12-04T09:42:57.1899723Z SHARD_NUMBER=2 2025-12-04T09:42:57.1899979Z GITHUB_REF_PROTECTED=true 2025-12-04T09:42:57.1900285Z HOME=/var/lib/jenkins 2025-12-04T09:42:57.1900606Z GITHUB_API_URL=https://api.github.com 2025-12-04T09:42:57.1900980Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2025-12-04T09:42:57.1901379Z UCX_COMMIT=7836b165abdbe468a2f607e7254011c07d788152 2025-12-04T09:42:57.1901775Z USE_SYSTEM_NCCL=1 2025-12-04T09:42:57.1902024Z NUM_TEST_SHARDS=5 2025-12-04T09:42:57.1902280Z UCX_HOME=/usr 2025-12-04T09:42:57.1902958Z GITHUB_STATE=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/save_state_751444db-9cfd-4cb1-be9d-c9f4352c3149 2025-12-04T09:42:57.1904139Z JOB_NAME=linux-jammy-cuda12.4-py3.10-gcc11 / test (legacy_nvidia_driver, 2, 5, linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check, unstable) 2025-12-04T09:42:57.1905312Z GITHUB_ENV=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_env_751444db-9cfd-4cb1-be9d-c9f4352c3149 2025-12-04T09:42:57.1906276Z GITHUB_EVENT_PATH=/home/ec2-user/actions-runner/_work/_temp/_github_workflow/event.json 2025-12-04T09:42:57.1906873Z GITHUB_EVENT_NAME=schedule 2025-12-04T09:42:57.1907162Z DASHBOARD_TAG= 2025-12-04T09:42:57.1907427Z GITHUB_RUN_ID=19922826259 2025-12-04T09:42:57.1907726Z INSTALLED_OPENBLAS= 2025-12-04T09:42:57.1908445Z GITHUB_STEP_SUMMARY=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/step_summary_751444db-9cfd-4cb1-be9d-c9f4352c3149 2025-12-04T09:42:57.1909248Z GITHUB_ACTOR=huydhn 2025-12-04T09:42:57.1909517Z PR_NUMBER= 2025-12-04T09:42:57.1909757Z DESIRED_CUDA=12.4 2025-12-04T09:42:57.1910208Z GITHUB_RUN_ATTEMPT=1 2025-12-04T09:42:57.1910510Z ANACONDA_PYTHON_VERSION=3.10 2025-12-04T09:42:57.1910902Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-12-04T09:42:57.1911403Z TERM=vt100 2025-12-04T09:42:57.1911651Z INSTALLED_VISION=yes 2025-12-04T09:42:57.1911928Z BRANCH=main 2025-12-04T09:42:57.1912169Z SCCACHE_REGION=us-east-1 2025-12-04T09:42:57.1912482Z OPENSSL_ROOT_DIR=/opt/openssl 2025-12-04T09:42:57.1912811Z BUILD_AOT_INDUCTOR_TEST= 2025-12-04T09:42:57.1913101Z CUDA_PATH=/usr/local/cuda 2025-12-04T09:42:57.1913714Z GITHUB_ACTION_PATH=/home/ec2-user/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-linux 2025-12-04T09:42:57.1914402Z GITHUB_SERVER_URL=https://github.com 2025-12-04T09:42:57.1914806Z UCC_COMMIT=430e241bf5d38cbc73fc7a6b89155397232e3f96 2025-12-04T09:42:57.1915209Z REENABLED_ISSUES= 2025-12-04T09:42:57.1915467Z DOCS= 2025-12-04T09:42:57.1915681Z SHLVL=1 2025-12-04T09:42:57.1915909Z MAX_JOBS=14 2025-12-04T09:42:57.1916156Z GITHUB_ACTOR_ID=475357 2025-12-04T09:42:57.1916557Z GITHUB_WORKFLOW_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:42:57.1916999Z GITHUB_REF_NAME=main 2025-12-04T09:42:57.1917440Z XLA_CLANG_CACHE_S3_BUCKET_NAME=ossci-compiler-clang-cache-circleci-xla 2025-12-04T09:42:57.1917955Z GITHUB_JOB=test 2025-12-04T09:42:57.1918208Z NO_TEST_TIMEOUT=False 2025-12-04T09:42:57.1918499Z TD_DISTRIBUTED=False 2025-12-04T09:42:57.1918809Z GITHUB_REPOSITORY=pytorch/pytorch 2025-12-04T09:42:57.1919147Z GITHUB_RETENTION_DAYS=90 2025-12-04T09:42:57.1919455Z OPENSSL_DIR=/opt/openssl 2025-12-04T09:42:57.1919767Z GITHUB_ACTION_REPOSITORY= 2025-12-04T09:42:57.1920681Z PATH=/opt/cache/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:42:57.1921653Z GITHUB_BASE_REF= 2025-12-04T09:42:57.1921925Z INSTALLED_ACL= 2025-12-04T09:42:57.1922468Z ARTIFACTS_FILE_SUFFIX=test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259 2025-12-04T09:42:57.1923086Z CI=true 2025-12-04T09:42:57.1923347Z GITHUB_REPOSITORY_OWNER=pytorch 2025-12-04T09:42:57.1923734Z RUST_LOG=sccache::server=error 2025-12-04T09:42:57.1924043Z JOB_ID=57119749259 2025-12-04T09:42:57.1924311Z GITHUB_HEAD_REF= 2025-12-04T09:42:57.1924574Z GITHUB_ACTION_REF= 2025-12-04T09:42:57.1924891Z SCCACHE_BUCKET=ossci-compiler-cache-circleci-v2 2025-12-04T09:42:57.1925302Z TEST_SHOWLOCALS=False 2025-12-04T09:42:57.1925592Z GITHUB_WORKFLOW=periodic 2025-12-04T09:42:57.1925899Z DEBIAN_FRONTEND=noninteractive 2025-12-04T09:42:57.1926638Z GITHUB_OUTPUT=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_output_751444db-9cfd-4cb1-be9d-c9f4352c3149 2025-12-04T09:42:57.1927402Z NO_TD=False 2025-12-04T09:42:57.1927655Z SKIP_SCCACHE_INITIALIZATION=1 2025-12-04T09:42:57.1928011Z NCCL_INCLUDE_DIR=/usr/local/cuda/include/ 2025-12-04T09:42:57.1928374Z _=/usr/bin/env 2025-12-04T09:42:57.1928787Z OLDPWD=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/numba/cuda 2025-12-04T09:42:57.1929406Z ++ python -c 'import site; print(site.getsitepackages()[0])' 2025-12-04T09:42:57.2069456Z + TORCH_INSTALL_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch 2025-12-04T09:42:57.2070406Z + TORCH_BIN_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2025-12-04T09:42:57.2071091Z + TORCH_LIB_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib 2025-12-04T09:42:57.2071810Z + TORCH_TEST_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/test 2025-12-04T09:42:57.2072402Z + BUILD_DIR=build 2025-12-04T09:42:57.2072690Z + BUILD_RENAMED_DIR=build_renamed 2025-12-04T09:42:57.2073023Z + BUILD_BIN_DIR=build/bin 2025-12-04T09:42:57.2073318Z + SHARD_NUMBER=2 2025-12-04T09:42:57.2073583Z + NUM_TEST_SHARDS=5 2025-12-04T09:42:57.2073865Z + export TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:42:57.2074231Z + TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:42:57.2074553Z + export VALGRIND=ON 2025-12-04T09:42:57.2074815Z + VALGRIND=ON 2025-12-04T09:42:57.2075327Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *clang9* ]] 2025-12-04T09:42:57.2075803Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *xpu* ]] 2025-12-04T09:42:57.2076191Z + detect_cuda_arch 2025-12-04T09:42:57.2076624Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *cuda* ]] 2025-12-04T09:42:57.2077040Z + command -v nvidia-smi 2025-12-04T09:42:57.2077340Z /usr/bin/nvidia-smi 2025-12-04T09:42:57.2079530Z ++ nvidia-smi --query-gpu=compute_cap --format=csv 2025-12-04T09:42:57.2080553Z ++ tail -n 1 2025-12-04T09:42:57.2294735Z + TORCH_CUDA_ARCH_LIST=7.5 2025-12-04T09:42:57.2295098Z + export TORCH_CUDA_ARCH_LIST 2025-12-04T09:42:57.2295494Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *s390x* ]] 2025-12-04T09:42:57.2295899Z + [[ 0 == \1 ]] 2025-12-04T09:42:57.2296155Z + [[ True == \1 ]] 2025-12-04T09:42:57.2296468Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 != *bazel* ]] 2025-12-04T09:42:57.2299754Z ++ realpath build/custom_test_artifacts 2025-12-04T09:42:57.2317968Z + CUSTOM_TEST_ARTIFACT_BUILD_DIR=/var/lib/jenkins/workspace/build/custom_test_artifacts 2025-12-04T09:42:57.2318546Z + [[ -n '' ]] 2025-12-04T09:42:57.2318817Z + echo 'Environment variables' 2025-12-04T09:42:57.2319151Z Environment variables 2025-12-04T09:42:57.2319422Z + env 2025-12-04T09:42:57.2336794Z GITHUB_WORKSPACE=/home/ec2-user/actions-runner/_work/pytorch/pytorch 2025-12-04T09:42:57.2337404Z CONTINUE_THROUGH_ERROR=True 2025-12-04T09:42:57.2337965Z BUILD_ENVIRONMENT=linux-jammy-cuda12.4-py3.10-gcc11 2025-12-04T09:42:57.2339130Z VLLM_TEST_HUGGING_FACE_TOKEN=*** 2025-12-04T09:42:57.2339668Z HOSTNAME=fffd24d16954 2025-12-04T09:42:57.2340404Z GITHUB_PATH=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/add_path_751444db-9cfd-4cb1-be9d-c9f4352c3149 2025-12-04T09:42:57.2341164Z GITHUB_ACTION=__run_3 2025-12-04T09:42:57.2341605Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 2025-12-04T09:42:57.2341961Z GITHUB_RUN_NUMBER=19107 2025-12-04T09:42:57.2342271Z TEST_CONFIG=legacy_nvidia_driver 2025-12-04T09:42:57.2342612Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-12-04T09:42:57.2343002Z TORCH_NVCC_FLAGS=-Xfatbin -compress-all 2025-12-04T09:42:57.2343373Z SCCACHE_IDLE_TIMEOUT=0 2025-12-04T09:42:57.2343834Z SCRIBE_GRAPHQL_ACCESS_TOKEN=*** 2025-12-04T09:42:57.2344162Z GITHUB_TRIGGERING_ACTOR=huydhn 2025-12-04T09:42:57.2344497Z GITHUB_REF_TYPE=branch 2025-12-04T09:42:57.2344786Z TORCH_CUDA_ARCH_LIST=7.5 2025-12-04T09:42:57.2345126Z BASE_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:42:57.2345516Z XLA_CUDA= 2025-12-04T09:42:57.2345774Z NCCL_LIB_DIR=/usr/local/cuda/lib64/ 2025-12-04T09:42:57.2346548Z HUGGING_FACE_HUB_TOKEN=*** 2025-12-04T09:42:57.2346967Z *** 2025-12-04T09:42:57.2347258Z GITHUB_REPOSITORY_ID=65600975 2025-12-04T09:42:57.2347661Z GITHUB_ACTIONS=true 2025-12-04T09:42:57.2347985Z NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T09:42:57.2348413Z SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log 2025-12-04T09:42:57.2349068Z SHA1=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:42:57.2349524Z GITHUB_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:42:57.2350149Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/periodic.yml@refs/heads/main 2025-12-04T09:42:57.2350718Z UCC_HOME=/usr 2025-12-04T09:42:57.2351003Z TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:42:57.2351318Z VERBOSE_TEST_LOGS=False 2025-12-04T09:42:57.2351621Z GITHUB_REF=refs/heads/main 2025-12-04T09:42:57.2351922Z SHARD_NUMBER=2 2025-12-04T09:42:57.2352197Z GITHUB_REF_PROTECTED=true 2025-12-04T09:42:57.2352489Z HOME=/var/lib/jenkins 2025-12-04T09:42:57.2352816Z GITHUB_API_URL=https://api.github.com 2025-12-04T09:42:57.2353206Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2025-12-04T09:42:57.2353597Z UCX_COMMIT=7836b165abdbe468a2f607e7254011c07d788152 2025-12-04T09:42:57.2354000Z USE_SYSTEM_NCCL=1 2025-12-04T09:42:57.2354269Z NUM_TEST_SHARDS=5 2025-12-04T09:42:57.2354518Z UCX_HOME=/usr 2025-12-04T09:42:57.2355193Z GITHUB_STATE=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/save_state_751444db-9cfd-4cb1-be9d-c9f4352c3149 2025-12-04T09:42:57.2356593Z JOB_NAME=linux-jammy-cuda12.4-py3.10-gcc11 / test (legacy_nvidia_driver, 2, 5, linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check, unstable) 2025-12-04T09:42:57.2357766Z GITHUB_ENV=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_env_751444db-9cfd-4cb1-be9d-c9f4352c3149 2025-12-04T09:42:57.2358826Z GITHUB_EVENT_PATH=/home/ec2-user/actions-runner/_work/_temp/_github_workflow/event.json 2025-12-04T09:42:57.2359424Z GITHUB_EVENT_NAME=schedule 2025-12-04T09:42:57.2359732Z DASHBOARD_TAG= 2025-12-04T09:42:57.2359989Z GITHUB_RUN_ID=19922826259 2025-12-04T09:42:57.2360295Z INSTALLED_OPENBLAS= 2025-12-04T09:42:57.2361029Z GITHUB_STEP_SUMMARY=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/step_summary_751444db-9cfd-4cb1-be9d-c9f4352c3149 2025-12-04T09:42:57.2361823Z GITHUB_ACTOR=huydhn 2025-12-04T09:42:57.2362088Z PR_NUMBER= 2025-12-04T09:42:57.2362328Z DESIRED_CUDA=12.4 2025-12-04T09:42:57.2362583Z GITHUB_RUN_ATTEMPT=1 2025-12-04T09:42:57.2362857Z VALGRIND=ON 2025-12-04T09:42:57.2363115Z ANACONDA_PYTHON_VERSION=3.10 2025-12-04T09:42:57.2363508Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-12-04T09:42:57.2363902Z TERM=vt100 2025-12-04T09:42:57.2364144Z INSTALLED_VISION=yes 2025-12-04T09:42:57.2364427Z BRANCH=main 2025-12-04T09:42:57.2364669Z SCCACHE_REGION=us-east-1 2025-12-04T09:42:57.2364981Z OPENSSL_ROOT_DIR=/opt/openssl 2025-12-04T09:42:57.2365311Z BUILD_AOT_INDUCTOR_TEST= 2025-12-04T09:42:57.2365599Z CUDA_PATH=/usr/local/cuda 2025-12-04T09:42:57.2366211Z GITHUB_ACTION_PATH=/home/ec2-user/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-linux 2025-12-04T09:42:57.2366901Z GITHUB_SERVER_URL=https://github.com 2025-12-04T09:42:57.2367302Z UCC_COMMIT=430e241bf5d38cbc73fc7a6b89155397232e3f96 2025-12-04T09:42:57.2367705Z REENABLED_ISSUES= 2025-12-04T09:42:57.2367964Z DOCS= 2025-12-04T09:42:57.2368178Z SHLVL=1 2025-12-04T09:42:57.2368402Z MAX_JOBS=14 2025-12-04T09:42:57.2368650Z GITHUB_ACTOR_ID=475357 2025-12-04T09:42:57.2369031Z GITHUB_WORKFLOW_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:42:57.2369490Z GITHUB_REF_NAME=main 2025-12-04T09:42:57.2369928Z XLA_CLANG_CACHE_S3_BUCKET_NAME=ossci-compiler-clang-cache-circleci-xla 2025-12-04T09:42:57.2370429Z GITHUB_JOB=test 2025-12-04T09:42:57.2370684Z NO_TEST_TIMEOUT=False 2025-12-04T09:42:57.2370972Z TD_DISTRIBUTED=False 2025-12-04T09:42:57.2371277Z GITHUB_REPOSITORY=pytorch/pytorch 2025-12-04T09:42:57.2371612Z GITHUB_RETENTION_DAYS=90 2025-12-04T09:42:57.2371915Z OPENSSL_DIR=/opt/openssl 2025-12-04T09:42:57.2372221Z GITHUB_ACTION_REPOSITORY= 2025-12-04T09:42:57.2373134Z PATH=/opt/cache/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:42:57.2374099Z GITHUB_BASE_REF= 2025-12-04T09:42:57.2374365Z INSTALLED_ACL= 2025-12-04T09:42:57.2374890Z ARTIFACTS_FILE_SUFFIX=test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259 2025-12-04T09:42:57.2375510Z CI=true 2025-12-04T09:42:57.2375768Z GITHUB_REPOSITORY_OWNER=pytorch 2025-12-04T09:42:57.2376155Z RUST_LOG=sccache::server=error 2025-12-04T09:42:57.2376456Z JOB_ID=57119749259 2025-12-04T09:42:57.2376720Z GITHUB_HEAD_REF= 2025-12-04T09:42:57.2376983Z GITHUB_ACTION_REF= 2025-12-04T09:42:57.2377307Z SCCACHE_BUCKET=ossci-compiler-cache-circleci-v2 2025-12-04T09:42:57.2377721Z TEST_SHOWLOCALS=False 2025-12-04T09:42:57.2378020Z GITHUB_WORKFLOW=periodic 2025-12-04T09:42:57.2378326Z DEBIAN_FRONTEND=noninteractive 2025-12-04T09:42:57.2379172Z GITHUB_OUTPUT=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_output_751444db-9cfd-4cb1-be9d-c9f4352c3149 2025-12-04T09:42:57.2379937Z NO_TD=False 2025-12-04T09:42:57.2380189Z SKIP_SCCACHE_INITIALIZATION=1 2025-12-04T09:42:57.2380545Z NCCL_INCLUDE_DIR=/usr/local/cuda/include/ 2025-12-04T09:42:57.2381080Z OLDPWD=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/numba/cuda 2025-12-04T09:42:57.2381565Z _=/usr/bin/env 2025-12-04T09:42:57.2381822Z + echo 'Testing pytorch' 2025-12-04T09:42:57.2382117Z Testing pytorch 2025-12-04T09:42:57.2382501Z + export LANG=C.UTF-8 2025-12-04T09:42:57.2382769Z + LANG=C.UTF-8 2025-12-04T09:42:57.2383019Z + PR_NUMBER= 2025-12-04T09:42:57.2383301Z + [[ legacy_nvidia_driver == \d\e\f\a\u\l\t ]] 2025-12-04T09:42:57.2383790Z + [[ legacy_nvidia_driver == \d\i\s\t\r\i\b\u\t\e\d ]] 2025-12-04T09:42:57.2384207Z + [[ legacy_nvidia_driver == \s\l\o\w ]] 2025-12-04T09:42:57.2384667Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *slow-gradcheck* ]] 2025-12-04T09:42:57.2385172Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *cuda* ]] 2025-12-04T09:42:57.2385616Z + export PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2025-12-04T09:42:57.2386024Z + PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2025-12-04T09:42:57.2386389Z + [[ legacy_nvidia_driver == *crossref* ]] 2025-12-04T09:42:57.2386806Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *rocm* ]] 2025-12-04T09:42:57.2387254Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *xpu* ]] 2025-12-04T09:42:57.2387925Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 != *-bazel-* ]] 2025-12-04T09:42:57.2388342Z + pip_install ninja==1.10.2 2025-12-04T09:42:57.2388773Z + pip_install_pkg='python3 -m pip install --progress-bar off' 2025-12-04T09:42:57.2389321Z + python3 -m pip install --progress-bar off ninja==1.10.2 2025-12-04T09:42:57.6618759Z Collecting ninja==1.10.2 2025-12-04T09:42:57.6867467Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl.metadata (5.0 kB) 2025-12-04T09:42:57.6975142Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl (108 kB) 2025-12-04T09:42:58.1222798Z Installing collected packages: ninja 2025-12-04T09:42:58.1223202Z Attempting uninstall: ninja 2025-12-04T09:42:58.1231828Z Found existing installation: ninja 1.11.1.4 2025-12-04T09:42:58.1255419Z Uninstalling ninja-1.11.1.4: 2025-12-04T09:42:58.1321975Z Successfully uninstalled ninja-1.11.1.4 2025-12-04T09:42:58.1695406Z Successfully installed ninja-1.10.2 2025-12-04T09:42:58.2288670Z + export PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:42:58.2290611Z + PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:42:58.2291871Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *aarch64* ]] 2025-12-04T09:42:58.2292413Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *asan* ]] 2025-12-04T09:42:58.2292877Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *-debug* ]] 2025-12-04T09:42:58.2293347Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 != *-bazel-* ]] 2025-12-04T09:42:58.2294010Z + echo 'We are not in debug mode: linux-jammy-cuda12.4-py3.10-gcc11. Expect the assertion to pass' 2025-12-04T09:42:58.2294816Z We are not in debug mode: linux-jammy-cuda12.4-py3.10-gcc11. Expect the assertion to pass 2025-12-04T09:42:58.2295385Z + cd test 2025-12-04T09:42:58.2295794Z + python -c 'import torch; torch._C._crash_if_debug_asserts_fail(424242)' 2025-12-04T09:42:59.9148747Z + [[ legacy_nvidia_driver == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]] 2025-12-04T09:42:59.9149291Z + [[ legacy_nvidia_driver == \n\o\g\p\u\_\A\V\X\5\1\2 ]] 2025-12-04T09:42:59.9149921Z + [[ legacy_nvidia_driver == \l\e\g\a\c\y\_\n\v\i\d\i\a\_\d\r\i\v\e\r ]] 2025-12-04T09:42:59.9150663Z + cd test 2025-12-04T09:42:59.9151659Z + python -c 'import torch; torch.rand(2, 2, device='\''cuda'\'')' 2025-12-04T09:43:04.8414140Z + export USE_LEGACY_DRIVER=1 2025-12-04T09:43:04.8414537Z + USE_LEGACY_DRIVER=1 2025-12-04T09:43:04.8420444Z + DYNAMO_BENCHMARK_FLAGS=() 2025-12-04T09:43:04.8421322Z + [[ legacy_nvidia_driver == *pr_time_benchmarks* ]] 2025-12-04T09:43:04.8421816Z + [[ legacy_nvidia_driver == *dynamo_eager* ]] 2025-12-04T09:43:04.8422218Z + [[ legacy_nvidia_driver == *aot_eager* ]] 2025-12-04T09:43:04.8422618Z + [[ legacy_nvidia_driver == *aot_inductor* ]] 2025-12-04T09:43:04.8423057Z + [[ legacy_nvidia_driver == *max_autotune_inductor* ]] 2025-12-04T09:43:04.8423815Z + [[ legacy_nvidia_driver == *inductor* ]] 2025-12-04T09:43:04.8424192Z + [[ legacy_nvidia_driver == *dynamic* ]] 2025-12-04T09:43:04.8424579Z + [[ legacy_nvidia_driver == *cpu* ]] 2025-12-04T09:43:04.8425087Z + [[ legacy_nvidia_driver == *xpu* ]] 2025-12-04T09:43:04.8425468Z + DYNAMO_BENCHMARK_FLAGS+=(--device cuda) 2025-12-04T09:43:04.8458659Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *libtorch* ]] 2025-12-04T09:43:04.8459165Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *-bazel-* ]] 2025-12-04T09:43:04.8462151Z + cd test 2025-12-04T09:43:04.8462645Z + python -c 'import torch; print(torch.__config__.show())' 2025-12-04T09:43:07.7379229Z PyTorch built with: 2025-12-04T09:43:07.7379532Z - GCC 11.4 2025-12-04T09:43:07.7379794Z - C++ Version: 201703 2025-12-04T09:43:07.7380462Z - Intel(R) oneAPI Math Kernel Library Version 2024.2-Product Build 20240605 for Intel(R) 64 architecture applications 2025-12-04T09:43:07.7381307Z - Intel(R) MKL-DNN v3.7.1 (Git Hash 8d263e693366ef8db40acc569cc7d8edf644556d) 2025-12-04T09:43:07.7381850Z - OpenMP 201511 (a.k.a. OpenMP 4.5) 2025-12-04T09:43:07.7382250Z - LAPACK is enabled (usually provided by MKL) 2025-12-04T09:43:07.7382631Z - NNPACK is enabled 2025-12-04T09:43:07.7382946Z - CPU capability usage: AVX512 2025-12-04T09:43:07.7383277Z - CUDA Runtime 12.4 2025-12-04T09:43:07.7383664Z - NVCC architecture flags: -gencode;arch=compute_75,code=sm_75 2025-12-04T09:43:07.7384120Z - CuDNN 90.1 2025-12-04T09:43:07.7390062Z - Build settings: BLAS_INFO=mkl, BUILD_TYPE=Release, COMMIT_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32, CUDA_VERSION=12.4, CUDNN_VERSION=9.1.0, CXX_COMPILER=/opt/cache/bin/c++, CXX_FLAGS= -fvisibility-inlines-hidden -DUSE_PTHREADPOOL -DNDEBUG -DUSE_KINETO -DLIBKINETO_NOROCTRACER -DLIBKINETO_NOXPUPTI=ON -DUSE_FBGEMM -DUSE_PYTORCH_QNNPACK -DUSE_XNNPACK -DSYMBOLICATE_MOBILE_DEBUG_HANDLE -O2 -fPIC -DC10_NODEPRECATED -Wall -Wextra -Werror=return-type -Werror=non-virtual-dtor -Werror=range-loop-construct -Werror=bool-operation -Wnarrowing -Wno-missing-field-initializers -Wno-unknown-pragmas -Wno-unused-parameter -Wno-strict-overflow -Wno-strict-aliasing -Wno-stringop-overflow -Wsuggest-override -Wno-psabi -Wno-error=old-style-cast -faligned-new -Werror -Wno-maybe-uninitialized -fno-math-errno -fno-trapping-math -Werror=format -Wno-stringop-overflow, FORCE_FALLBACK_CUDA_MPI=1, LAPACK_INFO=mkl, PERF_WITH_AVX=1, PERF_WITH_AVX2=1, TORCH_VERSION=2.10.0, USE_CUDA=ON, USE_CUDNN=ON, USE_CUSPARSELT=ON, USE_GFLAGS=OFF, USE_GLOG=OFF, USE_GLOO=ON, USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=ON, USE_NCCL=ON, USE_NNPACK=ON, USE_OPENMP=ON, USE_ROCM=OFF, USE_ROCM_KERNEL_ASSERT=OFF, USE_XCCL=OFF, USE_XPU=OFF, 2025-12-04T09:43:07.7396015Z 2025-12-04T09:43:08.0924106Z + cd test 2025-12-04T09:43:08.0924609Z + python -c 'import torch; print(torch.__config__.parallel_info())' 2025-12-04T09:43:09.4986894Z ATen/Parallel: 2025-12-04T09:43:09.4987266Z at::get_num_threads() : 8 2025-12-04T09:43:09.4987620Z at::get_num_interop_threads() : 8 2025-12-04T09:43:09.4988105Z OpenMP 201511 (a.k.a. OpenMP 4.5) 2025-12-04T09:43:09.4988481Z omp_get_max_threads() : 8 2025-12-04T09:43:09.4989145Z Intel(R) oneAPI Math Kernel Library Version 2024.2-Product Build 20240605 for Intel(R) 64 architecture applications 2025-12-04T09:43:09.4989868Z mkl_get_max_threads() : 8 2025-12-04T09:43:09.4990307Z Intel(R) MKL-DNN v3.7.1 (Git Hash 8d263e693366ef8db40acc569cc7d8edf644556d) 2025-12-04T09:43:09.4990837Z std::thread::hardware_concurrency() : 16 2025-12-04T09:43:09.4991212Z Environment variables: 2025-12-04T09:43:09.4991499Z OMP_NUM_THREADS : [not set] 2025-12-04T09:43:09.4991820Z MKL_NUM_THREADS : [not set] 2025-12-04T09:43:09.4992149Z ATen parallel backend: OpenMP 2025-12-04T09:43:09.4992362Z 2025-12-04T09:43:09.7764221Z + [[ legacy_nvidia_driver == *numpy_2* ]] 2025-12-04T09:43:09.7764799Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *aarch64* ]] 2025-12-04T09:43:09.7765554Z + [[ legacy_nvidia_driver == *backward* ]] 2025-12-04T09:43:09.7766472Z + [[ legacy_nvidia_driver == *libtorch_agnostic_targetting* ]] 2025-12-04T09:43:09.7767299Z + [[ legacy_nvidia_driver == *xla* ]] 2025-12-04T09:43:09.7767661Z + [[ legacy_nvidia_driver == *vllm* ]] 2025-12-04T09:43:09.7768045Z + [[ legacy_nvidia_driver == *executorch* ]] 2025-12-04T09:43:09.7768602Z + [[ legacy_nvidia_driver == \j\i\t\_\l\e\g\a\c\y ]] 2025-12-04T09:43:09.7769059Z + [[ legacy_nvidia_driver == \q\u\a\n\t\i\z\a\t\i\o\n ]] 2025-12-04T09:43:09.7769533Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *libtorch* ]] 2025-12-04T09:43:09.7769980Z + [[ legacy_nvidia_driver == distributed ]] 2025-12-04T09:43:09.7770408Z + [[ legacy_nvidia_driver == *operator_benchmark* ]] 2025-12-04T09:43:09.7771064Z + [[ legacy_nvidia_driver == *operator_microbenchmark* ]] 2025-12-04T09:43:09.7771978Z + [[ legacy_nvidia_driver == *attention_microbenchmark* ]] 2025-12-04T09:43:09.7772469Z + [[ legacy_nvidia_driver == *inductor_distributed* ]] 2025-12-04T09:43:09.7772920Z + [[ legacy_nvidia_driver == *inductor-halide* ]] 2025-12-04T09:43:09.7773348Z + [[ legacy_nvidia_driver == *inductor-pallas* ]] 2025-12-04T09:43:09.7773819Z + [[ legacy_nvidia_driver == *inductor-triton-cpu* ]] 2025-12-04T09:43:09.7774314Z + [[ legacy_nvidia_driver == *inductor-micro-benchmark* ]] 2025-12-04T09:43:09.7774838Z + [[ legacy_nvidia_driver == *aoti_cross_compile_for_windows* ]] 2025-12-04T09:43:09.7775328Z + [[ legacy_nvidia_driver == *huggingface* ]] 2025-12-04T09:43:09.7775725Z + [[ legacy_nvidia_driver == *timm* ]] 2025-12-04T09:43:09.7776086Z + [[ legacy_nvidia_driver == cachebench ]] 2025-12-04T09:43:09.7776494Z + [[ legacy_nvidia_driver == verify_cachebench ]] 2025-12-04T09:43:09.7776909Z + [[ legacy_nvidia_driver == *torchbench* ]] 2025-12-04T09:43:09.7777345Z + [[ legacy_nvidia_driver == *inductor_cpp_wrapper* ]] 2025-12-04T09:43:09.7777774Z + [[ legacy_nvidia_driver == *inductor_core* ]] 2025-12-04T09:43:09.7778178Z + [[ legacy_nvidia_driver == *inductor* ]] 2025-12-04T09:43:09.7778634Z + [[ legacy_nvidia_driver == *einops* ]] 2025-12-04T09:43:09.7779011Z + [[ legacy_nvidia_driver == *dynamo_core* ]] 2025-12-04T09:43:09.7779427Z + [[ legacy_nvidia_driver == *dynamo_wrapped* ]] 2025-12-04T09:43:09.7779886Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *rocm* ]] 2025-12-04T09:43:09.7780263Z + [[ 2 == 1 ]] 2025-12-04T09:43:09.7780508Z + [[ 2 == 2 ]] 2025-12-04T09:43:09.7780762Z + [[ 5 -gt 1 ]] 2025-12-04T09:43:09.7781009Z + install_torchvision 2025-12-04T09:43:09.7781294Z + local orig_preload 2025-12-04T09:43:09.7781563Z + local commit 2025-12-04T09:43:09.7781825Z ++ get_pinned_commit vision 2025-12-04T09:43:09.7782147Z ++ cat .github/ci_commit_pins/vision.txt 2025-12-04T09:43:09.7785207Z + commit=617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:43:09.7786069Z + orig_preload= 2025-12-04T09:43:09.7786545Z + '[' -n '' ']' 2025-12-04T09:43:09.7787150Z + [[ linux-jammy-cuda12.4-py3.10-gcc11 == *cuda* ]] 2025-12-04T09:43:09.7788098Z + export FORCE_CUDA=1 2025-12-04T09:43:09.7788411Z + FORCE_CUDA=1 2025-12-04T09:43:09.7788667Z + export WITH_CUDA=1 2025-12-04T09:43:09.7788941Z + WITH_CUDA=1 2025-12-04T09:43:09.7789610Z + pip_build_and_install git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e dist/vision 2025-12-04T09:43:09.7790662Z + local build_target=git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:43:09.7791342Z + local wheel_dir=dist/vision 2025-12-04T09:43:09.7791661Z + local found_whl=0 2025-12-04T09:43:09.7791932Z + for file in "${wheel_dir}"/*.whl 2025-12-04T09:43:09.7792277Z + [[ -f dist/vision/*.whl ]] 2025-12-04T09:43:09.7792583Z + '[' 0 == 0 ']' 2025-12-04T09:43:09.7793367Z + python3 -m pip wheel --no-build-isolation --no-deps -w dist/vision git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:43:10.1342344Z Collecting git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:43:10.1347560Z Cloning https://github.com/pytorch/vision.git (to revision 617079d944b0e72632311c30ae2bbdf1168b901e) to /tmp/pip-req-build-kid0nm2r 2025-12-04T09:43:10.1527880Z Running command git clone --filter=blob:none --quiet https://github.com/pytorch/vision.git /tmp/pip-req-build-kid0nm2r 2025-12-04T09:43:11.8278828Z Running command git rev-parse -q --verify 'sha^617079d944b0e72632311c30ae2bbdf1168b901e' 2025-12-04T09:43:11.8303115Z Running command git fetch -q https://github.com/pytorch/vision.git 617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:43:11.9500148Z Resolved https://github.com/pytorch/vision.git to commit 617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:43:15.5590310Z Preparing metadata (pyproject.toml) ... [?25l- \ | done 2025-12-04T09:43:15.5628794Z [?25hBuilding wheels for collected packages: torchvision 2025-12-04T09:44:44.6875114Z Building wheel for torchvision (pyproject.toml) ... [?25l- \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-12-04T09:44:44.6939493Z [?25h Created wheel for torchvision: filename=torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl size=1821268 sha256=23d10085ddea839b10fac3612b40e9129a16c2e29ede5517fa38c92d1587989b 2025-12-04T09:44:44.6942558Z Stored in directory: /var/lib/jenkins/.cache/pip/wheels/12/b2/29/1f82685c5b5173629e1f36a9b93989ce92ce563e5fb91d27ac 2025-12-04T09:44:44.6993777Z Successfully built torchvision 2025-12-04T09:44:44.7841780Z + for file in "${wheel_dir}"/*.whl 2025-12-04T09:44:44.7842513Z + pip_install_whl dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:44:44.7843429Z + args=('dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl') 2025-12-04T09:44:44.7844015Z + local args 2025-12-04T09:44:44.7844576Z + [[ dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl == *\ * ]] 2025-12-04T09:44:44.7845226Z + for path in "${args[@]}" 2025-12-04T09:44:44.7845800Z + echo 'Installing dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl' 2025-12-04T09:44:44.7846762Z Installing dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:44:44.7847785Z + python3 -mpip install --no-index --no-deps dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:44:45.1471785Z Processing ./dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:44:45.1610814Z Installing collected packages: torchvision 2025-12-04T09:44:45.6792071Z Successfully installed torchvision-0.25.0a0+617079d 2025-12-04T09:44:45.7179052Z + '[' -n '' ']' 2025-12-04T09:44:45.7179335Z + test_python_shard 2 2025-12-04T09:44:45.7179627Z + [[ -z 5 ]] 2025-12-04T09:44:45.7180530Z + python test/run_test.py --exclude-jit-executor --exclude-distributed-tests --exclude-quantization-tests --shard 2 5 --verbose --upload-artifacts-while-running 2025-12-04T09:44:52.5173066Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2025-12-04T09:44:52.5720865Z Ignoring disabled issues: [''] 2025-12-04T09:44:52.5837697Z Found test times from artifacts 2025-12-04T09:44:52.6282901Z Found test times from artifacts 2025-12-04T09:44:52.6298414Z Running all tests 2025-12-04T09:44:52.7200237Z Running parallel tests on 1 processes 2025-12-04T09:44:52.7210062Z Name: tests to run (est. time: 293.23min) 2025-12-04T09:44:52.7210808Z Serial tests (103): 2025-12-04T09:44:52.7211374Z inductor/test_aot_inductor 2/6 2025-12-04T09:44:52.7212062Z inductor/test_torchinductor 1/2 2025-12-04T09:44:52.7212825Z inductor/test_torchinductor_dynamic_shapes 3/5 2025-12-04T09:44:52.7213767Z inductor/test_torchinductor_codegen_dynamic_shapes 3/4 2025-12-04T09:44:52.7214644Z inductor/test_torchinductor_opinfo 4/17 2025-12-04T09:44:52.7215372Z inductor/test_torchinductor_opinfo 9/17 2025-12-04T09:44:52.7216018Z inductor/test_torchinductor_opinfo 14/17 2025-12-04T09:44:52.7216761Z dynamo/test_activation_checkpointing 1/1 2025-12-04T09:44:52.7217851Z inductor/test_flex_attention 1/1 2025-12-04T09:44:52.7218636Z inductor/test_cuda_select_algorithm 1/5 2025-12-04T09:44:52.7219360Z inductor/test_compile_subprocess 1/3 2025-12-04T09:44:52.7220083Z inductor/test_deterministic 3/8 2025-12-04T09:44:52.7220941Z inductor/test_deterministic 8/8 2025-12-04T09:44:52.7221614Z inductor/test_op_dtype_prop 1/2 2025-12-04T09:44:52.7222251Z dynamo/test_callback 1/1 2025-12-04T09:44:52.7222868Z dynamo/test_compiler_bisector 1/1 2025-12-04T09:44:52.7223692Z inductor/test_cudagraph_trees_expandable_segments 1/1 2025-12-04T09:44:52.7224567Z inductor/test_collective_autotuning 1/1 2025-12-04T09:44:52.7225305Z higher_order_ops/test_local_map 1/1 2025-12-04T09:44:52.7225941Z test_dataloader 1/2 2025-12-04T09:44:52.7226352Z test_decomp 4/17 2025-12-04T09:44:52.7226785Z test_decomp 9/17 2025-12-04T09:44:52.7227292Z test_decomp 14/17 2025-12-04T09:44:52.7227812Z test_ops_fwd_gradients 2/2 2025-12-04T09:44:52.7228389Z test_meta 3/5 2025-12-04T09:44:52.7228868Z test_ops_gradients 1/2 2025-12-04T09:44:52.7229330Z test_linalg 1/1 2025-12-04T09:44:52.7229736Z test_ops 4/11 2025-12-04T09:44:52.7230154Z test_ops 9/11 2025-12-04T09:44:52.7230572Z test_sparse_csr 2/2 2025-12-04T09:44:52.7231053Z functorch/test_ops 5/7 2025-12-04T09:44:52.7231590Z inductor/test_group_batch_fusion 1/1 2025-12-04T09:44:52.7232193Z inductor/test_cpu_repro 2/3 2025-12-04T09:44:52.7232699Z inductor/test_smoke 1/1 2025-12-04T09:44:52.7233219Z inductor/test_fused_attention 1/1 2025-12-04T09:44:52.7233803Z inductor/test_inductor_freezing 1/1 2025-12-04T09:44:52.7234487Z dynamo/test_higher_order_ops 1/1 2025-12-04T09:44:52.7235194Z inductor/test_aot_inductor_arrayref 1/2 2025-12-04T09:44:52.7235930Z inductor/test_triton_cpu_backend 1/1 2025-12-04T09:44:52.7236527Z inductor/test_autoheuristic 1/1 2025-12-04T09:44:52.7237064Z inductor/test_flex_flash 1/1 2025-12-04T09:44:52.7237421Z inductor/test_segmented_tree 1/1 2025-12-04T09:44:52.7237809Z inductor/test_kernel_optimization 1/1 2025-12-04T09:44:52.7238191Z inductor/test_metrics 1/1 2025-12-04T09:44:52.7238541Z export/test_unflatten_training_ir 1/1 2025-12-04T09:44:52.7238914Z inductor/test_fx_fusion 1/1 2025-12-04T09:44:52.7239310Z dynamo/test_inline_and_install 1/1 2025-12-04T09:44:52.7239900Z inductor/test_inductor_scheduler 1/1 2025-12-04T09:44:52.7240349Z export/test_export_strict 1/1 2025-12-04T09:44:52.7240703Z inductor/test_compiled_optimizers 3/3 2025-12-04T09:44:52.7241093Z dynamo/test_graph_region_tracker 1/1 2025-12-04T09:44:52.7241465Z dynamo/test_unittest 1/1 2025-12-04T09:44:52.7241782Z inductor/test_compile 1/1 2025-12-04T09:44:52.7242111Z dynamo/test_functions 1/1 2025-12-04T09:44:52.7242442Z inductor/test_ordered_set 1/1 2025-12-04T09:44:52.7242786Z inductor/test_pallas 1/1 2025-12-04T09:44:52.7243109Z dynamo/test_install_free_tensors 1/1 2025-12-04T09:44:52.7243577Z inductor/test_torchinductor_codegen_config_overrides 1/1 2025-12-04T09:44:52.7244034Z export/test_passes 1/1 2025-12-04T09:44:52.7244352Z dynamo/test_autograd_function 1/1 2025-12-04T09:44:52.7244715Z inductor/test_codecache 1/1 2025-12-04T09:44:52.7245072Z inductor/test_auto_functionalize 1/1 2025-12-04T09:44:52.7245455Z inductor/test_efficient_conv_bn_eval 1/1 2025-12-04T09:44:52.7245837Z inductor/test_torchbind 1/1 2025-12-04T09:44:52.7246168Z dynamo/test_flat_apply 1/1 2025-12-04T09:44:52.7246496Z dynamo/test_input_attr_tracking 1/1 2025-12-04T09:44:52.7246874Z dynamo/test_graph_deduplication 1/1 2025-12-04T09:44:52.7247263Z inductor/test_distributed_patterns 1/1 2025-12-04T09:44:52.7247633Z dynamo/test_bytecode_utils 1/1 2025-12-04T09:44:52.7247983Z export/test_nativert 1/1 2025-12-04T09:44:52.7248301Z export/test_hop 1/1 2025-12-04T09:44:52.7248597Z export/test_export 1/1 2025-12-04T09:44:52.7248908Z test_utils_config_module 1/1 2025-12-04T09:44:52.7249395Z test_hop_infra 1/1 2025-12-04T09:44:52.7249715Z test_appending_byte_serializer 1/1 2025-12-04T09:44:52.7250055Z test_license 1/1 2025-12-04T09:44:52.7250341Z test_ao_sparsity 1/1 2025-12-04T09:44:52.7250868Z test_proxy_tensor 1/1 2025-12-04T09:44:52.7251162Z torch_np/test_dtype 1/1 2025-12-04T09:44:52.7251473Z test_file_check 1/1 2025-12-04T09:44:52.7251766Z test_flop_counter 1/1 2025-12-04T09:44:52.7252065Z backends/xeon/test_launch 1/1 2025-12-04T09:44:52.7252400Z test_show_pickle 1/1 2025-12-04T09:44:52.7252693Z test_openmp 1/1 2025-12-04T09:44:52.7252960Z test_expanded_weights 1/1 2025-12-04T09:44:52.7253301Z functorch/test_ac_knapsack 1/1 2025-12-04T09:44:52.7253656Z torch_np/test_nep50_examples 1/1 2025-12-04T09:44:52.7253985Z test_torch 1/1 2025-12-04T09:44:52.7254287Z higher_order_ops/test_with_effects 1/1 2025-12-04T09:44:52.7254647Z test_weak 1/1 2025-12-04T09:44:52.7254911Z test_complex 1/1 2025-12-04T09:44:52.7255175Z test_optim 1/1 2025-12-04T09:44:52.7255448Z xpu/test_conv 1/1 2025-12-04T09:44:52.7255742Z nn/test_packed_sequence 1/1 2025-12-04T09:44:52.7256058Z test_numa_binding 1/1 2025-12-04T09:44:52.7256368Z test_pruning_op 1/1 2025-12-04T09:44:52.7256662Z test_jit_fuser_te 1/1 2025-12-04T09:44:52.7256951Z test_sympy_utils 1/1 2025-12-04T09:44:52.7257255Z test_mkldnn_fusion 1/1 2025-12-04T09:44:52.7257565Z test_schema_check 1/1 2025-12-04T09:44:52.7257846Z test_mkldnn 1/1 2025-12-04T09:44:52.7258193Z test_unary_ufuncs 1/1 2025-12-04T09:44:52.7258507Z test_autoload_enable 1/1 2025-12-04T09:44:52.7258812Z Parallel tests (0): 2025-12-04T09:44:52.7259117Z Name: excluded (est. time: 0.0min) 2025-12-04T09:44:52.7259458Z Serial tests (0): 2025-12-04T09:44:52.7259721Z Parallel tests (0): 2025-12-04T09:44:52.7260205Z Running inductor/test_aot_inductor 2/6 ... [2025-12-04 09:44:52.722606][1896.843825767] 2025-12-04T09:44:52.7260786Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T09:44:52.7262056Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor.py', '--shard-id=2', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:44:52.723216] 2025-12-04T09:55:32.5517436Z 2025-12-04T09:55:32.5520574Z PRINTING LOG FILE of inductor/test_aot_inductor 2/6 (test/test-reports/inductor.test_aot_inductor_2.6_195f61f6b7f24fa5_.log) 2025-12-04T09:55:32.5522024Z W1204 09:45:05.035000 1721 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.5523413Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-7ff7771e3c72f123.xml 2025-12-04T09:55:32.5524663Z ============================= test session starts ============================== 2025-12-04T09:55:32.5525758Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T09:55:32.5526665Z cachedir: .pytest_cache 2025-12-04T09:55:32.5527463Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T09:55:32.5528366Z rootdir: /var/lib/jenkins/workspace 2025-12-04T09:55:32.5529080Z configfile: pytest.ini 2025-12-04T09:55:32.5530271Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T09:55:32.5531586Z collecting ... collected 934 items 2025-12-04T09:55:32.5532291Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T09:55:32.5627149Z Running 154 items in this shard: test/inductor/test_aot_inductor.py::AOTInductorLoggingTest::test_shape_env_reuse, test/inductor/test_aot_inductor.py::TestAOTInductorConfig::test_no_compile_standalone, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_32_num_groups_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_codegen_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_fp8_dtype_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_sym_inputs_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_user_defined_triton_kernel_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_buffer_mutation_3_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_cpu_predicate_cuda_operands_max_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_cpu_predicate_cuda_operands_max_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_consecutive_compiles_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_conv3d_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_dynamic_cat_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_dynamic_scalar_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_empty_constant_folding_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fill__fallback_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fp8_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_dynamic_dim_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_libtorch_free_so_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misaligned_input_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_missing_output_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_model_modified_weights_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_none_args_aot_codegen_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_output_path_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_profile_benchmark_harness_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_proxy_executor_hann_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_repeat_interleave_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_return_constant_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_reuse_kernel_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_same_backing_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scaled_dot_product_efficient_attention_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sdpa_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sdpa_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_size_with_unbacked_add_expr_transitive_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sym_i64_input_codegen_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_autotuning_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_dynamic_launcher_grid_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_True_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_2_use_static_size_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_update_constant_buffer_simple_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_update_user_managed_buffer_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_nested_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_zero_grid_with_unbacked_symbols_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_64_num_groups_2_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_runtime_asserts_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_bmm_multiple_dynamic_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_bool_input_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_boolean_indexing_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_codegen_int_array_var_fix_memory_leak_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_cpu_predicate_cuda_operands_max_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_mismatched_branch_output_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_symint_input_disable_one_pass_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_unbacked_symint_closure_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_constant_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_convolution_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_custom_op_in_subgraph_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_dup_unbacked_sym_decl_with_refinement_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_dynamic_scalar_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_embedding_bag_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fallback_mem_leak_fix_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fft_c2c_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fp8_view_of_param_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_int_list_input_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_issue_140766_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_large_mmaped_weights_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_missing_cubin_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_mixed_device_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_model_modified_weights_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_non_contiguous_output_alias_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_output_path_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_runtime_checks_fp8_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_runtime_checks_large_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_scatter_reduce_fallback_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sdpa_2_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_shifted_constraint_ranges_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_simple_embed_kernel_binary_True_max_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_so_without_weight_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sym_i64_input_codegen_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_symfloat_item_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_autotuning_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_bool_param_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_dynamic_grid_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_equal_to_1_arg_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_on_device_tma_dynamic_False_tma_version_old_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_tma_descriptor_1d_dynamic_False_tma_version_new_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_expr_replacements_shift_k_3_use_static_size_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_update_constant_buffer_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_nested_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_with_offset_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_zero_grid_with_unbacked_symbols_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_constant_tensor_name_collision_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_debug_printer_sym_inputs_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_profiler_enable_kernel_profile_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_autotune_int64_user_defined_triton_kernel_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_bool_input_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_boolean_indexing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_buffer_mutation_and_force_mmap_weights_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_cpu_predicate_cuda_operands_max_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_mismatched_branch_output_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_non_tensor_predicates_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_conv3d_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_conv_freezing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_duplicate_constant_folding_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_duplicated_params_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_empty_cat_dtype_promotion_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_freezing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_int_list_input_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_linear_dynamic_maxautotune_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_misaligned_input_1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_misaligned_input_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_misc_1_max_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_missing_cubin_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_model_modified_weights_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_multiple_output_alias_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_non_default_gpu_device_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_on_gpu_device1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_profile_benchmark_harness_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_repeated_user_defined_triton_kernel_embed_kernel_binary_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_return_constant_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_reuse_kernel_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_runtime_checks_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_size_with_unbacked_add_expr_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_dynamic_launcher_grid_infer_from_tensor_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_dynamic_grid_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_equal_to_1_float_arg_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_extern_kernel_arg_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_2_dynamic_False_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_sympy_fn_like_arg_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_tma_descriptor_1d_dynamic_True_tma_version_new_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_new_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_2_use_static_size_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_conv_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_outer_code_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_sym_expr_cond_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_with_offset_mps 2025-12-04T09:55:32.5714581Z 2025-12-04T09:55:32.5715542Z inductor/test_aot_inductor.py::AOTInductorLoggingTest::test_shape_env_reuse <- ../../../../opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/logging_utils.py PASSED [9.8618s] [ 0%] 2025-12-04T09:55:32.5717041Z inductor/test_aot_inductor.py::TestAOTInductorConfig::test_no_compile_standalone PASSED [0.0032s] [ 1%] 2025-12-04T09:55:32.5718330Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cpu SKIPPED [0.0029s] (requires GPU) [ 1%] 2025-12-04T09:55:32.5720019Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_32_num_groups_1_cpu SKIPPED [0.0027s] (requires Intel GPU) [ 2%] 2025-12-04T09:55:32.5721734Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_codegen_cpu <- test/inductor/test_torchinductor.py PASSED [11.7885s] [ 3%] 2025-12-04T09:55:32.5723424Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_fp8_dtype_cpu SKIPPED [0.0003s] (FP8 is only supported on H100+, SM 8.9 and MI300+ devices) [ 3%] 2025-12-04T09:55:32.5725181Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_sym_inputs_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (requires GPU) [ 4%] 2025-12-04T09:55:32.5727044Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printer_user_defined_triton_kernel_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0028s] (requires GPU) [ 5%] 2025-12-04T09:55:32.5728577Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_buffer_mutation_3_cpu PASSED [11.1105s] [ 5%] 2025-12-04T09:55:32.5729959Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_cpu_predicate_cuda_operands_max_autotune_False_cpu SKIPPED [0.0032s] (requires CUDA) [ 6%] 2025-12-04T09:55:32.5731582Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_cpu_predicate_cuda_operands_max_autotune_True_cpu SKIPPED [0.0028s] (requires CUDA) [ 7%] 2025-12-04T09:55:32.5733160Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_consecutive_compiles_cpu <- test/inductor/test_torchinductor.py PASSED [5.9491s] [ 7%] 2025-12-04T09:55:32.5734648Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_conv3d_cpu SKIPPED [0.0032s] (requires modern GPU to run max-autotune) [ 8%] 2025-12-04T09:55:32.5736092Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_dynamic_cat_cpu <- test/inductor/test_torchinductor.py PASSED [4.9898s] [ 9%] 2025-12-04T09:55:32.5737575Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_dynamic_scalar_cpu <- test/inductor/test_torchinductor.py PASSED [5.3367s] [ 9%] 2025-12-04T09:55:32.5739190Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_empty_constant_folding_cpu <- test/inductor/test_torchinductor.py PASSED [4.8614s] [ 10%] 2025-12-04T09:55:32.5740720Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fill__fallback_cpu <- test/inductor/test_torchinductor.py PASSED [4.9254s] [ 11%] 2025-12-04T09:55:32.5742207Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fp8_cpu SKIPPED [0.0003s] (FP8 is only supported on H100+, SM 8.9 and MI300+ devices) [ 11%] 2025-12-04T09:55:32.5743534Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_dynamic_dim_cpu PASSED [4.9910s] [ 12%] 2025-12-04T09:55:32.5744993Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_libtorch_free_so_cpu <- test/inductor/test_torchinductor.py PASSED [12.0132s] [ 12%] 2025-12-04T09:55:32.5746606Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misaligned_input_2_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0031s] (GPU test only) [ 13%] 2025-12-04T09:55:32.5748290Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_missing_output_cpu <- test/inductor/test_torchinductor.py PASSED [4.9324s] [ 14%] 2025-12-04T09:55:32.5749814Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_model_modified_weights_cpu <- test/inductor/test_torchinductor.py PASSED [5.8871s] [ 14%] 2025-12-04T09:55:32.5751544Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_none_args_aot_codegen_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (requires GPU) [ 15%] 2025-12-04T09:55:32.5753127Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_output_path_1_cpu <- test/inductor/test_torchinductor.py PASSED [4.9890s] [ 16%] 2025-12-04T09:55:32.5754661Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_profile_benchmark_harness_cpu <- test/inductor/test_torchinductor.py PASSED [5.4508s] [ 16%] 2025-12-04T09:55:32.5756222Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_proxy_executor_hann_cpu <- test/inductor/test_torchinductor.py PASSED [4.8166s] [ 17%] 2025-12-04T09:55:32.5757751Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_repeat_interleave_cpu <- test/inductor/test_torchinductor.py PASSED [4.8081s] [ 18%] 2025-12-04T09:55:32.5759267Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_return_constant_cpu <- test/inductor/test_torchinductor.py PASSED [4.8291s] [ 18%] 2025-12-04T09:55:32.5760750Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_reuse_kernel_cpu <- test/inductor/test_torchinductor.py PASSED [4.9293s] [ 19%] 2025-12-04T09:55:32.5762203Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_same_backing_cpu <- test/inductor/test_torchinductor.py PASSED [5.0146s] [ 20%] 2025-12-04T09:55:32.5763876Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scaled_dot_product_efficient_attention_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (requires GPU) [ 20%] 2025-12-04T09:55:32.5765639Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sdpa_2_cpu SKIPPED [0.0002s] (bfloat16 only supported in sm80+ or XPU) [ 21%] 2025-12-04T09:55:32.5767171Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sdpa_cpu SKIPPED [0.0002s] (bfloat16 only supported in sm80+ or XPU) [ 22%] 2025-12-04T09:55:32.5768613Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_size_with_unbacked_add_expr_transitive_cpu SKIPPED [0.0028s] (requires GPU) [ 22%] 2025-12-04T09:55:32.5770000Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sym_i64_input_codegen_cpu SKIPPED [0.0027s] (requires GPU) [ 23%] 2025-12-04T09:55:32.5771310Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_autotuning_cpu SKIPPED [0.0031s] (requires GPU) [ 24%] 2025-12-04T09:55:32.5772660Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_dynamic_launcher_grid_cpu SKIPPED [0.0033s] (requires GPU) [ 24%] 2025-12-04T09:55:32.5774217Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_True_autotune_False_cpu SKIPPED [0.0027s] (requires GPU) [ 25%] 2025-12-04T09:55:32.5775903Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_cpu SKIPPED [0.0027s] (requires GPU) [ 25%] 2025-12-04T09:55:32.5777617Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_cpu SKIPPED [0.0027s] (requires GPU) [ 26%] 2025-12-04T09:55:32.5779508Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_cpu SKIPPED [0.0027s] (requires GPU) [ 27%] 2025-12-04T09:55:32.5781285Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_True_cpu SKIPPED [0.0029s] (requires GPU) [ 27%] 2025-12-04T09:55:32.5783264Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_2_use_static_size_True_cpu SKIPPED [0.0027s] (Need triton for user-defined triton kernel) [ 28%] 2025-12-04T09:55:32.5785038Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_update_constant_buffer_simple_cpu <- test/inductor/test_torchinductor.py PASSED [4.8559s] [ 29%] 2025-12-04T09:55:32.5786771Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_update_user_managed_buffer_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (requires CUDA/XPU) [ 29%] 2025-12-04T09:55:32.5788569Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_nested_cpu <- test/inductor/test_torchinductor.py PASSED [6.0257s] [ 30%] 2025-12-04T09:55:32.5790154Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_zero_grid_with_unbacked_symbols_cpu <- test/inductor/test_torchinductor.py PASSED [5.1227s] [ 31%] 2025-12-04T09:55:32.5791769Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda ('RERUN', {'yellow': True}) [0.0359s] [ 31%] 2025-12-04T09:55:32.5793373Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda ('RERUN', {'yellow': True}) [0.0050s] [ 31%] 2025-12-04T09:55:32.5794894Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda FAILED [0.0049s] [ 31%] 2025-12-04T09:55:32.5795676Z 2025-12-04T09:55:32.5795837Z ==================================== RERUNS ==================================== 2025-12-04T09:55:32.5796567Z _ AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda _ 2025-12-04T09:55:32.5797252Z Traceback (most recent call last): 2025-12-04T09:55:32.5797979Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 6893, in test__weight_int4pack_mm 2025-12-04T09:55:32.5798729Z self.check_model(model, (a,)) 2025-12-04T09:55:32.5799386Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 247, in check_model 2025-12-04T09:55:32.5800089Z ref_model = copy.deepcopy(model) 2025-12-04T09:55:32.5800611Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 172, in deepcopy 2025-12-04T09:55:32.5801144Z y = _reconstruct(x, memo, *rv) 2025-12-04T09:55:32.5801653Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 271, in _reconstruct 2025-12-04T09:55:32.5802206Z state = deepcopy(state, memo) 2025-12-04T09:55:32.5802712Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 146, in deepcopy 2025-12-04T09:55:32.5803225Z y = copier(x, memo) 2025-12-04T09:55:32.5803713Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 231, in _deepcopy_dict 2025-12-04T09:55:32.5804317Z y[deepcopy(key, memo)] = deepcopy(value, memo) 2025-12-04T09:55:32.5804879Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.5805376Z y = copier(memo) 2025-12-04T09:55:32.5805965Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_tensor.py", line 180, in __deepcopy__ 2025-12-04T09:55:32.5806695Z new_storage = self._typed_storage()._deepcopy(memo) 2025-12-04T09:55:32.5807386Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 1139, in _deepcopy 2025-12-04T09:55:32.5808211Z return self._new_wrapped_storage(copy.deepcopy(self._untyped_storage, memo)) 2025-12-04T09:55:32.5808907Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.5809422Z y = copier(memo) 2025-12-04T09:55:32.5810132Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 243, in __deepcopy__ 2025-12-04T09:55:32.5810810Z new_storage = self.clone() 2025-12-04T09:55:32.5811487Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 257, in clone 2025-12-04T09:55:32.5812221Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.5812816Z torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.5813834Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.5815093Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.5815891Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.5816449Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.5816817Z 2025-12-04T09:55:32.5816828Z 2025-12-04T09:55:32.5817057Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.5818206Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.5819211Z 2025-12-04T09:55:32.5819480Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.5820318Z _ AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda _ 2025-12-04T09:55:32.5821017Z Traceback (most recent call last): 2025-12-04T09:55:32.5821730Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 6893, in test__weight_int4pack_mm 2025-12-04T09:55:32.5822479Z self.check_model(model, (a,)) 2025-12-04T09:55:32.5823148Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 247, in check_model 2025-12-04T09:55:32.5823858Z ref_model = copy.deepcopy(model) 2025-12-04T09:55:32.5824365Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 172, in deepcopy 2025-12-04T09:55:32.5824900Z y = _reconstruct(x, memo, *rv) 2025-12-04T09:55:32.5825428Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 271, in _reconstruct 2025-12-04T09:55:32.5825970Z state = deepcopy(state, memo) 2025-12-04T09:55:32.5826471Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 146, in deepcopy 2025-12-04T09:55:32.5826988Z y = copier(x, memo) 2025-12-04T09:55:32.5827473Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 231, in _deepcopy_dict 2025-12-04T09:55:32.5828063Z y[deepcopy(key, memo)] = deepcopy(value, memo) 2025-12-04T09:55:32.5828624Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.5829133Z y = copier(memo) 2025-12-04T09:55:32.5829710Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_tensor.py", line 180, in __deepcopy__ 2025-12-04T09:55:32.5830446Z new_storage = self._typed_storage()._deepcopy(memo) 2025-12-04T09:55:32.5831145Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 1139, in _deepcopy 2025-12-04T09:55:32.5831974Z return self._new_wrapped_storage(copy.deepcopy(self._untyped_storage, memo)) 2025-12-04T09:55:32.5832654Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.5833169Z y = copier(memo) 2025-12-04T09:55:32.5833755Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 243, in __deepcopy__ 2025-12-04T09:55:32.5834419Z new_storage = self.clone() 2025-12-04T09:55:32.5835009Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 257, in clone 2025-12-04T09:55:32.5835750Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.5836337Z torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.5837416Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.5838680Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.5839554Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.5840097Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.5840475Z 2025-12-04T09:55:32.5840480Z 2025-12-04T09:55:32.5840696Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.5841841Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.5842760Z 2025-12-04T09:55:32.5843043Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.5843612Z =================================== FAILURES =================================== 2025-12-04T09:55:32.5844325Z _ AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda _ 2025-12-04T09:55:32.5845033Z Traceback (most recent call last): 2025-12-04T09:55:32.5845756Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 6893, in test__weight_int4pack_mm 2025-12-04T09:55:32.5846480Z self.check_model(model, (a,)) 2025-12-04T09:55:32.5847144Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 247, in check_model 2025-12-04T09:55:32.5847846Z ref_model = copy.deepcopy(model) 2025-12-04T09:55:32.5848368Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 172, in deepcopy 2025-12-04T09:55:32.5848889Z y = _reconstruct(x, memo, *rv) 2025-12-04T09:55:32.5849414Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 271, in _reconstruct 2025-12-04T09:55:32.5849967Z state = deepcopy(state, memo) 2025-12-04T09:55:32.5850457Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 146, in deepcopy 2025-12-04T09:55:32.5850972Z y = copier(x, memo) 2025-12-04T09:55:32.5851458Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 231, in _deepcopy_dict 2025-12-04T09:55:32.5852051Z y[deepcopy(key, memo)] = deepcopy(value, memo) 2025-12-04T09:55:32.5852607Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.5853117Z y = copier(memo) 2025-12-04T09:55:32.5853706Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_tensor.py", line 180, in __deepcopy__ 2025-12-04T09:55:32.5854421Z new_storage = self._typed_storage()._deepcopy(memo) 2025-12-04T09:55:32.5855120Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 1139, in _deepcopy 2025-12-04T09:55:32.5855948Z return self._new_wrapped_storage(copy.deepcopy(self._untyped_storage, memo)) 2025-12-04T09:55:32.5856648Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.5857146Z y = copier(memo) 2025-12-04T09:55:32.5857738Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 243, in __deepcopy__ 2025-12-04T09:55:32.5858495Z new_storage = self.clone() 2025-12-04T09:55:32.5859078Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 257, in clone 2025-12-04T09:55:32.5859829Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.5860421Z torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.5861409Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.5862673Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.5863480Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.5864127Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.5864497Z 2025-12-04T09:55:32.5864502Z 2025-12-04T09:55:32.5864718Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.5865938Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.5866870Z 2025-12-04T09:55:32.5867138Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.5868309Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-7ff7771e3c72f123.xml - 2025-12-04T09:55:32.5869378Z =========================== short test summary info ============================ 2025-12-04T09:55:32.5870605Z FAILED [0.0049s] inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda - torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.5872287Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.5873552Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.5874355Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.5874895Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.5875276Z 2025-12-04T09:55:32.5875281Z 2025-12-04T09:55:32.5875496Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.5876637Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.5877556Z 2025-12-04T09:55:32.5877840Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.5878422Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T09:55:32.5878980Z ======== 1 failed, 23 passed, 25 skipped, 2 rerun in 137.78s (0:02:17) ========= 2025-12-04T09:55:32.5879464Z Got exit code 1 2025-12-04T09:55:32.5879725Z Retrying single test... 2025-12-04T09:55:32.5880355Z W1204 09:47:34.901000 4494 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.5881529Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b312d84b484956a1.xml 2025-12-04T09:55:32.5882412Z ============================= test session starts ============================== 2025-12-04T09:55:32.5883072Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T09:55:32.5883683Z cachedir: .pytest_cache 2025-12-04T09:55:32.5884407Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T09:55:32.5885191Z rootdir: /var/lib/jenkins/workspace 2025-12-04T09:55:32.5885552Z configfile: pytest.ini 2025-12-04T09:55:32.5886296Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T09:55:32.5887213Z collecting ... collected 934 items / 153 deselected / 781 selected 2025-12-04T09:55:32.5888600Z stepcurrent: skipping 48 already run items. Running only test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.5889725Z Running 1 items in this shard 2025-12-04T09:55:32.5889936Z 2025-12-04T09:55:32.5891219Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda [W1204 09:47:36.907971615 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T09:55:32.5892498Z 2025-12-04T09:55:32.5892646Z ('RERUN', {'yellow': True}) [16.0686s] [100%] 2025-12-04T09:55:32.5894050Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda [W1204 09:47:52.984494728 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T09:55:32.5895394Z 2025-12-04T09:55:32.5895528Z ('RERUN', {'yellow': True}) [0.0066s] [100%] 2025-12-04T09:55:32.5897004Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda [W1204 09:47:52.991460104 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T09:55:32.5898277Z 2025-12-04T09:55:32.5898467Z FAILED [0.0050s] [100%] 2025-12-04T09:55:32.5898650Z 2025-12-04T09:55:32.5898815Z ==================================== RERUNS ==================================== 2025-12-04T09:55:32.5899524Z _ AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda _ 2025-12-04T09:55:32.5900231Z Traceback (most recent call last): 2025-12-04T09:55:32.5900958Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 6893, in test__weight_int4pack_mm 2025-12-04T09:55:32.5901707Z self.check_model(model, (a,)) 2025-12-04T09:55:32.5902359Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 247, in check_model 2025-12-04T09:55:32.5903069Z ref_model = copy.deepcopy(model) 2025-12-04T09:55:32.5903584Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 172, in deepcopy 2025-12-04T09:55:32.5904105Z y = _reconstruct(x, memo, *rv) 2025-12-04T09:55:32.5904626Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 271, in _reconstruct 2025-12-04T09:55:32.5905180Z state = deepcopy(state, memo) 2025-12-04T09:55:32.5905687Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 146, in deepcopy 2025-12-04T09:55:32.5906195Z y = copier(x, memo) 2025-12-04T09:55:32.5906684Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 231, in _deepcopy_dict 2025-12-04T09:55:32.5907294Z y[deepcopy(key, memo)] = deepcopy(value, memo) 2025-12-04T09:55:32.5907837Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.5908352Z y = copier(memo) 2025-12-04T09:55:32.5908947Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_tensor.py", line 180, in __deepcopy__ 2025-12-04T09:55:32.5909676Z new_storage = self._typed_storage()._deepcopy(memo) 2025-12-04T09:55:32.5910368Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 1139, in _deepcopy 2025-12-04T09:55:32.5911196Z return self._new_wrapped_storage(copy.deepcopy(self._untyped_storage, memo)) 2025-12-04T09:55:32.5911891Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.5912391Z y = copier(memo) 2025-12-04T09:55:32.5912976Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 243, in __deepcopy__ 2025-12-04T09:55:32.5913652Z new_storage = self.clone() 2025-12-04T09:55:32.5914226Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 257, in clone 2025-12-04T09:55:32.5914973Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.5915557Z torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.5916562Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.5917813Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.5918625Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.5919292Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.5919665Z 2025-12-04T09:55:32.5920296Z Exception raised from copy_device_to_device at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/Copy.cu:337 (most recent call first): 2025-12-04T09:55:32.5921218Z C++ CapturedTraceback: 2025-12-04T09:55:32.5922741Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T09:55:32.5924684Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T09:55:32.5925983Z #6 c10::AcceleratorError::AcceleratorError(c10::SourceLocation, int, std::__cxx11::basic_string, std::allocator > const&) from :0 2025-12-04T09:55:32.5927162Z #7 c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, unsigned int, bool) from ??:0 2025-12-04T09:55:32.5927936Z #8 at::native::copy_device_to_device(at::TensorIterator&, bool, bool) from ??:0 2025-12-04T09:55:32.5928666Z #9 at::native::copy_impl(at::Tensor&, at::Tensor const&, bool) [clone .isra.0] from Copy.cpp:0 2025-12-04T09:55:32.5929335Z #10 at::native::copy_(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.5931859Z #11 c10::impl::wrap_kernel_functor_unboxed_, at::Tensor&, c10::guts::typelist::typelist >, at::Tensor& (c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.5934759Z #12 torch::autograd::VariableType::(anonymous namespace)::copy_(c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.5935727Z #13 at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.5936327Z #14 at::storage_copy(c10::Storage&, c10::Storage const&, bool) from ??:0 2025-12-04T09:55:32.5936967Z #15 THPStorage_copy_(_object*, _object*, _object*) from StorageMethods.cpp:0 2025-12-04T09:55:32.5937801Z #16 method_vectorcall_VARARGS_KEYWORDS from /usr/local/src/conda/python-3.10.14/Objects/descrobject.c:344 2025-12-04T09:55:32.5938852Z #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5939794Z #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5940733Z #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5941672Z #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5942598Z #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5943539Z #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5944472Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5945405Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5946324Z #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5947251Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5948257Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5949181Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5950112Z #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5951115Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5952046Z #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5952967Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5953907Z #33 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5954846Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5955635Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.5956413Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5957360Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5958296Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5959227Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5960141Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5960961Z #41 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.5961671Z #42 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.5962439Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5963266Z #44 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.5963969Z #45 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.5964756Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5965562Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.5966267Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.5967052Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5967991Z #50 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5968919Z #51 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5969704Z #52 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.5970494Z #53 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5971430Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5972359Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5973296Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5974233Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5975159Z #58 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5976091Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5977032Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5978039Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5978922Z #62 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.5979701Z #63 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.5980480Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5981359Z #65 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.5982157Z #66 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.5982918Z #67 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.5983674Z #68 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.5984532Z #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.5985467Z #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5986404Z #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5987335Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5988262Z #73 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.5989044Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5989984Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5990921Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5991847Z #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5992789Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5993675Z #79 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.5994486Z #80 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.5995231Z #81 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.5995945Z #82 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T09:55:32.5996623Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.5997393Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.5998331Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.5999267Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6000199Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6001120Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6001902Z #89 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6002682Z #90 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6003619Z #91 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6004630Z #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6005575Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6006629Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6007422Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6008296Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6009235Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6010179Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6011149Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6012101Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6012992Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6013824Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6014583Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6015343Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6016224Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6017173Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6017955Z #107 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6018843Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6019792Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6020744Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6021678Z #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6022623Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6023518Z #113 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6024327Z #114 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6025089Z #115 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6025854Z #116 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6026730Z #117 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6027663Z #118 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6028615Z #119 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6029561Z #120 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6030510Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6031441Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6032240Z #123 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6033034Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6033998Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6034933Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6035966Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6036917Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6037871Z #129 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6038697Z #130 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6039471Z #131 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6040233Z #132 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6041099Z #133 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6042046Z #134 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6043001Z #135 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6043953Z #136 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6044900Z #137 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6045849Z #138 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6046798Z #139 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6047745Z #140 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6048676Z #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6049626Z #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6050457Z #143 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T09:55:32.6051202Z #144 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T09:55:32.6051946Z #145 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T09:55:32.6052652Z #146 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T09:55:32.6053445Z #147 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T09:55:32.6054266Z #148 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T09:55:32.6055037Z #149 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T09:55:32.6055751Z #150 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T09:55:32.6056435Z #151 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T09:55:32.6057044Z #152 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T09:55:32.6057489Z #153 _start from ??:0 2025-12-04T09:55:32.6057795Z #154 from ??:0 2025-12-04T09:55:32.6058030Z 2025-12-04T09:55:32.6058040Z 2025-12-04T09:55:32.6058259Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.6059486Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.6060419Z 2025-12-04T09:55:32.6060690Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.6061333Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.6062818Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py:257: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T09:55:32.6064407Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.6065208Z _ AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda _ 2025-12-04T09:55:32.6065982Z Traceback (most recent call last): 2025-12-04T09:55:32.6066690Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 6893, in test__weight_int4pack_mm 2025-12-04T09:55:32.6067433Z self.check_model(model, (a,)) 2025-12-04T09:55:32.6068102Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 247, in check_model 2025-12-04T09:55:32.6068808Z ref_model = copy.deepcopy(model) 2025-12-04T09:55:32.6069315Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 172, in deepcopy 2025-12-04T09:55:32.6069847Z y = _reconstruct(x, memo, *rv) 2025-12-04T09:55:32.6070371Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 271, in _reconstruct 2025-12-04T09:55:32.6070921Z state = deepcopy(state, memo) 2025-12-04T09:55:32.6071421Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 146, in deepcopy 2025-12-04T09:55:32.6071939Z y = copier(x, memo) 2025-12-04T09:55:32.6072422Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 231, in _deepcopy_dict 2025-12-04T09:55:32.6073026Z y[deepcopy(key, memo)] = deepcopy(value, memo) 2025-12-04T09:55:32.6073584Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.6074099Z y = copier(memo) 2025-12-04T09:55:32.6074683Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_tensor.py", line 180, in __deepcopy__ 2025-12-04T09:55:32.6075417Z new_storage = self._typed_storage()._deepcopy(memo) 2025-12-04T09:55:32.6076126Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 1139, in _deepcopy 2025-12-04T09:55:32.6076939Z return self._new_wrapped_storage(copy.deepcopy(self._untyped_storage, memo)) 2025-12-04T09:55:32.6077639Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.6078154Z y = copier(memo) 2025-12-04T09:55:32.6078740Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 243, in __deepcopy__ 2025-12-04T09:55:32.6079412Z new_storage = self.clone() 2025-12-04T09:55:32.6080003Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 257, in clone 2025-12-04T09:55:32.6080761Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.6081337Z torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.6082343Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.6083607Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.6084422Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.6084972Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.6085353Z 2025-12-04T09:55:32.6085966Z Exception raised from copy_device_to_device at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/Copy.cu:337 (most recent call first): 2025-12-04T09:55:32.6086843Z C++ CapturedTraceback: 2025-12-04T09:55:32.6088518Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T09:55:32.6090465Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T09:55:32.6091870Z #6 c10::AcceleratorError::AcceleratorError(c10::SourceLocation, int, std::__cxx11::basic_string, std::allocator > const&) from :0 2025-12-04T09:55:32.6093057Z #7 c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, unsigned int, bool) from ??:0 2025-12-04T09:55:32.6093924Z #8 at::native::copy_device_to_device(at::TensorIterator&, bool, bool) from ??:0 2025-12-04T09:55:32.6094648Z #9 at::native::copy_impl(at::Tensor&, at::Tensor const&, bool) [clone .isra.0] from Copy.cpp:0 2025-12-04T09:55:32.6095307Z #10 at::native::copy_(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6097833Z #11 c10::impl::wrap_kernel_functor_unboxed_, at::Tensor&, c10::guts::typelist::typelist >, at::Tensor& (c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6100807Z #12 torch::autograd::VariableType::(anonymous namespace)::copy_(c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6101792Z #13 at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6102380Z #14 at::storage_copy(c10::Storage&, c10::Storage const&, bool) from ??:0 2025-12-04T09:55:32.6103030Z #15 THPStorage_copy_(_object*, _object*, _object*) from StorageMethods.cpp:0 2025-12-04T09:55:32.6103879Z #16 method_vectorcall_VARARGS_KEYWORDS from /usr/local/src/conda/python-3.10.14/Objects/descrobject.c:344 2025-12-04T09:55:32.6104874Z #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6105805Z #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6106751Z #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6107686Z #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6108629Z #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6109551Z #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6127186Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6128303Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6129245Z #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6130187Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6131145Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6132086Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6133019Z #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6133952Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6134886Z #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6135821Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6136743Z #33 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6137675Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6138703Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6139485Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6140501Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6141436Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6142370Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6143293Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6144118Z #41 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6144828Z #42 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6145614Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6146423Z #44 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6147127Z #45 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6147913Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6148715Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6149415Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6150197Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6151133Z #50 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6152050Z #51 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6152841Z #52 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6153622Z #53 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6154559Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6155483Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6156416Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6157345Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6158280Z #58 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6159205Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6160141Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6161074Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6161886Z #62 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6162587Z #63 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6163371Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6164247Z #65 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6165043Z #66 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6165801Z #67 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6166554Z #68 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6167493Z #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6168413Z #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6169414Z #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6170338Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6171109Z #73 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6171889Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6172819Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6173749Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6174672Z #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6175596Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6176476Z #79 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6177290Z #80 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6178032Z #81 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6178836Z #82 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T09:55:32.6179519Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6180303Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6181230Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6182164Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6183097Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6184021Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6184807Z #89 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6185586Z #90 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6186519Z #91 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6187432Z #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6188542Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6189474Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6190264Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6191038Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6191978Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6192917Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6193849Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6194779Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6195668Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6196626Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6197384Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6198233Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6199111Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6200060Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6200846Z #107 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6201642Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6202590Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6203541Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6204468Z #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6205416Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6206301Z #113 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6207119Z #114 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6207870Z #115 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6208628Z #116 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6209503Z #117 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6210437Z #118 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6211379Z #119 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6212326Z #120 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6213268Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6214201Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6214994Z #123 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6215790Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6216739Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6217674Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6218708Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6219663Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6220556Z #129 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6221363Z #130 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6222126Z #131 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6222893Z #132 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6223750Z #133 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6224695Z #134 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6225809Z #135 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6226758Z #136 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6227755Z #137 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6228698Z #138 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6229648Z #139 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6230590Z #140 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6231519Z #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6232468Z #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6233295Z #143 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T09:55:32.6234047Z #144 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T09:55:32.6234777Z #145 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T09:55:32.6235482Z #146 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T09:55:32.6236272Z #147 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T09:55:32.6237092Z #148 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T09:55:32.6237870Z #149 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T09:55:32.6238586Z #150 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T09:55:32.6239284Z #151 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T09:55:32.6239886Z #152 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T09:55:32.6240331Z #153 _start from ??:0 2025-12-04T09:55:32.6240641Z #154 from ??:0 2025-12-04T09:55:32.6240884Z 2025-12-04T09:55:32.6240890Z 2025-12-04T09:55:32.6241111Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.6242275Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.6243208Z 2025-12-04T09:55:32.6243479Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.6244119Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.6245616Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py:257: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T09:55:32.6247123Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.6247639Z =================================== FAILURES =================================== 2025-12-04T09:55:32.6248376Z _ AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda _ 2025-12-04T09:55:32.6249066Z Traceback (most recent call last): 2025-12-04T09:55:32.6249790Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 6893, in test__weight_int4pack_mm 2025-12-04T09:55:32.6250536Z self.check_model(model, (a,)) 2025-12-04T09:55:32.6251195Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 247, in check_model 2025-12-04T09:55:32.6251884Z ref_model = copy.deepcopy(model) 2025-12-04T09:55:32.6252402Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 172, in deepcopy 2025-12-04T09:55:32.6253000Z y = _reconstruct(x, memo, *rv) 2025-12-04T09:55:32.6253517Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 271, in _reconstruct 2025-12-04T09:55:32.6254071Z state = deepcopy(state, memo) 2025-12-04T09:55:32.6254635Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 146, in deepcopy 2025-12-04T09:55:32.6255155Z y = copier(x, memo) 2025-12-04T09:55:32.6255630Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 231, in _deepcopy_dict 2025-12-04T09:55:32.6256237Z y[deepcopy(key, memo)] = deepcopy(value, memo) 2025-12-04T09:55:32.6256799Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.6257301Z y = copier(memo) 2025-12-04T09:55:32.6257894Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_tensor.py", line 180, in __deepcopy__ 2025-12-04T09:55:32.6258704Z new_storage = self._typed_storage()._deepcopy(memo) 2025-12-04T09:55:32.6259416Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 1139, in _deepcopy 2025-12-04T09:55:32.6260233Z return self._new_wrapped_storage(copy.deepcopy(self._untyped_storage, memo)) 2025-12-04T09:55:32.6260930Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.6261452Z y = copier(memo) 2025-12-04T09:55:32.6262029Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 243, in __deepcopy__ 2025-12-04T09:55:32.6262707Z new_storage = self.clone() 2025-12-04T09:55:32.6263295Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 257, in clone 2025-12-04T09:55:32.6264691Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.6265275Z torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.6266297Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.6267571Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.6268369Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.6268931Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.6269313Z 2025-12-04T09:55:32.6269927Z Exception raised from copy_device_to_device at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/Copy.cu:337 (most recent call first): 2025-12-04T09:55:32.6270795Z C++ CapturedTraceback: 2025-12-04T09:55:32.6272302Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T09:55:32.6274242Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T09:55:32.6275535Z #6 c10::AcceleratorError::AcceleratorError(c10::SourceLocation, int, std::__cxx11::basic_string, std::allocator > const&) from :0 2025-12-04T09:55:32.6276721Z #7 c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, unsigned int, bool) from ??:0 2025-12-04T09:55:32.6277505Z #8 at::native::copy_device_to_device(at::TensorIterator&, bool, bool) from ??:0 2025-12-04T09:55:32.6278210Z #9 at::native::copy_impl(at::Tensor&, at::Tensor const&, bool) [clone .isra.0] from Copy.cpp:0 2025-12-04T09:55:32.6278879Z #10 at::native::copy_(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6281486Z #11 c10::impl::wrap_kernel_functor_unboxed_, at::Tensor&, c10::guts::typelist::typelist >, at::Tensor& (c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6284455Z #12 torch::autograd::VariableType::(anonymous namespace)::copy_(c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6285431Z #13 at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6286016Z #14 at::storage_copy(c10::Storage&, c10::Storage const&, bool) from ??:0 2025-12-04T09:55:32.6286660Z #15 THPStorage_copy_(_object*, _object*, _object*) from StorageMethods.cpp:0 2025-12-04T09:55:32.6287506Z #16 method_vectorcall_VARARGS_KEYWORDS from /usr/local/src/conda/python-3.10.14/Objects/descrobject.c:344 2025-12-04T09:55:32.6288658Z #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6289604Z #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6290542Z #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6291479Z #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6292400Z #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6293331Z #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6294264Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6295196Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6296120Z #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6297053Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6297983Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6298990Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6299913Z #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6300846Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6301780Z #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6302710Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6303633Z #33 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6304567Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6305355Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6306131Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6307065Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6307996Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6308925Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6309843Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6310663Z #41 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6311495Z #42 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6312291Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6313208Z #44 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6313914Z #45 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6314698Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6315527Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6316217Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6317000Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6317934Z #50 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6318858Z #51 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6319638Z #52 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6320420Z #53 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6321356Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6322277Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6323206Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6324131Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6325064Z #58 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6325983Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6326914Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6327847Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6328667Z #62 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6329361Z #63 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6330145Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6331023Z #65 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6331823Z #66 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6332581Z #67 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6333328Z #68 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6334196Z #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6335119Z #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6336050Z #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6336980Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6337763Z #73 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6338610Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6339549Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6340570Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6341504Z #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6342488Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6343360Z #79 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6344170Z #80 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6344909Z #81 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6345620Z #82 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T09:55:32.6346301Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6347082Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6348002Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6348934Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6349862Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6350793Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6351560Z #89 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6352336Z #90 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6353269Z #91 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6354192Z #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6355127Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6356062Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6356843Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6357613Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6358546Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6359476Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6360404Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6361338Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6362226Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6363047Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6363816Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6364562Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6365436Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6366388Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6367174Z #107 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6367973Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6368989Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6369945Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6370949Z #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6371898Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6372791Z #113 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6373613Z #114 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6374365Z #115 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6375125Z #116 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6376005Z #117 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6376955Z #118 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6377892Z #119 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6378914Z #120 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6379860Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6380794Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6381597Z #123 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6382397Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6383357Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6384288Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6385245Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6386193Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6387084Z #129 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6388048Z #130 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6388816Z #131 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6389571Z #132 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6390450Z #133 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6391379Z #134 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6392326Z #135 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6393272Z #136 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6394206Z #137 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6395155Z #138 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6396103Z #139 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6397047Z #140 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6398099Z #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6399048Z #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6399959Z #143 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T09:55:32.6400711Z #144 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T09:55:32.6401429Z #145 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T09:55:32.6402139Z #146 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T09:55:32.6402928Z #147 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T09:55:32.6403756Z #148 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T09:55:32.6404513Z #149 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T09:55:32.6405235Z #150 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T09:55:32.6405910Z #151 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T09:55:32.6406502Z #152 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T09:55:32.6406929Z #153 _start from ??:0 2025-12-04T09:55:32.6407223Z #154 from ??:0 2025-12-04T09:55:32.6407454Z 2025-12-04T09:55:32.6407459Z 2025-12-04T09:55:32.6407683Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.6408813Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.6409731Z 2025-12-04T09:55:32.6409992Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.6410614Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.6412102Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py:257: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T09:55:32.6413573Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.6414693Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b312d84b484956a1.xml - 2025-12-04T09:55:32.6415757Z =========================== short test summary info ============================ 2025-12-04T09:55:32.6417000Z FAILED [0.0050s] inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda - torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.6418744Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.6420018Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.6420823Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.6421387Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.6421756Z 2025-12-04T09:55:32.6422367Z Exception raised from copy_device_to_device at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/Copy.cu:337 (most recent call first): 2025-12-04T09:55:32.6423228Z C++ CapturedTraceback: 2025-12-04T09:55:32.6424729Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T09:55:32.6426732Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T09:55:32.6428037Z #6 c10::AcceleratorError::AcceleratorError(c10::SourceLocation, int, std::__cxx11::basic_string, std::allocator > const&) from :0 2025-12-04T09:55:32.6429292Z #7 c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, unsigned int, bool) from ??:0 2025-12-04T09:55:32.6430076Z #8 at::native::copy_device_to_device(at::TensorIterator&, bool, bool) from ??:0 2025-12-04T09:55:32.6430798Z #9 at::native::copy_impl(at::Tensor&, at::Tensor const&, bool) [clone .isra.0] from Copy.cpp:0 2025-12-04T09:55:32.6431454Z #10 at::native::copy_(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6433964Z #11 c10::impl::wrap_kernel_functor_unboxed_, at::Tensor&, c10::guts::typelist::typelist >, at::Tensor& (c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6436846Z #12 torch::autograd::VariableType::(anonymous namespace)::copy_(c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6437805Z #13 at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6438387Z #14 at::storage_copy(c10::Storage&, c10::Storage const&, bool) from ??:0 2025-12-04T09:55:32.6439004Z #15 THPStorage_copy_(_object*, _object*, _object*) from StorageMethods.cpp:0 2025-12-04T09:55:32.6439828Z #16 method_vectorcall_VARARGS_KEYWORDS from /usr/local/src/conda/python-3.10.14/Objects/descrobject.c:344 2025-12-04T09:55:32.6440803Z #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6441726Z #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6442637Z #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6443556Z #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6444475Z #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6445387Z #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6446298Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6447212Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6448131Z #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6449041Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6449944Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6450860Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6451777Z #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6452678Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6453595Z #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6454511Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6455495Z #33 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6456404Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6457242Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6458033Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6459030Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6459950Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6460879Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6461807Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6462627Z #41 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6463325Z #42 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6464109Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6464934Z #44 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6465621Z #45 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6466400Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6467216Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6467913Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6468679Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6469622Z #50 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6470555Z #51 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6471336Z #52 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6472102Z #53 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6473029Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6473958Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6474891Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6475805Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6476733Z #58 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6477653Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6478567Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6479492Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6480307Z #62 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6481005Z #63 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6481768Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6482638Z #65 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6483443Z #66 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6484269Z #67 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6485006Z #68 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6485931Z #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6486865Z #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6487922Z #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6488855Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6489638Z #73 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6490416Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6491335Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6492264Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6493192Z #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6494115Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6494969Z #79 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6495773Z #80 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6496520Z #81 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6497235Z #82 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T09:55:32.6497899Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6498759Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6499695Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6500620Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6501553Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6502481Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6503259Z #89 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6504026Z #90 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6504951Z #91 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6505881Z #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6506807Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6507727Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6508505Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6509281Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6510210Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6511126Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6512050Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6513101Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6513986Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6514890Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6515648Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6516408Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6517272Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6518213Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6519001Z #107 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6519795Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6520722Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6521661Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6522603Z #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6523540Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6524418Z #113 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6525236Z #114 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6525996Z #115 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6526737Z #116 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6527618Z #117 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6528561Z #118 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6529509Z #119 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6530438Z #120 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6531386Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6532324Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6533115Z #123 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6533894Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6534843Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6535786Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6536735Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6537665Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6538623Z #129 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6539455Z #130 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6540213Z #131 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6540977Z #132 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6541958Z #133 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6542912Z #134 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6543913Z #135 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6544868Z #136 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6545822Z #137 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6546773Z #138 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6547708Z #139 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6548658Z #140 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6549616Z #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6550562Z #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6551384Z #143 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T09:55:32.6552142Z #144 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T09:55:32.6552882Z #145 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T09:55:32.6553573Z #146 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T09:55:32.6554367Z #147 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T09:55:32.6555197Z #148 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T09:55:32.6555975Z #149 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T09:55:32.6556683Z #150 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T09:55:32.6557371Z #151 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T09:55:32.6557992Z #152 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T09:55:32.6558423Z #153 _start from ??:0 2025-12-04T09:55:32.6558731Z #154 from ??:0 2025-12-04T09:55:32.6558980Z 2025-12-04T09:55:32.6558985Z 2025-12-04T09:55:32.6559204Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.6560356Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.6561276Z 2025-12-04T09:55:32.6561546Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.6562150Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T09:55:32.6562689Z ================= 1 failed, 153 deselected, 2 rerun in 16.17s ================== 2025-12-04T09:55:32.6563141Z Got exit code 1 2025-12-04T09:55:32.6563410Z Retrying single test... 2025-12-04T09:55:32.6564048Z W1204 09:48:02.887000 4615 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.6565211Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-9ee61a34966ac1ed.xml 2025-12-04T09:55:32.6566090Z ============================= test session starts ============================== 2025-12-04T09:55:32.6566754Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T09:55:32.6567357Z cachedir: .pytest_cache 2025-12-04T09:55:32.6568079Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T09:55:32.6568931Z rootdir: /var/lib/jenkins/workspace 2025-12-04T09:55:32.6569290Z configfile: pytest.ini 2025-12-04T09:55:32.6570032Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T09:55:32.6571013Z collecting ... collected 934 items / 153 deselected / 781 selected 2025-12-04T09:55:32.6572248Z stepcurrent: skipping 48 already run items. Running only test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.6573369Z Running 1 items in this shard 2025-12-04T09:55:32.6573580Z 2025-12-04T09:55:32.6574740Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda [W1204 09:48:04.889148054 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T09:55:32.6576011Z 2025-12-04T09:55:32.6576164Z ('RERUN', {'yellow': True}) [15.1626s] [100%] 2025-12-04T09:55:32.6577556Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda [W1204 09:48:19.059849040 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T09:55:32.6578902Z 2025-12-04T09:55:32.6579040Z ('RERUN', {'yellow': True}) [0.0066s] [100%] 2025-12-04T09:55:32.6580441Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda [W1204 09:48:19.066892811 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T09:55:32.6581720Z 2025-12-04T09:55:32.6581825Z FAILED [0.0050s] [100%] 2025-12-04T09:55:32.6582003Z 2025-12-04T09:55:32.6582165Z ==================================== RERUNS ==================================== 2025-12-04T09:55:32.6582889Z _ AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda _ 2025-12-04T09:55:32.6583595Z Traceback (most recent call last): 2025-12-04T09:55:32.6584320Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 6893, in test__weight_int4pack_mm 2025-12-04T09:55:32.6585056Z self.check_model(model, (a,)) 2025-12-04T09:55:32.6585729Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 247, in check_model 2025-12-04T09:55:32.6586433Z ref_model = copy.deepcopy(model) 2025-12-04T09:55:32.6586959Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 172, in deepcopy 2025-12-04T09:55:32.6587484Z y = _reconstruct(x, memo, *rv) 2025-12-04T09:55:32.6588153Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 271, in _reconstruct 2025-12-04T09:55:32.6588714Z state = deepcopy(state, memo) 2025-12-04T09:55:32.6589210Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 146, in deepcopy 2025-12-04T09:55:32.6589738Z y = copier(x, memo) 2025-12-04T09:55:32.6590231Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 231, in _deepcopy_dict 2025-12-04T09:55:32.6590845Z y[deepcopy(key, memo)] = deepcopy(value, memo) 2025-12-04T09:55:32.6591399Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.6591918Z y = copier(memo) 2025-12-04T09:55:32.6592512Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_tensor.py", line 180, in __deepcopy__ 2025-12-04T09:55:32.6593232Z new_storage = self._typed_storage()._deepcopy(memo) 2025-12-04T09:55:32.6593935Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 1139, in _deepcopy 2025-12-04T09:55:32.6594754Z return self._new_wrapped_storage(copy.deepcopy(self._untyped_storage, memo)) 2025-12-04T09:55:32.6595455Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.6595956Z y = copier(memo) 2025-12-04T09:55:32.6596699Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 243, in __deepcopy__ 2025-12-04T09:55:32.6597381Z new_storage = self.clone() 2025-12-04T09:55:32.6597958Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 257, in clone 2025-12-04T09:55:32.6598794Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.6599387Z torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.6600400Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.6601654Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.6602466Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.6603026Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.6603396Z 2025-12-04T09:55:32.6604033Z Exception raised from copy_device_to_device at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/Copy.cu:337 (most recent call first): 2025-12-04T09:55:32.6604889Z C++ CapturedTraceback: 2025-12-04T09:55:32.6606404Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T09:55:32.6608341Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T09:55:32.6609644Z #6 c10::AcceleratorError::AcceleratorError(c10::SourceLocation, int, std::__cxx11::basic_string, std::allocator > const&) from :0 2025-12-04T09:55:32.6610812Z #7 c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, unsigned int, bool) from ??:0 2025-12-04T09:55:32.6611600Z #8 at::native::copy_device_to_device(at::TensorIterator&, bool, bool) from ??:0 2025-12-04T09:55:32.6612324Z #9 at::native::copy_impl(at::Tensor&, at::Tensor const&, bool) [clone .isra.0] from Copy.cpp:0 2025-12-04T09:55:32.6613001Z #10 at::native::copy_(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6615495Z #11 c10::impl::wrap_kernel_functor_unboxed_, at::Tensor&, c10::guts::typelist::typelist >, at::Tensor& (c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6618494Z #12 torch::autograd::VariableType::(anonymous namespace)::copy_(c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6619483Z #13 at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6620092Z #14 at::storage_copy(c10::Storage&, c10::Storage const&, bool) from ??:0 2025-12-04T09:55:32.6620739Z #15 THPStorage_copy_(_object*, _object*, _object*) from StorageMethods.cpp:0 2025-12-04T09:55:32.6621568Z #16 method_vectorcall_VARARGS_KEYWORDS from /usr/local/src/conda/python-3.10.14/Objects/descrobject.c:344 2025-12-04T09:55:32.6622562Z #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6623506Z #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6624433Z #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6625380Z #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6626399Z #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6627336Z #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6628319Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6629252Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6630190Z #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6631122Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6632041Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6632975Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6633908Z #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6634834Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6635759Z #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6636691Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6637626Z #33 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6638559Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6639331Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6640118Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6641049Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6641959Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6642888Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6643821Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6644643Z #41 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6645339Z #42 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6646122Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6646948Z #44 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6647654Z #45 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6648421Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6649245Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6649944Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6650711Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6651651Z #50 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6652583Z #51 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6653365Z #52 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6654134Z #53 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6655152Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6656086Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6657159Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6658077Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6659086Z #58 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6660023Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6660960Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6661887Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6662707Z #62 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6663412Z #63 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6664183Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6665059Z #65 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6665870Z #66 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6666622Z #67 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6667361Z #68 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6668228Z #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6669166Z #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6670099Z #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6671020Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6671804Z #73 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6672586Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6673517Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6674431Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6675358Z #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6676291Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6677153Z #79 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6677964Z #80 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6678711Z #81 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6679421Z #82 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T09:55:32.6680083Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6680861Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6681798Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6682734Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6683728Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6684666Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6685514Z #89 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6686278Z #90 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6687214Z #91 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6688316Z #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6689249Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6690167Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6690961Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6691743Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6692678Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6693598Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6694529Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6695473Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6696364Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6697173Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6697948Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6698783Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6699650Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6700610Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6701414Z #107 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6702218Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6703156Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6704106Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6705063Z #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6706014Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6706894Z #113 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6707724Z #114 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6708491Z #115 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6709260Z #116 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6710125Z #117 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6711080Z #118 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6712029Z #119 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6713084Z #120 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6714031Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6715060Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6715862Z #123 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6716647Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6717597Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6718544Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6719493Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6720430Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6721318Z #129 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6722139Z #130 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6722895Z #131 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6723641Z #132 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6724512Z #133 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6725461Z #134 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6726396Z #135 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6727341Z #136 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6728285Z #137 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6729235Z #138 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6730170Z #139 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6731111Z #140 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6732056Z #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6733000Z #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6733813Z #143 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T09:55:32.6734571Z #144 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T09:55:32.6735309Z #145 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T09:55:32.6736020Z #146 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T09:55:32.6736801Z #147 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T09:55:32.6737636Z #148 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T09:55:32.6738493Z #149 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T09:55:32.6739206Z #150 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T09:55:32.6739903Z #151 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T09:55:32.6740522Z #152 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T09:55:32.6740974Z #153 _start from ??:0 2025-12-04T09:55:32.6741349Z #154 from ??:0 2025-12-04T09:55:32.6741605Z 2025-12-04T09:55:32.6741611Z 2025-12-04T09:55:32.6741835Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.6743103Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.6744023Z 2025-12-04T09:55:32.6744307Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.6744938Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.6746437Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py:257: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T09:55:32.6747933Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.6748740Z _ AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda _ 2025-12-04T09:55:32.6749426Z Traceback (most recent call last): 2025-12-04T09:55:32.6750156Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 6893, in test__weight_int4pack_mm 2025-12-04T09:55:32.6750897Z self.check_model(model, (a,)) 2025-12-04T09:55:32.6751550Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 247, in check_model 2025-12-04T09:55:32.6752255Z ref_model = copy.deepcopy(model) 2025-12-04T09:55:32.6752772Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 172, in deepcopy 2025-12-04T09:55:32.6753303Z y = _reconstruct(x, memo, *rv) 2025-12-04T09:55:32.6753814Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 271, in _reconstruct 2025-12-04T09:55:32.6754367Z state = deepcopy(state, memo) 2025-12-04T09:55:32.6754865Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 146, in deepcopy 2025-12-04T09:55:32.6755374Z y = copier(x, memo) 2025-12-04T09:55:32.6755860Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 231, in _deepcopy_dict 2025-12-04T09:55:32.6756467Z y[deepcopy(key, memo)] = deepcopy(value, memo) 2025-12-04T09:55:32.6757011Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.6757521Z y = copier(memo) 2025-12-04T09:55:32.6758112Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_tensor.py", line 180, in __deepcopy__ 2025-12-04T09:55:32.6758839Z new_storage = self._typed_storage()._deepcopy(memo) 2025-12-04T09:55:32.6759528Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 1139, in _deepcopy 2025-12-04T09:55:32.6760350Z return self._new_wrapped_storage(copy.deepcopy(self._untyped_storage, memo)) 2025-12-04T09:55:32.6761044Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.6761550Z y = copier(memo) 2025-12-04T09:55:32.6762138Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 243, in __deepcopy__ 2025-12-04T09:55:32.6762807Z new_storage = self.clone() 2025-12-04T09:55:32.6763403Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 257, in clone 2025-12-04T09:55:32.6764141Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.6764732Z torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.6765742Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.6767003Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.6767799Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.6768354Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.6768807Z 2025-12-04T09:55:32.6769439Z Exception raised from copy_device_to_device at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/Copy.cu:337 (most recent call first): 2025-12-04T09:55:32.6770362Z C++ CapturedTraceback: 2025-12-04T09:55:32.6771881Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T09:55:32.6773815Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T09:55:32.6775120Z #6 c10::AcceleratorError::AcceleratorError(c10::SourceLocation, int, std::__cxx11::basic_string, std::allocator > const&) from :0 2025-12-04T09:55:32.6776305Z #7 c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, unsigned int, bool) from ??:0 2025-12-04T09:55:32.6777085Z #8 at::native::copy_device_to_device(at::TensorIterator&, bool, bool) from ??:0 2025-12-04T09:55:32.6777817Z #9 at::native::copy_impl(at::Tensor&, at::Tensor const&, bool) [clone .isra.0] from Copy.cpp:0 2025-12-04T09:55:32.6778564Z #10 at::native::copy_(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6781081Z #11 c10::impl::wrap_kernel_functor_unboxed_, at::Tensor&, c10::guts::typelist::typelist >, at::Tensor& (c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6783963Z #12 torch::autograd::VariableType::(anonymous namespace)::copy_(c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6784929Z #13 at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6785535Z #14 at::storage_copy(c10::Storage&, c10::Storage const&, bool) from ??:0 2025-12-04T09:55:32.6786176Z #15 THPStorage_copy_(_object*, _object*, _object*) from StorageMethods.cpp:0 2025-12-04T09:55:32.6787019Z #16 method_vectorcall_VARARGS_KEYWORDS from /usr/local/src/conda/python-3.10.14/Objects/descrobject.c:344 2025-12-04T09:55:32.6788153Z #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6789104Z #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6790046Z #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6790978Z #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6791917Z #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6792859Z #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6793796Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6794710Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6795647Z #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6796585Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6797517Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6798579Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6799518Z #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6800530Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6801463Z #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6802376Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6803301Z #33 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6804231Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6805020Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6805795Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6806730Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6807672Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6808596Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6809526Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6810348Z #41 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6811055Z #42 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6811823Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6812650Z #44 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6813351Z #45 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6814138Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6814949Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6815649Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6816428Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6817345Z #50 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6818271Z #51 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6819172Z #52 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6819961Z #53 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6820882Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6821824Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6822763Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6823697Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6824620Z #58 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6825552Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6826479Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6827481Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6828288Z #62 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6829057Z #63 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6829841Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6830701Z #65 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6831510Z #66 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6832262Z #67 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6833009Z #68 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6833859Z #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6834794Z #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6835722Z #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6836661Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6837430Z #73 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6838212Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6839142Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6840060Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6840984Z #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6841917Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6842793Z #79 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6843593Z #80 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6844341Z #81 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6845052Z #82 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T09:55:32.6845727Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6846494Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6847427Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6848361Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6849299Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6850213Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6851002Z #89 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6851783Z #90 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6852698Z #91 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6853626Z #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6854556Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6855553Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6856327Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6857101Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6869776Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6870737Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6871684Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6872638Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6873538Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6874347Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6875133Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6875902Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6876795Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6877736Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6878538Z #107 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6879333Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6880282Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6881215Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6882163Z #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6883106Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6883991Z #113 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6884811Z #114 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6885581Z #115 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6886339Z #116 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6887204Z #117 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6888361Z #118 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6889314Z #119 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6890258Z #120 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6891192Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6892141Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6892935Z #123 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6893729Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6894662Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6895606Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6896734Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6897666Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6898769Z #129 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6899596Z #130 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6900360Z #131 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6901107Z #132 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6901985Z #133 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6902933Z #134 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6903886Z #135 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6904823Z #136 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6905770Z #137 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6906723Z #138 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6907668Z #139 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6908599Z #140 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6909549Z #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6910487Z #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6911318Z #143 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T09:55:32.6912058Z #144 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T09:55:32.6912793Z #145 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T09:55:32.6913502Z #146 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T09:55:32.6914280Z #147 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T09:55:32.6915109Z #148 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T09:55:32.6915880Z #149 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T09:55:32.6916595Z #150 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T09:55:32.6917270Z #151 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T09:55:32.6917884Z #152 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T09:55:32.6918334Z #153 _start from ??:0 2025-12-04T09:55:32.6918629Z #154 from ??:0 2025-12-04T09:55:32.6918878Z 2025-12-04T09:55:32.6918884Z 2025-12-04T09:55:32.6919102Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.6920263Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.6921189Z 2025-12-04T09:55:32.6921469Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.6922098Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.6923593Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py:257: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T09:55:32.6925169Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.6925686Z =================================== FAILURES =================================== 2025-12-04T09:55:32.6926400Z _ AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda _ 2025-12-04T09:55:32.6927163Z Traceback (most recent call last): 2025-12-04T09:55:32.6927889Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 6893, in test__weight_int4pack_mm 2025-12-04T09:55:32.6928633Z self.check_model(model, (a,)) 2025-12-04T09:55:32.6929291Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 247, in check_model 2025-12-04T09:55:32.6929996Z ref_model = copy.deepcopy(model) 2025-12-04T09:55:32.6930519Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 172, in deepcopy 2025-12-04T09:55:32.6931042Z y = _reconstruct(x, memo, *rv) 2025-12-04T09:55:32.6931348Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 271, in _reconstruct 2025-12-04T09:55:32.6931473Z state = deepcopy(state, memo) 2025-12-04T09:55:32.6931751Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 146, in deepcopy 2025-12-04T09:55:32.6931865Z y = copier(x, memo) 2025-12-04T09:55:32.6932164Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 231, in _deepcopy_dict 2025-12-04T09:55:32.6932335Z y[deepcopy(key, memo)] = deepcopy(value, memo) 2025-12-04T09:55:32.6932597Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.6932699Z y = copier(memo) 2025-12-04T09:55:32.6933127Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_tensor.py", line 180, in __deepcopy__ 2025-12-04T09:55:32.6933300Z new_storage = self._typed_storage()._deepcopy(memo) 2025-12-04T09:55:32.6933708Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 1139, in _deepcopy 2025-12-04T09:55:32.6933994Z return self._new_wrapped_storage(copy.deepcopy(self._untyped_storage, memo)) 2025-12-04T09:55:32.6934259Z File "/opt/conda/envs/py_3.10/lib/python3.10/copy.py", line 153, in deepcopy 2025-12-04T09:55:32.6934373Z y = copier(memo) 2025-12-04T09:55:32.6934785Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 243, in __deepcopy__ 2025-12-04T09:55:32.6934904Z new_storage = self.clone() 2025-12-04T09:55:32.6935283Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py", line 257, in clone 2025-12-04T09:55:32.6935510Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.6935745Z torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.6936397Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.6936880Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.6937083Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.6937321Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.6937327Z 2025-12-04T09:55:32.6937957Z Exception raised from copy_device_to_device at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/Copy.cu:337 (most recent call first): 2025-12-04T09:55:32.6938070Z C++ CapturedTraceback: 2025-12-04T09:55:32.6939483Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T09:55:32.6939997Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T09:55:32.6940735Z #6 c10::AcceleratorError::AcceleratorError(c10::SourceLocation, int, std::__cxx11::basic_string, std::allocator > const&) from :0 2025-12-04T09:55:32.6941123Z #7 c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, unsigned int, bool) from ??:0 2025-12-04T09:55:32.6941458Z #8 at::native::copy_device_to_device(at::TensorIterator&, bool, bool) from ??:0 2025-12-04T09:55:32.6941767Z #9 at::native::copy_impl(at::Tensor&, at::Tensor const&, bool) [clone .isra.0] from Copy.cpp:0 2025-12-04T09:55:32.6941999Z #10 at::native::copy_(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6944147Z #11 c10::impl::wrap_kernel_functor_unboxed_, at::Tensor&, c10::guts::typelist::typelist >, at::Tensor& (c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6944776Z #12 torch::autograd::VariableType::(anonymous namespace)::copy_(c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.6945001Z #13 at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.6945250Z #14 at::storage_copy(c10::Storage&, c10::Storage const&, bool) from ??:0 2025-12-04T09:55:32.6945518Z #15 THPStorage_copy_(_object*, _object*, _object*) from StorageMethods.cpp:0 2025-12-04T09:55:32.6945945Z #16 method_vectorcall_VARARGS_KEYWORDS from /usr/local/src/conda/python-3.10.14/Objects/descrobject.c:344 2025-12-04T09:55:32.6946370Z #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6946749Z #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6947163Z #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6947550Z #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6947962Z #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6948347Z #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6948752Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6949126Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6949544Z #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6949916Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6950342Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6950714Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6951124Z #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6951507Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6951913Z #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6952296Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6952701Z #33 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6953073Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6953426Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6953800Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6954264Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6954646Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6955052Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6955437Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6955732Z #41 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6955994Z #42 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6956381Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6956677Z #44 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6956950Z #45 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6957319Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6957609Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6957876Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6958248Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6958673Z #50 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6959045Z #51 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6959306Z #52 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6959689Z #53 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6960096Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6960466Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6960882Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6961251Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6961669Z #58 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6962039Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6962448Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6962831Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6963127Z #62 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.6963398Z #63 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6963769Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6964121Z #65 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6964441Z #66 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6964736Z #67 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6965105Z #68 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6965530Z #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6965902Z #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6966384Z #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6966755Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6967013Z #73 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6967395Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6967800Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6968191Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6968599Z #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6968971Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6969338Z #79 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6969645Z #80 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6969951Z #81 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6970220Z #82 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T09:55:32.6970479Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6970865Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6971277Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6971649Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6972073Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6972444Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6972717Z #89 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6973089Z #90 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6973499Z #91 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6973883Z #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6974294Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6974681Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6974944Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6975315Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6975736Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6976108Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6976526Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6976910Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6977334Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6977663Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6977962Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6978332Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6978841Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6979225Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6979509Z #107 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6979890Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6980305Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6980707Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6981122Z #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6981523Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6981883Z #113 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6982195Z #114 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6982505Z #115 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6982813Z #116 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6983242Z #117 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6983623Z #118 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6984040Z #119 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6984433Z #120 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6984848Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6985226Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6985510Z #123 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.6985888Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6986315Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6986696Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6987111Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6987505Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6988038Z #129 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.6988365Z #130 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.6988667Z #131 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.6988975Z #132 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.6989403Z #133 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.6989945Z #134 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6990376Z #135 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6990834Z #136 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6991250Z #137 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6991644Z #138 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6992058Z #139 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6992435Z #140 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6992860Z #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.6993242Z #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.6993547Z #143 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T09:55:32.6993859Z #144 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T09:55:32.6994130Z #145 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T09:55:32.6994429Z #146 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T09:55:32.6994784Z #147 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T09:55:32.6995122Z #148 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T09:55:32.6995416Z #149 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T09:55:32.6995689Z #150 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T09:55:32.6995975Z #151 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T09:55:32.6996174Z #152 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T09:55:32.6996277Z #153 _start from ??:0 2025-12-04T09:55:32.6996406Z #154 from ??:0 2025-12-04T09:55:32.6996412Z 2025-12-04T09:55:32.6996417Z 2025-12-04T09:55:32.6996634Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.6997439Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.6997446Z 2025-12-04T09:55:32.6997713Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.6997939Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.6999079Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/storage.py:257: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T09:55:32.6999311Z return type(self)(self.nbytes(), device=self.device).copy_(self) 2025-12-04T09:55:32.7000079Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-9ee61a34966ac1ed.xml - 2025-12-04T09:55:32.7000257Z =========================== short test summary info ============================ 2025-12-04T09:55:32.7001183Z FAILED [0.0050s] inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda - torch.AcceleratorError: CUDA error: invalid device function 2025-12-04T09:55:32.7001829Z Search for `cudaErrorInvalidDeviceFunction' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. 2025-12-04T09:55:32.7002402Z CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. 2025-12-04T09:55:32.7002605Z For debugging consider passing CUDA_LAUNCH_BLOCKING=1 2025-12-04T09:55:32.7002845Z Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. 2025-12-04T09:55:32.7002909Z 2025-12-04T09:55:32.7003534Z Exception raised from copy_device_to_device at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/Copy.cu:337 (most recent call first): 2025-12-04T09:55:32.7003653Z C++ CapturedTraceback: 2025-12-04T09:55:32.7004972Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T09:55:32.7005480Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T09:55:32.7006150Z #6 c10::AcceleratorError::AcceleratorError(c10::SourceLocation, int, std::__cxx11::basic_string, std::allocator > const&) from :0 2025-12-04T09:55:32.7006540Z #7 c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, unsigned int, bool) from ??:0 2025-12-04T09:55:32.7006808Z #8 at::native::copy_device_to_device(at::TensorIterator&, bool, bool) from ??:0 2025-12-04T09:55:32.7007118Z #9 at::native::copy_impl(at::Tensor&, at::Tensor const&, bool) [clone .isra.0] from Copy.cpp:0 2025-12-04T09:55:32.7007348Z #10 at::native::copy_(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.7009512Z #11 c10::impl::wrap_kernel_functor_unboxed_, at::Tensor&, c10::guts::typelist::typelist >, at::Tensor& (c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.7010137Z #12 torch::autograd::VariableType::(anonymous namespace)::copy_(c10::DispatchKeySet, at::Tensor&, at::Tensor const&, bool) from VariableTypeManual.cpp:0 2025-12-04T09:55:32.7010359Z #13 at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T09:55:32.7010605Z #14 at::storage_copy(c10::Storage&, c10::Storage const&, bool) from ??:0 2025-12-04T09:55:32.7010870Z #15 THPStorage_copy_(_object*, _object*, _object*) from StorageMethods.cpp:0 2025-12-04T09:55:32.7011302Z #16 method_vectorcall_VARARGS_KEYWORDS from /usr/local/src/conda/python-3.10.14/Objects/descrobject.c:344 2025-12-04T09:55:32.7011728Z #17 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7012112Z #18 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7012517Z #19 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7012904Z #20 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7013316Z #21 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7013701Z #22 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7014108Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7014485Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7014901Z #25 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7015332Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7015753Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7016259Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7016667Z #29 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7017056Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7017464Z #31 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7017850Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7018256Z #33 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7018710Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7018992Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7019366Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7019792Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7020164Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7020570Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7020955Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7021255Z #41 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.7021518Z #42 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7021908Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7022203Z #44 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.7022480Z #45 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7022850Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7023144Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.7023413Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7023783Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7024202Z #50 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7024578Z #51 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7024837Z #52 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7025221Z #53 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7025629Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7026000Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7026420Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7026789Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7027206Z #58 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7027646Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7028052Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7028490Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7028782Z #62 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T09:55:32.7029053Z #63 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7029425Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7029775Z #65 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.7030091Z #66 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.7030386Z #67 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.7030706Z #68 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.7031113Z #69 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.7031491Z #70 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7031907Z #71 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7032282Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7032541Z #73 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7032924Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7033330Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7033719Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7034124Z #77 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7034499Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7034858Z #79 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.7035166Z #80 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.7035473Z #81 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.7035744Z #82 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T09:55:32.7036003Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7036387Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7036798Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7037181Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7037590Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7037961Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7038229Z #89 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7038600Z #90 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7039006Z #91 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7039398Z #92 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7039862Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7040250Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7040585Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7040957Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7041376Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7041748Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7042153Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7042548Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7042911Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.7043238Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.7043543Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.7043851Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.7044276Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.7044656Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7044935Z #107 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7045316Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7045735Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7046126Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7046543Z #111 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7046934Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7047293Z #113 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.7047603Z #114 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.7047917Z #115 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.7048225Z #116 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.7048638Z #117 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.7049035Z #118 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7049449Z #119 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7049846Z #120 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7050259Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7050637Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7050914Z #123 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T09:55:32.7051293Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7051718Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7052156Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7052569Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7053020Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7053377Z #129 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T09:55:32.7053701Z #130 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T09:55:32.7054000Z #131 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T09:55:32.7054307Z #132 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T09:55:32.7054735Z #133 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T09:55:32.7055120Z #134 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7055533Z #135 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7055930Z #136 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7056341Z #137 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7056731Z #138 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7057143Z #139 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7057520Z #140 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7057944Z #141 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T09:55:32.7058326Z #142 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T09:55:32.7058704Z #143 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T09:55:32.7059024Z #144 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T09:55:32.7059296Z #145 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T09:55:32.7059596Z #146 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T09:55:32.7059950Z #147 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T09:55:32.7060281Z #148 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T09:55:32.7060588Z #149 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T09:55:32.7060859Z #150 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T09:55:32.7061147Z #151 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T09:55:32.7061344Z #152 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T09:55:32.7061450Z #153 _start from ??:0 2025-12-04T09:55:32.7061587Z #154 from ??:0 2025-12-04T09:55:32.7061593Z 2025-12-04T09:55:32.7061598Z 2025-12-04T09:55:32.7061822Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7062631Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.7062637Z 2025-12-04T09:55:32.7062904Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7063087Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T09:55:32.7063301Z ================= 1 failed, 153 deselected, 2 rerun in 15.26s ================== 2025-12-04T09:55:32.7063472Z Got exit code 1 2025-12-04T09:55:32.7064193Z FAILED CONSISTENTLY: test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda 2025-12-04T09:55:32.7064670Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T09:55:32.7065114Z W1204 09:48:30.007000 4736 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7065694Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-31f94497420761eb.xml 2025-12-04T09:55:32.7065862Z ============================= test session starts ============================== 2025-12-04T09:55:32.7066229Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T09:55:32.7066343Z cachedir: .pytest_cache 2025-12-04T09:55:32.7066875Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T09:55:32.7067014Z rootdir: /var/lib/jenkins/workspace 2025-12-04T09:55:32.7067127Z configfile: pytest.ini 2025-12-04T09:55:32.7067673Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T09:55:32.7067916Z collecting ... collected 934 items / 49 deselected / 885 selected 2025-12-04T09:55:32.7068060Z stepcurrent: skipping 49 already run items. 2025-12-04T09:55:32.7068188Z Running 105 items in this shard 2025-12-04T09:55:32.7068194Z 2025-12-04T09:55:32.7069046Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_64_num_groups_2_cuda SKIPPED [0.0038s] (requires Intel GPU) [ 0%] 2025-12-04T09:55:32.7070388Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_runtime_asserts_cuda <- test/inductor/test_torchinductor.py W1204 09:48:32.039000 4736 site-packages/torch/fx/experimental/symbolic_shapes.py:7357] propagate_real_tensors evaluate_expr(Ne(u1 - 3, 7)) -> False 2025-12-04T09:55:32.7070659Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] 2025-12-04T09:55:32.7071144Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] ################################################################################################### 2025-12-04T09:55:32.7071808Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] WARNING: 2 issue(s) found during export, and it was not able to soundly produce a graph. 2025-12-04T09:55:32.7072380Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] To view the report of failures in an html page, please run the command: 2025-12-04T09:55:32.7073019Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] `tlparse /tmp/export_jenkins/dedicated_log_torch_trace_jiadzds_.log --export` 2025-12-04T09:55:32.7073591Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] Or, you can view the errors in python by inspecting `print(ep._report)`. 2025-12-04T09:55:32.7073848Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] 2025-12-04T09:55:32.7074475Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] While tracing we found 1 operator(s) which do not have a fake kernel registered. 2025-12-04T09:55:32.7075153Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] If you intend to retrace the exported graph or run it with fake tensors, please run it under the 2025-12-04T09:55:32.7075797Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] following context manager, which will register a fake kernel for those operators. 2025-12-04T09:55:32.7076061Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] ``` 2025-12-04T09:55:32.7076781Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] with torch._library.fake_profile.unsafe_generate_fake_kernels(ep._report.op_profiles): 2025-12-04T09:55:32.7077174Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] # run with fake tensors 2025-12-04T09:55:32.7077492Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] ``` 2025-12-04T09:55:32.7077976Z W1204 09:48:32.069000 4736 site-packages/torch/export/_draft_export.py:524] ################################################################################################# 2025-12-04T09:55:32.7078105Z Error: Expected u5 >= 10 but received 2 2025-12-04T09:55:32.7078210Z PASSED [5.9722s] [ 1%] 2025-12-04T09:55:32.7079053Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_bmm_multiple_dynamic_cuda SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 2%] 2025-12-04T09:55:32.7079721Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_bool_input_cuda <- test/inductor/test_torchinductor.py PASSED [12.3691s] [ 3%] 2025-12-04T09:55:32.7080428Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_boolean_indexing_cuda <- test/inductor/test_torchinductor.py PASSED [6.7813s] [ 4%] 2025-12-04T09:55:32.7081639Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_codegen_int_array_var_fix_memory_leak_cuda <- test/inductor/test_torchinductor.py W1204 09:48:58.249000 4736 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7081747Z PASSED [14.5736s] [ 5%] 2025-12-04T09:55:32.7082425Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_cpu_predicate_cuda_operands_max_autotune_True_cuda PASSED [6.4242s] [ 6%] 2025-12-04T09:55:32.7083700Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_mismatched_branch_output_dynamic_True_cuda W1204 09:49:18.119000 4736 site-packages/torch/export/dynamic_shapes.py:923] Using None as a dynamic shape dimension is deprecated. Please use Dim.STATIC instead 2025-12-04T09:55:32.7084373Z W1204 09:49:18.119000 4736 site-packages/torch/export/dynamic_shapes.py:923] Using None as a dynamic shape dimension is deprecated. Please use Dim.STATIC instead 2025-12-04T09:55:32.7085039Z W1204 09:49:18.120000 4736 site-packages/torch/export/dynamic_shapes.py:923] Using None as a dynamic shape dimension is deprecated. Please use Dim.STATIC instead 2025-12-04T09:55:32.7085142Z PASSED [7.2802s] [ 7%] 2025-12-04T09:55:32.7085928Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_symint_input_disable_one_pass_cuda <- test/inductor/test_torchinductor.py PASSED [6.2724s] [ 8%] 2025-12-04T09:55:32.7087187Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_unbacked_symint_closure_dynamic_True_cuda W1204 09:49:31.584000 4736 site-packages/torch/export/dynamic_shapes.py:923] Using None as a dynamic shape dimension is deprecated. Please use Dim.STATIC instead 2025-12-04T09:55:32.7088021Z W1204 09:49:31.585000 4736 site-packages/torch/export/dynamic_shapes.py:923] Using None as a dynamic shape dimension is deprecated. Please use Dim.STATIC instead 2025-12-04T09:55:32.7088683Z W1204 09:49:31.585000 4736 site-packages/torch/export/dynamic_shapes.py:923] Using None as a dynamic shape dimension is deprecated. Please use Dim.STATIC instead 2025-12-04T09:55:32.7088804Z PASSED [6.3877s] [ 9%] 2025-12-04T09:55:32.7089271Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_constant_cuda PASSED [5.4556s] [ 10%] 2025-12-04T09:55:32.7090067Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_convolution_cuda SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 11%] 2025-12-04T09:55:32.7090794Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_custom_op_in_subgraph_cuda <- test/inductor/test_torchinductor.py PASSED [5.1375s] [ 12%] 2025-12-04T09:55:32.7091676Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_dup_unbacked_sym_decl_with_refinement_cuda <- test/inductor/test_torchinductor.py PASSED [5.9217s] [ 13%] 2025-12-04T09:55:32.7092374Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_dynamic_scalar_cuda <- test/inductor/test_torchinductor.py PASSED [5.9814s] [ 14%] 2025-12-04T09:55:32.7093137Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_embedding_bag_cuda <- test/inductor/test_torchinductor.py PASSED [5.0913s] [ 15%] 2025-12-04T09:55:32.7093655Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fallback_mem_leak_fix_cuda PASSED [5.7568s] [ 16%] 2025-12-04T09:55:32.7094310Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fft_c2c_cuda <- test/inductor/test_torchinductor.py PASSED [5.2626s] [ 17%] 2025-12-04T09:55:32.7095064Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fp8_view_of_param_cuda SKIPPED [0.0003s] (FP8 is only supported on H100+, SM 8.9 and MI300+ devices) [ 18%] 2025-12-04T09:55:32.7095757Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_int_list_input_cuda <- test/inductor/test_torchinductor.py PASSED [5.3469s] [ 19%] 2025-12-04T09:55:32.7096241Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_issue_140766_cuda PASSED [10.7913s] [ 20%] 2025-12-04T09:55:32.7096970Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_large_mmaped_weights_cuda <- test/inductor/test_torchinductor.py PASSED [13.7087s] [ 20%] 2025-12-04T09:55:32.7097646Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_missing_cubin_cuda <- test/inductor/test_torchinductor.py PASSED [24.6213s] [ 21%] 2025-12-04T09:55:32.7098788Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_mixed_device_1_cuda <- test/inductor/test_torchinductor.py W1204 09:51:11.241000 4736 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T09:55:32.7098911Z PASSED [5.9982s] [ 22%] 2025-12-04T09:55:32.7099640Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_model_modified_weights_cuda <- test/inductor/test_torchinductor.py PASSED [6.1637s] [ 23%] 2025-12-04T09:55:32.7100395Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_non_contiguous_output_alias_cuda <- test/inductor/test_torchinductor.py PASSED [5.4240s] [ 24%] 2025-12-04T09:55:32.7101064Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_output_path_1_cuda <- test/inductor/test_torchinductor.py PASSED [5.5909s] [ 25%] 2025-12-04T09:55:32.7101828Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_runtime_checks_fp8_cuda SKIPPED [0.0003s] (FP8 is only supported on H100+, SM 8.9 and MI300+ devices) [ 26%] 2025-12-04T09:55:32.7102637Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_runtime_checks_large_cuda SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 27%] 2025-12-04T09:55:32.7103368Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_scatter_reduce_fallback_cuda <- test/inductor/test_torchinductor.py PASSED [5.4247s] [ 28%] 2025-12-04T09:55:32.7104027Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sdpa_2_cuda SKIPPED [0.0003s] (bfloat16 only supported in sm80+ or XPU) [ 29%] 2025-12-04T09:55:32.7104768Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_shifted_constraint_ranges_cuda <- test/inductor/test_torchinductor.py PASSED [6.1918s] [ 30%] 2025-12-04T09:55:32.7105441Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_simple_embed_kernel_binary_True_max_autotune_False_cuda PASSED [6.6190s] [ 31%] 2025-12-04T09:55:32.7106271Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.6127s] [ 32%] 2025-12-04T09:55:32.7107151Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.5069s] [ 32%] 2025-12-04T09:55:32.7107895Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda <- test/inductor/test_torchinductor.py FAILED [0.5069s] [ 32%] 2025-12-04T09:55:32.7107960Z 2025-12-04T09:55:32.7108107Z ==================================== RERUNS ==================================== 2025-12-04T09:55:32.7108453Z ____ AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda _____ 2025-12-04T09:55:32.7108578Z Traceback (most recent call last): 2025-12-04T09:55:32.7108987Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7109107Z return value(self) 2025-12-04T09:55:32.7109625Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 1916, in test_size_with_unbacked_add_expr 2025-12-04T09:55:32.7109865Z self.check_model(Repro(), example_inputs, dynamic_shapes=spec) 2025-12-04T09:55:32.7110304Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 252, in check_model 2025-12-04T09:55:32.7110425Z actual = AOTIRunnerUtil.run( 2025-12-04T09:55:32.7110832Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 184, in run 2025-12-04T09:55:32.7110972Z package_path = AOTIRunnerUtil.compile( 2025-12-04T09:55:32.7111379Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 172, in compile 2025-12-04T09:55:32.7111596Z package_path = torch._inductor.aoti_compile_and_package( 2025-12-04T09:55:32.7112128Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 151, in aoti_compile_and_package 2025-12-04T09:55:32.7112269Z return aot_inductor_minifier_wrapper( 2025-12-04T09:55:32.7112820Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1336, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7112912Z raise e 2025-12-04T09:55:32.7113469Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1306, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7113571Z return func( 2025-12-04T09:55:32.7114121Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 195, in _aoti_compile_and_package_inner 2025-12-04T09:55:32.7114362Z aoti_files = aot_compile(gm, args, kwargs, options=inductor_configs) 2025-12-04T09:55:32.7114821Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7114949Z return compile_fx_aot( 2025-12-04T09:55:32.7115445Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7115568Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7116055Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7116161Z return compile_fx( 2025-12-04T09:55:32.7116629Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7116784Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7117362Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7117490Z return _compile_fx_main( 2025-12-04T09:55:32.7117997Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7118197Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7118737Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7118892Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7119465Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7119583Z return compile_fx_forward( 2025-12-04T09:55:32.7120163Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7120285Z return inner_compile( 2025-12-04T09:55:32.7120568Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7120679Z return func(*args, **kwds) 2025-12-04T09:55:32.7121190Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7121455Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7121963Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7122143Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7122646Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7122858Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7123363Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7123520Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7124059Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7124382Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7124917Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7125043Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7125600Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7125735Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7126229Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7126384Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7126640Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7126647Z 2025-12-04T09:55:32.7126863Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7127539Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7127545Z 2025-12-04T09:55:32.7127811Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7128051Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7128156Z unimplemented [] 2025-12-04T09:55:32.7128314Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7128429Z inductor [] 2025-12-04T09:55:32.7128529Z graph_break [] 2025-12-04T09:55:32.7128749Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7129587Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7129704Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7130449Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7130550Z warnings.warn( 2025-12-04T09:55:32.7130887Z ____ AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda _____ 2025-12-04T09:55:32.7131081Z Traceback (most recent call last): 2025-12-04T09:55:32.7131491Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7131656Z return value(self) 2025-12-04T09:55:32.7132185Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 1916, in test_size_with_unbacked_add_expr 2025-12-04T09:55:32.7132409Z self.check_model(Repro(), example_inputs, dynamic_shapes=spec) 2025-12-04T09:55:32.7132855Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 252, in check_model 2025-12-04T09:55:32.7132976Z actual = AOTIRunnerUtil.run( 2025-12-04T09:55:32.7133362Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 184, in run 2025-12-04T09:55:32.7133513Z package_path = AOTIRunnerUtil.compile( 2025-12-04T09:55:32.7133918Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 172, in compile 2025-12-04T09:55:32.7134123Z package_path = torch._inductor.aoti_compile_and_package( 2025-12-04T09:55:32.7134660Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 151, in aoti_compile_and_package 2025-12-04T09:55:32.7134794Z return aot_inductor_minifier_wrapper( 2025-12-04T09:55:32.7135345Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1336, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7135437Z raise e 2025-12-04T09:55:32.7135974Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1306, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7136082Z return func( 2025-12-04T09:55:32.7136634Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 195, in _aoti_compile_and_package_inner 2025-12-04T09:55:32.7136878Z aoti_files = aot_compile(gm, args, kwargs, options=inductor_configs) 2025-12-04T09:55:32.7137337Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7137450Z return compile_fx_aot( 2025-12-04T09:55:32.7137959Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7138086Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7138637Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7138759Z return compile_fx( 2025-12-04T09:55:32.7139231Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7139380Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7139957Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7140071Z return _compile_fx_main( 2025-12-04T09:55:32.7140599Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7140801Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7141345Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7141493Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7141996Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7142122Z return compile_fx_forward( 2025-12-04T09:55:32.7142639Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7142748Z return inner_compile( 2025-12-04T09:55:32.7143038Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7143238Z return func(*args, **kwds) 2025-12-04T09:55:32.7143753Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7144077Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7144572Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7144757Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7145261Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7145453Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7145967Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7146113Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7146664Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7146986Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7147509Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7147648Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7148197Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7148332Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7148818Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7148955Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7149224Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7149234Z 2025-12-04T09:55:32.7149451Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7150116Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7150142Z 2025-12-04T09:55:32.7150412Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7150635Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7150750Z unimplemented [] 2025-12-04T09:55:32.7150908Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7151005Z inductor [] 2025-12-04T09:55:32.7151114Z graph_break [] 2025-12-04T09:55:32.7151333Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7152174Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7152290Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7153022Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7153142Z warnings.warn( 2025-12-04T09:55:32.7153359Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7153462Z unimplemented [] 2025-12-04T09:55:32.7153629Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7153723Z inductor [] 2025-12-04T09:55:32.7153820Z graph_break [] 2025-12-04T09:55:32.7154046Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7154862Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7155050Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7155781Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7155940Z warnings.warn( 2025-12-04T09:55:32.7156100Z =================================== FAILURES =================================== 2025-12-04T09:55:32.7156428Z ____ AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda _____ 2025-12-04T09:55:32.7156563Z Traceback (most recent call last): 2025-12-04T09:55:32.7156970Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7157074Z return value(self) 2025-12-04T09:55:32.7157605Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 1916, in test_size_with_unbacked_add_expr 2025-12-04T09:55:32.7157828Z self.check_model(Repro(), example_inputs, dynamic_shapes=spec) 2025-12-04T09:55:32.7158265Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 252, in check_model 2025-12-04T09:55:32.7158404Z actual = AOTIRunnerUtil.run( 2025-12-04T09:55:32.7158792Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 184, in run 2025-12-04T09:55:32.7158943Z package_path = AOTIRunnerUtil.compile( 2025-12-04T09:55:32.7159346Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 172, in compile 2025-12-04T09:55:32.7159548Z package_path = torch._inductor.aoti_compile_and_package( 2025-12-04T09:55:32.7160085Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 151, in aoti_compile_and_package 2025-12-04T09:55:32.7160214Z return aot_inductor_minifier_wrapper( 2025-12-04T09:55:32.7160756Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1336, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7160857Z raise e 2025-12-04T09:55:32.7161403Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1306, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7161509Z return func( 2025-12-04T09:55:32.7162062Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 195, in _aoti_compile_and_package_inner 2025-12-04T09:55:32.7162296Z aoti_files = aot_compile(gm, args, kwargs, options=inductor_configs) 2025-12-04T09:55:32.7162767Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7162881Z return compile_fx_aot( 2025-12-04T09:55:32.7163386Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7163508Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7163982Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7164099Z return compile_fx( 2025-12-04T09:55:32.7164567Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7164706Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7165293Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7165407Z return _compile_fx_main( 2025-12-04T09:55:32.7165918Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7166114Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7166637Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7166798Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7167361Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7167478Z return compile_fx_forward( 2025-12-04T09:55:32.7168005Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7168174Z return inner_compile( 2025-12-04T09:55:32.7168463Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7168574Z return func(*args, **kwds) 2025-12-04T09:55:32.7169074Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7169350Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7169845Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7170034Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7170536Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7170728Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7171241Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7171383Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7171917Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7172247Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7172763Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7172900Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7173448Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7173569Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7174069Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7174204Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7174472Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7174478Z 2025-12-04T09:55:32.7174692Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7175352Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7175358Z 2025-12-04T09:55:32.7175636Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7175858Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7175972Z unimplemented [] 2025-12-04T09:55:32.7176126Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7176218Z inductor [] 2025-12-04T09:55:32.7176330Z graph_break [] 2025-12-04T09:55:32.7176549Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7177367Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7177492Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7178219Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7178330Z warnings.warn( 2025-12-04T09:55:32.7178620Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7178719Z unimplemented [] 2025-12-04T09:55:32.7178956Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7179054Z inductor [] 2025-12-04T09:55:32.7179148Z graph_break [] 2025-12-04T09:55:32.7179434Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7180247Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7180357Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7181096Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7181194Z warnings.warn( 2025-12-04T09:55:32.7181420Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7181522Z unimplemented [] 2025-12-04T09:55:32.7181681Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7181787Z inductor [] 2025-12-04T09:55:32.7181882Z graph_break [] 2025-12-04T09:55:32.7182094Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7182920Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7183034Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7183771Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7183872Z warnings.warn( 2025-12-04T09:55:32.7184616Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-31f94497420761eb.xml - 2025-12-04T09:55:32.7184797Z =========================== short test summary info ============================ 2025-12-04T09:55:32.7185658Z FAILED [0.5069s] inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7185669Z 2025-12-04T09:55:32.7185894Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7186557Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7186563Z 2025-12-04T09:55:32.7186829Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7187019Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T09:55:32.7187270Z = 1 failed, 26 passed, 7 skipped, 49 deselected, 2 rerun in 202.32s (0:03:22) == 2025-12-04T09:55:32.7187375Z Got exit code 1 2025-12-04T09:55:32.7187482Z Retrying single test... 2025-12-04T09:55:32.7188094Z W1204 09:52:05.440000 10543 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7188674Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b252d51c09f06fd1.xml 2025-12-04T09:55:32.7188845Z ============================= test session starts ============================== 2025-12-04T09:55:32.7189211Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T09:55:32.7189319Z cachedir: .pytest_cache 2025-12-04T09:55:32.7189849Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T09:55:32.7189987Z rootdir: /var/lib/jenkins/workspace 2025-12-04T09:55:32.7190099Z configfile: pytest.ini 2025-12-04T09:55:32.7190645Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T09:55:32.7191018Z collecting ... collected 934 items / 153 deselected / 781 selected 2025-12-04T09:55:32.7191773Z stepcurrent: skipping 82 already run items. Running only test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7191976Z Running 1 items in this shard 2025-12-04T09:55:32.7191981Z 2025-12-04T09:55:32.7192814Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [1.4401s] [100%] 2025-12-04T09:55:32.7193629Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.4826s] [100%] 2025-12-04T09:55:32.7194372Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda <- test/inductor/test_torchinductor.py FAILED [0.4792s] [100%] 2025-12-04T09:55:32.7194378Z 2025-12-04T09:55:32.7194527Z ==================================== RERUNS ==================================== 2025-12-04T09:55:32.7194874Z ____ AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda _____ 2025-12-04T09:55:32.7195004Z Traceback (most recent call last): 2025-12-04T09:55:32.7195410Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7195525Z return value(self) 2025-12-04T09:55:32.7196038Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 1916, in test_size_with_unbacked_add_expr 2025-12-04T09:55:32.7196277Z self.check_model(Repro(), example_inputs, dynamic_shapes=spec) 2025-12-04T09:55:32.7196708Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 252, in check_model 2025-12-04T09:55:32.7196829Z actual = AOTIRunnerUtil.run( 2025-12-04T09:55:32.7197228Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 184, in run 2025-12-04T09:55:32.7197370Z package_path = AOTIRunnerUtil.compile( 2025-12-04T09:55:32.7197775Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 172, in compile 2025-12-04T09:55:32.7197989Z package_path = torch._inductor.aoti_compile_and_package( 2025-12-04T09:55:32.7198516Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 151, in aoti_compile_and_package 2025-12-04T09:55:32.7198657Z return aot_inductor_minifier_wrapper( 2025-12-04T09:55:32.7199201Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1336, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7199293Z raise e 2025-12-04T09:55:32.7199840Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1306, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7199934Z return func( 2025-12-04T09:55:32.7200490Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 195, in _aoti_compile_and_package_inner 2025-12-04T09:55:32.7200731Z aoti_files = aot_compile(gm, args, kwargs, options=inductor_configs) 2025-12-04T09:55:32.7201189Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7201317Z return compile_fx_aot( 2025-12-04T09:55:32.7201808Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7201928Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7202412Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7202516Z return compile_fx( 2025-12-04T09:55:32.7203005Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7203139Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7203770Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7203898Z return _compile_fx_main( 2025-12-04T09:55:32.7204458Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7204655Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7205191Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7205335Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7205854Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7205971Z return compile_fx_forward( 2025-12-04T09:55:32.7206489Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7206613Z return inner_compile( 2025-12-04T09:55:32.7206893Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7207010Z return func(*args, **kwds) 2025-12-04T09:55:32.7207521Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7207788Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7208291Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7208462Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7208964Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7209168Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7209673Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7209824Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7210361Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7210683Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7211210Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7211335Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7211895Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7212016Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7212504Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7212651Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7212907Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7212917Z 2025-12-04T09:55:32.7213135Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7213808Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7213815Z 2025-12-04T09:55:32.7214080Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7214308Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7214410Z unimplemented [] 2025-12-04T09:55:32.7214566Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7214669Z inductor [] 2025-12-04T09:55:32.7214764Z graph_break [] 2025-12-04T09:55:32.7215114Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7215966Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7216145Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7216893Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7216997Z warnings.warn( 2025-12-04T09:55:32.7217330Z ____ AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda _____ 2025-12-04T09:55:32.7217468Z Traceback (most recent call last): 2025-12-04T09:55:32.7217872Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7217994Z return value(self) 2025-12-04T09:55:32.7218619Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 1916, in test_size_with_unbacked_add_expr 2025-12-04T09:55:32.7218850Z self.check_model(Repro(), example_inputs, dynamic_shapes=spec) 2025-12-04T09:55:32.7219300Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 252, in check_model 2025-12-04T09:55:32.7219429Z actual = AOTIRunnerUtil.run( 2025-12-04T09:55:32.7219816Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 184, in run 2025-12-04T09:55:32.7219974Z package_path = AOTIRunnerUtil.compile( 2025-12-04T09:55:32.7220385Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 172, in compile 2025-12-04T09:55:32.7220604Z package_path = torch._inductor.aoti_compile_and_package( 2025-12-04T09:55:32.7221131Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 151, in aoti_compile_and_package 2025-12-04T09:55:32.7221264Z return aot_inductor_minifier_wrapper( 2025-12-04T09:55:32.7221829Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1336, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7221925Z raise e 2025-12-04T09:55:32.7222469Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1306, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7222586Z return func( 2025-12-04T09:55:32.7223141Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 195, in _aoti_compile_and_package_inner 2025-12-04T09:55:32.7223389Z aoti_files = aot_compile(gm, args, kwargs, options=inductor_configs) 2025-12-04T09:55:32.7223853Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7223968Z return compile_fx_aot( 2025-12-04T09:55:32.7224482Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7224612Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7225086Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7225213Z return compile_fx( 2025-12-04T09:55:32.7225685Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7225836Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7226417Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7226534Z return _compile_fx_main( 2025-12-04T09:55:32.7227057Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7227261Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7227867Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7228021Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7228527Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7228731Z return compile_fx_forward( 2025-12-04T09:55:32.7229252Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7229365Z return inner_compile( 2025-12-04T09:55:32.7229662Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7229777Z return func(*args, **kwds) 2025-12-04T09:55:32.7230289Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7230555Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7231056Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7231243Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7231751Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7231959Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7232462Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7232611Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7233164Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7233484Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7234010Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7234150Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7234701Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7234839Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7235323Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7235463Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7235730Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7235735Z 2025-12-04T09:55:32.7235952Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7236634Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7236639Z 2025-12-04T09:55:32.7236910Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7237133Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7237254Z unimplemented [] 2025-12-04T09:55:32.7237412Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7237509Z inductor [] 2025-12-04T09:55:32.7237620Z graph_break [] 2025-12-04T09:55:32.7237839Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7238673Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7238788Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7239521Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7239697Z warnings.warn( 2025-12-04T09:55:32.7239916Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7240020Z unimplemented [] 2025-12-04T09:55:32.7240249Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7240346Z inductor [] 2025-12-04T09:55:32.7240460Z graph_break [] 2025-12-04T09:55:32.7240676Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7241488Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7241617Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7242346Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7242449Z warnings.warn( 2025-12-04T09:55:32.7242614Z =================================== FAILURES =================================== 2025-12-04T09:55:32.7242945Z ____ AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda _____ 2025-12-04T09:55:32.7243086Z Traceback (most recent call last): 2025-12-04T09:55:32.7243492Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7243598Z return value(self) 2025-12-04T09:55:32.7244132Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 1916, in test_size_with_unbacked_add_expr 2025-12-04T09:55:32.7244358Z self.check_model(Repro(), example_inputs, dynamic_shapes=spec) 2025-12-04T09:55:32.7244792Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 252, in check_model 2025-12-04T09:55:32.7244928Z actual = AOTIRunnerUtil.run( 2025-12-04T09:55:32.7245316Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 184, in run 2025-12-04T09:55:32.7245473Z package_path = AOTIRunnerUtil.compile( 2025-12-04T09:55:32.7245884Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 172, in compile 2025-12-04T09:55:32.7246085Z package_path = torch._inductor.aoti_compile_and_package( 2025-12-04T09:55:32.7246631Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 151, in aoti_compile_and_package 2025-12-04T09:55:32.7246763Z return aot_inductor_minifier_wrapper( 2025-12-04T09:55:32.7247320Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1336, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7247413Z raise e 2025-12-04T09:55:32.7247953Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1306, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7248063Z return func( 2025-12-04T09:55:32.7248620Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 195, in _aoti_compile_and_package_inner 2025-12-04T09:55:32.7248855Z aoti_files = aot_compile(gm, args, kwargs, options=inductor_configs) 2025-12-04T09:55:32.7249333Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7249451Z return compile_fx_aot( 2025-12-04T09:55:32.7249962Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7250087Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7250558Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7250679Z return compile_fx( 2025-12-04T09:55:32.7251150Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7251285Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7251936Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7252051Z return _compile_fx_main( 2025-12-04T09:55:32.7252632Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7252834Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7253359Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7253521Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7254026Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7254155Z return compile_fx_forward( 2025-12-04T09:55:32.7254670Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7254785Z return inner_compile( 2025-12-04T09:55:32.7255079Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7255192Z return func(*args, **kwds) 2025-12-04T09:55:32.7255694Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7255972Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7256468Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7256654Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7257157Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7257348Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7257873Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7258020Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7258637Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7258985Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7259508Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7259652Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7260205Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7260328Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7260829Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7260977Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7261249Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7261260Z 2025-12-04T09:55:32.7261478Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7262141Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7262147Z 2025-12-04T09:55:32.7262432Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7262657Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7262774Z unimplemented [] 2025-12-04T09:55:32.7262931Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7263030Z inductor [] 2025-12-04T09:55:32.7263139Z graph_break [] 2025-12-04T09:55:32.7263421Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7264246Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7264431Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7265171Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7265284Z warnings.warn( 2025-12-04T09:55:32.7265503Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7265604Z unimplemented [] 2025-12-04T09:55:32.7265775Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7265870Z inductor [] 2025-12-04T09:55:32.7265966Z graph_break [] 2025-12-04T09:55:32.7266193Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7267010Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7267143Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7267876Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7267977Z warnings.warn( 2025-12-04T09:55:32.7268208Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7268311Z unimplemented [] 2025-12-04T09:55:32.7268468Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7268577Z inductor [] 2025-12-04T09:55:32.7268677Z graph_break [] 2025-12-04T09:55:32.7268894Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7269761Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7269927Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7270717Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7270825Z warnings.warn( 2025-12-04T09:55:32.7271586Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b252d51c09f06fd1.xml - 2025-12-04T09:55:32.7271773Z =========================== short test summary info ============================ 2025-12-04T09:55:32.7272627Z FAILED [0.4792s] inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7272633Z 2025-12-04T09:55:32.7272871Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7273539Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7273550Z 2025-12-04T09:55:32.7273818Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7274095Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T09:55:32.7274396Z ================== 1 failed, 153 deselected, 2 rerun in 2.48s ================== 2025-12-04T09:55:32.7274588Z Got exit code 1 2025-12-04T09:55:32.7274811Z Retrying single test... 2025-12-04T09:55:32.7275602Z W1204 09:52:24.739000 10712 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7276767Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-446d02039a2c0737.xml 2025-12-04T09:55:32.7277258Z ============================= test session starts ============================== 2025-12-04T09:55:32.7278043Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T09:55:32.7278406Z cachedir: .pytest_cache 2025-12-04T09:55:32.7279567Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T09:55:32.7279851Z rootdir: /var/lib/jenkins/workspace 2025-12-04T09:55:32.7280081Z configfile: pytest.ini 2025-12-04T09:55:32.7281248Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T09:55:32.7281763Z collecting ... collected 934 items / 153 deselected / 781 selected 2025-12-04T09:55:32.7283403Z stepcurrent: skipping 82 already run items. Running only test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7283669Z Running 1 items in this shard 2025-12-04T09:55:32.7283690Z 2025-12-04T09:55:32.7285559Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [1.4313s] [100%] 2025-12-04T09:55:32.7287451Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.4846s] [100%] 2025-12-04T09:55:32.7289269Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda <- test/inductor/test_torchinductor.py FAILED [0.4823s] [100%] 2025-12-04T09:55:32.7289282Z 2025-12-04T09:55:32.7289579Z ==================================== RERUNS ==================================== 2025-12-04T09:55:32.7290290Z ____ AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda _____ 2025-12-04T09:55:32.7290542Z Traceback (most recent call last): 2025-12-04T09:55:32.7291441Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7291678Z return value(self) 2025-12-04T09:55:32.7292829Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 1916, in test_size_with_unbacked_add_expr 2025-12-04T09:55:32.7293329Z self.check_model(Repro(), example_inputs, dynamic_shapes=spec) 2025-12-04T09:55:32.7294268Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 252, in check_model 2025-12-04T09:55:32.7294522Z actual = AOTIRunnerUtil.run( 2025-12-04T09:55:32.7295388Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 184, in run 2025-12-04T09:55:32.7295684Z package_path = AOTIRunnerUtil.compile( 2025-12-04T09:55:32.7296581Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 172, in compile 2025-12-04T09:55:32.7297016Z package_path = torch._inductor.aoti_compile_and_package( 2025-12-04T09:55:32.7298151Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 151, in aoti_compile_and_package 2025-12-04T09:55:32.7298503Z return aot_inductor_minifier_wrapper( 2025-12-04T09:55:32.7299712Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1336, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7299902Z raise e 2025-12-04T09:55:32.7301122Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1306, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7301326Z return func( 2025-12-04T09:55:32.7302559Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 195, in _aoti_compile_and_package_inner 2025-12-04T09:55:32.7303052Z aoti_files = aot_compile(gm, args, kwargs, options=inductor_configs) 2025-12-04T09:55:32.7304021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7304463Z return compile_fx_aot( 2025-12-04T09:55:32.7305553Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7305949Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7307003Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7307220Z return compile_fx( 2025-12-04T09:55:32.7308282Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7308565Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7309728Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7309949Z return _compile_fx_main( 2025-12-04T09:55:32.7311068Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7311491Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7312664Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7312985Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7314120Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7314365Z return compile_fx_forward( 2025-12-04T09:55:32.7315526Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7315783Z return inner_compile( 2025-12-04T09:55:32.7316350Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7316587Z return func(*args, **kwds) 2025-12-04T09:55:32.7317696Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7318271Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7319385Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7319754Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7320869Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7321287Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7322401Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7322708Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7323899Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7324599Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7325784Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7326054Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7327290Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7327546Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7328620Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7328922Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7329442Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7329456Z 2025-12-04T09:55:32.7329919Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7331519Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7331627Z 2025-12-04T09:55:32.7332202Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7332687Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7332904Z unimplemented [] 2025-12-04T09:55:32.7333230Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7333447Z inductor [] 2025-12-04T09:55:32.7333650Z graph_break [] 2025-12-04T09:55:32.7334108Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7335898Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7336136Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7337774Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7337990Z warnings.warn( 2025-12-04T09:55:32.7338794Z ____ AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda _____ 2025-12-04T09:55:32.7339070Z Traceback (most recent call last): 2025-12-04T09:55:32.7339938Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7340168Z return value(self) 2025-12-04T09:55:32.7341286Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 1916, in test_size_with_unbacked_add_expr 2025-12-04T09:55:32.7341756Z self.check_model(Repro(), example_inputs, dynamic_shapes=spec) 2025-12-04T09:55:32.7342721Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 252, in check_model 2025-12-04T09:55:32.7342969Z actual = AOTIRunnerUtil.run( 2025-12-04T09:55:32.7343827Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 184, in run 2025-12-04T09:55:32.7344134Z package_path = AOTIRunnerUtil.compile( 2025-12-04T09:55:32.7345032Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 172, in compile 2025-12-04T09:55:32.7345468Z package_path = torch._inductor.aoti_compile_and_package( 2025-12-04T09:55:32.7346611Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 151, in aoti_compile_and_package 2025-12-04T09:55:32.7346881Z return aot_inductor_minifier_wrapper( 2025-12-04T09:55:32.7348091Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1336, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7348287Z raise e 2025-12-04T09:55:32.7349471Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1306, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7349689Z return func( 2025-12-04T09:55:32.7350892Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 195, in _aoti_compile_and_package_inner 2025-12-04T09:55:32.7351414Z aoti_files = aot_compile(gm, args, kwargs, options=inductor_configs) 2025-12-04T09:55:32.7352421Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7352662Z return compile_fx_aot( 2025-12-04T09:55:32.7353773Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7354033Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7354915Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7355138Z return compile_fx( 2025-12-04T09:55:32.7356425Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7356740Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7358074Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7358425Z return _compile_fx_main( 2025-12-04T09:55:32.7359564Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7359991Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7361168Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7361481Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7362603Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7362859Z return compile_fx_forward( 2025-12-04T09:55:32.7363997Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7364230Z return inner_compile( 2025-12-04T09:55:32.7364869Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7365105Z return func(*args, **kwds) 2025-12-04T09:55:32.7366231Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7366800Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7367907Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7368288Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7369420Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7369837Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7370921Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7371233Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7372442Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7373147Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7374309Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7374587Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7375806Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7376084Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7377136Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7377432Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7378005Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7378018Z 2025-12-04T09:55:32.7378580Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7380074Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7380088Z 2025-12-04T09:55:32.7380673Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7381148Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7381389Z unimplemented [] 2025-12-04T09:55:32.7381719Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7382067Z inductor [] 2025-12-04T09:55:32.7382287Z graph_break [] 2025-12-04T09:55:32.7382744Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7384635Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7384880Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7386501Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7386729Z warnings.warn( 2025-12-04T09:55:32.7387188Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7387405Z unimplemented [] 2025-12-04T09:55:32.7387915Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7388128Z inductor [] 2025-12-04T09:55:32.7388356Z graph_break [] 2025-12-04T09:55:32.7388819Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7390537Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7390796Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7392413Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7392627Z warnings.warn( 2025-12-04T09:55:32.7392944Z =================================== FAILURES =================================== 2025-12-04T09:55:32.7393672Z ____ AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda _____ 2025-12-04T09:55:32.7393949Z Traceback (most recent call last): 2025-12-04T09:55:32.7394832Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7395051Z return value(self) 2025-12-04T09:55:32.7396189Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 1916, in test_size_with_unbacked_add_expr 2025-12-04T09:55:32.7396642Z self.check_model(Repro(), example_inputs, dynamic_shapes=spec) 2025-12-04T09:55:32.7397611Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 252, in check_model 2025-12-04T09:55:32.7397864Z actual = AOTIRunnerUtil.run( 2025-12-04T09:55:32.7398717Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 184, in run 2025-12-04T09:55:32.7399029Z package_path = AOTIRunnerUtil.compile( 2025-12-04T09:55:32.7399931Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 172, in compile 2025-12-04T09:55:32.7400352Z package_path = torch._inductor.aoti_compile_and_package( 2025-12-04T09:55:32.7401537Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 151, in aoti_compile_and_package 2025-12-04T09:55:32.7401809Z return aot_inductor_minifier_wrapper( 2025-12-04T09:55:32.7403011Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1336, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7403209Z raise e 2025-12-04T09:55:32.7404405Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/debug.py", line 1306, in aot_inductor_minifier_wrapper 2025-12-04T09:55:32.7404623Z return func( 2025-12-04T09:55:32.7405836Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 195, in _aoti_compile_and_package_inner 2025-12-04T09:55:32.7406331Z aoti_files = aot_compile(gm, args, kwargs, options=inductor_configs) 2025-12-04T09:55:32.7407369Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7407605Z return compile_fx_aot( 2025-12-04T09:55:32.7408902Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7409140Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7410312Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7410551Z return compile_fx( 2025-12-04T09:55:32.7411578Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7411859Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7413152Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7413390Z return _compile_fx_main( 2025-12-04T09:55:32.7414523Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7414954Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7416090Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7416423Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7417537Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7417795Z return compile_fx_forward( 2025-12-04T09:55:32.7419026Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7419252Z return inner_compile( 2025-12-04T09:55:32.7419875Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7420113Z return func(*args, **kwds) 2025-12-04T09:55:32.7421220Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7421815Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7422913Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7423298Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7424417Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7424821Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7425948Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7426251Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7427455Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7428165Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7429317Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7429605Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7430824Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7431073Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7432159Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7432453Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7433008Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7433021Z 2025-12-04T09:55:32.7433480Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7434817Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7434827Z 2025-12-04T09:55:32.7435393Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7435727Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7435896Z unimplemented [] 2025-12-04T09:55:32.7436149Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7436307Z inductor [] 2025-12-04T09:55:32.7436468Z graph_break [] 2025-12-04T09:55:32.7436869Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7438343Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7438565Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7439893Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7440097Z warnings.warn( 2025-12-04T09:55:32.7440498Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7440691Z unimplemented [] 2025-12-04T09:55:32.7440982Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7441157Z inductor [] 2025-12-04T09:55:32.7441322Z graph_break [] 2025-12-04T09:55:32.7441742Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7443082Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7443282Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7444701Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7444859Z warnings.warn( 2025-12-04T09:55:32.7445238Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7445407Z unimplemented [] 2025-12-04T09:55:32.7445651Z stats [('calls_captured', 9), ('unique_graphs', 1)] 2025-12-04T09:55:32.7445803Z inductor [] 2025-12-04T09:55:32.7445957Z graph_break [] 2025-12-04T09:55:32.7446353Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7447881Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7448078Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7449511Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7449709Z warnings.warn( 2025-12-04T09:55:32.7451114Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-446d02039a2c0737.xml - 2025-12-04T09:55:32.7451460Z =========================== short test summary info ============================ 2025-12-04T09:55:32.7453016Z FAILED [0.4823s] inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7453030Z 2025-12-04T09:55:32.7453453Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7454711Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7454721Z 2025-12-04T09:55:32.7455097Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7455413Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T09:55:32.7455620Z ================== 1 failed, 153 deselected, 2 rerun in 2.48s ================== 2025-12-04T09:55:32.7455805Z Got exit code 1 2025-12-04T09:55:32.7456400Z FAILED CONSISTENTLY: test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda 2025-12-04T09:55:32.7456815Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T09:55:32.7457280Z W1204 09:52:43.825000 10881 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7457851Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b3aa6989e65545bb.xml 2025-12-04T09:55:32.7458032Z ============================= test session starts ============================== 2025-12-04T09:55:32.7458497Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T09:55:32.7458612Z cachedir: .pytest_cache 2025-12-04T09:55:32.7459155Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T09:55:32.7459289Z rootdir: /var/lib/jenkins/workspace 2025-12-04T09:55:32.7459402Z configfile: pytest.ini 2025-12-04T09:55:32.7459968Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T09:55:32.7460196Z collecting ... collected 934 items / 83 deselected / 851 selected 2025-12-04T09:55:32.7460355Z stepcurrent: skipping 83 already run items. 2025-12-04T09:55:32.7460472Z Running 71 items in this shard 2025-12-04T09:55:32.7460478Z 2025-12-04T09:55:32.7461184Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_so_without_weight_cuda <- test/inductor/test_torchinductor.py PASSED [14.2085s] [ 1%] 2025-12-04T09:55:32.7461719Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sym_i64_input_codegen_cuda PASSED [11.7136s] [ 2%] 2025-12-04T09:55:32.7462393Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_symfloat_item_cuda <- test/inductor/test_torchinductor.py PASSED [4.8868s] [ 4%] 2025-12-04T09:55:32.7463259Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_autotuning_cuda W1204 09:53:16.528000 10881 site-packages/torch/_export/__init__.py:71] +============================+ 2025-12-04T09:55:32.7463621Z W1204 09:53:16.529000 10881 site-packages/torch/_export/__init__.py:72] | !!! WARNING !!! | 2025-12-04T09:55:32.7463974Z W1204 09:53:16.529000 10881 site-packages/torch/_export/__init__.py:73] +============================+ 2025-12-04T09:55:32.7465225Z W1204 09:53:16.529000 10881 site-packages/torch/_export/__init__.py:74] torch._export.aot_compile()/torch._export.aot_load() is being deprecated, please switch to directly calling torch._inductor.aoti_compile_and_package(torch.export.export())/torch._inductor.aoti_load_package() instead. 2025-12-04T09:55:32.7465332Z PASSED [9.9238s] [ 5%] 2025-12-04T09:55:32.7466076Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_bool_param_cuda <- test/inductor/test_torchinductor.py PASSED [6.1652s] [ 7%] 2025-12-04T09:55:32.7466628Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_dynamic_grid_cuda PASSED [6.7326s] [ 8%] 2025-12-04T09:55:32.7467378Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_equal_to_1_arg_cuda <- test/inductor/test_torchinductor.py PASSED [5.9283s] [ 9%] 2025-12-04T09:55:32.7468092Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_False_cuda PASSED [6.2507s] [ 11%] 2025-12-04T09:55:32.7468895Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_True_cuda PASSED [6.0381s] [ 12%] 2025-12-04T09:55:32.7469620Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_False_cuda PASSED [5.7256s] [ 14%] 2025-12-04T09:55:32.7470665Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_on_device_tma_dynamic_False_tma_version_old_cuda SKIPPED [0.0031s] (requires triton.tools.experimental_descriptor TMA support) [ 15%] 2025-12-04T09:55:32.7471649Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_tma_descriptor_1d_dynamic_False_tma_version_new_cuda SKIPPED [0.0028s] (requires triton.tools.tensor_descriptor TMA support) [ 16%] 2025-12-04T09:55:32.7472349Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_expr_replacements_shift_k_3_use_static_size_True_cuda PASSED [7.3886s] [ 18%] 2025-12-04T09:55:32.7473590Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda <- test/inductor/test_torchinductor.py W1204 09:54:10.869000 10881 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7473733Z ('RERUN', {'yellow': True}) [0.2042s] [ 19%] 2025-12-04T09:55:32.7474594Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.1599s] [ 19%] 2025-12-04T09:55:32.7475370Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda <- test/inductor/test_torchinductor.py FAILED [0.1563s] [ 19%] 2025-12-04T09:55:32.7475376Z 2025-12-04T09:55:32.7475523Z ==================================== RERUNS ==================================== 2025-12-04T09:55:32.7475873Z ____ AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda ____ 2025-12-04T09:55:32.7475999Z Traceback (most recent call last): 2025-12-04T09:55:32.7476411Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7476528Z return value(self) 2025-12-04T09:55:32.7477054Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 7702, in test_unbounded_expr_substitutions 2025-12-04T09:55:32.7477200Z so_path, code = run_and_get_cpp_code( 2025-12-04T09:55:32.7477711Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 3199, in run_and_get_cpp_code 2025-12-04T09:55:32.7477824Z result = fn(*args, **kwargs) 2025-12-04T09:55:32.7478282Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 76, in legacy_compile 2025-12-04T09:55:32.7478654Z so_path = torch._inductor.aot_compile(gm, example_inputs, options=options) # type: ignore[arg-type] 2025-12-04T09:55:32.7479117Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7479247Z return compile_fx_aot( 2025-12-04T09:55:32.7479746Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7479887Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7480363Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7480468Z return compile_fx( 2025-12-04T09:55:32.7480948Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7481083Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7481663Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7481788Z return _compile_fx_main( 2025-12-04T09:55:32.7482364Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7482583Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7483108Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7483316Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7483834Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7483951Z return compile_fx_forward( 2025-12-04T09:55:32.7484480Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7484590Z return inner_compile( 2025-12-04T09:55:32.7484871Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7484995Z return func(*args, **kwds) 2025-12-04T09:55:32.7485501Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7485766Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7486277Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7486456Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7486975Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7487167Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7487669Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7488023Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7488567Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7488905Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7489424Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7489557Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7490120Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7490241Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7490723Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7490878Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7491139Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7491145Z 2025-12-04T09:55:32.7491379Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7492064Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7492075Z 2025-12-04T09:55:32.7492349Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7492591Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7492698Z unimplemented [] 2025-12-04T09:55:32.7492871Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7492969Z inductor [] 2025-12-04T09:55:32.7493068Z graph_break [] 2025-12-04T09:55:32.7493224Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7493443Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7494382Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7494512Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7495251Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7495440Z warnings.warn( 2025-12-04T09:55:32.7495782Z ____ AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda ____ 2025-12-04T09:55:32.7495909Z Traceback (most recent call last): 2025-12-04T09:55:32.7496327Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7496436Z return value(self) 2025-12-04T09:55:32.7496960Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 7702, in test_unbounded_expr_substitutions 2025-12-04T09:55:32.7497102Z so_path, code = run_and_get_cpp_code( 2025-12-04T09:55:32.7497611Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 3199, in run_and_get_cpp_code 2025-12-04T09:55:32.7497738Z result = fn(*args, **kwargs) 2025-12-04T09:55:32.7498184Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 76, in legacy_compile 2025-12-04T09:55:32.7498654Z so_path = torch._inductor.aot_compile(gm, example_inputs, options=options) # type: ignore[arg-type] 2025-12-04T09:55:32.7499132Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7499247Z return compile_fx_aot( 2025-12-04T09:55:32.7499743Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7499883Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7500357Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7500488Z return compile_fx( 2025-12-04T09:55:32.7500964Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7501099Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7501699Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7501817Z return _compile_fx_main( 2025-12-04T09:55:32.7502338Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7502541Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7503068Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7503232Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7503737Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7503860Z return compile_fx_forward( 2025-12-04T09:55:32.7504396Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7504514Z return inner_compile( 2025-12-04T09:55:32.7504812Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7504930Z return func(*args, **kwds) 2025-12-04T09:55:32.7505431Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7505713Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7506209Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7506385Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7507036Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7507230Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7507747Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7507969Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7508506Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7508848Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7509372Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7509513Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7510069Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7510194Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7510694Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7510842Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7511102Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7511122Z 2025-12-04T09:55:32.7511341Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7512020Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7512027Z 2025-12-04T09:55:32.7512312Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7512535Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7512642Z unimplemented [] 2025-12-04T09:55:32.7512821Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7512919Z inductor [] 2025-12-04T09:55:32.7513035Z graph_break [] 2025-12-04T09:55:32.7513178Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7513401Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7514233Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7514352Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7515083Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7515196Z warnings.warn( 2025-12-04T09:55:32.7515413Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7515528Z unimplemented [] 2025-12-04T09:55:32.7515688Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7515783Z inductor [] 2025-12-04T09:55:32.7515893Z graph_break [] 2025-12-04T09:55:32.7516035Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7516255Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7517081Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7517197Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7517943Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7518045Z warnings.warn( 2025-12-04T09:55:32.7518192Z =================================== FAILURES =================================== 2025-12-04T09:55:32.7518598Z ____ AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda ____ 2025-12-04T09:55:32.7518723Z Traceback (most recent call last): 2025-12-04T09:55:32.7519133Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7519308Z return value(self) 2025-12-04T09:55:32.7519834Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 7702, in test_unbounded_expr_substitutions 2025-12-04T09:55:32.7519978Z so_path, code = run_and_get_cpp_code( 2025-12-04T09:55:32.7520481Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 3199, in run_and_get_cpp_code 2025-12-04T09:55:32.7520593Z result = fn(*args, **kwargs) 2025-12-04T09:55:32.7521053Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 76, in legacy_compile 2025-12-04T09:55:32.7521435Z so_path = torch._inductor.aot_compile(gm, example_inputs, options=options) # type: ignore[arg-type] 2025-12-04T09:55:32.7521901Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7522030Z return compile_fx_aot( 2025-12-04T09:55:32.7522531Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7522667Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7523138Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7523247Z return compile_fx( 2025-12-04T09:55:32.7523733Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7523868Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7524457Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7524578Z return _compile_fx_main( 2025-12-04T09:55:32.7525083Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7525301Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7525826Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7525975Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7526494Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7526612Z return compile_fx_forward( 2025-12-04T09:55:32.7527147Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7527256Z return inner_compile( 2025-12-04T09:55:32.7527537Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7527667Z return func(*args, **kwds) 2025-12-04T09:55:32.7528166Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7528435Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7528941Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7529115Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7529632Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7529828Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7530331Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7530493Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7531082Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7531427Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7532006Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7532134Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7532698Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7532820Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7533306Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7533464Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7533723Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7533729Z 2025-12-04T09:55:32.7533959Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7534634Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7534644Z 2025-12-04T09:55:32.7534912Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7535145Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7535252Z unimplemented [] 2025-12-04T09:55:32.7535422Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7535519Z inductor [] 2025-12-04T09:55:32.7535618Z graph_break [] 2025-12-04T09:55:32.7535775Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7535993Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7536817Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7536948Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7537679Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7537794Z warnings.warn( 2025-12-04T09:55:32.7538013Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7538115Z unimplemented [] 2025-12-04T09:55:32.7538285Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7538474Z inductor [] 2025-12-04T09:55:32.7538578Z graph_break [] 2025-12-04T09:55:32.7538736Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7538954Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7539797Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7539917Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7540650Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7540764Z warnings.warn( 2025-12-04T09:55:32.7540984Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7541087Z unimplemented [] 2025-12-04T09:55:32.7541262Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7541358Z inductor [] 2025-12-04T09:55:32.7541456Z graph_break [] 2025-12-04T09:55:32.7541610Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7541826Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7542717Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7542889Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7543619Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7543734Z warnings.warn( 2025-12-04T09:55:32.7544487Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b3aa6989e65545bb.xml - 2025-12-04T09:55:32.7544672Z =========================== short test summary info ============================ 2025-12-04T09:55:32.7545553Z FAILED [0.1563s] inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7545559Z 2025-12-04T09:55:32.7545783Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7546470Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7546480Z 2025-12-04T09:55:32.7546751Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7546946Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T09:55:32.7547198Z == 1 failed, 11 passed, 2 skipped, 83 deselected, 2 rerun in 85.59s (0:01:25) == 2025-12-04T09:55:32.7547298Z Got exit code 1 2025-12-04T09:55:32.7547419Z Retrying single test... 2025-12-04T09:55:32.7547876Z W1204 09:54:22.170000 13393 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7548474Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-dc71db8b83a58b10.xml 2025-12-04T09:55:32.7548641Z ============================= test session starts ============================== 2025-12-04T09:55:32.7548999Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T09:55:32.7549131Z cachedir: .pytest_cache 2025-12-04T09:55:32.7549656Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T09:55:32.7549788Z rootdir: /var/lib/jenkins/workspace 2025-12-04T09:55:32.7549911Z configfile: pytest.ini 2025-12-04T09:55:32.7550462Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T09:55:32.7550707Z collecting ... collected 934 items / 153 deselected / 781 selected 2025-12-04T09:55:32.7551471Z stepcurrent: skipping 96 already run items. Running only test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7551591Z Running 1 items in this shard 2025-12-04T09:55:32.7551596Z 2025-12-04T09:55:32.7552830Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda <- test/inductor/test_torchinductor.py W1204 09:54:25.006000 13393 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7552971Z ('RERUN', {'yellow': True}) [2.9117s] [100%] 2025-12-04T09:55:32.7553842Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.1615s] [100%] 2025-12-04T09:55:32.7554600Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda <- test/inductor/test_torchinductor.py FAILED [0.1592s] [100%] 2025-12-04T09:55:32.7554605Z 2025-12-04T09:55:32.7554751Z ==================================== RERUNS ==================================== 2025-12-04T09:55:32.7555158Z ____ AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda ____ 2025-12-04T09:55:32.7555285Z Traceback (most recent call last): 2025-12-04T09:55:32.7555776Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7555881Z return value(self) 2025-12-04T09:55:32.7556406Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 7702, in test_unbounded_expr_substitutions 2025-12-04T09:55:32.7556550Z so_path, code = run_and_get_cpp_code( 2025-12-04T09:55:32.7557060Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 3199, in run_and_get_cpp_code 2025-12-04T09:55:32.7557187Z result = fn(*args, **kwargs) 2025-12-04T09:55:32.7557633Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 76, in legacy_compile 2025-12-04T09:55:32.7558014Z so_path = torch._inductor.aot_compile(gm, example_inputs, options=options) # type: ignore[arg-type] 2025-12-04T09:55:32.7558491Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7558609Z return compile_fx_aot( 2025-12-04T09:55:32.7559104Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7566394Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7566927Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7567055Z return compile_fx( 2025-12-04T09:55:32.7567536Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7567673Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7568289Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7568409Z return _compile_fx_main( 2025-12-04T09:55:32.7568918Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7569145Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7569674Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7569838Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7570341Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7570459Z return compile_fx_forward( 2025-12-04T09:55:32.7570991Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7571099Z return inner_compile( 2025-12-04T09:55:32.7571398Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7571514Z return func(*args, **kwds) 2025-12-04T09:55:32.7572015Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7572301Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7572797Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7572969Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7573485Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7573679Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7574195Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7574467Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7575009Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7575425Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7575952Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7576094Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7576646Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7576771Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7577269Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7577412Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7577675Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7577696Z 2025-12-04T09:55:32.7577914Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7578702Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7578710Z 2025-12-04T09:55:32.7578999Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7579228Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7579335Z unimplemented [] 2025-12-04T09:55:32.7579509Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7579607Z inductor [] 2025-12-04T09:55:32.7579716Z graph_break [] 2025-12-04T09:55:32.7579863Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7580083Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7580928Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7581049Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7581780Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7581890Z warnings.warn( 2025-12-04T09:55:32.7582226Z ____ AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda ____ 2025-12-04T09:55:32.7582362Z Traceback (most recent call last): 2025-12-04T09:55:32.7582772Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7582878Z return value(self) 2025-12-04T09:55:32.7583418Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 7702, in test_unbounded_expr_substitutions 2025-12-04T09:55:32.7583553Z so_path, code = run_and_get_cpp_code( 2025-12-04T09:55:32.7584059Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 3199, in run_and_get_cpp_code 2025-12-04T09:55:32.7584186Z result = fn(*args, **kwargs) 2025-12-04T09:55:32.7584630Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 76, in legacy_compile 2025-12-04T09:55:32.7585013Z so_path = torch._inductor.aot_compile(gm, example_inputs, options=options) # type: ignore[arg-type] 2025-12-04T09:55:32.7585477Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7585589Z return compile_fx_aot( 2025-12-04T09:55:32.7586094Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7586212Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7586770Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7586878Z return compile_fx( 2025-12-04T09:55:32.7587346Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7587551Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7588315Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7588433Z return _compile_fx_main( 2025-12-04T09:55:32.7588950Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7589151Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7589687Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7589842Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7590348Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7590481Z return compile_fx_forward( 2025-12-04T09:55:32.7591001Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7591125Z return inner_compile( 2025-12-04T09:55:32.7591408Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7591524Z return func(*args, **kwds) 2025-12-04T09:55:32.7592039Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7592309Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7592808Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7592994Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7593499Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7593709Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7594212Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7594356Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7594906Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7595229Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7595751Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7595896Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7596448Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7596588Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7597075Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7597218Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7597487Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7597493Z 2025-12-04T09:55:32.7597710Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7598400Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7598406Z 2025-12-04T09:55:32.7598783Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7599011Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7599133Z unimplemented [] 2025-12-04T09:55:32.7599368Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7599464Z inductor [] 2025-12-04T09:55:32.7599572Z graph_break [] 2025-12-04T09:55:32.7599717Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7599949Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7600767Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7600885Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7601631Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7601739Z warnings.warn( 2025-12-04T09:55:32.7601973Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7602076Z unimplemented [] 2025-12-04T09:55:32.7602237Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7602346Z inductor [] 2025-12-04T09:55:32.7602443Z graph_break [] 2025-12-04T09:55:32.7602584Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7602813Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7603626Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7603741Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7604485Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7604590Z warnings.warn( 2025-12-04T09:55:32.7604752Z =================================== FAILURES =================================== 2025-12-04T09:55:32.7605084Z ____ AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda ____ 2025-12-04T09:55:32.7605210Z Traceback (most recent call last): 2025-12-04T09:55:32.7605624Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7605728Z return value(self) 2025-12-04T09:55:32.7606253Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 7702, in test_unbounded_expr_substitutions 2025-12-04T09:55:32.7606392Z so_path, code = run_and_get_cpp_code( 2025-12-04T09:55:32.7606897Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 3199, in run_and_get_cpp_code 2025-12-04T09:55:32.7607022Z result = fn(*args, **kwargs) 2025-12-04T09:55:32.7607469Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 76, in legacy_compile 2025-12-04T09:55:32.7607845Z so_path = torch._inductor.aot_compile(gm, example_inputs, options=options) # type: ignore[arg-type] 2025-12-04T09:55:32.7608324Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7608434Z return compile_fx_aot( 2025-12-04T09:55:32.7608938Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7609061Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7609531Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7609646Z return compile_fx( 2025-12-04T09:55:32.7610116Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7610251Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7610925Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7611043Z return _compile_fx_main( 2025-12-04T09:55:32.7611628Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7611829Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7612358Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7612520Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7613026Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7613159Z return compile_fx_forward( 2025-12-04T09:55:32.7613686Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7613796Z return inner_compile( 2025-12-04T09:55:32.7614089Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7614207Z return func(*args, **kwds) 2025-12-04T09:55:32.7614706Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7614984Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7615481Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7615670Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7616172Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7616363Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7616880Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7617025Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7617568Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7617899Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7618515Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7618660Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7619208Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7619330Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7619833Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7619973Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7620241Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7620252Z 2025-12-04T09:55:32.7620469Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7621145Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7621151Z 2025-12-04T09:55:32.7621433Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7621655Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7621770Z unimplemented [] 2025-12-04T09:55:32.7621930Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7622026Z inductor [] 2025-12-04T09:55:32.7622136Z graph_break [] 2025-12-04T09:55:32.7622347Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7622568Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7623401Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7623576Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7624321Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7624426Z warnings.warn( 2025-12-04T09:55:32.7624646Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7624765Z unimplemented [] 2025-12-04T09:55:32.7624924Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7625020Z inductor [] 2025-12-04T09:55:32.7625130Z graph_break [] 2025-12-04T09:55:32.7625277Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7625492Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7626319Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7626436Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7627177Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7627278Z warnings.warn( 2025-12-04T09:55:32.7627496Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7627610Z unimplemented [] 2025-12-04T09:55:32.7627767Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7627861Z inductor [] 2025-12-04T09:55:32.7627967Z graph_break [] 2025-12-04T09:55:32.7628112Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7628338Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7629147Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7629266Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7630002Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7630102Z warnings.warn( 2025-12-04T09:55:32.7630856Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-dc71db8b83a58b10.xml - 2025-12-04T09:55:32.7631040Z =========================== short test summary info ============================ 2025-12-04T09:55:32.7631920Z FAILED [0.1592s] inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7631930Z 2025-12-04T09:55:32.7632160Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7632834Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7632840Z 2025-12-04T09:55:32.7633117Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7633298Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T09:55:32.7633498Z ================== 1 failed, 153 deselected, 2 rerun in 3.32s ================== 2025-12-04T09:55:32.7633610Z Got exit code 1 2025-12-04T09:55:32.7633717Z Retrying single test... 2025-12-04T09:55:32.7634225Z W1204 09:54:40.900000 13562 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7634809Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-d1d71be98185ea4a.xml 2025-12-04T09:55:32.7635031Z ============================= test session starts ============================== 2025-12-04T09:55:32.7635396Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T09:55:32.7635508Z cachedir: .pytest_cache 2025-12-04T09:55:32.7636033Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T09:55:32.7636170Z rootdir: /var/lib/jenkins/workspace 2025-12-04T09:55:32.7636276Z configfile: pytest.ini 2025-12-04T09:55:32.7636821Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T09:55:32.7637068Z collecting ... collected 934 items / 153 deselected / 781 selected 2025-12-04T09:55:32.7637830Z stepcurrent: skipping 96 already run items. Running only test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7637965Z Running 1 items in this shard 2025-12-04T09:55:32.7637971Z 2025-12-04T09:55:32.7639189Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda <- test/inductor/test_torchinductor.py W1204 09:54:43.708000 13562 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7639335Z ('RERUN', {'yellow': True}) [2.8955s] [100%] 2025-12-04T09:55:32.7640185Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.1616s] [100%] 2025-12-04T09:55:32.7640946Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda <- test/inductor/test_torchinductor.py FAILED [0.1593s] [100%] 2025-12-04T09:55:32.7640952Z 2025-12-04T09:55:32.7641105Z ==================================== RERUNS ==================================== 2025-12-04T09:55:32.7641445Z ____ AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda ____ 2025-12-04T09:55:32.7641580Z Traceback (most recent call last): 2025-12-04T09:55:32.7641985Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7642090Z return value(self) 2025-12-04T09:55:32.7642628Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 7702, in test_unbounded_expr_substitutions 2025-12-04T09:55:32.7642756Z so_path, code = run_and_get_cpp_code( 2025-12-04T09:55:32.7643268Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 3199, in run_and_get_cpp_code 2025-12-04T09:55:32.7643392Z result = fn(*args, **kwargs) 2025-12-04T09:55:32.7643841Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 76, in legacy_compile 2025-12-04T09:55:32.7644226Z so_path = torch._inductor.aot_compile(gm, example_inputs, options=options) # type: ignore[arg-type] 2025-12-04T09:55:32.7644695Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7644806Z return compile_fx_aot( 2025-12-04T09:55:32.7645313Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7645435Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7645921Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7646026Z return compile_fx( 2025-12-04T09:55:32.7646497Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7646705Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7647287Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7647459Z return _compile_fx_main( 2025-12-04T09:55:32.7647977Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7648179Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7648717Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7648865Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7649368Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7649495Z return compile_fx_forward( 2025-12-04T09:55:32.7650015Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7650125Z return inner_compile( 2025-12-04T09:55:32.7650418Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7650537Z return func(*args, **kwds) 2025-12-04T09:55:32.7651049Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7651319Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7651816Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7652006Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7652515Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7652723Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7653224Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7653373Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7653921Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7654241Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7654763Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7654905Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7655455Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7655587Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7656074Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7656215Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7656488Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7656494Z 2025-12-04T09:55:32.7656711Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7657399Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7657405Z 2025-12-04T09:55:32.7657672Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7657891Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7658005Z unimplemented [] 2025-12-04T09:55:32.7658160Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7658257Z inductor [] 2025-12-04T09:55:32.7658531Z graph_break [] 2025-12-04T09:55:32.7658680Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7658913Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7659793Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7659909Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7660655Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7660756Z warnings.warn( 2025-12-04T09:55:32.7661092Z ____ AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda ____ 2025-12-04T09:55:32.7661226Z Traceback (most recent call last): 2025-12-04T09:55:32.7661627Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7661746Z return value(self) 2025-12-04T09:55:32.7662271Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 7702, in test_unbounded_expr_substitutions 2025-12-04T09:55:32.7662403Z so_path, code = run_and_get_cpp_code( 2025-12-04T09:55:32.7662920Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 3199, in run_and_get_cpp_code 2025-12-04T09:55:32.7663034Z result = fn(*args, **kwargs) 2025-12-04T09:55:32.7663476Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 76, in legacy_compile 2025-12-04T09:55:32.7663858Z so_path = torch._inductor.aot_compile(gm, example_inputs, options=options) # type: ignore[arg-type] 2025-12-04T09:55:32.7664319Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7664443Z return compile_fx_aot( 2025-12-04T09:55:32.7664941Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7665068Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7665551Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7665660Z return compile_fx( 2025-12-04T09:55:32.7666141Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7666273Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7666850Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7666975Z return _compile_fx_main( 2025-12-04T09:55:32.7667481Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7667685Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7668219Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7668372Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7668893Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7669007Z return compile_fx_forward( 2025-12-04T09:55:32.7669529Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7669654Z return inner_compile( 2025-12-04T09:55:32.7669933Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7670044Z return func(*args, **kwds) 2025-12-04T09:55:32.7670553Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7670878Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7671388Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7671620Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7672122Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7672322Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7672828Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7672986Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7673519Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7673846Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7674380Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7674512Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7675083Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7675205Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7675688Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7675840Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7676096Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7676101Z 2025-12-04T09:55:32.7676320Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7677010Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7677016Z 2025-12-04T09:55:32.7677287Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7677524Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7677629Z unimplemented [] 2025-12-04T09:55:32.7677786Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7677892Z inductor [] 2025-12-04T09:55:32.7677989Z graph_break [] 2025-12-04T09:55:32.7678131Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7678360Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7679184Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7679310Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7680047Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7680152Z warnings.warn( 2025-12-04T09:55:32.7680384Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7680487Z unimplemented [] 2025-12-04T09:55:32.7680642Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7680748Z inductor [] 2025-12-04T09:55:32.7680842Z graph_break [] 2025-12-04T09:55:32.7680999Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7681215Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7682031Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7682160Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7682967Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7683128Z warnings.warn( 2025-12-04T09:55:32.7683286Z =================================== FAILURES =================================== 2025-12-04T09:55:32.7683619Z ____ AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda ____ 2025-12-04T09:55:32.7683757Z Traceback (most recent call last): 2025-12-04T09:55:32.7684162Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T09:55:32.7684265Z return value(self) 2025-12-04T09:55:32.7684804Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor.py", line 7702, in test_unbounded_expr_substitutions 2025-12-04T09:55:32.7684931Z so_path, code = run_and_get_cpp_code( 2025-12-04T09:55:32.7685441Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 3199, in run_and_get_cpp_code 2025-12-04T09:55:32.7685570Z result = fn(*args, **kwargs) 2025-12-04T09:55:32.7686014Z File "/var/lib/jenkins/workspace/test/inductor/test_aot_inductor_utils.py", line 76, in legacy_compile 2025-12-04T09:55:32.7686405Z so_path = torch._inductor.aot_compile(gm, example_inputs, options=options) # type: ignore[arg-type] 2025-12-04T09:55:32.7686870Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T09:55:32.7686984Z return compile_fx_aot( 2025-12-04T09:55:32.7687493Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T09:55:32.7687615Z compiled_artifacts = compile_fx( 2025-12-04T09:55:32.7688347Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T09:55:32.7688455Z return compile_fx( 2025-12-04T09:55:32.7688932Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T09:55:32.7689082Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T09:55:32.7689666Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T09:55:32.7689779Z return _compile_fx_main( 2025-12-04T09:55:32.7690296Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T09:55:32.7690497Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T09:55:32.7691030Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T09:55:32.7691180Z return self.compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7691688Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T09:55:32.7691814Z return compile_fx_forward( 2025-12-04T09:55:32.7692335Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T09:55:32.7692460Z return inner_compile( 2025-12-04T09:55:32.7692741Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T09:55:32.7692853Z return func(*args, **kwds) 2025-12-04T09:55:32.7693360Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T09:55:32.7693626Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T09:55:32.7694122Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T09:55:32.7694305Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T09:55:32.7694925Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T09:55:32.7695132Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T09:55:32.7695801Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T09:55:32.7695946Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T09:55:32.7696497Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T09:55:32.7696821Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T09:55:32.7697355Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T09:55:32.7697481Z _check_triton_bf16_support(graph) 2025-12-04T09:55:32.7698034Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T09:55:32.7698168Z warn_and_skip(node.get_device()) 2025-12-04T09:55:32.7698755Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T09:55:32.7698901Z raise SkipFrame("BF16 is not supported") 2025-12-04T09:55:32.7699172Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7699177Z 2025-12-04T09:55:32.7699395Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7700084Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7700089Z 2025-12-04T09:55:32.7700357Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7700581Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7700696Z unimplemented [] 2025-12-04T09:55:32.7700858Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7700954Z inductor [] 2025-12-04T09:55:32.7701066Z graph_break [] 2025-12-04T09:55:32.7701213Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7701442Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7702265Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7702381Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7703128Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7703241Z warnings.warn( 2025-12-04T09:55:32.7703462Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7703566Z unimplemented [] 2025-12-04T09:55:32.7703746Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7703842Z inductor [] 2025-12-04T09:55:32.7703939Z graph_break [] 2025-12-04T09:55:32.7704100Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7704315Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7705131Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7705261Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7705990Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7706106Z warnings.warn( 2025-12-04T09:55:32.7706323Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T09:55:32.7706493Z unimplemented [] 2025-12-04T09:55:32.7706664Z stats [('calls_captured', 5), ('unique_graphs', 1)] 2025-12-04T09:55:32.7706759Z inductor [] 2025-12-04T09:55:32.7706855Z graph_break [] 2025-12-04T09:55:32.7707066Z aten_mm_info [('aten.mm_Max(15, u0)_16_64', 1)] 2025-12-04T09:55:32.7707279Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T09:55:32.7708101Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T09:55:32.7708217Z return cls.__new__(cls, *args) 2025-12-04T09:55:32.7708945Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T09:55:32.7709059Z warnings.warn( 2025-12-04T09:55:32.7709816Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-d1d71be98185ea4a.xml - 2025-12-04T09:55:32.7710005Z =========================== short test summary info ============================ 2025-12-04T09:55:32.7710885Z FAILED [0.1593s] inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T09:55:32.7710896Z 2025-12-04T09:55:32.7711114Z To execute this test, run the following from the base repo dir: 2025-12-04T09:55:32.7711802Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_aot_inductor.py AOTInductorTestABICompatibleGpu.test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7711808Z 2025-12-04T09:55:32.7712075Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T09:55:32.7712267Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T09:55:32.7712472Z ================== 1 failed, 153 deselected, 2 rerun in 3.30s ================== 2025-12-04T09:55:32.7712574Z Got exit code 1 2025-12-04T09:55:32.7713177Z FAILED CONSISTENTLY: test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda 2025-12-04T09:55:32.7713594Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T09:55:32.7714040Z W1204 09:54:59.896000 13731 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7714622Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-ec873e76fad9978d.xml 2025-12-04T09:55:32.7714785Z ============================= test session starts ============================== 2025-12-04T09:55:32.7715155Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T09:55:32.7715264Z cachedir: .pytest_cache 2025-12-04T09:55:32.7715792Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T09:55:32.7715930Z rootdir: /var/lib/jenkins/workspace 2025-12-04T09:55:32.7716037Z configfile: pytest.ini 2025-12-04T09:55:32.7716600Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T09:55:32.7716826Z collecting ... collected 934 items / 97 deselected / 837 selected 2025-12-04T09:55:32.7716968Z stepcurrent: skipping 97 already run items. 2025-12-04T09:55:32.7717096Z Running 57 items in this shard 2025-12-04T09:55:32.7717101Z 2025-12-04T09:55:32.7717832Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_update_constant_buffer_cuda <- test/inductor/test_torchinductor.py PASSED [9.1854s] [ 1%] 2025-12-04T09:55:32.7719260Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_nested_cuda <- test/inductor/test_torchinductor.py W1204 09:55:11.292000 13731 site-packages/torch/export/dynamic_shapes.py:923] Using None as a dynamic shape dimension is deprecated. Please use Dim.STATIC instead 2025-12-04T09:55:32.7719928Z W1204 09:55:11.292000 13731 site-packages/torch/export/dynamic_shapes.py:923] Using None as a dynamic shape dimension is deprecated. Please use Dim.STATIC instead 2025-12-04T09:55:32.7720094Z PASSED [7.3175s] [ 3%] 2025-12-04T09:55:32.7721219Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_with_offset_cuda <- test/inductor/test_torchinductor.py W1204 09:55:18.683000 13731 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T09:55:32.7721321Z PASSED [6.0019s] [ 5%] 2025-12-04T09:55:32.7722089Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_zero_grid_with_unbacked_symbols_cuda <- test/inductor/test_torchinductor.py PASSED [6.3327s] [ 7%] 2025-12-04T09:55:32.7723003Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_constant_tensor_name_collision_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0006s] (No MPS backend available) [ 8%] 2025-12-04T09:55:32.7723873Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_debug_printer_sym_inputs_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 10%] 2025-12-04T09:55:32.7724631Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_profiler_enable_kernel_profile_True_mps SKIPPED [0.0002s] (No MPS backend available) [ 12%] 2025-12-04T09:55:32.7725375Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_autotune_int64_user_defined_triton_kernel_mps SKIPPED [0.0002s] (No MPS backend available) [ 14%] 2025-12-04T09:55:32.7726169Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_bool_input_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 15%] 2025-12-04T09:55:32.7726986Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_boolean_indexing_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 17%] 2025-12-04T09:55:32.7727723Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_buffer_mutation_and_force_mmap_weights_mps SKIPPED [0.0002s] (No MPS backend available) [ 19%] 2025-12-04T09:55:32.7728513Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_cpu_predicate_cuda_operands_max_autotune_False_mps SKIPPED [0.0002s] (No MPS backend available) [ 21%] 2025-12-04T09:55:32.7729264Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_mismatched_branch_output_dynamic_False_mps SKIPPED [0.0004s] (No MPS backend available) [ 22%] 2025-12-04T09:55:32.7730007Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_non_tensor_predicates_dynamic_False_mps SKIPPED [0.0002s] (No MPS backend available) [ 24%] 2025-12-04T09:55:32.7730586Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_conv3d_mps SKIPPED [0.0002s] (No MPS backend available) [ 26%] 2025-12-04T09:55:32.7731393Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_conv_freezing_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 28%] 2025-12-04T09:55:32.7732268Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_duplicate_constant_folding_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 29%] 2025-12-04T09:55:32.7733098Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_duplicated_params_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0004s] (No MPS backend available) [ 31%] 2025-12-04T09:55:32.7733950Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_empty_cat_dtype_promotion_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 33%] 2025-12-04T09:55:32.7734783Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_freezing_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 35%] 2025-12-04T09:55:32.7735590Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_int_list_input_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 36%] 2025-12-04T09:55:32.7736329Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_linear_dynamic_maxautotune_mps SKIPPED [0.0002s] (No MPS backend available) [ 38%] 2025-12-04T09:55:32.7737164Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_misaligned_input_1_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 40%] 2025-12-04T09:55:32.7737985Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_misaligned_input_2_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 42%] 2025-12-04T09:55:32.7738757Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_misc_1_max_autotune_False_mps SKIPPED [0.0002s] (No MPS backend available) [ 43%] 2025-12-04T09:55:32.7739556Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_missing_cubin_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 45%] 2025-12-04T09:55:32.7740403Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_model_modified_weights_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 47%] 2025-12-04T09:55:32.7741250Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_multiple_output_alias_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 49%] 2025-12-04T09:55:32.7741902Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_non_default_gpu_device_mps SKIPPED [0.0002s] (No MPS backend available) [ 50%] 2025-12-04T09:55:32.7742522Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_on_gpu_device1_mps SKIPPED [0.0002s] (No MPS backend available) [ 52%] 2025-12-04T09:55:32.7743393Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_profile_benchmark_harness_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 54%] 2025-12-04T09:55:32.7744250Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_repeated_user_defined_triton_kernel_embed_kernel_binary_False_mps SKIPPED [0.0012s] (No MPS backend available) [ 56%] 2025-12-04T09:55:32.7745059Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_return_constant_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 57%] 2025-12-04T09:55:32.7745850Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_reuse_kernel_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 59%] 2025-12-04T09:55:32.7746475Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_runtime_checks_mps SKIPPED [0.0002s] (No MPS backend available) [ 61%] 2025-12-04T09:55:32.7747340Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_size_with_unbacked_add_expr_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 63%] 2025-12-04T09:55:32.7748125Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_dynamic_launcher_grid_infer_from_tensor_mps SKIPPED [0.0002s] (No MPS backend available) [ 64%] 2025-12-04T09:55:32.7748796Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_dynamic_grid_mps SKIPPED [0.0002s] (No MPS backend available) [ 66%] 2025-12-04T09:55:32.7749569Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_equal_to_1_float_arg_dynamic_False_mps SKIPPED [0.0002s] (No MPS backend available) [ 68%] 2025-12-04T09:55:32.7750262Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_extern_kernel_arg_mps SKIPPED [0.0002s] (No MPS backend available) [ 70%] 2025-12-04T09:55:32.7751157Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_False_mps SKIPPED [0.0002s] (No MPS backend available) [ 71%] 2025-12-04T09:55:32.7752119Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_False_mps SKIPPED [0.0002s] (No MPS backend available) [ 73%] 2025-12-04T09:55:32.7752944Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_True_mps SKIPPED [0.0004s] (No MPS backend available) [ 75%] 2025-12-04T09:55:32.7753791Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_2_dynamic_False_autotune_False_mps SKIPPED [0.0002s] (No MPS backend available) [ 77%] 2025-12-04T09:55:32.7754626Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_False_mps SKIPPED [0.0002s] (No MPS backend available) [ 78%] 2025-12-04T09:55:32.7755466Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_True_mps SKIPPED [0.0002s] (No MPS backend available) [ 80%] 2025-12-04T09:55:32.7756303Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_True_mps SKIPPED [0.0002s] (No MPS backend available) [ 82%] 2025-12-04T09:55:32.7757175Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_sympy_fn_like_arg_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0004s] (No MPS backend available) [ 84%] 2025-12-04T09:55:32.7758018Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_tma_descriptor_1d_dynamic_True_tma_version_new_mps SKIPPED [0.0002s] (No MPS backend available) [ 85%] 2025-12-04T09:55:32.7758851Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_new_mps SKIPPED [0.0002s] (No MPS backend available) [ 87%] 2025-12-04T09:55:32.7759728Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_True_mps SKIPPED [0.0002s] (No MPS backend available) [ 89%] 2025-12-04T09:55:32.7760588Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_False_mps SKIPPED [0.0002s] (No MPS backend available) [ 91%] 2025-12-04T09:55:32.7761417Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_2_use_static_size_False_mps SKIPPED [0.0004s] (No MPS backend available) [ 92%] 2025-12-04T09:55:32.7762112Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_conv_dynamic_True_mps SKIPPED [0.0002s] (No MPS backend available) [ 94%] 2025-12-04T09:55:32.7762962Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_outer_code_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [ 96%] 2025-12-04T09:55:32.7763715Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_sym_expr_cond_dynamic_False_mps SKIPPED [0.0002s] (No MPS backend available) [ 98%] 2025-12-04T09:55:32.7764505Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_with_offset_mps <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (No MPS backend available) [100%] 2025-12-04T09:55:32.7764512Z 2025-12-04T09:55:32.7765272Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-ec873e76fad9978d.xml - 2025-12-04T09:55:32.7765486Z ================ 4 passed, 53 skipped, 97 deselected in 29.00s ================= 2025-12-04T09:55:32.7767456Z The following tests failed consistently: ['test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_cuda', 'test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_cuda', 'test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbounded_expr_substitutions_cuda'] 2025-12-04T09:55:32.7767517Z 2025-12-04T09:55:32.7768078Z FINISHED PRINTING LOG FILE of inductor/test_aot_inductor 2/6 (test/test-reports/inductor.test_aot_inductor_2.6_195f61f6b7f24fa5_.log) 2025-12-04T09:55:32.7768084Z 2025-12-04T09:55:32.7768455Z Finished inductor/test_aot_inductor 2/6 ... [2025-12-04 09:55:32.554836][2536.676052059], took 10.66min 2025-12-04T09:55:32.7769265Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-7ff7771e3c72f123.xml 2025-12-04T09:55:32.8700566Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b312d84b484956a1.xml 2025-12-04T09:55:32.9040272Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-9ee61a34966ac1ed.xml 2025-12-04T09:55:32.9358529Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-31f94497420761eb.xml 2025-12-04T09:55:32.9692957Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b252d51c09f06fd1.xml 2025-12-04T09:55:32.9977768Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-446d02039a2c0737.xml 2025-12-04T09:55:33.0284926Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b3aa6989e65545bb.xml 2025-12-04T09:55:33.0603704Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-dc71db8b83a58b10.xml 2025-12-04T09:55:33.1047113Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-d1d71be98185ea4a.xml 2025-12-04T09:55:33.1376889Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-ec873e76fad9978d.xml 2025-12-04T09:55:33.4110175Z Uploading logs for 57119749259 to S3 2025-12-04T09:55:33.4450266Z Uploading artifacts took 0.27 seconds 2025-12-04T09:55:33.4450688Z inductor/test_aot_inductor 2/6 failed! 2025-12-04T09:55:33.4455757Z Running inductor/test_torchinductor 1/2 ... [2025-12-04 09:55:33.445403][2537.566624017] 2025-12-04T09:55:33.4456349Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T09:55:33.4460538Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor.py', '--shard-id=1', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:55:33.445814] 2025-12-04T10:04:28.0710625Z 2025-12-04T10:04:28.0711587Z PRINTING LOG FILE of inductor/test_torchinductor 1/2 (test/test-reports/inductor.test_torchinductor_1.2_9aedcc93d70c8453_.log) 2025-12-04T10:04:28.0713022Z W1204 09:55:42.410000 14751 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.0714381Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-bf110f953f5c5355.xml 2025-12-04T10:04:28.0715567Z ============================= test session starts ============================== 2025-12-04T10:04:28.0716629Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:04:28.0717561Z cachedir: .pytest_cache 2025-12-04T10:04:28.0720300Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:04:28.0721485Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:04:28.0722225Z configfile: pytest.ini 2025-12-04T10:04:28.0723566Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:04:28.0724685Z collecting ... collected 981 items 2025-12-04T10:04:28.0725155Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T10:04:28.0910854Z Running 463 items in this shard: test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_int, test/inductor/test_torchinductor.py::GPUTests::test_AllenaiLongformerBase_repro_cuda, test/inductor/test_torchinductor.py::GPUTests::test__dyn_quant_pack_4bit_weight_bf16_cuda, test/inductor/test_torchinductor.py::GPUTests::test__unsafe_masked_index_put_accumulate_cuda, test/inductor/test_torchinductor.py::GPUTests::test_abs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool1d_argmax_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool2d1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool_with_output_size_0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_max_pool2d1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_max_pool2d3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_pool_errors_with_long_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex10_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex9_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_const_float_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_inplace_permuted_cuda, test/inductor/test_torchinductor.py::GPUTests::test_addmm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_alexnet_prefix_cuda, test/inductor/test_torchinductor.py::GPUTests::test_allow_reuse_active_if_under_peak_cuda, test/inductor/test_torchinductor.py::GPUTests::test_angle_cuda, test/inductor/test_torchinductor.py::GPUTests::test_any_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aoti_eager_with_persistent_cache_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aoti_eager_with_scalar_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_argmin1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_argmin2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_as_strided_cuda, test/inductor/test_torchinductor.py::GPUTests::test_as_strided_on_views_cuda, test/inductor/test_torchinductor.py::GPUTests::test_assert_alignment_op_name_fail_cuda, test/inductor/test_torchinductor.py::GPUTests::test_assert_size_stride_op_name_fail_cuda, test/inductor/test_torchinductor.py::GPUTests::test_assert_size_stride_op_name_pass_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d7_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool3d_backward4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_batch_norm_2d_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_batch_norm_2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bitwise2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bmm2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_both_scalars_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_add_autotune_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_computed_offsets_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_default_kwargs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int32_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int32_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int32_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int64_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int64_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int64_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int64_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int64_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int8_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int8_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_uint8_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_uint8_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_uint8_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_float_ndigits_neg_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_int_ndigits_pos_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_empty_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_empty_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_single_empty_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_unbacked_empty_1d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cauchy_cuda, test/inductor/test_torchinductor.py::GPUTests::test_chunk_recompiles_cuda, test/inductor/test_torchinductor.py::GPUTests::test_clamp_type_promotion_cuda, test/inductor/test_torchinductor.py::GPUTests::test_clamp_type_promotion_non_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_clone_cuda, test/inductor/test_torchinductor.py::GPUTests::test_compar_cuda, test/inductor/test_torchinductor.py::GPUTests::test_complex_fallback_cuda, test/inductor/test_torchinductor.py::GPUTests::test_complex_from_real_imag_cuda, test/inductor/test_torchinductor.py::GPUTests::test_complex_memory_overlap_cuda, test/inductor/test_torchinductor.py::GPUTests::test_computed_buffer_inlining_cuda, test/inductor/test_torchinductor.py::GPUTests::test_concat_add_inplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_config_option_dont_assume_alignment_cudagraphs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_config_option_dont_assume_alignment_recompiles_cuda, test/inductor/test_torchinductor.py::GPUTests::test_consecutive_split_cumprod_cuda, test/inductor/test_torchinductor.py::GPUTests::test_const_int32_to_float_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_2d_strides_nonpositive_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_3d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_fill_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_nd_inplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv2d_backward_channels_last_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv2d_channels_last_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv3d_channels_last_use_block_ptr_False_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv3d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv_shape_check_cuda, test/inductor/test_torchinductor.py::GPUTests::test_convolution1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_convolution3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_False_cuda, test/inductor/test_torchinductor.py::GPUTests::test_copy_with_scalar_src_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cpu_scalar_with_cpu_scalar_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cpu_scalar_with_cpu_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cpu_scalar_with_gpu_tensor_cpp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cpu_tensor_with_cpu_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cudnn_rnn_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cummin_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumprod_zero_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumsum_inf_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumsum_pattern_matcher_issue_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_unbacked_symints_cuda, test/inductor/test_torchinductor.py::GPUTests::test_data_type_propogation_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dense_mask_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_deterministic_codegen_on_graph_break_cuda, test/inductor/test_torchinductor.py::GPUTests::test_deterministic_codegen_with_suffix_cuda, test/inductor/test_torchinductor.py::GPUTests::test_device_assert_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div7_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div_by_zero_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div_precision_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div_presicion_accuracy_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dont_constant_fold_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout_trivial_1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtype_mismatch_issue_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_float32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_float32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_fusion_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_elu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_embedding_bag_byte_unpack_cuda, test/inductor/test_torchinductor.py::GPUTests::test_embedding_bag_cuda, test/inductor/test_torchinductor.py::GPUTests::test_embedding_sparse_cuda, test/inductor/test_torchinductor.py::GPUTests::test_empty_strided_cuda, test/inductor/test_torchinductor.py::GPUTests::test_exp2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_exp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_expand_as_cuda, test/inductor/test_torchinductor.py::GPUTests::test_expand_cuda, test/inductor/test_torchinductor.py::GPUTests::test_expanded_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_basic_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_list_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_with_return_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fft_real_input_real_output_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fill1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_float16_to_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_float32_to_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_float_index_expression_type_promotion_cuda, test/inductor/test_torchinductor.py::GPUTests::test_floordiv_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fmod_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fmod_zero_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_full_boolean_cuda, test/inductor/test_torchinductor.py::GPUTests::test_full_like_cuda, test/inductor/test_torchinductor.py::GPUTests::test_full_like_sliced_cuda, test/inductor/test_torchinductor.py::GPUTests::test_full_truncation_cuda, test/inductor/test_torchinductor.py::GPUTests::test_functionalize_rng_wrappers_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fuse_large_params_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fusing_write_into_disjoint_read_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gather1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gather3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gelu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_generate_rand_fp8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_generated_code_has_alignment_assert_cuda, test/inductor/test_torchinductor.py::GPUTests::test_generated_code_has_size_stride_assert_cuda, test/inductor/test_torchinductor.py::GPUTests::test_glu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gpu_scalar_with_gpu_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_arange2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_argmax_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_both_scalars_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_constant_tensor1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_mutation_real_name_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_pad_dynamic_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_refcount_cuda, test/inductor/test_torchinductor.py::GPUTests::test_graph_partition_scalar_inputs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_grid_sampler_2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_grid_sampler_expand_preserves_view_cuda, test/inductor/test_torchinductor.py::GPUTests::test_hardtanh_cuda, test/inductor/test_torchinductor.py::GPUTests::test_horizonal_fusion1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_float_zero_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_propagation_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_propagation_flip_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_propagation_floordiv_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_propagation_remainder_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_as_masked_fill_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_failed_reinplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_fallback1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_fallback2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_indirect_load_broadcast_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inductor_assert_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inductor_layout_optimization_input_mutations_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inductor_triton_bucketize_respects_masking_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inplace_mixed_dtype_ops_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inplace_resize_as_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inplace_where_pointwise_cuda, test/inductor/test_torchinductor.py::GPUTests::test_input_mutation2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_int8_weight_only_quant_cuda, test/inductor/test_torchinductor.py::GPUTests::test_int_input_dynamic_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_invalid_operand_issue1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_isin_tensor_scalar_cuda, test/inductor/test_torchinductor.py::GPUTests::test_isinf2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_block_sizes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_grid_use_block_ptr_False_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_offset_pointwise_cuda, test/inductor/test_torchinductor.py::GPUTests::test_layer_norm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_lerp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_like_channels_last_cuda, test/inductor/test_torchinductor.py::GPUTests::test_like_rands2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_like_rands_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linear_dynamic_maxautotune_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linear_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linspace3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_list_clearing_cuda, test/inductor/test_torchinductor.py::GPUTests::test_lite_dynamic_shape_assertion_cuda, test/inductor/test_torchinductor.py::GPUTests::test_lite_mode_not_decompose_cuda, test/inductor/test_torchinductor.py::GPUTests::test_lite_regional_compile_invoke_subgraph_cuda, test/inductor/test_torchinductor.py::GPUTests::test_log1p_cuda, test/inductor/test_torchinductor.py::GPUTests::test_log2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_logsumexp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_long_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_low_memory_max_pool_dilation_1_dim_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_low_memory_max_pool_dilation_1_dim_3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mark_dynamic_with_hint_override_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mark_unbacked_with_hint_override_cuda, test/inductor/test_torchinductor.py::GPUTests::test_masked_fill_cuda, test/inductor/test_torchinductor.py::GPUTests::test_masked_fill_promotion_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d6_dilation_1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d6_dilation_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d7_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mean_cuda, test/inductor/test_torchinductor.py::GPUTests::test_min_max_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_misaligned_address_issue1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_move_arange_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multi_device_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multi_threading_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multilayer_any_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multilayer_prime_size_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multilayer_sum_low_prec_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mutations_loop_fusion_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nan_assert_inside_triton_kernel_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nan_sort_stable_True_descending_False_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nan_sort_stable_True_descending_True_cuda, test/inductor/test_torchinductor.py::GPUTests::test_narrow_cuda, test/inductor/test_torchinductor.py::GPUTests::test_needs_contiguous_strides_cuda, test/inductor/test_torchinductor.py::GPUTests::test_new_empty_cuda, test/inductor/test_torchinductor.py::GPUTests::test_new_empty_strided_cuda, test/inductor/test_torchinductor.py::GPUTests::test_new_ones_cuda, test/inductor/test_torchinductor.py::GPUTests::test_one_hot_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pad_cast_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pad_single_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pad_view_cuda, test/inductor/test_torchinductor.py::GPUTests::test_philox_rand_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pixel_shuffle_channels_last_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_airy_ai_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_y0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_y1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_t_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_erf_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_erfcx_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_erfinv_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_exp2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_gammaln_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_hermite_polynomial_h_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_i1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_laguerre_polynomial_l_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_legendre_polynomial_p_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_log1p_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_log_ndtr_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_i0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_i1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_k1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_multigammaln_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_ndtri_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_psi_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_round_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_scaled_modified_bessel_k0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_v_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_w_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_spherical_bessel_j0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_zeta_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow_by_natural_log2_dynamic_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_prod_cuda, test/inductor/test_torchinductor.py::GPUTests::test_rand_like_deterministic_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randint_kernel_count_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randn_with_dtype_and_device_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reflection_pad2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remainder_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remove_noop_clone_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remove_noop_copy_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remove_noop_slice1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_as_strided_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_Tensor_decomp_int32_nd_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_decomposition_has_clamp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_roi_align_cuda, test/inductor/test_torchinductor.py::GPUTests::test_round_correctness_cuda, test/inductor/test_torchinductor.py::GPUTests::test_round_cuda, test/inductor/test_torchinductor.py::GPUTests::test_rsqrt_dynamic_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scalar_cpu_tensor_arg_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scalar_input_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scaled_dot_product_efficient_attention_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_add2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_bf16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_reduce2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_reduce3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scheduler_vertical_fusion1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_False_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_True_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_True_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sdpa_unaligned_mask_cuda, test/inductor/test_torchinductor.py::GPUTests::test_searchsorted_broadcast_cuda, test/inductor/test_torchinductor.py::GPUTests::test_setitem_with_int_parameter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sgn_cuda, test/inductor/test_torchinductor.py::GPUTests::test_shape_prop_torch_ones_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sign_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_simplify_loops_cuda, test/inductor/test_torchinductor.py::GPUTests::test_single_elem_indirect_cuda, test/inductor/test_torchinductor.py::GPUTests::test_size_asserts_for_multi_output_fallback_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sizehint_issue1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_view_with_graph_break_cuda, test/inductor/test_torchinductor.py::GPUTests::test_softmax_cuda, test/inductor/test_torchinductor.py::GPUTests::test_softmax_one_kernel_persist_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sort_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sort_stable_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cumprod_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_failed_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_reduction_with_int64_size_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_with_list_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_with_sizes_with_unbacked_symints_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_with_unbacked_symints_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sqrt_dynamic_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_squeeze1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_stack_cuda, test/inductor/test_torchinductor.py::GPUTests::test_strided_inputs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum_int_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum_keepdims_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tan_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tanh_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tensor1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tensor3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tensor_index_put_slice_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tmp_not_defined_issue3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_topk_cuda, test/inductor/test_torchinductor.py::GPUTests::test_torch_device_split_cuda, test/inductor/test_torchinductor.py::GPUTests::test_transpose_add_cuda, test/inductor/test_torchinductor.py::GPUTests::test_transpose_cuda, test/inductor/test_torchinductor.py::GPUTests::test_triton_kernel_bool_param_cuda, test/inductor/test_torchinductor.py::GPUTests::test_uint_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unbacked_float_item_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unroll_small_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unsqueeze_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unsqueeze_inplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_bicubic2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_cat_conv_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_nearest2d_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_var_mean_div_by_cuda, test/inductor/test_torchinductor.py::GPUTests::test_vdd_clamp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_vertical_fusion1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_as_real_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_weight_norm_conv2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_where_broadcast_cuda, test/inductor/test_torchinductor.py::GPUTests::test_where_with_logical_op_cuda, test/inductor/test_torchinductor.py::GPUTests::test_zeros_cuda, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_bandwidth_profiler, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_bf16_atomic_add, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_codegen_config_option_dont_assume_alignment, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_comment_graph_fragment, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_computed_indirect_mask, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_constant_folding_deallocation, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_divisible_by_16_covers_numel_args, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_evict_last_non_coalesced_loads, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_indirect_device_assert, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_inductor_detach_view_backend_aot_eager, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_kernel_names_descriptive, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_layer_norm_inplaces_after_matmul, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_non_blocking_copy_codegen, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_numpy_autograd, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_indexing_dtype, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_red_followed_by_transposed_pointwise, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_respect_scaled_grouped_mm_layout_tag, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_rope_fusion, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_skip_l1_cache, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_split_op_with_sym, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_triton_attrs_dict_constexpr_signature, test/inductor/test_torchinductor.py::NanCheckerTest::test_nan_checker_fail 2025-12-04T10:04:28.1089569Z 2025-12-04T10:04:28.1090050Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_broadcast2 PASSED [3.1248s] [ 0%] 2025-12-04T10:04:28.1091092Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_strided PASSED [0.1941s] [ 0%] 2025-12-04T10:04:28.1092124Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_transposed PASSED [0.1880s] [ 0%] 2025-12-04T10:04:28.1093164Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_broadcast2 PASSED [0.1671s] [ 0%] 2025-12-04T10:04:28.1094179Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_double PASSED [0.1759s] [ 1%] 2025-12-04T10:04:28.1095182Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_int PASSED [0.1880s] [ 1%] 2025-12-04T10:04:28.1096194Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_transposed PASSED [0.1747s] [ 1%] 2025-12-04T10:04:28.1097215Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_dense PASSED [0.1729s] [ 1%] 2025-12-04T10:04:28.1098226Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_int PASSED [0.1664s] [ 1%] 2025-12-04T10:04:28.1099215Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_broadcast1 PASSED [0.1717s] [ 2%] 2025-12-04T10:04:28.1100203Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_broadcast3 PASSED [0.1678s] [ 2%] 2025-12-04T10:04:28.1101181Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_dense PASSED [0.1673s] [ 2%] 2025-12-04T10:04:28.1102123Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_double PASSED [0.1687s] [ 2%] 2025-12-04T10:04:28.1103090Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_strided PASSED [0.1775s] [ 3%] 2025-12-04T10:04:28.1104077Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_broadcast2 PASSED [0.1731s] [ 3%] 2025-12-04T10:04:28.1105070Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_double PASSED [0.1674s] [ 3%] 2025-12-04T10:04:28.1106006Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_int PASSED [0.1720s] [ 3%] 2025-12-04T10:04:28.1106962Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_strided PASSED [0.1813s] [ 3%] 2025-12-04T10:04:28.1107948Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_transposed PASSED [0.5003s] [ 4%] 2025-12-04T10:04:28.1108937Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_broadcast3 PASSED [0.4650s] [ 4%] 2025-12-04T10:04:28.1109891Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_strided PASSED [0.4803s] [ 4%] 2025-12-04T10:04:28.1110860Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_transposed PASSED [0.4849s] [ 4%] 2025-12-04T10:04:28.1111854Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_broadcast1 PASSED [0.4757s] [ 4%] 2025-12-04T10:04:28.1112857Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_broadcast3 PASSED [0.4732s] [ 5%] 2025-12-04T10:04:28.1113879Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_strided PASSED [0.4788s] [ 5%] 2025-12-04T10:04:28.1114867Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_transposed PASSED [0.5024s] [ 5%] 2025-12-04T10:04:28.1115894Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_broadcast2 PASSED [0.6525s] [ 5%] 2025-12-04T10:04:28.1116917Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_double PASSED [0.5002s] [ 6%] 2025-12-04T10:04:28.1118055Z inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_int PASSED [0.4825s] [ 6%] 2025-12-04T10:04:28.1119039Z inductor/test_torchinductor.py::GPUTests::test_AllenaiLongformerBase_repro_cuda PASSED [1.9256s] [ 6%] 2025-12-04T10:04:28.1120377Z inductor/test_torchinductor.py::GPUTests::test__dyn_quant_pack_4bit_weight_bf16_cuda SKIPPED [0.0032s] (No _dyn_quant_pack_4bit_weight implementation on CUDA) [ 6%] 2025-12-04T10:04:28.1121651Z inductor/test_torchinductor.py::GPUTests::test__unsafe_masked_index_put_accumulate_cuda PASSED [0.8285s] [ 6%] 2025-12-04T10:04:28.1122563Z inductor/test_torchinductor.py::GPUTests::test_abs_cuda PASSED [0.3412s] [ 7%] 2025-12-04T10:04:28.1123407Z inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool1d_argmax_cuda PASSED [0.4191s] [ 7%] 2025-12-04T10:04:28.1124340Z inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool2d1_cuda PASSED [3.7544s] [ 7%] 2025-12-04T10:04:28.1125322Z inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool_with_output_size_0_cuda PASSED [0.6595s] [ 7%] 2025-12-04T10:04:28.1126343Z inductor/test_torchinductor.py::GPUTests::test_adaptive_max_pool2d1_cuda PASSED [2.8015s] [ 7%] 2025-12-04T10:04:28.1127236Z inductor/test_torchinductor.py::GPUTests::test_adaptive_max_pool2d3_cuda PASSED [3.5427s] [ 8%] 2025-12-04T10:04:28.1128195Z inductor/test_torchinductor.py::GPUTests::test_adaptive_pool_errors_with_long_cuda PASSED [0.3063s] [ 8%] 2025-12-04T10:04:28.1129112Z inductor/test_torchinductor.py::GPUTests::test_add_complex10_cuda PASSED [0.3264s] [ 8%] 2025-12-04T10:04:28.1129942Z inductor/test_torchinductor.py::GPUTests::test_add_complex4_cuda PASSED [0.8199s] [ 8%] 2025-12-04T10:04:28.1130751Z inductor/test_torchinductor.py::GPUTests::test_add_complex5_cuda PASSED [0.2969s] [ 9%] 2025-12-04T10:04:28.1131569Z inductor/test_torchinductor.py::GPUTests::test_add_complex6_cuda PASSED [0.3036s] [ 9%] 2025-12-04T10:04:28.1132388Z inductor/test_torchinductor.py::GPUTests::test_add_complex8_cuda PASSED [0.2760s] [ 9%] 2025-12-04T10:04:28.1133209Z inductor/test_torchinductor.py::GPUTests::test_add_complex9_cuda PASSED [0.2717s] [ 9%] 2025-12-04T10:04:28.1134032Z inductor/test_torchinductor.py::GPUTests::test_add_const_float_cuda PASSED [0.3111s] [ 9%] 2025-12-04T10:04:28.1134908Z inductor/test_torchinductor.py::GPUTests::test_add_inplace_permuted_cuda PASSED [0.4451s] [ 10%] 2025-12-04T10:04:28.1136184Z inductor/test_torchinductor.py::GPUTests::test_addmm_cuda W1204 09:56:14.521000 14751 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.1137165Z PASSED [1.6170s] [ 10%] 2025-12-04T10:04:28.1137703Z inductor/test_torchinductor.py::GPUTests::test_alexnet_prefix_cuda PASSED [2.7003s] [ 10%] 2025-12-04T10:04:28.1138726Z inductor/test_torchinductor.py::GPUTests::test_allow_reuse_active_if_under_peak_cuda PASSED [1.8689s] [ 10%] 2025-12-04T10:04:28.1139625Z inductor/test_torchinductor.py::GPUTests::test_angle_cuda PASSED [0.8089s] [ 11%] 2025-12-04T10:04:28.1140381Z inductor/test_torchinductor.py::GPUTests::test_any_cuda PASSED [1.5735s] [ 11%] 2025-12-04T10:04:28.1141350Z inductor/test_torchinductor.py::GPUTests::test_aoti_eager_with_persistent_cache_cuda SKIPPED [0.0032s] (Requires sm80) [ 11%] 2025-12-04T10:04:28.1142478Z inductor/test_torchinductor.py::GPUTests::test_aoti_eager_with_scalar_cuda SKIPPED [0.0030s] (Requires sm80) [ 11%] 2025-12-04T10:04:28.1143402Z inductor/test_torchinductor.py::GPUTests::test_arange1_cuda PASSED [0.4215s] [ 11%] 2025-12-04T10:04:28.1144166Z inductor/test_torchinductor.py::GPUTests::test_arange2_cuda PASSED [0.1716s] [ 12%] 2025-12-04T10:04:28.1144935Z inductor/test_torchinductor.py::GPUTests::test_arange6_cuda PASSED [0.5170s] [ 12%] 2025-12-04T10:04:28.1145738Z inductor/test_torchinductor.py::GPUTests::test_argmax_argmin1_cuda PASSED [0.7082s] [ 12%] 2025-12-04T10:04:28.1146565Z inductor/test_torchinductor.py::GPUTests::test_argmax_argmin2_cuda PASSED [0.7854s] [ 12%] 2025-12-04T10:04:28.1147388Z inductor/test_torchinductor.py::GPUTests::test_as_strided_cuda PASSED [0.8493s] [ 12%] 2025-12-04T10:04:28.1148305Z inductor/test_torchinductor.py::GPUTests::test_as_strided_on_views_cuda PASSED [0.4618s] [ 13%] 2025-12-04T10:04:28.1149249Z inductor/test_torchinductor.py::GPUTests::test_assert_alignment_op_name_fail_cuda PASSED [0.0033s] [ 13%] 2025-12-04T10:04:28.1150307Z inductor/test_torchinductor.py::GPUTests::test_assert_size_stride_op_name_fail_cuda PASSED [0.0030s] [ 13%] 2025-12-04T10:04:28.1151326Z inductor/test_torchinductor.py::GPUTests::test_assert_size_stride_op_name_pass_cuda PASSED [0.0029s] [ 13%] 2025-12-04T10:04:28.1152237Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d4_cuda PASSED [0.6326s] [ 14%] 2025-12-04T10:04:28.1153047Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d6_cuda PASSED [0.7722s] [ 14%] 2025-12-04T10:04:28.1153840Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d7_cuda PASSED [0.2194s] [ 14%] 2025-12-04T10:04:28.1154648Z inductor/test_torchinductor.py::GPUTests::test_avg_pool2d8_cuda PASSED [1.7794s] [ 14%] 2025-12-04T10:04:28.1155503Z inductor/test_torchinductor.py::GPUTests::test_avg_pool3d_backward4_cuda PASSED [0.1279s] [ 14%] 2025-12-04T10:04:28.1156369Z inductor/test_torchinductor.py::GPUTests::test_batch_norm_2d_2_cuda PASSED [1.1756s] [ 15%] 2025-12-04T10:04:28.1157201Z inductor/test_torchinductor.py::GPUTests::test_batch_norm_2d_cuda PASSED [0.8729s] [ 15%] 2025-12-04T10:04:28.1158008Z inductor/test_torchinductor.py::GPUTests::test_bitwise2_cuda PASSED [0.3090s] [ 15%] 2025-12-04T10:04:28.1158773Z inductor/test_torchinductor.py::GPUTests::test_bmm2_cuda PASSED [0.1380s] [ 15%] 2025-12-04T10:04:28.1159545Z inductor/test_torchinductor.py::GPUTests::test_both_scalars_cuda PASSED [1.0217s] [ 15%] 2025-12-04T10:04:28.1160416Z inductor/test_torchinductor.py::GPUTests::test_bucketize_add_autotune_cuda PASSED [0.5614s] [ 16%] 2025-12-04T10:04:28.1161361Z inductor/test_torchinductor.py::GPUTests::test_bucketize_computed_offsets_cuda PASSED [0.2047s] [ 16%] 2025-12-04T10:04:28.1162318Z inductor/test_torchinductor.py::GPUTests::test_bucketize_default_kwargs_cuda PASSED [0.1833s] [ 16%] 2025-12-04T10:04:28.1163250Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int32_int16_cuda PASSED [0.7958s] [ 16%] 2025-12-04T10:04:28.1164198Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int32_int32_cuda PASSED [0.7459s] [ 17%] 2025-12-04T10:04:28.1165143Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int32_uint8_cuda PASSED [0.7922s] [ 17%] 2025-12-04T10:04:28.1166086Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int64_int16_cuda PASSED [0.7804s] [ 17%] 2025-12-04T10:04:28.1167015Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int64_int32_cuda PASSED [0.7898s] [ 17%] 2025-12-04T10:04:28.1167956Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int64_int64_cuda PASSED [0.7523s] [ 17%] 2025-12-04T10:04:28.1168894Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int64_int8_cuda PASSED [0.7886s] [ 18%] 2025-12-04T10:04:28.1169837Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int64_uint8_cuda PASSED [0.7804s] [ 18%] 2025-12-04T10:04:28.1170761Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int8_int8_cuda PASSED [0.7467s] [ 18%] 2025-12-04T10:04:28.1171701Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_int8_uint8_cuda PASSED [0.8077s] [ 18%] 2025-12-04T10:04:28.1172641Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_uint8_int32_cuda PASSED [0.7591s] [ 19%] 2025-12-04T10:04:28.1173570Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_uint8_int64_cuda PASSED [0.7661s] [ 19%] 2025-12-04T10:04:28.1174509Z inductor/test_torchinductor.py::GPUTests::test_bucketize_int_uint8_int8_cuda PASSED [0.8081s] [ 19%] 2025-12-04T10:04:28.1175482Z inductor/test_torchinductor.py::GPUTests::test_builtins_round_float_ndigits_neg_cuda PASSED [0.2729s] [ 19%] 2025-12-04T10:04:28.1176487Z inductor/test_torchinductor.py::GPUTests::test_builtins_round_int_ndigits_pos_cuda PASSED [0.4570s] [ 19%] 2025-12-04T10:04:28.1177412Z inductor/test_torchinductor.py::GPUTests::test_cat_cuda PASSED [1.3303s] [ 20%] 2025-12-04T10:04:28.1178277Z inductor/test_torchinductor.py::GPUTests::test_cat_empty_cuda PASSED [0.9763s] [ 20%] 2025-12-04T10:04:28.1179173Z inductor/test_torchinductor.py::GPUTests::test_cat_empty_index_cuda PASSED [0.3404s] [ 20%] 2025-12-04T10:04:28.1180028Z inductor/test_torchinductor.py::GPUTests::test_cat_single_empty_cuda PASSED [0.2060s] [ 20%] 2025-12-04T10:04:28.1180902Z inductor/test_torchinductor.py::GPUTests::test_cat_unbacked_empty_1d_cuda PASSED [0.6555s] [ 20%] 2025-12-04T10:04:28.1181745Z inductor/test_torchinductor.py::GPUTests::test_cauchy_cuda PASSED [0.2337s] [ 21%] 2025-12-04T10:04:28.1182567Z inductor/test_torchinductor.py::GPUTests::test_chunk_recompiles_cuda PASSED [0.8422s] [ 21%] 2025-12-04T10:04:28.1183449Z inductor/test_torchinductor.py::GPUTests::test_clamp_type_promotion_cuda PASSED [0.2788s] [ 21%] 2025-12-04T10:04:28.1184398Z inductor/test_torchinductor.py::GPUTests::test_clamp_type_promotion_non_tensor_cuda PASSED [0.2505s] [ 21%] 2025-12-04T10:04:28.1185285Z inductor/test_torchinductor.py::GPUTests::test_clone_cuda PASSED [0.3655s] [ 22%] 2025-12-04T10:04:28.1186056Z inductor/test_torchinductor.py::GPUTests::test_compar_cuda PASSED [0.2876s] [ 22%] 2025-12-04T10:04:28.1186868Z inductor/test_torchinductor.py::GPUTests::test_complex_fallback_cuda PASSED [0.3626s] [ 22%] 2025-12-04T10:04:28.1187937Z inductor/test_torchinductor.py::GPUTests::test_complex_from_real_imag_cuda PASSED [0.5370s] [ 22%] 2025-12-04T10:04:28.1188868Z inductor/test_torchinductor.py::GPUTests::test_complex_memory_overlap_cuda PASSED [0.0034s] [ 22%] 2025-12-04T10:04:28.1189804Z inductor/test_torchinductor.py::GPUTests::test_computed_buffer_inlining_cuda PASSED [0.1852s] [ 23%] 2025-12-04T10:04:28.1190705Z inductor/test_torchinductor.py::GPUTests::test_concat_add_inplace_cuda PASSED [0.4960s] [ 23%] 2025-12-04T10:04:28.1191734Z inductor/test_torchinductor.py::GPUTests::test_config_option_dont_assume_alignment_cudagraphs_cuda PASSED [0.8199s] [ 23%] 2025-12-04T10:04:28.1192897Z inductor/test_torchinductor.py::GPUTests::test_config_option_dont_assume_alignment_recompiles_cuda PASSED [0.6402s] [ 23%] 2025-12-04T10:04:28.1193959Z inductor/test_torchinductor.py::GPUTests::test_consecutive_split_cumprod_cuda PASSED [0.5138s] [ 23%] 2025-12-04T10:04:28.1194872Z inductor/test_torchinductor.py::GPUTests::test_const_int32_to_float_cuda PASSED [0.7501s] [ 24%] 2025-12-04T10:04:28.1195837Z inductor/test_torchinductor.py::GPUTests::test_constant_pad_2d_strides_nonpositive_cuda PASSED [0.3493s] [ 24%] 2025-12-04T10:04:28.1196796Z inductor/test_torchinductor.py::GPUTests::test_constant_pad_3d_cuda PASSED [0.5433s] [ 24%] 2025-12-04T10:04:28.1197689Z inductor/test_torchinductor.py::GPUTests::test_constant_pad_fill_dtype_cuda PASSED [0.2863s] [ 24%] 2025-12-04T10:04:28.1198610Z inductor/test_torchinductor.py::GPUTests::test_constant_pad_nd_inplace_cuda PASSED [0.1552s] [ 25%] 2025-12-04T10:04:28.1199581Z inductor/test_torchinductor.py::GPUTests::test_conv2d_backward_channels_last_cuda PASSED [0.4891s] [ 25%] 2025-12-04T10:04:28.1200724Z inductor/test_torchinductor.py::GPUTests::test_conv2d_channels_last_cuda SKIPPED [0.0032s] (only support cpu conv2d channels_last) [ 25%] 2025-12-04T10:04:28.1202102Z inductor/test_torchinductor.py::GPUTests::test_conv3d_channels_last_use_block_ptr_False_cuda SKIPPED [0.0030s] (only support cpu conv3d channels_last) [ 25%] 2025-12-04T10:04:28.1203206Z inductor/test_torchinductor.py::GPUTests::test_conv3d_cuda PASSED [0.4488s] [ 25%] 2025-12-04T10:04:28.1204023Z inductor/test_torchinductor.py::GPUTests::test_conv_shape_check_cuda PASSED [0.3886s] [ 26%] 2025-12-04T10:04:28.1204866Z inductor/test_torchinductor.py::GPUTests::test_convolution1_cuda PASSED [0.5743s] [ 26%] 2025-12-04T10:04:28.1205692Z inductor/test_torchinductor.py::GPUTests::test_convolution3_cuda PASSED [0.5215s] [ 26%] 2025-12-04T10:04:28.1207147Z inductor/test_torchinductor.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_False_cuda W1204 09:57:00.088000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1208533Z W1204 09:57:00.089000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1209542Z W1204 09:57:00.090000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1210470Z W1204 09:57:00.091000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1211381Z W1204 09:57:00.092000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1212304Z W1204 09:57:00.092000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1213225Z W1204 09:57:00.093000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1214154Z W1204 09:57:00.094000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1215060Z W1204 09:57:00.095000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1215985Z W1204 09:57:00.096000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1216916Z W1204 09:57:00.096000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1217826Z W1204 09:57:00.097000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1218836Z W1204 09:57:00.098000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1219761Z W1204 09:57:00.099000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1220687Z W1204 09:57:00.100000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1221600Z W1204 09:57:00.100000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1222524Z W1204 09:57:00.101000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1223454Z W1204 09:57:00.102000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1224382Z W1204 09:57:00.103000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1225289Z W1204 09:57:00.103000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1226215Z W1204 09:57:00.104000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1227141Z W1204 09:57:00.105000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1228066Z W1204 09:57:00.106000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1228982Z W1204 09:57:00.107000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1229909Z W1204 09:57:00.107000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1230837Z W1204 09:57:00.108000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1231754Z W1204 09:57:00.109000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1232664Z W1204 09:57:00.110000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1233591Z W1204 09:57:00.111000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1234513Z W1204 09:57:00.112000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1235511Z W1204 09:57:00.112000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1236427Z W1204 09:57:00.113000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1237418Z W1204 09:57:00.114000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1238343Z W1204 09:57:00.115000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1239269Z W1204 09:57:00.115000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1240182Z W1204 09:57:00.116000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1241108Z W1204 09:57:00.117000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1242034Z W1204 09:57:00.118000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1242955Z W1204 09:57:00.118000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1243872Z W1204 09:57:00.119000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1244804Z W1204 09:57:00.120000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1245725Z W1204 09:57:00.121000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1246635Z W1204 09:57:00.122000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1247556Z W1204 09:57:00.122000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1248480Z W1204 09:57:00.123000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1249410Z W1204 09:57:00.124000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1250325Z W1204 09:57:00.125000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1251250Z W1204 09:57:00.126000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1252172Z W1204 09:57:00.126000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1253090Z W1204 09:57:00.127000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1254000Z W1204 09:57:00.128000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1254923Z W1204 09:57:00.129000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1255847Z W1204 09:57:00.129000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1256776Z W1204 09:57:00.130000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1257686Z W1204 09:57:00.131000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1258689Z W1204 09:57:00.132000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1259615Z W1204 09:57:00.132000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1260543Z W1204 09:57:00.133000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1261454Z W1204 09:57:00.134000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1262386Z W1204 09:57:00.135000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1263381Z W1204 09:57:00.136000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1264306Z W1204 09:57:00.136000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1265279Z W1204 09:57:00.137000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1266203Z W1204 09:57:00.138000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1267130Z W1204 09:57:00.139000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1268055Z W1204 09:57:00.139000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1268969Z W1204 09:57:00.140000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1269899Z W1204 09:57:00.141000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1270835Z W1204 09:57:00.142000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1271746Z W1204 09:57:00.143000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1272672Z W1204 09:57:00.143000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1273603Z W1204 09:57:00.144000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1274530Z W1204 09:57:00.145000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1275443Z W1204 09:57:00.146000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1276366Z W1204 09:57:00.147000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1277294Z W1204 09:57:00.147000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1278214Z W1204 09:57:00.148000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1279127Z W1204 09:57:00.149000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1280048Z W1204 09:57:00.150000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1280968Z W1204 09:57:00.150000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1281890Z W1204 09:57:00.151000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1282802Z W1204 09:57:00.152000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1283724Z W1204 09:57:00.153000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1284655Z W1204 09:57:00.153000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1285588Z W1204 09:57:00.154000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1286515Z W1204 09:57:00.155000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1287429Z W1204 09:57:00.156000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1288549Z W1204 09:57:00.157000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1289482Z W1204 09:57:00.157000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1290408Z W1204 09:57:00.158000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1291435Z W1204 09:57:00.159000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1292373Z W1204 09:57:00.160000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1293411Z W1204 09:57:00.160000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1294327Z W1204 09:57:00.161000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1295256Z W1204 09:57:00.162000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1296184Z W1204 09:57:00.163000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1297116Z W1204 09:57:00.164000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1298032Z W1204 09:57:00.164000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1299025Z W1204 09:57:00.165000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1299953Z W1204 09:57:00.166000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1300605Z PASSED [1.4863s] [ 26%] 2025-12-04T10:04:28.1301169Z inductor/test_torchinductor.py::GPUTests::test_copy_with_scalar_src_cuda PASSED [0.3458s] [ 26%] 2025-12-04T10:04:28.1302103Z inductor/test_torchinductor.py::GPUTests::test_cpu_scalar_with_cpu_scalar_cuda PASSED [0.6930s] [ 27%] 2025-12-04T10:04:28.1303055Z inductor/test_torchinductor.py::GPUTests::test_cpu_scalar_with_cpu_tensor_cuda PASSED [0.7172s] [ 27%] 2025-12-04T10:04:28.1304398Z inductor/test_torchinductor.py::GPUTests::test_cpu_scalar_with_gpu_tensor_cpp_cuda W1204 09:57:02.211000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1305431Z PASSED [5.6594s] [ 27%] 2025-12-04T10:04:28.1306025Z inductor/test_torchinductor.py::GPUTests::test_cpu_tensor_with_cpu_tensor_cuda PASSED [0.7032s] [ 27%] 2025-12-04T10:04:28.1306898Z inductor/test_torchinductor.py::GPUTests::test_cudnn_rnn_cuda PASSED [3.8658s] [ 28%] 2025-12-04T10:04:28.1307676Z inductor/test_torchinductor.py::GPUTests::test_cummin_cuda PASSED [1.6031s] [ 28%] 2025-12-04T10:04:28.1308490Z inductor/test_torchinductor.py::GPUTests::test_cumprod_zero_dim_cuda PASSED [0.3036s] [ 28%] 2025-12-04T10:04:28.1309319Z inductor/test_torchinductor.py::GPUTests::test_cumsum_inf_cuda PASSED [0.5366s] [ 28%] 2025-12-04T10:04:28.1310211Z inductor/test_torchinductor.py::GPUTests::test_cumsum_pattern_matcher_issue_cuda PASSED [0.3588s] [ 28%] 2025-12-04T10:04:28.1311098Z inductor/test_torchinductor.py::GPUTests::test_custom_op_3_cuda PASSED [0.1262s] [ 29%] 2025-12-04T10:04:28.1311985Z inductor/test_torchinductor.py::GPUTests::test_custom_op_unbacked_symints_cuda PASSED [0.3401s] [ 29%] 2025-12-04T10:04:28.1313045Z inductor/test_torchinductor.py::GPUTests::test_data_type_propogation_cuda SKIPPED [0.0033s] (triton not supported) [ 29%] 2025-12-04T10:04:28.1314053Z inductor/test_torchinductor.py::GPUTests::test_dense_mask_index_cuda PASSED [0.5054s] [ 29%] 2025-12-04T10:04:28.1315017Z inductor/test_torchinductor.py::GPUTests::test_deterministic_codegen_on_graph_break_cuda PASSED [0.7978s] [ 30%] 2025-12-04T10:04:28.1316083Z inductor/test_torchinductor.py::GPUTests::test_deterministic_codegen_with_suffix_cuda PASSED [0.7759s] [ 30%] 2025-12-04T10:04:28.1317025Z inductor/test_torchinductor.py::GPUTests::test_device_assert_cuda PASSED [0.5885s] [ 30%] 2025-12-04T10:04:28.1317826Z inductor/test_torchinductor.py::GPUTests::test_div1_cuda PASSED [0.5955s] [ 30%] 2025-12-04T10:04:28.1318569Z inductor/test_torchinductor.py::GPUTests::test_div2_cuda PASSED [0.5980s] [ 30%] 2025-12-04T10:04:28.1319325Z inductor/test_torchinductor.py::GPUTests::test_div3_cuda PASSED [0.3418s] [ 31%] 2025-12-04T10:04:28.1320075Z inductor/test_torchinductor.py::GPUTests::test_div4_cuda PASSED [0.6809s] [ 31%] 2025-12-04T10:04:28.1320881Z inductor/test_torchinductor.py::GPUTests::test_div7_cuda PASSED [0.3505s] [ 31%] 2025-12-04T10:04:28.1321663Z inductor/test_torchinductor.py::GPUTests::test_div_by_zero_cuda PASSED [0.5707s] [ 31%] 2025-12-04T10:04:28.1322550Z inductor/test_torchinductor.py::GPUTests::test_div_precision_cuda PASSED [1.4589s] [ 31%] 2025-12-04T10:04:28.1323428Z inductor/test_torchinductor.py::GPUTests::test_div_presicion_accuracy_cuda PASSED [0.3627s] [ 32%] 2025-12-04T10:04:28.1324435Z inductor/test_torchinductor.py::GPUTests::test_dont_constant_fold_cuda SKIPPED [0.0032s] (triton not supported) [ 32%] 2025-12-04T10:04:28.1325389Z inductor/test_torchinductor.py::GPUTests::test_dropout3_cuda PASSED [0.6211s] [ 32%] 2025-12-04T10:04:28.1326168Z inductor/test_torchinductor.py::GPUTests::test_dropout_cuda PASSED [0.4554s] [ 32%] 2025-12-04T10:04:28.1326993Z inductor/test_torchinductor.py::GPUTests::test_dropout_trivial_1_cuda PASSED [0.2239s] [ 33%] 2025-12-04T10:04:28.1327868Z inductor/test_torchinductor.py::GPUTests::test_dtype_mismatch_issue_cuda PASSED [1.2558s] [ 33%] 2025-12-04T10:04:28.1328985Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_float32_cuda SKIPPED [0.0032s] (uses bfloat16 which requires SM >= 80) [ 33%] 2025-12-04T10:04:28.1330314Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_int16_cuda SKIPPED [0.0030s] (uses bfloat16 which requires SM >= 80) [ 33%] 2025-12-04T10:04:28.1331611Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_int8_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 33%] 2025-12-04T10:04:28.1332910Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_float64_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 34%] 2025-12-04T10:04:28.1334224Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_int16_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 34%] 2025-12-04T10:04:28.1335526Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_int64_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 34%] 2025-12-04T10:04:28.1336837Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_float32_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 34%] 2025-12-04T10:04:28.1338147Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_float64_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 34%] 2025-12-04T10:04:28.1339519Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_int32_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 35%] 2025-12-04T10:04:28.1340816Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_int64_cuda SKIPPED [0.0028s] (uses bfloat16 which requires SM >= 80) [ 35%] 2025-12-04T10:04:28.1342111Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_int8_cuda SKIPPED [0.0031s] (uses bfloat16 which requires SM >= 80) [ 35%] 2025-12-04T10:04:28.1343396Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_uint8_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 35%] 2025-12-04T10:04:28.1344482Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_fusion_cuda PASSED [0.4033s] [ 36%] 2025-12-04T10:04:28.1345564Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_float64_cuda SKIPPED [0.0032s] (uses bfloat16 which requires SM >= 80) [ 36%] 2025-12-04T10:04:28.1346849Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_int16_cuda SKIPPED [0.0030s] (uses bfloat16 which requires SM >= 80) [ 36%] 2025-12-04T10:04:28.1348112Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_uint8_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 36%] 2025-12-04T10:04:28.1349386Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_uint8_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 36%] 2025-12-04T10:04:28.1350661Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_float16_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 37%] 2025-12-04T10:04:28.1352013Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_int16_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 37%] 2025-12-04T10:04:28.1353360Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_float16_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 37%] 2025-12-04T10:04:28.1354616Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_int8_cuda SKIPPED [0.0028s] (uses bfloat16 which requires SM >= 80) [ 37%] 2025-12-04T10:04:28.1355877Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_uint8_cuda SKIPPED [0.0028s] (uses bfloat16 which requires SM >= 80) [ 38%] 2025-12-04T10:04:28.1357143Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_int16_cuda SKIPPED [0.0031s] (uses bfloat16 which requires SM >= 80) [ 38%] 2025-12-04T10:04:28.1358414Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_int64_cuda SKIPPED [0.0031s] (uses bfloat16 which requires SM >= 80) [ 38%] 2025-12-04T10:04:28.1359675Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_int8_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 38%] 2025-12-04T10:04:28.1360938Z inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_uint8_cuda SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 38%] 2025-12-04T10:04:28.1361955Z inductor/test_torchinductor.py::GPUTests::test_elu_cuda PASSED [0.6015s] [ 39%] 2025-12-04T10:04:28.1363010Z inductor/test_torchinductor.py::GPUTests::test_embedding_bag_byte_unpack_cuda SKIPPED [0.0033s] (No cuda implementation (it returns empty)) [ 39%] 2025-12-04T10:04:28.1364098Z inductor/test_torchinductor.py::GPUTests::test_embedding_bag_cuda PASSED [0.2698s] [ 39%] 2025-12-04T10:04:28.1364947Z inductor/test_torchinductor.py::GPUTests::test_embedding_sparse_cuda PASSED [0.2799s] [ 39%] 2025-12-04T10:04:28.1365798Z inductor/test_torchinductor.py::GPUTests::test_empty_strided_cuda PASSED [0.1503s] [ 39%] 2025-12-04T10:04:28.1366603Z inductor/test_torchinductor.py::GPUTests::test_exp2_cuda PASSED [0.6095s] [ 40%] 2025-12-04T10:04:28.1367343Z inductor/test_torchinductor.py::GPUTests::test_exp_cuda PASSED [0.4673s] [ 40%] 2025-12-04T10:04:28.1368112Z inductor/test_torchinductor.py::GPUTests::test_expand_as_cuda PASSED [0.4193s] [ 40%] 2025-12-04T10:04:28.1368898Z inductor/test_torchinductor.py::GPUTests::test_expand_cuda PASSED [0.4793s] [ 40%] 2025-12-04T10:04:28.1369717Z inductor/test_torchinductor.py::GPUTests::test_expanded_reduction_cuda PASSED [0.7784s] [ 41%] 2025-12-04T10:04:28.1370633Z inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_basic_cuda PASSED [0.0491s] [ 41%] 2025-12-04T10:04:28.1371612Z inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_list_tensor_cuda PASSED [0.7386s] [ 41%] 2025-12-04T10:04:28.1372619Z inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_with_return_cuda PASSED [0.0500s] [ 41%] 2025-12-04T10:04:28.1373591Z inductor/test_torchinductor.py::GPUTests::test_fft_real_input_real_output_cuda PASSED [0.1382s] [ 41%] 2025-12-04T10:04:28.1374453Z inductor/test_torchinductor.py::GPUTests::test_fill1_cuda PASSED [0.3361s] [ 42%] 2025-12-04T10:04:28.1375256Z inductor/test_torchinductor.py::GPUTests::test_float16_to_int16_cuda PASSED [0.1769s] [ 42%] 2025-12-04T10:04:28.1376114Z inductor/test_torchinductor.py::GPUTests::test_float32_to_int32_cuda PASSED [0.1872s] [ 42%] 2025-12-04T10:04:28.1392216Z inductor/test_torchinductor.py::GPUTests::test_float_index_expression_type_promotion_cuda PASSED [0.2198s] [ 42%] 2025-12-04T10:04:28.1393316Z inductor/test_torchinductor.py::GPUTests::test_floordiv_cuda PASSED [0.5368s] [ 42%] 2025-12-04T10:04:28.1394111Z inductor/test_torchinductor.py::GPUTests::test_fmod_cuda PASSED [0.5449s] [ 43%] 2025-12-04T10:04:28.1394915Z inductor/test_torchinductor.py::GPUTests::test_fmod_zero_dim_cuda PASSED [1.2260s] [ 43%] 2025-12-04T10:04:28.1395745Z inductor/test_torchinductor.py::GPUTests::test_full_boolean_cuda PASSED [0.4262s] [ 43%] 2025-12-04T10:04:28.1396741Z inductor/test_torchinductor.py::GPUTests::test_full_like_cuda PASSED [0.3154s] [ 43%] 2025-12-04T10:04:28.1397576Z inductor/test_torchinductor.py::GPUTests::test_full_like_sliced_cuda PASSED [0.3214s] [ 44%] 2025-12-04T10:04:28.1398531Z inductor/test_torchinductor.py::GPUTests::test_full_truncation_cuda PASSED [1.1398s] [ 44%] 2025-12-04T10:04:28.1399449Z inductor/test_torchinductor.py::GPUTests::test_functionalize_rng_wrappers_cuda PASSED [0.0561s] [ 44%] 2025-12-04T10:04:28.1400644Z inductor/test_torchinductor.py::GPUTests::test_fuse_large_params_cuda SKIPPED [0.0030s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 44%] 2025-12-04T10:04:28.1401860Z inductor/test_torchinductor.py::GPUTests::test_fusing_write_into_disjoint_read_cuda PASSED [0.9613s] [ 44%] 2025-12-04T10:04:28.1402757Z inductor/test_torchinductor.py::GPUTests::test_gather1_cuda PASSED [0.5561s] [ 45%] 2025-12-04T10:04:28.1403529Z inductor/test_torchinductor.py::GPUTests::test_gather3_cuda PASSED [0.6310s] [ 45%] 2025-12-04T10:04:28.1404290Z inductor/test_torchinductor.py::GPUTests::test_gelu_cuda PASSED [0.5721s] [ 45%] 2025-12-04T10:04:28.1405106Z inductor/test_torchinductor.py::GPUTests::test_generate_rand_fp8_cuda PASSED [0.0034s] [ 45%] 2025-12-04T10:04:28.1406070Z inductor/test_torchinductor.py::GPUTests::test_generated_code_has_alignment_assert_cuda PASSED [0.1821s] [ 46%] 2025-12-04T10:04:28.1407119Z inductor/test_torchinductor.py::GPUTests::test_generated_code_has_size_stride_assert_cuda PASSED [0.1704s] [ 46%] 2025-12-04T10:04:28.1408031Z inductor/test_torchinductor.py::GPUTests::test_glu_cuda PASSED [0.5370s] [ 46%] 2025-12-04T10:04:28.1408879Z inductor/test_torchinductor.py::GPUTests::test_gpu_scalar_with_gpu_tensor_cuda PASSED [0.1679s] [ 46%] 2025-12-04T10:04:28.1409825Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_arange2_cuda PASSED [0.2992s] [ 46%] 2025-12-04T10:04:28.1410744Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_argmax_cuda PASSED [0.7471s] [ 47%] 2025-12-04T10:04:28.1411698Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_both_scalars_cuda PASSED [0.7134s] [ 47%] 2025-12-04T10:04:28.1412709Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_constant_tensor1_cuda PASSED [0.6886s] [ 47%] 2025-12-04T10:04:28.1414130Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_mutation_real_name_cuda W1204 09:57:43.421000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1415480Z W1204 09:57:43.422000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1416126Z PASSED [0.3635s] [ 47%] 2025-12-04T10:04:28.1417316Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_pad_dynamic_cuda W1204 09:57:44.995000 14751 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:04:28.1418633Z PASSED [3.0611s] [ 47%] 2025-12-04T10:04:28.1419217Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_refcount_cuda PASSED [5.4255s] [ 48%] 2025-12-04T10:04:28.1420196Z inductor/test_torchinductor.py::GPUTests::test_graph_partition_scalar_inputs_cuda PASSED [0.7074s] [ 48%] 2025-12-04T10:04:28.1421121Z inductor/test_torchinductor.py::GPUTests::test_grid_sampler_2d_cuda PASSED [2.8954s] [ 48%] 2025-12-04T10:04:28.1422058Z inductor/test_torchinductor.py::GPUTests::test_grid_sampler_expand_preserves_view_cuda PASSED [4.2957s] [ 48%] 2025-12-04T10:04:28.1422966Z inductor/test_torchinductor.py::GPUTests::test_hardtanh_cuda PASSED [0.5234s] [ 49%] 2025-12-04T10:04:28.1423794Z inductor/test_torchinductor.py::GPUTests::test_horizonal_fusion1_cuda PASSED [0.4723s] [ 49%] 2025-12-04T10:04:28.1424618Z inductor/test_torchinductor.py::GPUTests::test_index2_cuda PASSED [0.5569s] [ 49%] 2025-12-04T10:04:28.1425384Z inductor/test_torchinductor.py::GPUTests::test_index3_cuda PASSED [0.5412s] [ 49%] 2025-12-04T10:04:28.1426252Z inductor/test_torchinductor.py::GPUTests::test_index_float_zero_cuda PASSED [0.4760s] [ 49%] 2025-12-04T10:04:28.1427127Z inductor/test_torchinductor.py::GPUTests::test_index_propagation_cuda PASSED [0.1631s] [ 50%] 2025-12-04T10:04:28.1428030Z inductor/test_torchinductor.py::GPUTests::test_index_propagation_flip_cuda PASSED [0.1705s] [ 50%] 2025-12-04T10:04:28.1429041Z inductor/test_torchinductor.py::GPUTests::test_index_propagation_floordiv_cuda PASSED [0.2098s] [ 50%] 2025-12-04T10:04:28.1429996Z inductor/test_torchinductor.py::GPUTests::test_index_propagation_remainder_cuda PASSED [0.2090s] [ 50%] 2025-12-04T10:04:28.1430884Z inductor/test_torchinductor.py::GPUTests::test_index_put3_cuda PASSED [0.6141s] [ 50%] 2025-12-04T10:04:28.1431746Z inductor/test_torchinductor.py::GPUTests::test_index_put_as_masked_fill_cuda PASSED [0.7433s] [ 51%] 2025-12-04T10:04:28.1432671Z inductor/test_torchinductor.py::GPUTests::test_index_put_failed_reinplace_cuda PASSED [0.4635s] [ 51%] 2025-12-04T10:04:28.1433597Z inductor/test_torchinductor.py::GPUTests::test_index_put_fallback1_cuda PASSED [0.6113s] [ 51%] 2025-12-04T10:04:28.1434485Z inductor/test_torchinductor.py::GPUTests::test_index_put_fallback2_cuda PASSED [0.6457s] [ 51%] 2025-12-04T10:04:28.1435338Z inductor/test_torchinductor.py::GPUTests::test_index_tensor_cuda PASSED [0.5154s] [ 52%] 2025-12-04T10:04:28.1436207Z inductor/test_torchinductor.py::GPUTests::test_indirect_load_broadcast_cuda PASSED [1.0220s] [ 52%] 2025-12-04T10:04:28.1437098Z inductor/test_torchinductor.py::GPUTests::test_inductor_assert_cuda PASSED [0.9723s] [ 52%] 2025-12-04T10:04:28.1438093Z inductor/test_torchinductor.py::GPUTests::test_inductor_layout_optimization_input_mutations_cuda PASSED [0.3098s] [ 52%] 2025-12-04T10:04:28.1439238Z inductor/test_torchinductor.py::GPUTests::test_inductor_triton_bucketize_respects_masking_cuda PASSED [0.2714s] [ 52%] 2025-12-04T10:04:28.1440256Z inductor/test_torchinductor.py::GPUTests::test_inplace_mixed_dtype_ops_cuda PASSED [0.2070s] [ 53%] 2025-12-04T10:04:28.1441163Z inductor/test_torchinductor.py::GPUTests::test_inplace_resize_as_cuda PASSED [0.0581s] [ 53%] 2025-12-04T10:04:28.1442063Z inductor/test_torchinductor.py::GPUTests::test_inplace_where_pointwise_cuda PASSED [0.3979s] [ 53%] 2025-12-04T10:04:28.1442953Z inductor/test_torchinductor.py::GPUTests::test_input_mutation2_cuda PASSED [0.2388s] [ 53%] 2025-12-04T10:04:28.1443835Z inductor/test_torchinductor.py::GPUTests::test_int8_weight_only_quant_cuda PASSED [0.3502s] [ 53%] 2025-12-04T10:04:28.1444766Z inductor/test_torchinductor.py::GPUTests::test_int_input_dynamic_shapes_cuda PASSED [0.8383s] [ 54%] 2025-12-04T10:04:28.1445692Z inductor/test_torchinductor.py::GPUTests::test_invalid_operand_issue1_cuda PASSED [1.0101s] [ 54%] 2025-12-04T10:04:28.1446573Z inductor/test_torchinductor.py::GPUTests::test_isin_tensor_scalar_cuda PASSED [0.9059s] [ 54%] 2025-12-04T10:04:28.1447399Z inductor/test_torchinductor.py::GPUTests::test_isinf2_cuda PASSED [0.3781s] [ 54%] 2025-12-04T10:04:28.1448216Z inductor/test_torchinductor.py::GPUTests::test_large_block_sizes_cuda PASSED [5.7476s] [ 55%] 2025-12-04T10:04:28.1449150Z inductor/test_torchinductor.py::GPUTests::test_large_grid_use_block_ptr_False_cuda PASSED [0.6012s] [ 55%] 2025-12-04T10:04:28.1450090Z inductor/test_torchinductor.py::GPUTests::test_large_offset_pointwise_cuda PASSED [0.5425s] [ 55%] 2025-12-04T10:04:28.1452000Z inductor/test_torchinductor.py::GPUTests::test_layer_norm_cuda W1204 09:58:21.232000 14751 site-packages/torch/_inductor/debug.py:518] [0/0] model__439_inference_445 debug trace: /var/lib/jenkins/workspace/test/torch_compile_debug/run_2025_12_04_09_58_20_873252-pid_14751/torchinductor/model__439_inference_445.0 2025-12-04T10:04:28.1453605Z PASSED [0.5080s] [ 55%] 2025-12-04T10:04:28.1454101Z inductor/test_torchinductor.py::GPUTests::test_lerp_cuda PASSED [0.9043s] [ 55%] 2025-12-04T10:04:28.1454908Z inductor/test_torchinductor.py::GPUTests::test_like_channels_last_cuda PASSED [0.4405s] [ 56%] 2025-12-04T10:04:28.1455752Z inductor/test_torchinductor.py::GPUTests::test_like_rands2_cuda PASSED [0.2737s] [ 56%] 2025-12-04T10:04:28.1456629Z inductor/test_torchinductor.py::GPUTests::test_like_rands_cuda PASSED [0.2578s] [ 56%] 2025-12-04T10:04:28.1457839Z inductor/test_torchinductor.py::GPUTests::test_linear_dynamic_maxautotune_cuda SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 56%] 2025-12-04T10:04:28.1459349Z inductor/test_torchinductor.py::GPUTests::test_linear_float64_cuda SKIPPED [0.0032s] (cuda failed for float64 linear) [ 57%] 2025-12-04T10:04:28.1460331Z inductor/test_torchinductor.py::GPUTests::test_linspace3_cuda PASSED [0.1581s] [ 57%] 2025-12-04T10:04:28.1461426Z inductor/test_torchinductor.py::GPUTests::test_list_clearing_cuda SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 57%] 2025-12-04T10:04:28.1462604Z inductor/test_torchinductor.py::GPUTests::test_lite_dynamic_shape_assertion_cuda PASSED [0.1689s] [ 57%] 2025-12-04T10:04:28.1463545Z inductor/test_torchinductor.py::GPUTests::test_lite_mode_not_decompose_cuda PASSED [0.3972s] [ 57%] 2025-12-04T10:04:28.1464550Z inductor/test_torchinductor.py::GPUTests::test_lite_regional_compile_invoke_subgraph_cuda PASSED [1.8018s] [ 58%] 2025-12-04T10:04:28.1465467Z inductor/test_torchinductor.py::GPUTests::test_log1p_cuda PASSED [2.9492s] [ 58%] 2025-12-04T10:04:28.1466231Z inductor/test_torchinductor.py::GPUTests::test_log2_cuda PASSED [0.4821s] [ 58%] 2025-12-04T10:04:28.1466990Z inductor/test_torchinductor.py::GPUTests::test_logsumexp_cuda PASSED [1.1213s] [ 58%] 2025-12-04T10:04:28.1467790Z inductor/test_torchinductor.py::GPUTests::test_long_tensor_cuda PASSED [0.3182s] [ 58%] 2025-12-04T10:04:28.1468714Z inductor/test_torchinductor.py::GPUTests::test_low_memory_max_pool_dilation_1_dim_2_cuda PASSED [1.6771s] [ 59%] 2025-12-04T10:04:28.1469751Z inductor/test_torchinductor.py::GPUTests::test_low_memory_max_pool_dilation_1_dim_3_cuda PASSED [4.0526s] [ 59%] 2025-12-04T10:04:28.1471072Z inductor/test_torchinductor.py::GPUTests::test_mark_dynamic_with_hint_override_cuda SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 59%] 2025-12-04T10:04:28.1472703Z inductor/test_torchinductor.py::GPUTests::test_mark_unbacked_with_hint_override_cuda SKIPPED [0.0002s] (Skipping triton backend only since not big GPU (not enough SM)) [ 59%] 2025-12-04T10:04:28.1473927Z inductor/test_torchinductor.py::GPUTests::test_masked_fill_cuda PASSED [0.4605s] [ 60%] 2025-12-04T10:04:28.1474789Z inductor/test_torchinductor.py::GPUTests::test_masked_fill_promotion_cuda PASSED [0.3778s] [ 60%] 2025-12-04T10:04:28.1475635Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d1_cuda PASSED [1.2187s] [ 60%] 2025-12-04T10:04:28.1476449Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d2_cuda PASSED [2.1463s] [ 60%] 2025-12-04T10:04:28.1477252Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d4_cuda PASSED [1.3696s] [ 60%] 2025-12-04T10:04:28.1478102Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d6_dilation_1_cuda PASSED [2.6652s] [ 61%] 2025-12-04T10:04:28.1479021Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d6_dilation_2_cuda PASSED [2.9262s] [ 61%] 2025-12-04T10:04:28.1479882Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d7_cuda PASSED [0.3971s] [ 61%] 2025-12-04T10:04:28.1480689Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d8_cuda PASSED [1.3097s] [ 61%] 2025-12-04T10:04:28.1481593Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward5_cuda PASSED [0.2650s] [ 61%] 2025-12-04T10:04:28.1482606Z inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward_cuda PASSED [1.7949s] [ 62%] 2025-12-04T10:04:28.1483494Z inductor/test_torchinductor.py::GPUTests::test_mean_cuda PASSED [0.6844s] [ 62%] 2025-12-04T10:04:28.1484306Z inductor/test_torchinductor.py::GPUTests::test_min_max_reduction_cuda PASSED [0.7049s] [ 62%] 2025-12-04T10:04:28.1485203Z inductor/test_torchinductor.py::GPUTests::test_misaligned_address_issue1_cuda PASSED [0.4235s] [ 62%] 2025-12-04T10:04:28.1486156Z inductor/test_torchinductor.py::GPUTests::test_move_arange_cuda PASSED [0.1743s] [ 63%] 2025-12-04T10:04:28.1487362Z inductor/test_torchinductor.py::GPUTests::test_multi_device_cuda W1204 09:58:53.405000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1488847Z W1204 09:58:53.408000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1489770Z W1204 09:58:53.411000 14751 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.1490414Z PASSED [0.8417s] [ 63%] 2025-12-04T10:04:28.1490961Z inductor/test_torchinductor.py::GPUTests::test_multi_threading_cuda PASSED [0.2007s] [ 63%] 2025-12-04T10:04:28.1491815Z inductor/test_torchinductor.py::GPUTests::test_multilayer_any_cuda PASSED [1.0186s] [ 63%] 2025-12-04T10:04:28.1492680Z inductor/test_torchinductor.py::GPUTests::test_multilayer_prime_size_cuda PASSED [0.3756s] [ 63%] 2025-12-04T10:04:28.1493615Z inductor/test_torchinductor.py::GPUTests::test_multilayer_sum_low_prec_cuda PASSED [0.4243s] [ 64%] 2025-12-04T10:04:28.1494543Z inductor/test_torchinductor.py::GPUTests::test_mutations_loop_fusion_cuda PASSED [0.3100s] [ 64%] 2025-12-04T10:04:28.1495492Z inductor/test_torchinductor.py::GPUTests::test_nan_assert_inside_triton_kernel_cuda PASSED [0.2359s] [ 64%] 2025-12-04T10:04:28.1496524Z inductor/test_torchinductor.py::GPUTests::test_nan_sort_stable_True_descending_False_cuda PASSED [0.9865s] [ 64%] 2025-12-04T10:04:28.1497575Z inductor/test_torchinductor.py::GPUTests::test_nan_sort_stable_True_descending_True_cuda PASSED [0.9767s] [ 65%] 2025-12-04T10:04:28.1498570Z inductor/test_torchinductor.py::GPUTests::test_narrow_cuda PASSED [0.4190s] [ 65%] 2025-12-04T10:04:28.1499419Z inductor/test_torchinductor.py::GPUTests::test_needs_contiguous_strides_cuda PASSED [0.8448s] [ 65%] 2025-12-04T10:04:28.1500290Z inductor/test_torchinductor.py::GPUTests::test_new_empty_cuda PASSED [0.2079s] [ 65%] 2025-12-04T10:04:28.1501130Z inductor/test_torchinductor.py::GPUTests::test_new_empty_strided_cuda PASSED [0.2061s] [ 65%] 2025-12-04T10:04:28.1501961Z inductor/test_torchinductor.py::GPUTests::test_new_ones_cuda PASSED [0.2820s] [ 66%] 2025-12-04T10:04:28.1502729Z inductor/test_torchinductor.py::GPUTests::test_one_hot_cuda PASSED [0.2458s] [ 66%] 2025-12-04T10:04:28.1503512Z inductor/test_torchinductor.py::GPUTests::test_pad_cast_cuda PASSED [0.9590s] [ 66%] 2025-12-04T10:04:28.1504302Z inductor/test_torchinductor.py::GPUTests::test_pad_single_cuda PASSED [0.3440s] [ 66%] 2025-12-04T10:04:28.1505083Z inductor/test_torchinductor.py::GPUTests::test_pad_view_cuda PASSED [1.2142s] [ 66%] 2025-12-04T10:04:28.1505877Z inductor/test_torchinductor.py::GPUTests::test_philox_rand_cuda PASSED [1.5727s] [ 67%] 2025-12-04T10:04:28.1506769Z inductor/test_torchinductor.py::GPUTests::test_pixel_shuffle_channels_last_cuda PASSED [0.8962s] [ 67%] 2025-12-04T10:04:28.1507685Z inductor/test_torchinductor.py::GPUTests::test_pointwise_airy_ai_cuda PASSED [1.1049s] [ 67%] 2025-12-04T10:04:28.1508559Z inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_y0_cuda PASSED [1.6386s] [ 67%] 2025-12-04T10:04:28.1509457Z inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_y1_cuda PASSED [1.0343s] [ 68%] 2025-12-04T10:04:28.1510431Z inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_t_cuda PASSED [0.4487s] [ 68%] 2025-12-04T10:04:28.1511369Z inductor/test_torchinductor.py::GPUTests::test_pointwise_erf_cuda PASSED [0.3812s] [ 68%] 2025-12-04T10:04:28.1512198Z inductor/test_torchinductor.py::GPUTests::test_pointwise_erfcx_cuda PASSED [5.3364s] [ 68%] 2025-12-04T10:04:28.1513051Z inductor/test_torchinductor.py::GPUTests::test_pointwise_erfinv_cuda PASSED [0.5916s] [ 68%] 2025-12-04T10:04:28.1513906Z inductor/test_torchinductor.py::GPUTests::test_pointwise_exp2_cuda PASSED [0.3646s] [ 69%] 2025-12-04T10:04:28.1514766Z inductor/test_torchinductor.py::GPUTests::test_pointwise_gammaln_cuda PASSED [1.0764s] [ 69%] 2025-12-04T10:04:28.1515826Z inductor/test_torchinductor.py::GPUTests::test_pointwise_hermite_polynomial_h_cuda PASSED [0.2604s] [ 69%] 2025-12-04T10:04:28.1516749Z inductor/test_torchinductor.py::GPUTests::test_pointwise_i1_cuda PASSED [0.4473s] [ 69%] 2025-12-04T10:04:28.1517753Z inductor/test_torchinductor.py::GPUTests::test_pointwise_laguerre_polynomial_l_cuda PASSED [0.2231s] [ 69%] 2025-12-04T10:04:28.1518770Z inductor/test_torchinductor.py::GPUTests::test_pointwise_legendre_polynomial_p_cuda PASSED [0.2389s] [ 70%] 2025-12-04T10:04:28.1519709Z inductor/test_torchinductor.py::GPUTests::test_pointwise_log1p_cuda PASSED [0.3821s] [ 70%] 2025-12-04T10:04:28.1520575Z inductor/test_torchinductor.py::GPUTests::test_pointwise_log_ndtr_cuda PASSED [0.8302s] [ 70%] 2025-12-04T10:04:28.1521506Z inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_i0_cuda PASSED [0.4385s] [ 70%] 2025-12-04T10:04:28.1522476Z inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_i1_cuda PASSED [0.4506s] [ 71%] 2025-12-04T10:04:28.1523466Z inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_k1_cuda PASSED [0.5483s] [ 71%] 2025-12-04T10:04:28.1524421Z inductor/test_torchinductor.py::GPUTests::test_pointwise_multigammaln_cuda PASSED [0.6576s] [ 71%] 2025-12-04T10:04:28.1525316Z inductor/test_torchinductor.py::GPUTests::test_pointwise_ndtri_cuda PASSED [0.4110s] [ 71%] 2025-12-04T10:04:28.1526151Z inductor/test_torchinductor.py::GPUTests::test_pointwise_psi_cuda PASSED [1.0128s] [ 71%] 2025-12-04T10:04:28.1526997Z inductor/test_torchinductor.py::GPUTests::test_pointwise_round_cuda PASSED [0.3813s] [ 72%] 2025-12-04T10:04:28.1527954Z inductor/test_torchinductor.py::GPUTests::test_pointwise_scaled_modified_bessel_k0_cuda PASSED [0.5269s] [ 72%] 2025-12-04T10:04:28.1529040Z inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_v_cuda PASSED [1.0015s] [ 72%] 2025-12-04T10:04:28.1530140Z inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_w_cuda PASSED [0.9784s] [ 72%] 2025-12-04T10:04:28.1531204Z inductor/test_torchinductor.py::GPUTests::test_pointwise_spherical_bessel_j0_cuda PASSED [0.3565s] [ 73%] 2025-12-04T10:04:28.1532126Z inductor/test_torchinductor.py::GPUTests::test_pointwise_zeta_cuda PASSED [1.5277s] [ 73%] 2025-12-04T10:04:28.1533066Z inductor/test_torchinductor.py::GPUTests::test_pow_by_natural_log2_dynamic_shapes_cuda PASSED [0.8713s] [ 73%] 2025-12-04T10:04:28.1533947Z inductor/test_torchinductor.py::GPUTests::test_prod_cuda PASSED [4.0601s] [ 73%] 2025-12-04T10:04:28.1534789Z inductor/test_torchinductor.py::GPUTests::test_rand_like_deterministic_cuda PASSED [0.2847s] [ 73%] 2025-12-04T10:04:28.1535710Z inductor/test_torchinductor.py::GPUTests::test_randint_kernel_count_cuda PASSED [0.2863s] [ 74%] 2025-12-04T10:04:28.1536866Z inductor/test_torchinductor.py::GPUTests::test_randn_with_dtype_and_device_cuda SKIPPED [0.0036s] (only support cpu randn_with_dtype_and_device test) [ 74%] 2025-12-04T10:04:28.1537987Z inductor/test_torchinductor.py::GPUTests::test_reduction4_cuda PASSED [0.7186s] [ 74%] 2025-12-04T10:04:28.1538894Z inductor/test_torchinductor.py::GPUTests::test_reflection_pad2d_cuda PASSED [0.6944s] [ 74%] 2025-12-04T10:04:28.1539728Z inductor/test_torchinductor.py::GPUTests::test_remainder_cuda PASSED [0.5767s] [ 74%] 2025-12-04T10:04:28.1540556Z inductor/test_torchinductor.py::GPUTests::test_remove_noop_clone_cuda PASSED [0.4346s] [ 75%] 2025-12-04T10:04:28.1541420Z inductor/test_torchinductor.py::GPUTests::test_remove_noop_copy_cuda PASSED [0.8558s] [ 75%] 2025-12-04T10:04:28.1543706Z inductor/test_torchinductor.py::GPUTests::test_remove_noop_slice1_cuda SKIPPED [0.0008s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/151381 for platform(s) linux, rocm, slow. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 75%] 2025-12-04T10:04:28.1546044Z inductor/test_torchinductor.py::GPUTests::test_repeat_as_strided_cuda PASSED [0.3266s] [ 75%] 2025-12-04T10:04:28.1546872Z inductor/test_torchinductor.py::GPUTests::test_repeat_cuda PASSED [0.5123s] [ 76%] 2025-12-04T10:04:28.1547805Z inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_Tensor_decomp_int32_nd_2_cuda PASSED [0.2644s] [ 76%] 2025-12-04T10:04:28.1548985Z inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_1_cuda PASSED [0.2526s] [ 76%] 2025-12-04T10:04:28.1550097Z inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_2_cuda PASSED [0.2550s] [ 76%] 2025-12-04T10:04:28.1551093Z inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_cuda PASSED [0.9752s] [ 76%] 2025-12-04T10:04:28.1552078Z inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_decomposition_has_clamp_cuda PASSED [0.3024s] [ 77%] 2025-12-04T10:04:28.1553035Z inductor/test_torchinductor.py::GPUTests::test_roi_align_cuda PASSED [0.4670s] [ 77%] 2025-12-04T10:04:28.1554065Z inductor/test_torchinductor.py::GPUTests::test_round_correctness_cuda SKIPPED [0.0034s] (need to debug tl.libdevice on A100/V100) [ 77%] 2025-12-04T10:04:28.1555082Z inductor/test_torchinductor.py::GPUTests::test_round_cuda PASSED [0.4694s] [ 77%] 2025-12-04T10:04:28.1555907Z inductor/test_torchinductor.py::GPUTests::test_rsqrt_dynamic_shapes_cuda PASSED [1.1969s] [ 77%] 2025-12-04T10:04:28.1556806Z inductor/test_torchinductor.py::GPUTests::test_scalar_cpu_tensor_arg_cuda PASSED [2.1551s] [ 78%] 2025-12-04T10:04:28.1557665Z inductor/test_torchinductor.py::GPUTests::test_scalar_input_cuda PASSED [0.2009s] [ 78%] 2025-12-04T10:04:28.1558605Z inductor/test_torchinductor.py::GPUTests::test_scaled_dot_product_efficient_attention_cuda PASSED [0.2187s] [ 78%] 2025-12-04T10:04:28.1559540Z inductor/test_torchinductor.py::GPUTests::test_scatter6_cuda PASSED [0.6491s] [ 78%] 2025-12-04T10:04:28.1560344Z inductor/test_torchinductor.py::GPUTests::test_scatter_add2_cuda PASSED [0.4527s] [ 79%] 2025-12-04T10:04:28.1561352Z inductor/test_torchinductor.py::GPUTests::test_scatter_bf16_cuda SKIPPED [0.0032s] (uses bfloat16 which requires SM >= 80) [ 79%] 2025-12-04T10:04:28.1562363Z inductor/test_torchinductor.py::GPUTests::test_scatter_reduce2_cuda PASSED [0.6964s] [ 79%] 2025-12-04T10:04:28.1563219Z inductor/test_torchinductor.py::GPUTests::test_scatter_reduce3_cuda PASSED [0.8382s] [ 79%] 2025-12-04T10:04:28.1564126Z inductor/test_torchinductor.py::GPUTests::test_scheduler_vertical_fusion1_cuda PASSED [0.6392s] [ 79%] 2025-12-04T10:04:28.1565377Z inductor/test_torchinductor.py::GPUTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_False_cuda SKIPPED [0.0003s] (Does not support SDPA or pre-SM80 hardware) [ 80%] 2025-12-04T10:04:28.1566904Z inductor/test_torchinductor.py::GPUTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_True_cuda SKIPPED [0.0002s] (Does not support SDPA or pre-SM80 hardware) [ 80%] 2025-12-04T10:04:28.1568432Z inductor/test_torchinductor.py::GPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_True_cuda SKIPPED [0.0002s] (Does not support SDPA or pre-SM80 hardware) [ 80%] 2025-12-04T10:04:28.1569635Z inductor/test_torchinductor.py::GPUTests::test_sdpa_unaligned_mask_cuda PASSED [0.4494s] [ 80%] 2025-12-04T10:04:28.1570551Z inductor/test_torchinductor.py::GPUTests::test_searchsorted_broadcast_cuda PASSED [0.2973s] [ 80%] 2025-12-04T10:04:28.1571483Z inductor/test_torchinductor.py::GPUTests::test_setitem_with_int_parameter_cuda PASSED [0.4964s] [ 81%] 2025-12-04T10:04:28.1572342Z inductor/test_torchinductor.py::GPUTests::test_sgn_cuda PASSED [0.3775s] [ 81%] 2025-12-04T10:04:28.1573169Z inductor/test_torchinductor.py::GPUTests::test_shape_prop_torch_ones_cuda PASSED [0.5836s] [ 81%] 2025-12-04T10:04:28.1574025Z inductor/test_torchinductor.py::GPUTests::test_sign_dtype_cuda PASSED [0.4060s] [ 81%] 2025-12-04T10:04:28.1574836Z inductor/test_torchinductor.py::GPUTests::test_simplify_loops_cuda PASSED [0.3727s] [ 82%] 2025-12-04T10:04:28.1575778Z inductor/test_torchinductor.py::GPUTests::test_single_elem_indirect_cuda PASSED [0.2955s] [ 82%] 2025-12-04T10:04:28.1576764Z inductor/test_torchinductor.py::GPUTests::test_size_asserts_for_multi_output_fallback_cuda PASSED [0.0930s] [ 82%] 2025-12-04T10:04:28.1577785Z inductor/test_torchinductor.py::GPUTests::test_sizehint_issue1_cuda PASSED [0.6229s] [ 82%] 2025-12-04T10:04:28.1578649Z inductor/test_torchinductor.py::GPUTests::test_slice2_cuda PASSED [0.4656s] [ 82%] 2025-12-04T10:04:28.1579451Z inductor/test_torchinductor.py::GPUTests::test_slice_scatter3_cuda PASSED [0.3838s] [ 83%] 2025-12-04T10:04:28.1580283Z inductor/test_torchinductor.py::GPUTests::test_slice_scatter_cuda PASSED [0.5179s] [ 83%] 2025-12-04T10:04:28.1581171Z inductor/test_torchinductor.py::GPUTests::test_slice_view_with_graph_break_cuda PASSED [0.3630s] [ 83%] 2025-12-04T10:04:28.1582043Z inductor/test_torchinductor.py::GPUTests::test_softmax_cuda PASSED [1.2937s] [ 83%] 2025-12-04T10:04:28.1582912Z inductor/test_torchinductor.py::GPUTests::test_softmax_one_kernel_persist_cuda PASSED [0.4376s] [ 84%] 2025-12-04T10:04:28.1583777Z inductor/test_torchinductor.py::GPUTests::test_sort_cuda PASSED [1.6985s] [ 84%] 2025-12-04T10:04:28.1584827Z inductor/test_torchinductor.py::GPUTests::test_sort_stable_cuda SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 84%] 2025-12-04T10:04:28.1585932Z inductor/test_torchinductor.py::GPUTests::test_split_cumprod_cuda PASSED [1.0167s] [ 84%] 2025-12-04T10:04:28.1587364Z inductor/test_torchinductor.py::GPUTests::test_split_failed_cuda E1204 09:59:58.525000 14751 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] failed while attempting to run meta for aten.split_with_sizes.default 2025-12-04T10:04:28.1589094Z E1204 09:59:58.525000 14751 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] Traceback (most recent call last): 2025-12-04T10:04:28.1590597Z E1204 09:59:58.525000 14751 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:04:28.1592061Z E1204 09:59:58.525000 14751 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] r = func(*args, **kwargs) 2025-12-04T10:04:28.1593386Z E1204 09:59:58.525000 14751 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:04:28.1594735Z E1204 09:59:58.525000 14751 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] return self._op(*args, **kwargs) 2025-12-04T10:04:28.1596182Z E1204 09:59:58.525000 14751 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 4359, in split_with_sizes 2025-12-04T10:04:28.1597544Z E1204 09:59:58.525000 14751 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] torch._check_with( 2025-12-04T10:04:28.1598879Z E1204 09:59:58.525000 14751 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:04:28.1600273Z E1204 09:59:58.525000 14751 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] raise error_type(message_evaluated) 2025-12-04T10:04:28.1601496Z E1204 09:59:58.525000 14751 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] ValueError: Split sizes add up to 4 but got the tensor's size of 5 2025-12-04T10:04:28.1602330Z PASSED [0.0173s] [ 84%] 2025-12-04T10:04:28.1603086Z inductor/test_torchinductor.py::GPUTests::test_split_reduction_with_int64_size_cuda SKIPPED [0.5079s] (Insufficient cuda memory) [ 85%] 2025-12-04T10:04:28.1604147Z inductor/test_torchinductor.py::GPUTests::test_split_with_list_cuda PASSED [1.3608s] [ 85%] 2025-12-04T10:04:28.1605108Z inductor/test_torchinductor.py::GPUTests::test_split_with_sizes_with_unbacked_symints_cuda PASSED [0.9630s] [ 85%] 2025-12-04T10:04:28.1606222Z inductor/test_torchinductor.py::GPUTests::test_split_with_unbacked_symints_cuda PASSED [0.7427s] [ 85%] 2025-12-04T10:04:28.1607348Z inductor/test_torchinductor.py::GPUTests::test_sqrt_dynamic_shapes_cuda SKIPPED [0.0034s] (sqrt dynamic shapes only supports cpu) [ 85%] 2025-12-04T10:04:28.1608470Z inductor/test_torchinductor.py::GPUTests::test_squeeze1_cuda PASSED [0.3654s] [ 86%] 2025-12-04T10:04:28.1609247Z inductor/test_torchinductor.py::GPUTests::test_stack_cuda PASSED [0.4069s] [ 86%] 2025-12-04T10:04:28.1610039Z inductor/test_torchinductor.py::GPUTests::test_strided_inputs_cuda PASSED [0.1718s] [ 86%] 2025-12-04T10:04:28.1610841Z inductor/test_torchinductor.py::GPUTests::test_sum4_cuda PASSED [1.0041s] [ 86%] 2025-12-04T10:04:28.1611614Z inductor/test_torchinductor.py::GPUTests::test_sum_dtype_cuda PASSED [0.7313s] [ 87%] 2025-12-04T10:04:28.1612396Z inductor/test_torchinductor.py::GPUTests::test_sum_int_cuda PASSED [0.5587s] [ 87%] 2025-12-04T10:04:28.1613180Z inductor/test_torchinductor.py::GPUTests::test_sum_keepdims_cuda PASSED [0.5078s] [ 87%] 2025-12-04T10:04:28.1613968Z inductor/test_torchinductor.py::GPUTests::test_tan_cuda PASSED [0.5277s] [ 87%] 2025-12-04T10:04:28.1614719Z inductor/test_torchinductor.py::GPUTests::test_tanh_cuda PASSED [0.4476s] [ 87%] 2025-12-04T10:04:28.1615479Z inductor/test_torchinductor.py::GPUTests::test_tensor1_cuda PASSED [0.3288s] [ 88%] 2025-12-04T10:04:28.1616251Z inductor/test_torchinductor.py::GPUTests::test_tensor3_cuda PASSED [0.4552s] [ 88%] 2025-12-04T10:04:28.1617098Z inductor/test_torchinductor.py::GPUTests::test_tensor_index_put_slice_cuda PASSED [2.1819s] [ 88%] 2025-12-04T10:04:28.1618017Z inductor/test_torchinductor.py::GPUTests::test_tmp_not_defined_issue3_cuda PASSED [2.3420s] [ 88%] 2025-12-04T10:04:28.1618933Z inductor/test_torchinductor.py::GPUTests::test_topk_cuda PASSED [0.1961s] [ 88%] 2025-12-04T10:04:28.1619758Z inductor/test_torchinductor.py::GPUTests::test_torch_device_split_cuda PASSED [0.0387s] [ 89%] 2025-12-04T10:04:28.1620626Z inductor/test_torchinductor.py::GPUTests::test_transpose_add_cuda PASSED [0.2135s] [ 89%] 2025-12-04T10:04:28.1621428Z inductor/test_torchinductor.py::GPUTests::test_transpose_cuda PASSED [0.4483s] [ 89%] 2025-12-04T10:04:28.1622298Z inductor/test_torchinductor.py::GPUTests::test_triton_kernel_bool_param_cuda PASSED [0.9086s] [ 89%] 2025-12-04T10:04:28.1623157Z inductor/test_torchinductor.py::GPUTests::test_uint_cuda PASSED [0.1946s] [ 90%] 2025-12-04T10:04:28.1623984Z inductor/test_torchinductor.py::GPUTests::test_unbacked_float_item_cuda PASSED [0.3982s] [ 90%] 2025-12-04T10:04:28.1624880Z inductor/test_torchinductor.py::GPUTests::test_unroll_small_reduction_cuda PASSED [2.0759s] [ 90%] 2025-12-04T10:04:28.1625790Z inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_float16_cuda PASSED [0.9596s] [ 90%] 2025-12-04T10:04:28.1626683Z inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_int16_cuda PASSED [0.4121s] [ 90%] 2025-12-04T10:04:28.1627571Z inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_int64_cuda PASSED [0.4134s] [ 91%] 2025-12-04T10:04:28.1628455Z inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_int8_cuda PASSED [0.4130s] [ 91%] 2025-12-04T10:04:28.1629288Z inductor/test_torchinductor.py::GPUTests::test_unsqueeze_cuda PASSED [0.4911s] [ 91%] 2025-12-04T10:04:28.1630127Z inductor/test_torchinductor.py::GPUTests::test_unsqueeze_inplace_cuda PASSED [0.3587s] [ 91%] 2025-12-04T10:04:28.1631005Z inductor/test_torchinductor.py::GPUTests::test_upsample_bicubic2d_cuda PASSED [10.1623s] [ 92%] 2025-12-04T10:04:28.1632079Z inductor/test_torchinductor.py::GPUTests::test_upsample_cat_conv_cuda SKIPPED [0.0034s] (only support cpu upsample_cat_conv test) [ 92%] 2025-12-04T10:04:28.1633204Z inductor/test_torchinductor.py::GPUTests::test_upsample_nearest2d_backward_cuda PASSED [2.9964s] [ 92%] 2025-12-04T10:04:28.1634517Z inductor/test_torchinductor.py::GPUTests::test_var_mean_div_by_cuda PASSED [0.6100s] [ 92%] 2025-12-04T10:04:28.1635347Z inductor/test_torchinductor.py::GPUTests::test_vdd_clamp_cuda PASSED [0.3630s] [ 92%] 2025-12-04T10:04:28.1636244Z inductor/test_torchinductor.py::GPUTests::test_vertical_fusion1_cuda PASSED [0.5186s] [ 93%] 2025-12-04T10:04:28.1637095Z inductor/test_torchinductor.py::GPUTests::test_view_as_real_cuda PASSED [0.2066s] [ 93%] 2025-12-04T10:04:28.1637955Z inductor/test_torchinductor.py::GPUTests::test_views1_cuda PASSED [7.8819s] [ 93%] 2025-12-04T10:04:28.1638709Z inductor/test_torchinductor.py::GPUTests::test_views2_cuda PASSED [1.6752s] [ 93%] 2025-12-04T10:04:28.1639476Z inductor/test_torchinductor.py::GPUTests::test_views3_cuda PASSED [0.5390s] [ 93%] 2025-12-04T10:04:28.1640308Z inductor/test_torchinductor.py::GPUTests::test_weight_norm_conv2d_cuda PASSED [1.1318s] [ 94%] 2025-12-04T10:04:28.1641173Z inductor/test_torchinductor.py::GPUTests::test_where_broadcast_cuda PASSED [1.0914s] [ 94%] 2025-12-04T10:04:28.1642040Z inductor/test_torchinductor.py::GPUTests::test_where_with_logical_op_cuda PASSED [0.6666s] [ 94%] 2025-12-04T10:04:28.1642877Z inductor/test_torchinductor.py::GPUTests::test_zeros_cuda PASSED [0.9626s] [ 94%] 2025-12-04T10:04:28.1643749Z inductor/test_torchinductor.py::TritonCodeGenTests::test_bandwidth_profiler PASSED [0.7804s] [ 95%] 2025-12-04T10:04:28.1644966Z inductor/test_torchinductor.py::TritonCodeGenTests::test_bf16_atomic_add SKIPPED [0.0036s] (uses bfloat16 atomic add instrs which requires SM >= 90) [ 95%] 2025-12-04T10:04:28.1646294Z inductor/test_torchinductor.py::TritonCodeGenTests::test_codegen_config_option_dont_assume_alignment PASSED [0.6228s] [ 95%] 2025-12-04T10:04:28.1647398Z inductor/test_torchinductor.py::TritonCodeGenTests::test_comment_graph_fragment PASSED [0.2077s] [ 95%] 2025-12-04T10:04:28.1648403Z inductor/test_torchinductor.py::TritonCodeGenTests::test_computed_indirect_mask PASSED [0.1499s] [ 95%] 2025-12-04T10:04:28.1649435Z inductor/test_torchinductor.py::TritonCodeGenTests::test_constant_folding_deallocation PASSED [0.0306s] [ 96%] 2025-12-04T10:04:28.1650508Z inductor/test_torchinductor.py::TritonCodeGenTests::test_divisible_by_16_covers_numel_args PASSED [0.2051s] [ 96%] 2025-12-04T10:04:28.1651690Z inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt You have not run this instance of FileCheck! 2025-12-04T10:04:28.1652492Z FileCheck checks: 2025-12-04T10:04:28.1652859Z CHECK-REGEX: reinterpret_tensor\(.*, \(1024, 50257\).*# reuse 2025-12-04T10:04:28.1653341Z ('RERUN', {'yellow': True}) [4.9526s] [ 96%] 2025-12-04T10:04:28.1654146Z inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt ('RERUN', {'yellow': True}) [3.9527s] [ 96%] 2025-12-04T10:04:28.1655386Z inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt You have not run this instance of FileCheck! 2025-12-04T10:04:28.1656173Z FileCheck checks: 2025-12-04T10:04:28.1656549Z CHECK-REGEX: reinterpret_tensor\(.*, \(1024, 50257\).*# reuse 2025-12-04T10:04:28.1656995Z FAILED [3.9482s] [ 96%] 2025-12-04T10:04:28.1657176Z 2025-12-04T10:04:28.1657336Z ==================================== RERUNS ==================================== 2025-12-04T10:04:28.1657905Z ______________ TritonCodeGenTests.test_donated_buffer_inplace_gpt ______________ 2025-12-04T10:04:28.1658531Z Traceback (most recent call last): 2025-12-04T10:04:28.1659323Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 16197, in test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1660088Z ).run(code[1]) 2025-12-04T10:04:28.1660382Z IndexError: list index out of range 2025-12-04T10:04:28.1660621Z 2025-12-04T10:04:28.1660850Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.1661759Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1662445Z 2025-12-04T10:04:28.1662711Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.1663352Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.1663835Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.1664236Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.1664681Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.1665549Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.1666350Z graph_break [] 2025-12-04T10:04:28.1667367Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.1668609Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.1669718Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1670707Z warnings.warn( 2025-12-04T10:04:28.1671590Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1672566Z warnings.warn( 2025-12-04T10:04:28.1673460Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1674426Z warnings.warn( 2025-12-04T10:04:28.1675301Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1676266Z warnings.warn( 2025-12-04T10:04:28.1677157Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1678113Z warnings.warn( 2025-12-04T10:04:28.1678562Z ______________ TritonCodeGenTests.test_donated_buffer_inplace_gpt ______________ 2025-12-04T10:04:28.1679112Z Traceback (most recent call last): 2025-12-04T10:04:28.1679889Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 16197, in test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1680654Z ).run(code[1]) 2025-12-04T10:04:28.1680942Z IndexError: list index out of range 2025-12-04T10:04:28.1681180Z 2025-12-04T10:04:28.1681408Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.1682304Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1682999Z 2025-12-04T10:04:28.1683266Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.1683898Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.1684373Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.1684706Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.1685148Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.1686011Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.1686757Z graph_break [] 2025-12-04T10:04:28.1687940Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.1689172Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.1690276Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1691253Z warnings.warn( 2025-12-04T10:04:28.1692242Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1693294Z warnings.warn( 2025-12-04T10:04:28.1694184Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1695152Z warnings.warn( 2025-12-04T10:04:28.1696031Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1696997Z warnings.warn( 2025-12-04T10:04:28.1697886Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1698911Z warnings.warn( 2025-12-04T10:04:28.1699302Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.1699779Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.1700120Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.1700557Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.1701421Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.1702168Z graph_break [] 2025-12-04T10:04:28.1703182Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.1704409Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.1705510Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1706481Z warnings.warn( 2025-12-04T10:04:28.1707367Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1708346Z warnings.warn( 2025-12-04T10:04:28.1709238Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1710197Z warnings.warn( 2025-12-04T10:04:28.1711065Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1712021Z warnings.warn( 2025-12-04T10:04:28.1712905Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1713866Z warnings.warn( 2025-12-04T10:04:28.1714154Z =================================== FAILURES =================================== 2025-12-04T10:04:28.1714725Z ______________ TritonCodeGenTests.test_donated_buffer_inplace_gpt ______________ 2025-12-04T10:04:28.1715267Z Traceback (most recent call last): 2025-12-04T10:04:28.1716026Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 16197, in test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1716790Z ).run(code[1]) 2025-12-04T10:04:28.1717071Z IndexError: list index out of range 2025-12-04T10:04:28.1717302Z 2025-12-04T10:04:28.1717525Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.1718410Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1719105Z 2025-12-04T10:04:28.1719442Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.1720065Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.1720539Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.1720930Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.1721365Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.1722223Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.1722949Z graph_break [] 2025-12-04T10:04:28.1723969Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.1725193Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.1726296Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1727255Z warnings.warn( 2025-12-04T10:04:28.1728135Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1729093Z warnings.warn( 2025-12-04T10:04:28.1729970Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1730917Z warnings.warn( 2025-12-04T10:04:28.1731792Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1732746Z warnings.warn( 2025-12-04T10:04:28.1733622Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1734572Z warnings.warn( 2025-12-04T10:04:28.1734944Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.1735408Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.1735726Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.1736159Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.1737018Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.1737744Z graph_break [] 2025-12-04T10:04:28.1738849Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.1740070Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.1741157Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1742128Z warnings.warn( 2025-12-04T10:04:28.1742998Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1743958Z warnings.warn( 2025-12-04T10:04:28.1744840Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1745791Z warnings.warn( 2025-12-04T10:04:28.1746745Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1747705Z warnings.warn( 2025-12-04T10:04:28.1748585Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1749593Z warnings.warn( 2025-12-04T10:04:28.1749972Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.1750436Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.1750757Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.1751187Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.1752045Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.1752777Z graph_break [] 2025-12-04T10:04:28.1753788Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.1755015Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.1756104Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1757068Z warnings.warn( 2025-12-04T10:04:28.1757940Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1758908Z warnings.warn( 2025-12-04T10:04:28.1759798Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1760758Z warnings.warn( 2025-12-04T10:04:28.1761632Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1762597Z warnings.warn( 2025-12-04T10:04:28.1763473Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1764423Z warnings.warn( 2025-12-04T10:04:28.1765338Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-bf110f953f5c5355.xml - 2025-12-04T10:04:28.1766419Z =========================== short test summary info ============================ 2025-12-04T10:04:28.1767330Z FAILED [3.9482s] inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt - IndexError: list index out of range 2025-12-04T10:04:28.1768058Z 2025-12-04T10:04:28.1768283Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.1769172Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1769863Z 2025-12-04T10:04:28.1770128Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.1770708Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:04:28.1771246Z ======== 1 failed, 395 passed, 51 skipped, 2 rerun in 319.24s (0:05:19) ======== 2025-12-04T10:04:28.1771757Z You have not run this instance of FileCheck! 2025-12-04T10:04:28.1772128Z FileCheck checks: 2025-12-04T10:04:28.1772494Z CHECK-REGEX: reinterpret_tensor\(.*, \(1024, 50257\).*# reuse 2025-12-04T10:04:28.1772925Z Got exit code 1 2025-12-04T10:04:28.1773187Z Retrying single test... 2025-12-04T10:04:28.1773814Z W1204 10:01:15.866000 23154 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.1775041Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-268cc96fdd12a701.xml 2025-12-04T10:04:28.1775935Z ============================= test session starts ============================== 2025-12-04T10:04:28.1776651Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:04:28.1777248Z cachedir: .pytest_cache 2025-12-04T10:04:28.1777942Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:04:28.1778785Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:04:28.1779134Z configfile: pytest.ini 2025-12-04T10:04:28.1779846Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:04:28.1780747Z collecting ... collected 981 items / 462 deselected / 519 selected 2025-12-04T10:04:28.1781744Z stepcurrent: skipping 446 already run items. Running only test/inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1782641Z Running 1 items in this shard 2025-12-04T10:04:28.1782856Z 2025-12-04T10:04:28.1783820Z inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt [W1204 10:01:22.144723527 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1784925Z 2025-12-04T10:04:28.1785447Z [W1204 10:01:37.386048750 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1786119Z 2025-12-04T10:04:28.1786638Z [W1204 10:01:37.390740597 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1787307Z 2025-12-04T10:04:28.1787984Z [W1204 10:01:37.390959526 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1788639Z 2025-12-04T10:04:28.1789176Z [W1204 10:01:37.391602813 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1789833Z 2025-12-04T10:04:28.1790362Z [W1204 10:01:37.391789249 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1791015Z 2025-12-04T10:04:28.1791528Z [W1204 10:01:37.392432464 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1792196Z 2025-12-04T10:04:28.1792712Z [W1204 10:01:37.392612281 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1793378Z 2025-12-04T10:04:28.1793900Z [W1204 10:01:37.393601901 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1794553Z 2025-12-04T10:04:28.1795085Z [W1204 10:01:37.393781986 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1795742Z 2025-12-04T10:04:28.1796273Z [W1204 10:01:37.394290666 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1796927Z 2025-12-04T10:04:28.1797443Z [W1204 10:01:37.394489417 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1798110Z 2025-12-04T10:04:28.1798623Z [W1204 10:01:37.394994598 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1799292Z 2025-12-04T10:04:28.1799810Z [W1204 10:01:37.395190702 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1800562Z 2025-12-04T10:04:28.1801096Z [W1204 10:01:37.395627020 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1801823Z 2025-12-04T10:04:28.1802356Z [W1204 10:01:37.395807418 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1803010Z 2025-12-04T10:04:28.1803524Z [W1204 10:01:37.396227496 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1804191Z 2025-12-04T10:04:28.1804705Z [W1204 10:01:37.396405391 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1805370Z 2025-12-04T10:04:28.1805882Z [W1204 10:01:37.398136357 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1806531Z 2025-12-04T10:04:28.1807058Z [W1204 10:01:37.398322283 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1807712Z 2025-12-04T10:04:28.1808242Z [W1204 10:01:37.398750769 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1808892Z 2025-12-04T10:04:28.1809405Z [W1204 10:01:37.398933254 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1810069Z 2025-12-04T10:04:28.1810585Z [W1204 10:01:37.399376867 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1811249Z 2025-12-04T10:04:28.1811761Z [W1204 10:01:37.399555968 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1812411Z 2025-12-04T10:04:28.1812937Z [W1204 10:01:37.399983366 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1813588Z 2025-12-04T10:04:28.1814116Z [W1204 10:01:37.400184854 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1814764Z 2025-12-04T10:04:28.1815229Z W1204 10:01:37.902000 23154 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.1816349Z [W1204 10:01:38.960631127 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1817018Z 2025-12-04T10:04:28.1817531Z [W1204 10:01:38.961646987 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1818247Z 2025-12-04T10:04:28.1818785Z [W1204 10:01:38.962225231 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1819434Z 2025-12-04T10:04:28.1819961Z [W1204 10:01:38.962419438 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1820617Z 2025-12-04T10:04:28.1821130Z [W1204 10:01:38.962854104 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1821801Z 2025-12-04T10:04:28.1822313Z [W1204 10:01:38.963039843 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1822977Z 2025-12-04T10:04:28.1823489Z [W1204 10:01:38.963601537 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1824140Z 2025-12-04T10:04:28.1824742Z [W1204 10:01:38.963786800 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1825398Z 2025-12-04T10:04:28.1825926Z [W1204 10:01:39.737237525 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1826632Z 2025-12-04T10:04:28.1827146Z [W1204 10:01:39.738337276 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1827812Z 2025-12-04T10:04:28.1828324Z [W1204 10:01:39.738909057 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1828984Z 2025-12-04T10:04:28.1829493Z [W1204 10:01:39.739124453 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1830154Z 2025-12-04T10:04:28.1830667Z [W1204 10:01:39.739564407 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1831323Z 2025-12-04T10:04:28.1831848Z [W1204 10:01:39.739754169 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1832503Z 2025-12-04T10:04:28.1833018Z [W1204 10:01:39.740331199 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1833685Z 2025-12-04T10:04:28.1834199Z [W1204 10:01:39.740527514 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1834861Z 2025-12-04T10:04:28.1835009Z You have not run this instance of FileCheck! 2025-12-04T10:04:28.1835387Z FileCheck checks: 2025-12-04T10:04:28.1835752Z CHECK-REGEX: reinterpret_tensor\(.*, \(1024, 50257\).*# reuse 2025-12-04T10:04:28.1836232Z ('RERUN', {'yellow': True}) [23.4779s] [100%] 2025-12-04T10:04:28.1837467Z inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt [W1204 10:01:42.657658086 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1838561Z 2025-12-04T10:04:28.1839092Z [W1204 10:01:42.657883616 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1839749Z 2025-12-04T10:04:28.1840275Z [W1204 10:01:42.658425581 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1840926Z 2025-12-04T10:04:28.1841442Z [W1204 10:01:42.658617454 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1842105Z 2025-12-04T10:04:28.1842620Z [W1204 10:01:42.659053986 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1843283Z 2025-12-04T10:04:28.1843802Z [W1204 10:01:42.659243847 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1844452Z 2025-12-04T10:04:28.1844976Z [W1204 10:01:42.659789199 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1845629Z 2025-12-04T10:04:28.1846155Z [W1204 10:01:42.659970714 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1846807Z 2025-12-04T10:04:28.1847321Z [W1204 10:01:42.660839853 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1847982Z 2025-12-04T10:04:28.1848496Z [W1204 10:01:42.661027148 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1849158Z 2025-12-04T10:04:28.1849729Z [W1204 10:01:42.661480010 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1850384Z 2025-12-04T10:04:28.1850910Z [W1204 10:01:42.661659419 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1851619Z 2025-12-04T10:04:28.1852142Z [W1204 10:01:42.662119792 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1852794Z 2025-12-04T10:04:28.1853307Z [W1204 10:01:42.662301066 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1853968Z 2025-12-04T10:04:28.1854482Z [W1204 10:01:42.662718051 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1855145Z 2025-12-04T10:04:28.1855662Z [W1204 10:01:42.662896908 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1856310Z 2025-12-04T10:04:28.1856832Z [W1204 10:01:42.663317518 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1857487Z 2025-12-04T10:04:28.1858011Z [W1204 10:01:42.663496395 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1858726Z 2025-12-04T10:04:28.1859242Z [W1204 10:01:42.665001339 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1859909Z 2025-12-04T10:04:28.1860422Z [W1204 10:01:42.665182091 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1861088Z 2025-12-04T10:04:28.1861606Z [W1204 10:01:42.665600956 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1862271Z 2025-12-04T10:04:28.1862785Z [W1204 10:01:42.665779255 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1863438Z 2025-12-04T10:04:28.1863967Z [W1204 10:01:42.666191865 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1864618Z 2025-12-04T10:04:28.1865144Z [W1204 10:01:42.666371351 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1865791Z 2025-12-04T10:04:28.1866302Z [W1204 10:01:42.666789008 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1866963Z 2025-12-04T10:04:28.1867474Z [W1204 10:01:42.666967840 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1868137Z 2025-12-04T10:04:28.1868650Z [W1204 10:01:43.824753349 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1869307Z 2025-12-04T10:04:28.1869832Z [W1204 10:01:43.824991411 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1870481Z 2025-12-04T10:04:28.1871008Z [W1204 10:01:43.825547350 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1871658Z 2025-12-04T10:04:28.1872169Z [W1204 10:01:43.825742303 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1872834Z 2025-12-04T10:04:28.1873346Z [W1204 10:01:43.826165464 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1874013Z 2025-12-04T10:04:28.1874596Z [W1204 10:01:43.826363155 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1875305Z 2025-12-04T10:04:28.1875829Z [W1204 10:01:43.826914288 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1876484Z 2025-12-04T10:04:28.1877008Z [W1204 10:01:43.827109796 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1877659Z 2025-12-04T10:04:28.1878172Z [W1204 10:01:44.587617101 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1878834Z 2025-12-04T10:04:28.1879347Z [W1204 10:01:44.587856076 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1880013Z 2025-12-04T10:04:28.1880531Z [W1204 10:01:44.588403176 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1881182Z 2025-12-04T10:04:28.1881712Z [W1204 10:01:44.588603470 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1882361Z 2025-12-04T10:04:28.1882883Z [W1204 10:01:44.589031926 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1883532Z 2025-12-04T10:04:28.1884043Z [W1204 10:01:44.589219296 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1884705Z 2025-12-04T10:04:28.1885222Z [W1204 10:01:44.589772675 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1885885Z 2025-12-04T10:04:28.1886399Z [W1204 10:01:44.589961786 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1887050Z 2025-12-04T10:04:28.1887194Z ('RERUN', {'yellow': True}) [4.2284s] [100%] 2025-12-04T10:04:28.1888168Z inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt You have not run this instance of FileCheck! 2025-12-04T10:04:28.1888971Z FileCheck checks: 2025-12-04T10:04:28.1889347Z CHECK-REGEX: reinterpret_tensor\(.*, \(1024, 50257\).*# reuse 2025-12-04T10:04:28.1890212Z [W1204 10:01:46.876803967 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1890864Z 2025-12-04T10:04:28.1891380Z [W1204 10:01:46.877060838 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1892041Z 2025-12-04T10:04:28.1892557Z [W1204 10:01:46.877643056 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1893213Z 2025-12-04T10:04:28.1893724Z [W1204 10:01:46.877845486 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1894387Z 2025-12-04T10:04:28.1894898Z [W1204 10:01:46.878289097 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1895551Z 2025-12-04T10:04:28.1896080Z [W1204 10:01:46.878477326 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1896728Z 2025-12-04T10:04:28.1897251Z [W1204 10:01:46.879020771 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1897908Z 2025-12-04T10:04:28.1898592Z [W1204 10:01:46.879220193 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1899258Z 2025-12-04T10:04:28.1899774Z [W1204 10:01:46.880093015 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1900507Z 2025-12-04T10:04:28.1901020Z [W1204 10:01:46.880285425 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1901670Z 2025-12-04T10:04:28.1902192Z [W1204 10:01:46.880743622 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1902841Z 2025-12-04T10:04:28.1903368Z [W1204 10:01:46.880929277 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1904019Z 2025-12-04T10:04:28.1904532Z [W1204 10:01:46.881396744 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1905200Z 2025-12-04T10:04:28.1905713Z [W1204 10:01:46.881581647 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1906380Z 2025-12-04T10:04:28.1906897Z [W1204 10:01:46.882010941 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1907547Z 2025-12-04T10:04:28.1908072Z [W1204 10:01:46.882195120 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1908723Z 2025-12-04T10:04:28.1909245Z [W1204 10:01:46.882607697 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1909898Z 2025-12-04T10:04:28.1910411Z [W1204 10:01:46.882791237 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1911071Z 2025-12-04T10:04:28.1911588Z [W1204 10:01:46.884313047 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1912256Z 2025-12-04T10:04:28.1912765Z [W1204 10:01:46.884498158 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1913416Z 2025-12-04T10:04:28.1913938Z [W1204 10:01:46.884925567 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1914584Z 2025-12-04T10:04:28.1915104Z [W1204 10:01:46.885108285 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1915754Z 2025-12-04T10:04:28.1916268Z [W1204 10:01:46.885529758 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1916930Z 2025-12-04T10:04:28.1917443Z [W1204 10:01:46.885713694 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1918100Z 2025-12-04T10:04:28.1918615Z [W1204 10:01:46.886137973 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1919268Z 2025-12-04T10:04:28.1919789Z [W1204 10:01:46.886329357 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1920436Z 2025-12-04T10:04:28.1920961Z [W1204 10:01:47.059665270 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1921613Z 2025-12-04T10:04:28.1922122Z [W1204 10:01:47.059914704 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1922784Z 2025-12-04T10:04:28.1923366Z [W1204 10:01:47.060503637 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1924034Z 2025-12-04T10:04:28.1924542Z [W1204 10:01:47.060707602 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1925251Z 2025-12-04T10:04:28.1925777Z [W1204 10:01:47.061141922 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1926426Z 2025-12-04T10:04:28.1926951Z [W1204 10:01:47.061333971 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1927602Z 2025-12-04T10:04:28.1928113Z [W1204 10:01:47.061886098 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1928778Z 2025-12-04T10:04:28.1929292Z [W1204 10:01:47.062072781 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1929954Z 2025-12-04T10:04:28.1930462Z [W1204 10:01:48.829642008 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1931128Z 2025-12-04T10:04:28.1931639Z [W1204 10:01:48.829894616 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1932287Z 2025-12-04T10:04:28.1932806Z [W1204 10:01:48.830474945 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1933457Z 2025-12-04T10:04:28.1933974Z [W1204 10:01:48.830688168 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1934623Z 2025-12-04T10:04:28.1935138Z [W1204 10:01:48.831144705 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1935792Z 2025-12-04T10:04:28.1936304Z [W1204 10:01:48.831346637 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1936966Z 2025-12-04T10:04:28.1937475Z [W1204 10:01:48.831906301 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1938125Z 2025-12-04T10:04:28.1938706Z [W1204 10:01:48.832103802 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.1939355Z 2025-12-04T10:04:28.1939468Z FAILED [3.9945s] [100%] 2025-12-04T10:04:28.1939646Z 2025-12-04T10:04:28.1939790Z ==================================== RERUNS ==================================== 2025-12-04T10:04:28.1940373Z ______________ TritonCodeGenTests.test_donated_buffer_inplace_gpt ______________ 2025-12-04T10:04:28.1940926Z Traceback (most recent call last): 2025-12-04T10:04:28.1941697Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 16197, in test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1942476Z ).run(code[1]) 2025-12-04T10:04:28.1942760Z IndexError: list index out of range 2025-12-04T10:04:28.1942994Z 2025-12-04T10:04:28.1943216Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.1944106Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1944800Z 2025-12-04T10:04:28.1945065Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.1945697Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.1946156Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.1946488Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.1947277Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.1948139Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.1948645Z graph_break [] 2025-12-04T10:04:28.1949671Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.1950896Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.1952477Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.1953926Z if out == self.unknown_value: 2025-12-04T10:04:28.1954878Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1955858Z warnings.warn( 2025-12-04T10:04:28.1956752Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1957708Z warnings.warn( 2025-12-04T10:04:28.1958598Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1959563Z warnings.warn( 2025-12-04T10:04:28.1960445Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1961391Z warnings.warn( 2025-12-04T10:04:28.1962275Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1963231Z warnings.warn( 2025-12-04T10:04:28.1963664Z ______________ TritonCodeGenTests.test_donated_buffer_inplace_gpt ______________ 2025-12-04T10:04:28.1964208Z Traceback (most recent call last): 2025-12-04T10:04:28.1964979Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 16197, in test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1965754Z ).run(code[1]) 2025-12-04T10:04:28.1966031Z IndexError: list index out of range 2025-12-04T10:04:28.1966273Z 2025-12-04T10:04:28.1966487Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.1967389Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.1968073Z 2025-12-04T10:04:28.1968351Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.1968968Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.1969437Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.1969776Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.1970481Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.1971336Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.1971790Z graph_break [] 2025-12-04T10:04:28.1972819Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.1974026Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.1975663Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.1977185Z if out == self.unknown_value: 2025-12-04T10:04:28.1978134Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1979167Z warnings.warn( 2025-12-04T10:04:28.1980059Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1981015Z warnings.warn( 2025-12-04T10:04:28.1981905Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1982863Z warnings.warn( 2025-12-04T10:04:28.1983748Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1984721Z warnings.warn( 2025-12-04T10:04:28.1985597Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1986552Z warnings.warn( 2025-12-04T10:04:28.1986928Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.1987406Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.1987896Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.1988339Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.1989209Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.1989941Z graph_break [] 2025-12-04T10:04:28.1990968Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.1992194Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.1993295Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1994269Z warnings.warn( 2025-12-04T10:04:28.1995150Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1996112Z warnings.warn( 2025-12-04T10:04:28.1997001Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1997954Z warnings.warn( 2025-12-04T10:04:28.1998838Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.1999806Z warnings.warn( 2025-12-04T10:04:28.2000688Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2001635Z warnings.warn( 2025-12-04T10:04:28.2001938Z =================================== FAILURES =================================== 2025-12-04T10:04:28.2002510Z ______________ TritonCodeGenTests.test_donated_buffer_inplace_gpt ______________ 2025-12-04T10:04:28.2003049Z Traceback (most recent call last): 2025-12-04T10:04:28.2003927Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 16197, in test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2004699Z ).run(code[1]) 2025-12-04T10:04:28.2004983Z IndexError: list index out of range 2025-12-04T10:04:28.2005309Z 2025-12-04T10:04:28.2005523Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2006430Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2007118Z 2025-12-04T10:04:28.2007384Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2008011Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2008473Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.2008807Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.2009521Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.2010371Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.2010829Z graph_break [] 2025-12-04T10:04:28.2011860Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.2013085Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2014650Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.2016110Z if out == self.unknown_value: 2025-12-04T10:04:28.2017062Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2018029Z warnings.warn( 2025-12-04T10:04:28.2018987Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2019968Z warnings.warn( 2025-12-04T10:04:28.2020863Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2021833Z warnings.warn( 2025-12-04T10:04:28.2022704Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2023670Z warnings.warn( 2025-12-04T10:04:28.2024556Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2025517Z warnings.warn( 2025-12-04T10:04:28.2025881Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2026362Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.2026691Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.2027116Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.2027972Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.2028714Z graph_break [] 2025-12-04T10:04:28.2029734Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.2031009Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2032112Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2033139Z warnings.warn( 2025-12-04T10:04:28.2034032Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2034985Z warnings.warn( 2025-12-04T10:04:28.2035874Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2036833Z warnings.warn( 2025-12-04T10:04:28.2037714Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2038661Z warnings.warn( 2025-12-04T10:04:28.2039545Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2040506Z warnings.warn( 2025-12-04T10:04:28.2040871Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2041342Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.2041675Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.2042110Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.2042958Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.2043695Z graph_break [] 2025-12-04T10:04:28.2044727Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.2045949Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2047029Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2048001Z warnings.warn( 2025-12-04T10:04:28.2048890Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2049845Z warnings.warn( 2025-12-04T10:04:28.2050723Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2051684Z warnings.warn( 2025-12-04T10:04:28.2052575Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2053525Z warnings.warn( 2025-12-04T10:04:28.2054405Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2055371Z warnings.warn( 2025-12-04T10:04:28.2056310Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-268cc96fdd12a701.xml - 2025-12-04T10:04:28.2057395Z =========================== short test summary info ============================ 2025-12-04T10:04:28.2058387Z FAILED [3.9945s] inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt - IndexError: list index out of range 2025-12-04T10:04:28.2059133Z 2025-12-04T10:04:28.2059353Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2060335Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2061022Z 2025-12-04T10:04:28.2061288Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2061951Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:04:28.2062493Z ================= 1 failed, 462 deselected, 2 rerun in 31.79s ================== 2025-12-04T10:04:28.2062999Z You have not run this instance of FileCheck! 2025-12-04T10:04:28.2063367Z FileCheck checks: 2025-12-04T10:04:28.2063750Z CHECK-REGEX: reinterpret_tensor\(.*, \(1024, 50257\).*# reuse 2025-12-04T10:04:28.2064203Z Got exit code 1 2025-12-04T10:04:28.2064463Z Retrying single test... 2025-12-04T10:04:28.2065104Z W1204 10:02:00.368000 23337 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.2066293Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-9d3344bbd0e95b72.xml 2025-12-04T10:04:28.2067184Z ============================= test session starts ============================== 2025-12-04T10:04:28.2067854Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:04:28.2068465Z cachedir: .pytest_cache 2025-12-04T10:04:28.2069174Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:04:28.2069955Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:04:28.2070316Z configfile: pytest.ini 2025-12-04T10:04:28.2071051Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:04:28.2071964Z collecting ... collected 981 items / 462 deselected / 519 selected 2025-12-04T10:04:28.2072968Z stepcurrent: skipping 446 already run items. Running only test/inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2073857Z Running 1 items in this shard 2025-12-04T10:04:28.2074069Z 2025-12-04T10:04:28.2075049Z inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt [W1204 10:02:06.600640755 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2076140Z 2025-12-04T10:04:28.2076672Z [W1204 10:02:21.965994288 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2077330Z 2025-12-04T10:04:28.2077846Z [W1204 10:02:21.970556130 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2078512Z 2025-12-04T10:04:28.2079028Z [W1204 10:02:21.970769992 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2079693Z 2025-12-04T10:04:28.2080210Z [W1204 10:02:21.971394725 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2080866Z 2025-12-04T10:04:28.2081394Z [W1204 10:02:21.971591790 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2082050Z 2025-12-04T10:04:28.2082575Z [W1204 10:02:21.972245608 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2083223Z 2025-12-04T10:04:28.2083738Z [W1204 10:02:21.972425092 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2084399Z 2025-12-04T10:04:28.2084911Z [W1204 10:02:21.973419695 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2085573Z 2025-12-04T10:04:28.2086151Z [W1204 10:02:21.973598007 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2086805Z 2025-12-04T10:04:28.2087329Z [W1204 10:02:21.974101317 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2088204Z 2025-12-04T10:04:28.2088734Z [W1204 10:02:21.974283216 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2089385Z 2025-12-04T10:04:28.2089900Z [W1204 10:02:21.974770943 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2090562Z 2025-12-04T10:04:28.2091076Z [W1204 10:02:21.974950877 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2091741Z 2025-12-04T10:04:28.2092259Z [W1204 10:02:21.975390549 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2092924Z 2025-12-04T10:04:28.2093437Z [W1204 10:02:21.975569654 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2094094Z 2025-12-04T10:04:28.2094621Z [W1204 10:02:21.975986600 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2095273Z 2025-12-04T10:04:28.2095801Z [W1204 10:02:21.976166239 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2096453Z 2025-12-04T10:04:28.2096967Z [W1204 10:02:21.977883097 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2097629Z 2025-12-04T10:04:28.2098148Z [W1204 10:02:21.978061794 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2098894Z 2025-12-04T10:04:28.2099410Z [W1204 10:02:21.978489681 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2100069Z 2025-12-04T10:04:28.2100598Z [W1204 10:02:21.978666168 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2100604Z 2025-12-04T10:04:28.2101117Z [W1204 10:02:21.979105033 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2101122Z 2025-12-04T10:04:28.2101649Z [W1204 10:02:21.979288473 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2101654Z 2025-12-04T10:04:28.2102171Z [W1204 10:02:21.979709001 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2102176Z 2025-12-04T10:04:28.2102703Z [W1204 10:02:21.979891001 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2102713Z 2025-12-04T10:04:28.2103180Z W1204 10:02:22.471000 23337 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.2103695Z [W1204 10:02:23.517808864 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2103699Z 2025-12-04T10:04:28.2104225Z [W1204 10:02:23.518807325 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2104230Z 2025-12-04T10:04:28.2104742Z [W1204 10:02:23.519381783 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2104747Z 2025-12-04T10:04:28.2105370Z [W1204 10:02:23.519576846 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2105375Z 2025-12-04T10:04:28.2105973Z [W1204 10:02:23.520028082 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2105977Z 2025-12-04T10:04:28.2106504Z [W1204 10:02:23.520216878 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2106508Z 2025-12-04T10:04:28.2107018Z [W1204 10:02:23.520759075 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2107023Z 2025-12-04T10:04:28.2107550Z [W1204 10:02:23.520941825 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2107555Z 2025-12-04T10:04:28.2108072Z [W1204 10:02:24.280906139 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2108076Z 2025-12-04T10:04:28.2108587Z [W1204 10:02:24.282032452 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2108610Z 2025-12-04T10:04:28.2109120Z [W1204 10:02:24.282585554 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2109124Z 2025-12-04T10:04:28.2109640Z [W1204 10:02:24.282787579 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2109645Z 2025-12-04T10:04:28.2110172Z [W1204 10:02:24.283224619 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2110177Z 2025-12-04T10:04:28.2110691Z [W1204 10:02:24.283411904 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2110696Z 2025-12-04T10:04:28.2111221Z [W1204 10:02:24.283950941 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2111230Z 2025-12-04T10:04:28.2111738Z [W1204 10:02:24.284136655 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2111743Z 2025-12-04T10:04:28.2111889Z ('RERUN', {'yellow': True}) [23.5329s] [100%] 2025-12-04T10:04:28.2112457Z inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt You have not run this instance of FileCheck! 2025-12-04T10:04:28.2112559Z FileCheck checks: 2025-12-04T10:04:28.2112781Z CHECK-REGEX: reinterpret_tensor\(.*, \(1024, 50257\).*# reuse 2025-12-04T10:04:28.2113302Z [W1204 10:02:27.185944139 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2113306Z 2025-12-04T10:04:28.2113832Z [W1204 10:02:27.186166873 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2113842Z 2025-12-04T10:04:28.2114351Z [W1204 10:02:27.186710653 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2114356Z 2025-12-04T10:04:28.2114870Z [W1204 10:02:27.186901526 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2114886Z 2025-12-04T10:04:28.2115398Z [W1204 10:02:27.187332003 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2115402Z 2025-12-04T10:04:28.2115911Z [W1204 10:02:27.187508099 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2115972Z 2025-12-04T10:04:28.2116497Z [W1204 10:02:27.188043487 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2116554Z 2025-12-04T10:04:28.2117064Z [W1204 10:02:27.188225089 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2117069Z 2025-12-04T10:04:28.2117594Z [W1204 10:02:27.189050592 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2117599Z 2025-12-04T10:04:28.2118110Z [W1204 10:02:27.189227122 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2118114Z 2025-12-04T10:04:28.2118637Z [W1204 10:02:27.189668319 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2118642Z 2025-12-04T10:04:28.2119158Z [W1204 10:02:27.189843737 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2119168Z 2025-12-04T10:04:28.2119691Z [W1204 10:02:27.190320402 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2119696Z 2025-12-04T10:04:28.2120206Z [W1204 10:02:27.190502435 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2120210Z 2025-12-04T10:04:28.2120721Z [W1204 10:02:27.190915747 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2120738Z 2025-12-04T10:04:28.2121251Z [W1204 10:02:27.191104052 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2121256Z 2025-12-04T10:04:28.2121773Z [W1204 10:02:27.191508974 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2121778Z 2025-12-04T10:04:28.2122303Z [W1204 10:02:27.191682664 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2122307Z 2025-12-04T10:04:28.2122816Z [W1204 10:02:27.193155744 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2122821Z 2025-12-04T10:04:28.2123341Z [W1204 10:02:27.193332466 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2123345Z 2025-12-04T10:04:28.2123856Z [W1204 10:02:27.193742167 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2123861Z 2025-12-04T10:04:28.2124387Z [W1204 10:02:27.193915662 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2124392Z 2025-12-04T10:04:28.2124899Z [W1204 10:02:27.194321212 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2124908Z 2025-12-04T10:04:28.2125432Z [W1204 10:02:27.194494436 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2125436Z 2025-12-04T10:04:28.2125946Z [W1204 10:02:27.194900542 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2125951Z 2025-12-04T10:04:28.2126457Z [W1204 10:02:27.195094245 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2126462Z 2025-12-04T10:04:28.2127045Z [W1204 10:02:28.335777355 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2127050Z 2025-12-04T10:04:28.2127566Z [W1204 10:02:28.336010452 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2127624Z 2025-12-04T10:04:28.2128152Z [W1204 10:02:28.336563808 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2128157Z 2025-12-04T10:04:28.2128670Z [W1204 10:02:28.336758720 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2128675Z 2025-12-04T10:04:28.2129196Z [W1204 10:02:28.337178280 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2129201Z 2025-12-04T10:04:28.2129718Z [W1204 10:02:28.337359302 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2129722Z 2025-12-04T10:04:28.2130244Z [W1204 10:02:28.337896665 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2130254Z 2025-12-04T10:04:28.2130765Z [W1204 10:02:28.338076658 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2130769Z 2025-12-04T10:04:28.2131282Z [W1204 10:02:28.090361552 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2131300Z 2025-12-04T10:04:28.2131808Z [W1204 10:02:28.090605343 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2131813Z 2025-12-04T10:04:28.2132328Z [W1204 10:02:28.091178874 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2132333Z 2025-12-04T10:04:28.2132857Z [W1204 10:02:28.091380397 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2132865Z 2025-12-04T10:04:28.2133377Z [W1204 10:02:28.091802221 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2133381Z 2025-12-04T10:04:28.2133903Z [W1204 10:02:28.091987149 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2133908Z 2025-12-04T10:04:28.2134419Z [W1204 10:02:28.092522107 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2134424Z 2025-12-04T10:04:28.2134948Z [W1204 10:02:28.092707563 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2134956Z 2025-12-04T10:04:28.2135089Z ('RERUN', {'yellow': True}) [4.1746s] [100%] 2025-12-04T10:04:28.2135652Z inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt You have not run this instance of FileCheck! 2025-12-04T10:04:28.2135774Z FileCheck checks: 2025-12-04T10:04:28.2135984Z CHECK-REGEX: reinterpret_tensor\(.*, \(1024, 50257\).*# reuse 2025-12-04T10:04:28.2136513Z [W1204 10:02:31.353448616 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2136518Z 2025-12-04T10:04:28.2137031Z [W1204 10:02:31.353696471 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2137036Z 2025-12-04T10:04:28.2137547Z [W1204 10:02:31.354248586 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2137564Z 2025-12-04T10:04:28.2138128Z [W1204 10:02:31.354445980 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2138133Z 2025-12-04T10:04:28.2138764Z [W1204 10:02:31.354869255 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2138769Z 2025-12-04T10:04:28.2139291Z [W1204 10:02:31.355064414 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2139295Z 2025-12-04T10:04:28.2139807Z [W1204 10:02:31.355607578 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2139812Z 2025-12-04T10:04:28.2140335Z [W1204 10:02:31.355790345 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2140340Z 2025-12-04T10:04:28.2140857Z [W1204 10:02:31.356632268 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2140861Z 2025-12-04T10:04:28.2141387Z [W1204 10:02:31.356813305 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2141396Z 2025-12-04T10:04:28.2141909Z [W1204 10:02:31.357261037 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2141914Z 2025-12-04T10:04:28.2142437Z [W1204 10:02:31.357440187 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2142442Z 2025-12-04T10:04:28.2142952Z [W1204 10:02:31.357894821 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2142957Z 2025-12-04T10:04:28.2143472Z [W1204 10:02:31.358076139 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2143490Z 2025-12-04T10:04:28.2144002Z [W1204 10:02:31.358490528 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2144011Z 2025-12-04T10:04:28.2144519Z [W1204 10:02:31.358669753 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2144524Z 2025-12-04T10:04:28.2145051Z [W1204 10:02:31.359089410 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2145056Z 2025-12-04T10:04:28.2145565Z [W1204 10:02:31.359270003 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2145570Z 2025-12-04T10:04:28.2146100Z [W1204 10:02:31.360776592 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2146104Z 2025-12-04T10:04:28.2146616Z [W1204 10:02:31.360963644 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2146625Z 2025-12-04T10:04:28.2147148Z [W1204 10:02:31.361387969 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2147152Z 2025-12-04T10:04:28.2147662Z [W1204 10:02:31.361580690 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2147667Z 2025-12-04T10:04:28.2148193Z [W1204 10:02:31.361998603 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2148198Z 2025-12-04T10:04:28.2148782Z [W1204 10:02:31.362177977 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2148787Z 2025-12-04T10:04:28.2149299Z [W1204 10:02:31.362593211 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2149367Z 2025-12-04T10:04:28.2149880Z [W1204 10:02:31.362772880 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2149885Z 2025-12-04T10:04:28.2150396Z [W1204 10:02:32.516319720 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2150400Z 2025-12-04T10:04:28.2150925Z [W1204 10:02:32.516566870 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2150931Z 2025-12-04T10:04:28.2151443Z [W1204 10:02:32.517128729 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2151452Z 2025-12-04T10:04:28.2151976Z [W1204 10:02:32.517326812 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2151985Z 2025-12-04T10:04:28.2152497Z [W1204 10:02:32.517750815 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2152502Z 2025-12-04T10:04:28.2153030Z [W1204 10:02:32.517933831 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2153035Z 2025-12-04T10:04:28.2153545Z [W1204 10:02:32.518480320 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2153549Z 2025-12-04T10:04:28.2154059Z [W1204 10:02:32.518663071 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2154077Z 2025-12-04T10:04:28.2154589Z [W1204 10:02:33.277019514 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2154594Z 2025-12-04T10:04:28.2155105Z [W1204 10:02:33.277271121 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2155109Z 2025-12-04T10:04:28.2155630Z [W1204 10:02:33.277822807 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2155634Z 2025-12-04T10:04:28.2156147Z [W1204 10:02:33.278029955 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2156151Z 2025-12-04T10:04:28.2156676Z [W1204 10:02:33.278453389 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2156681Z 2025-12-04T10:04:28.2157193Z [W1204 10:02:33.278641970 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2157197Z 2025-12-04T10:04:28.2157717Z [W1204 10:02:33.279196976 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2157726Z 2025-12-04T10:04:28.2158236Z [W1204 10:02:33.279386226 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2158241Z 2025-12-04T10:04:28.2158353Z FAILED [3.9484s] [100%] 2025-12-04T10:04:28.2158358Z 2025-12-04T10:04:28.2158502Z ==================================== RERUNS ==================================== 2025-12-04T10:04:28.2158781Z ______________ TritonCodeGenTests.test_donated_buffer_inplace_gpt ______________ 2025-12-04T10:04:28.2158916Z Traceback (most recent call last): 2025-12-04T10:04:28.2159508Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 16197, in test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2159604Z ).run(code[1]) 2025-12-04T10:04:28.2159732Z IndexError: list index out of range 2025-12-04T10:04:28.2159788Z 2025-12-04T10:04:28.2160005Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2160567Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2160572Z 2025-12-04T10:04:28.2160834Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2161053Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2161177Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.2161288Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.2161792Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.2162016Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.2162113Z graph_break [] 2025-12-04T10:04:28.2162989Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.2163211Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2164442Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.2164559Z if out == self.unknown_value: 2025-12-04T10:04:28.2165297Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2177231Z warnings.warn( 2025-12-04T10:04:28.2178081Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2178307Z warnings.warn( 2025-12-04T10:04:28.2179058Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2179160Z warnings.warn( 2025-12-04T10:04:28.2179900Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2179999Z warnings.warn( 2025-12-04T10:04:28.2180722Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2180846Z warnings.warn( 2025-12-04T10:04:28.2181126Z ______________ TritonCodeGenTests.test_donated_buffer_inplace_gpt ______________ 2025-12-04T10:04:28.2181250Z Traceback (most recent call last): 2025-12-04T10:04:28.2181809Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 16197, in test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2181904Z ).run(code[1]) 2025-12-04T10:04:28.2182039Z IndexError: list index out of range 2025-12-04T10:04:28.2182046Z 2025-12-04T10:04:28.2182263Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2182813Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2182820Z 2025-12-04T10:04:28.2183100Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2183321Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2183585Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.2183702Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.2184206Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.2184506Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.2184601Z graph_break [] 2025-12-04T10:04:28.2185473Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.2185705Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2186925Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.2187053Z if out == self.unknown_value: 2025-12-04T10:04:28.2187955Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2188066Z warnings.warn( 2025-12-04T10:04:28.2188793Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2188896Z warnings.warn( 2025-12-04T10:04:28.2189626Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2189727Z warnings.warn( 2025-12-04T10:04:28.2190451Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2190566Z warnings.warn( 2025-12-04T10:04:28.2191289Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2191404Z warnings.warn( 2025-12-04T10:04:28.2191625Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2191737Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.2191860Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.2192084Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.2192585Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.2192692Z graph_break [] 2025-12-04T10:04:28.2193566Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.2193796Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2194527Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2194627Z warnings.warn( 2025-12-04T10:04:28.2195357Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2195455Z warnings.warn( 2025-12-04T10:04:28.2196185Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2196382Z warnings.warn( 2025-12-04T10:04:28.2197108Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2197289Z warnings.warn( 2025-12-04T10:04:28.2198005Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2198114Z warnings.warn( 2025-12-04T10:04:28.2198260Z =================================== FAILURES =================================== 2025-12-04T10:04:28.2198536Z ______________ TritonCodeGenTests.test_donated_buffer_inplace_gpt ______________ 2025-12-04T10:04:28.2198666Z Traceback (most recent call last): 2025-12-04T10:04:28.2199208Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 16197, in test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2199303Z ).run(code[1]) 2025-12-04T10:04:28.2199436Z IndexError: list index out of range 2025-12-04T10:04:28.2199442Z 2025-12-04T10:04:28.2199657Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2200217Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2200223Z 2025-12-04T10:04:28.2200493Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2200707Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2200827Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.2200940Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.2201436Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.2201666Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.2201762Z graph_break [] 2025-12-04T10:04:28.2202646Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.2202867Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2204099Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.2204211Z if out == self.unknown_value: 2025-12-04T10:04:28.2204938Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2205049Z warnings.warn( 2025-12-04T10:04:28.2205777Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2205882Z warnings.warn( 2025-12-04T10:04:28.2206618Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2206717Z warnings.warn( 2025-12-04T10:04:28.2207442Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2207543Z warnings.warn( 2025-12-04T10:04:28.2208260Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2208365Z warnings.warn( 2025-12-04T10:04:28.2208660Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2208782Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.2208893Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.2209116Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.2209676Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.2209773Z graph_break [] 2025-12-04T10:04:28.2210642Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.2210868Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2211603Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2211713Z warnings.warn( 2025-12-04T10:04:28.2212436Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2212540Z warnings.warn( 2025-12-04T10:04:28.2213272Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2213371Z warnings.warn( 2025-12-04T10:04:28.2214104Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2214201Z warnings.warn( 2025-12-04T10:04:28.2214925Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2215034Z warnings.warn( 2025-12-04T10:04:28.2215248Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2215365Z frames [('total', 5), ('ok', 5)] 2025-12-04T10:04:28.2215487Z stats [('calls_captured', 103)] 2025-12-04T10:04:28.2215714Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('not_ok', 5)] 2025-12-04T10:04:28.2216225Z inductor [('pattern_matcher_nodes', 246), ('pattern_matcher_count', 199), ('fxgraph_cache_miss', 5), ('removed_pointless_view_pair', 2)] 2025-12-04T10:04:28.2216322Z graph_break [] 2025-12-04T10:04:28.2217189Z aten_mm_info [('aten.mm_1024_2304_768', 3), ('aten.bmm_12_1024_1024_64', 3), ('aten.bmm_12_1024_64_1024', 3), ('aten.addmm_1024_3072_768', 3), ('aten.mm_1024_768_3072', 2), ('aten.addmm_1024_768_768', 2), ('aten.mm_1024_768_768', 1), ('aten.mm_1024_50264_768', 1), ('aten.addmm_1024_768_3072', 1)] 2025-12-04T10:04:28.2217419Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2218153Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2218347Z warnings.warn( 2025-12-04T10:04:28.2219073Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2219171Z warnings.warn( 2025-12-04T10:04:28.2219903Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2220001Z warnings.warn( 2025-12-04T10:04:28.2220732Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2220829Z warnings.warn( 2025-12-04T10:04:28.2221611Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2221784Z warnings.warn( 2025-12-04T10:04:28.2222563Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-9d3344bbd0e95b72.xml - 2025-12-04T10:04:28.2222745Z =========================== short test summary info ============================ 2025-12-04T10:04:28.2223349Z FAILED [3.9484s] inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt - IndexError: list index out of range 2025-12-04T10:04:28.2223356Z 2025-12-04T10:04:28.2223569Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2224127Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2224133Z 2025-12-04T10:04:28.2224406Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2224596Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:04:28.2224804Z ================= 1 failed, 462 deselected, 2 rerun in 31.74s ================== 2025-12-04T10:04:28.2224948Z You have not run this instance of FileCheck! 2025-12-04T10:04:28.2225062Z FileCheck checks: 2025-12-04T10:04:28.2225272Z CHECK-REGEX: reinterpret_tensor\(.*, \(1024, 50257\).*# reuse 2025-12-04T10:04:28.2225371Z Got exit code 1 2025-12-04T10:04:28.2225853Z FAILED CONSISTENTLY: test/inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt 2025-12-04T10:04:28.2226263Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T10:04:28.2226720Z W1204 10:02:44.754000 23520 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.2227317Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-c49d705795b8a118.xml 2025-12-04T10:04:28.2227480Z ============================= test session starts ============================== 2025-12-04T10:04:28.2227848Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:04:28.2227960Z cachedir: .pytest_cache 2025-12-04T10:04:28.2228485Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:04:28.2228624Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:04:28.2228729Z configfile: pytest.ini 2025-12-04T10:04:28.2229283Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:04:28.2229511Z collecting ... collected 981 items / 447 deselected / 534 selected 2025-12-04T10:04:28.2229658Z stepcurrent: skipping 447 already run items. 2025-12-04T10:04:28.2229790Z Running 16 items in this shard 2025-12-04T10:04:28.2229796Z 2025-12-04T10:04:28.2230270Z inductor/test_torchinductor.py::TritonCodeGenTests::test_evict_last_non_coalesced_loads PASSED [6.6333s] [ 6%] 2025-12-04T10:04:28.2230995Z inductor/test_torchinductor.py::TritonCodeGenTests::test_indirect_device_assert SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 12%] 2025-12-04T10:04:28.2231502Z inductor/test_torchinductor.py::TritonCodeGenTests::test_inductor_detach_view_backend_aot_eager PASSED [0.0922s] [ 18%] 2025-12-04T10:04:28.2232387Z inductor/test_torchinductor.py::TritonCodeGenTests::test_kernel_names_descriptive W1204 10:02:54.034000 23520 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.2232500Z PASSED [3.2434s] [ 25%] 2025-12-04T10:04:28.2232974Z inductor/test_torchinductor.py::TritonCodeGenTests::test_layer_norm_inplaces_after_matmul PASSED [0.7576s] [ 31%] 2025-12-04T10:04:28.2233868Z inductor/test_torchinductor.py::TritonCodeGenTests::test_non_blocking_copy_codegen W1204 10:02:57.425000 23520 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:04:28.2234026Z PASSED [0.1081s] [ 37%] 2025-12-04T10:04:28.2234418Z inductor/test_torchinductor.py::TritonCodeGenTests::test_numpy_autograd PASSED [2.6541s] [ 43%] 2025-12-04T10:04:28.2234865Z inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_indexing_dtype PASSED [0.9486s] [ 50%] 2025-12-04T10:04:28.2235546Z inductor/test_torchinductor.py::TritonCodeGenTests::test_red_followed_by_transposed_pointwise SKIPPED [0.0003s] (Triton does not support fp8 on A100) [ 56%] 2025-12-04T10:04:28.2236206Z inductor/test_torchinductor.py::TritonCodeGenTests::test_respect_scaled_grouped_mm_layout_tag SKIPPED [0.0003s] (no scaled_grouped_mm support) [ 62%] 2025-12-04T10:04:28.2236582Z inductor/test_torchinductor.py::TritonCodeGenTests::test_rope_fusion PASSED [0.9312s] [ 68%] 2025-12-04T10:04:28.2237151Z inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile ('RERUN', {'yellow': True}) [0.4155s] [ 75%] 2025-12-04T10:04:28.2237724Z inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile ('RERUN', {'yellow': True}) [0.2764s] [ 75%] 2025-12-04T10:04:28.2238198Z inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile FAILED [0.2724s] [ 75%] 2025-12-04T10:04:28.2238204Z 2025-12-04T10:04:28.2238361Z ==================================== RERUNS ==================================== 2025-12-04T10:04:28.2238647Z ___________ TritonCodeGenTests.test_sdpa_inference_mode_aot_compile ____________ 2025-12-04T10:04:28.2238772Z Traceback (most recent call last): 2025-12-04T10:04:28.2239343Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 15193, in test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2239491Z torch._inductor.aot_compile(traced, inputs) 2025-12-04T10:04:28.2239961Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T10:04:28.2240082Z return compile_fx_aot( 2025-12-04T10:04:28.2240583Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T10:04:28.2240715Z compiled_artifacts = compile_fx( 2025-12-04T10:04:28.2241186Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T10:04:28.2241289Z return compile_fx( 2025-12-04T10:04:28.2241769Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T10:04:28.2241905Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T10:04:28.2242491Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T10:04:28.2242607Z return _compile_fx_main( 2025-12-04T10:04:28.2243114Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T10:04:28.2243329Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T10:04:28.2243849Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T10:04:28.2243999Z return self.compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2244509Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T10:04:28.2244624Z return compile_fx_forward( 2025-12-04T10:04:28.2245152Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T10:04:28.2245259Z return inner_compile( 2025-12-04T10:04:28.2245537Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:04:28.2245719Z return func(*args, **kwds) 2025-12-04T10:04:28.2246218Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T10:04:28.2246555Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T10:04:28.2247061Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T10:04:28.2247231Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2247747Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T10:04:28.2247937Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T10:04:28.2248442Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T10:04:28.2248596Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T10:04:28.2249136Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T10:04:28.2249467Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T10:04:28.2249990Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T10:04:28.2250112Z _check_triton_bf16_support(graph) 2025-12-04T10:04:28.2250672Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T10:04:28.2250790Z warn_and_skip(node.get_device()) 2025-12-04T10:04:28.2251274Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T10:04:28.2251425Z raise SkipFrame("BF16 is not supported") 2025-12-04T10:04:28.2251682Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2251688Z 2025-12-04T10:04:28.2251916Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2252496Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2252506Z 2025-12-04T10:04:28.2252772Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2253005Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2253110Z unimplemented [] 2025-12-04T10:04:28.2253276Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2253523Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2253621Z graph_break [] 2025-12-04T10:04:28.2253853Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2254076Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2254894Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2255031Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2255760Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2255873Z warnings.warn( 2025-12-04T10:04:28.2256163Z ___________ TritonCodeGenTests.test_sdpa_inference_mode_aot_compile ____________ 2025-12-04T10:04:28.2256283Z Traceback (most recent call last): 2025-12-04T10:04:28.2256853Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 15193, in test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2257004Z torch._inductor.aot_compile(traced, inputs) 2025-12-04T10:04:28.2257538Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T10:04:28.2257650Z return compile_fx_aot( 2025-12-04T10:04:28.2258141Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T10:04:28.2258442Z compiled_artifacts = compile_fx( 2025-12-04T10:04:28.2258917Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T10:04:28.2259022Z return compile_fx( 2025-12-04T10:04:28.2259509Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T10:04:28.2259643Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T10:04:28.2260228Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T10:04:28.2260341Z return _compile_fx_main( 2025-12-04T10:04:28.2260850Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T10:04:28.2261064Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T10:04:28.2261589Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T10:04:28.2261733Z return self.compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2262243Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T10:04:28.2262358Z return compile_fx_forward( 2025-12-04T10:04:28.2262882Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T10:04:28.2262991Z return inner_compile( 2025-12-04T10:04:28.2263270Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:04:28.2263391Z return func(*args, **kwds) 2025-12-04T10:04:28.2263889Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T10:04:28.2264154Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T10:04:28.2264662Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T10:04:28.2264834Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2265344Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T10:04:28.2265535Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T10:04:28.2266038Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T10:04:28.2266191Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T10:04:28.2266726Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T10:04:28.2267056Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T10:04:28.2267578Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T10:04:28.2267704Z _check_triton_bf16_support(graph) 2025-12-04T10:04:28.2268261Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T10:04:28.2268381Z warn_and_skip(node.get_device()) 2025-12-04T10:04:28.2268876Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T10:04:28.2269014Z raise SkipFrame("BF16 is not supported") 2025-12-04T10:04:28.2269267Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2269273Z 2025-12-04T10:04:28.2269557Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2270136Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2270197Z 2025-12-04T10:04:28.2270480Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2270696Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2270803Z unimplemented [] 2025-12-04T10:04:28.2270970Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2271212Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2271313Z graph_break [] 2025-12-04T10:04:28.2271543Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2271757Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2272582Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2272708Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2273437Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2273547Z warnings.warn( 2025-12-04T10:04:28.2273760Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2273860Z unimplemented [] 2025-12-04T10:04:28.2274027Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2274276Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2274373Z graph_break [] 2025-12-04T10:04:28.2274601Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2274821Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2275649Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2275765Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2276490Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2276600Z warnings.warn( 2025-12-04T10:04:28.2276742Z =================================== FAILURES =================================== 2025-12-04T10:04:28.2277039Z ___________ TritonCodeGenTests.test_sdpa_inference_mode_aot_compile ____________ 2025-12-04T10:04:28.2277158Z Traceback (most recent call last): 2025-12-04T10:04:28.2277719Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 15193, in test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2277888Z torch._inductor.aot_compile(traced, inputs) 2025-12-04T10:04:28.2278349Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T10:04:28.2278467Z return compile_fx_aot( 2025-12-04T10:04:28.2278976Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T10:04:28.2279095Z compiled_artifacts = compile_fx( 2025-12-04T10:04:28.2279579Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T10:04:28.2279681Z return compile_fx( 2025-12-04T10:04:28.2280146Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T10:04:28.2280291Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T10:04:28.2280923Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T10:04:28.2281036Z return _compile_fx_main( 2025-12-04T10:04:28.2281552Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T10:04:28.2281807Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T10:04:28.2282340Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T10:04:28.2282486Z return self.compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2282985Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T10:04:28.2283110Z return compile_fx_forward( 2025-12-04T10:04:28.2283624Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T10:04:28.2283742Z return inner_compile( 2025-12-04T10:04:28.2284022Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:04:28.2284135Z return func(*args, **kwds) 2025-12-04T10:04:28.2284643Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T10:04:28.2284911Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T10:04:28.2285403Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T10:04:28.2285586Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2286088Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T10:04:28.2286290Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T10:04:28.2286796Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T10:04:28.2286940Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T10:04:28.2287486Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T10:04:28.2287968Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T10:04:28.2288502Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T10:04:28.2288629Z _check_triton_bf16_support(graph) 2025-12-04T10:04:28.2289179Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T10:04:28.2289312Z warn_and_skip(node.get_device()) 2025-12-04T10:04:28.2289798Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T10:04:28.2289941Z raise SkipFrame("BF16 is not supported") 2025-12-04T10:04:28.2290204Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2290210Z 2025-12-04T10:04:28.2290430Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2291025Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2291031Z 2025-12-04T10:04:28.2291297Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2291515Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2291628Z unimplemented [] 2025-12-04T10:04:28.2291783Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2292037Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2292134Z graph_break [] 2025-12-04T10:04:28.2293001Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2293234Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2294060Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2294265Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2295003Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2295105Z warnings.warn( 2025-12-04T10:04:28.2295332Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2295433Z unimplemented [] 2025-12-04T10:04:28.2295588Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2295850Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2295947Z graph_break [] 2025-12-04T10:04:28.2296175Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2296401Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2297222Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2297346Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2298073Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2298229Z warnings.warn( 2025-12-04T10:04:28.2298465Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2298566Z unimplemented [] 2025-12-04T10:04:28.2298720Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2298984Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2299080Z graph_break [] 2025-12-04T10:04:28.2299312Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2299531Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2300348Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2300475Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2301204Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2301315Z warnings.warn( 2025-12-04T10:04:28.2302090Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-c49d705795b8a118.xml - 2025-12-04T10:04:28.2302266Z =========================== short test summary info ============================ 2025-12-04T10:04:28.2303054Z FAILED [0.2724s] inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2303064Z 2025-12-04T10:04:28.2303280Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2303873Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2303878Z 2025-12-04T10:04:28.2304143Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2304323Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:04:28.2304572Z ======= 1 failed, 8 passed, 3 skipped, 447 deselected, 2 rerun in 16.44s ======= 2025-12-04T10:04:28.2304671Z Got exit code 1 2025-12-04T10:04:28.2304847Z Retrying single test... 2025-12-04T10:04:28.2305312Z W1204 10:03:14.101000 24061 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.2305958Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-03cc5e31909b0a52.xml 2025-12-04T10:04:28.2306123Z ============================= test session starts ============================== 2025-12-04T10:04:28.2306483Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:04:28.2306594Z cachedir: .pytest_cache 2025-12-04T10:04:28.2307124Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:04:28.2307248Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:04:28.2307357Z configfile: pytest.ini 2025-12-04T10:04:28.2307916Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:04:28.2308142Z collecting ... collected 981 items / 462 deselected / 519 selected 2025-12-04T10:04:28.2308823Z stepcurrent: skipping 458 already run items. Running only test/inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2308939Z Running 1 items in this shard 2025-12-04T10:04:28.2308944Z 2025-12-04T10:04:28.2309933Z inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile [W1204 10:03:16.987344891 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2309940Z 2025-12-04T10:04:28.2310470Z [W1204 10:03:32.218468971 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2310475Z 2025-12-04T10:04:28.2310994Z [W1204 10:03:32.222848442 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2310999Z 2025-12-04T10:04:28.2311518Z [W1204 10:03:32.223005533 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2311527Z 2025-12-04T10:04:28.2312038Z [W1204 10:03:32.223660775 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2312043Z 2025-12-04T10:04:28.2312566Z [W1204 10:03:32.223805570 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2312571Z 2025-12-04T10:04:28.2313028Z W1204 10:03:32.282000 24061 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.2313159Z ('RERUN', {'yellow': True}) [18.2475s] [100%] 2025-12-04T10:04:28.2314155Z inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile [W1204 10:03:34.526170471 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2314161Z 2025-12-04T10:04:28.2314676Z [W1204 10:03:34.526348475 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2314681Z 2025-12-04T10:04:28.2315203Z [W1204 10:03:34.526738489 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2315208Z 2025-12-04T10:04:28.2315719Z [W1204 10:03:34.526878847 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2315724Z 2025-12-04T10:04:28.2316241Z [W1204 10:03:34.527377475 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2316246Z 2025-12-04T10:04:28.2316808Z [W1204 10:03:34.527518402 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2316813Z 2025-12-04T10:04:28.2316953Z ('RERUN', {'yellow': True}) [0.2758s] [100%] 2025-12-04T10:04:28.2318002Z inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile [W1204 10:03:34.798624983 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2318008Z 2025-12-04T10:04:28.2318521Z [W1204 10:03:34.798801700 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2318536Z 2025-12-04T10:04:28.2319044Z [W1204 10:03:34.799197707 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2319049Z 2025-12-04T10:04:28.2319562Z [W1204 10:03:34.799339891 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2319566Z 2025-12-04T10:04:28.2320088Z [W1204 10:03:34.799814995 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2320098Z 2025-12-04T10:04:28.2320612Z [W1204 10:03:34.799951671 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2320617Z 2025-12-04T10:04:28.2320731Z FAILED [0.2702s] [100%] 2025-12-04T10:04:28.2320736Z 2025-12-04T10:04:28.2320878Z ==================================== RERUNS ==================================== 2025-12-04T10:04:28.2321164Z ___________ TritonCodeGenTests.test_sdpa_inference_mode_aot_compile ____________ 2025-12-04T10:04:28.2321299Z Traceback (most recent call last): 2025-12-04T10:04:28.2321860Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 15193, in test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2322029Z torch._inductor.aot_compile(traced, inputs) 2025-12-04T10:04:28.2322500Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T10:04:28.2322616Z return compile_fx_aot( 2025-12-04T10:04:28.2323121Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T10:04:28.2323244Z compiled_artifacts = compile_fx( 2025-12-04T10:04:28.2323725Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T10:04:28.2323827Z return compile_fx( 2025-12-04T10:04:28.2324296Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T10:04:28.2324442Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T10:04:28.2325020Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T10:04:28.2325131Z return _compile_fx_main( 2025-12-04T10:04:28.2325648Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T10:04:28.2325849Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T10:04:28.2326381Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T10:04:28.2326526Z return self.compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2327026Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T10:04:28.2327150Z return compile_fx_forward( 2025-12-04T10:04:28.2327667Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T10:04:28.2327777Z return inner_compile( 2025-12-04T10:04:28.2328129Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:04:28.2328239Z return func(*args, **kwds) 2025-12-04T10:04:28.2328745Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T10:04:28.2329059Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T10:04:28.2329552Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T10:04:28.2329733Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2330238Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T10:04:28.2330443Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T10:04:28.2330942Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T10:04:28.2331086Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T10:04:28.2331634Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T10:04:28.2331960Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T10:04:28.2332483Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T10:04:28.2332619Z _check_triton_bf16_support(graph) 2025-12-04T10:04:28.2333166Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T10:04:28.2333297Z warn_and_skip(node.get_device()) 2025-12-04T10:04:28.2333781Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T10:04:28.2333918Z raise SkipFrame("BF16 is not supported") 2025-12-04T10:04:28.2334188Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2334193Z 2025-12-04T10:04:28.2334403Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2335003Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2335008Z 2025-12-04T10:04:28.2335274Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2335493Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2335608Z unimplemented [] 2025-12-04T10:04:28.2335763Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2336009Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2336119Z graph_break [] 2025-12-04T10:04:28.2336337Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2336572Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2337790Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.2337909Z if out == self.unknown_value: 2025-12-04T10:04:28.2338811Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2338928Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2339670Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2339771Z warnings.warn( 2025-12-04T10:04:28.2340139Z ___________ TritonCodeGenTests.test_sdpa_inference_mode_aot_compile ____________ 2025-12-04T10:04:28.2340272Z Traceback (most recent call last): 2025-12-04T10:04:28.2340829Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 15193, in test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2341035Z torch._inductor.aot_compile(traced, inputs) 2025-12-04T10:04:28.2341508Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T10:04:28.2341621Z return compile_fx_aot( 2025-12-04T10:04:28.2342122Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T10:04:28.2342242Z compiled_artifacts = compile_fx( 2025-12-04T10:04:28.2342716Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T10:04:28.2342827Z return compile_fx( 2025-12-04T10:04:28.2343301Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T10:04:28.2343442Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T10:04:28.2344016Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T10:04:28.2344131Z return _compile_fx_main( 2025-12-04T10:04:28.2344643Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T10:04:28.2344842Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T10:04:28.2345369Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T10:04:28.2345522Z return self.compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2346026Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T10:04:28.2346153Z return compile_fx_forward( 2025-12-04T10:04:28.2346668Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T10:04:28.2346781Z return inner_compile( 2025-12-04T10:04:28.2347069Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:04:28.2347181Z return func(*args, **kwds) 2025-12-04T10:04:28.2347674Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T10:04:28.2347950Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T10:04:28.2348447Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T10:04:28.2348628Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2349136Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T10:04:28.2349326Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T10:04:28.2349840Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T10:04:28.2349981Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T10:04:28.2350521Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T10:04:28.2350838Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T10:04:28.2351359Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T10:04:28.2351493Z _check_triton_bf16_support(graph) 2025-12-04T10:04:28.2352039Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T10:04:28.2352217Z warn_and_skip(node.get_device()) 2025-12-04T10:04:28.2352712Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T10:04:28.2352903Z raise SkipFrame("BF16 is not supported") 2025-12-04T10:04:28.2353170Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2353175Z 2025-12-04T10:04:28.2353393Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2353970Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2353989Z 2025-12-04T10:04:28.2354249Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2354468Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2354583Z unimplemented [] 2025-12-04T10:04:28.2354742Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2354991Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2355101Z graph_break [] 2025-12-04T10:04:28.2355326Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2355541Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2356768Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.2356882Z if out == self.unknown_value: 2025-12-04T10:04:28.2357708Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2357823Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2358554Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2358673Z warnings.warn( 2025-12-04T10:04:28.2358891Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2359007Z unimplemented [] 2025-12-04T10:04:28.2359162Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2359408Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2359517Z graph_break [] 2025-12-04T10:04:28.2359735Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2359950Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2360778Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2360890Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2361633Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2361736Z warnings.warn( 2025-12-04T10:04:28.2361878Z =================================== FAILURES =================================== 2025-12-04T10:04:28.2362177Z ___________ TritonCodeGenTests.test_sdpa_inference_mode_aot_compile ____________ 2025-12-04T10:04:28.2362301Z Traceback (most recent call last): 2025-12-04T10:04:28.2362855Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 15193, in test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2363015Z torch._inductor.aot_compile(traced, inputs) 2025-12-04T10:04:28.2363472Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T10:04:28.2363650Z return compile_fx_aot( 2025-12-04T10:04:28.2364147Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T10:04:28.2364319Z compiled_artifacts = compile_fx( 2025-12-04T10:04:28.2364799Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T10:04:28.2364903Z return compile_fx( 2025-12-04T10:04:28.2365386Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T10:04:28.2365519Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T10:04:28.2366093Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T10:04:28.2366214Z return _compile_fx_main( 2025-12-04T10:04:28.2366717Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T10:04:28.2366921Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T10:04:28.2367450Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T10:04:28.2367599Z return self.compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2368106Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T10:04:28.2368220Z return compile_fx_forward( 2025-12-04T10:04:28.2368737Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T10:04:28.2368857Z return inner_compile( 2025-12-04T10:04:28.2369139Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:04:28.2369253Z return func(*args, **kwds) 2025-12-04T10:04:28.2369761Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T10:04:28.2370028Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T10:04:28.2370534Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T10:04:28.2370711Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2371220Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T10:04:28.2371419Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T10:04:28.2371919Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T10:04:28.2372071Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T10:04:28.2372603Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T10:04:28.2372929Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T10:04:28.2373460Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T10:04:28.2373588Z _check_triton_bf16_support(graph) 2025-12-04T10:04:28.2374148Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T10:04:28.2374264Z warn_and_skip(node.get_device()) 2025-12-04T10:04:28.2374748Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T10:04:28.2374896Z raise SkipFrame("BF16 is not supported") 2025-12-04T10:04:28.2375149Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2375154Z 2025-12-04T10:04:28.2375368Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2376013Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2376019Z 2025-12-04T10:04:28.2376340Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2376569Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2376671Z unimplemented [] 2025-12-04T10:04:28.2376824Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2377081Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2377179Z graph_break [] 2025-12-04T10:04:28.2377402Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2377626Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2378924Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.2379059Z if out == self.unknown_value: 2025-12-04T10:04:28.2379880Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2379995Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2380738Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2380840Z warnings.warn( 2025-12-04T10:04:28.2381072Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2381175Z unimplemented [] 2025-12-04T10:04:28.2381331Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2381597Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2381692Z graph_break [] 2025-12-04T10:04:28.2381914Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2382142Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2382958Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2383084Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2383813Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2383911Z warnings.warn( 2025-12-04T10:04:28.2384137Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2384234Z unimplemented [] 2025-12-04T10:04:28.2384392Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2384648Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2384744Z graph_break [] 2025-12-04T10:04:28.2384981Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2385195Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2386000Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2386123Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2386843Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2386956Z warnings.warn( 2025-12-04T10:04:28.2387992Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-03cc5e31909b0a52.xml - 2025-12-04T10:04:28.2388171Z =========================== short test summary info ============================ 2025-12-04T10:04:28.2389040Z FAILED [0.2702s] inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2389046Z 2025-12-04T10:04:28.2389264Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2389847Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2389852Z 2025-12-04T10:04:28.2390119Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2390299Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:04:28.2390517Z ================= 1 failed, 462 deselected, 2 rerun in 18.88s ================== 2025-12-04T10:04:28.2390617Z Got exit code 1 2025-12-04T10:04:28.2390728Z Retrying single test... 2025-12-04T10:04:28.2391185Z W1204 10:03:45.699000 24235 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.2391785Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-e6be60d41d66658c.xml 2025-12-04T10:04:28.2391956Z ============================= test session starts ============================== 2025-12-04T10:04:28.2392312Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:04:28.2392421Z cachedir: .pytest_cache 2025-12-04T10:04:28.2392951Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:04:28.2393077Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:04:28.2393193Z configfile: pytest.ini 2025-12-04T10:04:28.2393736Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:04:28.2393964Z collecting ... collected 981 items / 462 deselected / 519 selected 2025-12-04T10:04:28.2394650Z stepcurrent: skipping 458 already run items. Running only test/inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2394767Z Running 1 items in this shard 2025-12-04T10:04:28.2394773Z 2025-12-04T10:04:28.2395765Z inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile [W1204 10:03:48.561949054 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2395771Z 2025-12-04T10:04:28.2396287Z [W1204 10:04:04.384986540 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2396292Z 2025-12-04T10:04:28.2396808Z [W1204 10:04:04.389372209 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2396825Z 2025-12-04T10:04:28.2397341Z [W1204 10:04:04.389534299 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2397347Z 2025-12-04T10:04:28.2397854Z [W1204 10:04:04.390221026 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2397859Z 2025-12-04T10:04:28.2398376Z [W1204 10:04:04.390370074 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2398381Z 2025-12-04T10:04:28.2398840Z W1204 10:04:04.449000 24235 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.2398986Z ('RERUN', {'yellow': True}) [18.8243s] [100%] 2025-12-04T10:04:28.2400030Z inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile [W1204 10:04:06.694383721 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2400089Z 2025-12-04T10:04:28.2400616Z [W1204 10:04:06.694558561 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2400620Z 2025-12-04T10:04:28.2401129Z [W1204 10:04:06.694945518 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2401135Z 2025-12-04T10:04:28.2401645Z [W1204 10:04:06.695100512 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2401650Z 2025-12-04T10:04:28.2402170Z [W1204 10:04:06.695598001 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2402179Z 2025-12-04T10:04:28.2402688Z [W1204 10:04:06.695736291 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2402698Z 2025-12-04T10:04:28.2402842Z ('RERUN', {'yellow': True}) [0.2753s] [100%] 2025-12-04T10:04:28.2403819Z inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile [W1204 10:04:06.966568700 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2403825Z 2025-12-04T10:04:28.2404344Z [W1204 10:04:06.966748997 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2404349Z 2025-12-04T10:04:28.2404856Z [W1204 10:04:06.967160934 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2404861Z 2025-12-04T10:04:28.2405388Z [W1204 10:04:06.967306943 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2405393Z 2025-12-04T10:04:28.2405900Z [W1204 10:04:06.967799343 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2405909Z 2025-12-04T10:04:28.2406418Z [W1204 10:04:06.967935749 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:04:28.2406434Z 2025-12-04T10:04:28.2406534Z FAILED [0.2708s] [100%] 2025-12-04T10:04:28.2406538Z 2025-12-04T10:04:28.2406680Z ==================================== RERUNS ==================================== 2025-12-04T10:04:28.2406975Z ___________ TritonCodeGenTests.test_sdpa_inference_mode_aot_compile ____________ 2025-12-04T10:04:28.2407094Z Traceback (most recent call last): 2025-12-04T10:04:28.2407655Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 15193, in test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2407817Z torch._inductor.aot_compile(traced, inputs) 2025-12-04T10:04:28.2408284Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T10:04:28.2408410Z return compile_fx_aot( 2025-12-04T10:04:28.2408909Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T10:04:28.2409031Z compiled_artifacts = compile_fx( 2025-12-04T10:04:28.2409515Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T10:04:28.2409616Z return compile_fx( 2025-12-04T10:04:28.2410086Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T10:04:28.2410229Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T10:04:28.2410858Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T10:04:28.2410985Z return _compile_fx_main( 2025-12-04T10:04:28.2411545Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T10:04:28.2411742Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T10:04:28.2412276Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T10:04:28.2412421Z return self.compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2412925Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T10:04:28.2413053Z return compile_fx_forward( 2025-12-04T10:04:28.2413564Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T10:04:28.2413689Z return inner_compile( 2025-12-04T10:04:28.2413966Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:04:28.2414074Z return func(*args, **kwds) 2025-12-04T10:04:28.2414583Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T10:04:28.2414847Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T10:04:28.2415348Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T10:04:28.2415516Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2416019Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T10:04:28.2416214Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T10:04:28.2416722Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T10:04:28.2416862Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T10:04:28.2417404Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T10:04:28.2417733Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T10:04:28.2418329Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T10:04:28.2418456Z _check_triton_bf16_support(graph) 2025-12-04T10:04:28.2419007Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T10:04:28.2419133Z warn_and_skip(node.get_device()) 2025-12-04T10:04:28.2419626Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T10:04:28.2419770Z raise SkipFrame("BF16 is not supported") 2025-12-04T10:04:28.2420023Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2420033Z 2025-12-04T10:04:28.2420248Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2420840Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2420845Z 2025-12-04T10:04:28.2421108Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2421342Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2421445Z unimplemented [] 2025-12-04T10:04:28.2421600Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2421860Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2421958Z graph_break [] 2025-12-04T10:04:28.2422244Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2422473Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2423756Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.2423886Z if out == self.unknown_value: 2025-12-04T10:04:28.2424702Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2424814Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2425558Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2425662Z warnings.warn( 2025-12-04T10:04:28.2425962Z ___________ TritonCodeGenTests.test_sdpa_inference_mode_aot_compile ____________ 2025-12-04T10:04:28.2426080Z Traceback (most recent call last): 2025-12-04T10:04:28.2426636Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 15193, in test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2426795Z torch._inductor.aot_compile(traced, inputs) 2025-12-04T10:04:28.2427256Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T10:04:28.2427364Z return compile_fx_aot( 2025-12-04T10:04:28.2427866Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T10:04:28.2427987Z compiled_artifacts = compile_fx( 2025-12-04T10:04:28.2428470Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T10:04:28.2428577Z return compile_fx( 2025-12-04T10:04:28.2429043Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T10:04:28.2429191Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T10:04:28.2429764Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T10:04:28.2429874Z return _compile_fx_main( 2025-12-04T10:04:28.2430387Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T10:04:28.2430583Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T10:04:28.2431116Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T10:04:28.2431255Z return self.compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2431759Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T10:04:28.2431884Z return compile_fx_forward( 2025-12-04T10:04:28.2432396Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T10:04:28.2432520Z return inner_compile( 2025-12-04T10:04:28.2432795Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:04:28.2432902Z return func(*args, **kwds) 2025-12-04T10:04:28.2433411Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T10:04:28.2433667Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T10:04:28.2434160Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T10:04:28.2434339Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2434897Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T10:04:28.2435100Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T10:04:28.2435653Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T10:04:28.2435794Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T10:04:28.2436335Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T10:04:28.2436652Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T10:04:28.2437182Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T10:04:28.2437303Z _check_triton_bf16_support(graph) 2025-12-04T10:04:28.2437852Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T10:04:28.2437979Z warn_and_skip(node.get_device()) 2025-12-04T10:04:28.2438462Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T10:04:28.2438603Z raise SkipFrame("BF16 is not supported") 2025-12-04T10:04:28.2438864Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2438869Z 2025-12-04T10:04:28.2439079Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2439662Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2439667Z 2025-12-04T10:04:28.2439930Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2440149Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2440263Z unimplemented [] 2025-12-04T10:04:28.2440418Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2440671Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2440770Z graph_break [] 2025-12-04T10:04:28.2440991Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2441210Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2442424Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.2442542Z if out == self.unknown_value: 2025-12-04T10:04:28.2443367Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2443480Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2444214Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2444318Z warnings.warn( 2025-12-04T10:04:28.2444533Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2444643Z unimplemented [] 2025-12-04T10:04:28.2444797Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2445041Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2445145Z graph_break [] 2025-12-04T10:04:28.2445364Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2445590Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2446475Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2446586Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2447380Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2447478Z warnings.warn( 2025-12-04T10:04:28.2447631Z =================================== FAILURES =================================== 2025-12-04T10:04:28.2447912Z ___________ TritonCodeGenTests.test_sdpa_inference_mode_aot_compile ____________ 2025-12-04T10:04:28.2448031Z Traceback (most recent call last): 2025-12-04T10:04:28.2448592Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 15193, in test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2448738Z torch._inductor.aot_compile(traced, inputs) 2025-12-04T10:04:28.2449203Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/__init__.py", line 311, in aot_compile 2025-12-04T10:04:28.2449319Z return compile_fx_aot( 2025-12-04T10:04:28.2449814Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2007, in compile_fx_aot 2025-12-04T10:04:28.2449946Z compiled_artifacts = compile_fx( 2025-12-04T10:04:28.2450419Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2477, in compile_fx 2025-12-04T10:04:28.2450521Z return compile_fx( 2025-12-04T10:04:28.2450999Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2516, in compile_fx 2025-12-04T10:04:28.2451130Z return _maybe_wrap_and_compile_fx_main( 2025-12-04T10:04:28.2451705Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2605, in _maybe_wrap_and_compile_fx_main 2025-12-04T10:04:28.2451826Z return _compile_fx_main( 2025-12-04T10:04:28.2452330Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2788, in _compile_fx_main 2025-12-04T10:04:28.2452532Z return inference_compiler(unlifted_gm, example_inputs_) 2025-12-04T10:04:28.2453058Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_functorch/_aot_autograd/schemas.py", line 1249, in __call__ 2025-12-04T10:04:28.2453200Z return self.compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2453713Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2669, in fw_compiler_base 2025-12-04T10:04:28.2453824Z return compile_fx_forward( 2025-12-04T10:04:28.2454350Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2341, in compile_fx_forward 2025-12-04T10:04:28.2454459Z return inner_compile( 2025-12-04T10:04:28.2454733Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:04:28.2454857Z return func(*args, **kwds) 2025-12-04T10:04:28.2455355Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 806, in compile_fx_inner 2025-12-04T10:04:28.2455618Z return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( 2025-12-04T10:04:28.2456118Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/repro/after_aot.py", line 146, in debug_wrapper 2025-12-04T10:04:28.2456293Z inner_compiled_fn = compiler_fn(gm, example_inputs) 2025-12-04T10:04:28.2456800Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T10:04:28.2456991Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T10:04:28.2457491Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T10:04:28.2457639Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T10:04:28.2458303Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T10:04:28.2458644Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T10:04:28.2459215Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T10:04:28.2459341Z _check_triton_bf16_support(graph) 2025-12-04T10:04:28.2459899Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T10:04:28.2460018Z warn_and_skip(node.get_device()) 2025-12-04T10:04:28.2460500Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T10:04:28.2460649Z raise SkipFrame("BF16 is not supported") 2025-12-04T10:04:28.2460899Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2460905Z 2025-12-04T10:04:28.2461131Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2461711Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2461721Z 2025-12-04T10:04:28.2461984Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2462218Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2462317Z unimplemented [] 2025-12-04T10:04:28.2462486Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2462730Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2462827Z graph_break [] 2025-12-04T10:04:28.2463055Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2463271Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2464492Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:04:28.2464618Z if out == self.unknown_value: 2025-12-04T10:04:28.2465433Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2465558Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2466281Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2466381Z warnings.warn( 2025-12-04T10:04:28.2466599Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2466700Z unimplemented [] 2025-12-04T10:04:28.2466868Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2467112Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2467210Z graph_break [] 2025-12-04T10:04:28.2467439Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2467653Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2468460Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2468583Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2469311Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2469416Z warnings.warn( 2025-12-04T10:04:28.2469626Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:04:28.2469787Z unimplemented [] 2025-12-04T10:04:28.2469954Z stats [('calls_captured', 1), ('unique_graphs', 1)] 2025-12-04T10:04:28.2470194Z inductor [('pattern_matcher_nodes', 12), ('pattern_matcher_count', 9)] 2025-12-04T10:04:28.2470362Z graph_break [] 2025-12-04T10:04:28.2470589Z aten_mm_info [('aten.bmm_40_128_128_64', 1), ('aten.bmm_40_128_64_128', 1)] 2025-12-04T10:04:28.2470799Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:04:28.2471618Z /opt/conda/envs/py_3.10/lib/python3.10/copyreg.py:101: FutureWarning: `isinstance(treespec, LeafSpec)` is deprecated, use `isinstance(treespec, TreeSpec) and treespec.is_leaf()` instead. 2025-12-04T10:04:28.2471728Z return cls.__new__(cls, *args) 2025-12-04T10:04:28.2472452Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:04:28.2472558Z warnings.warn( 2025-12-04T10:04:28.2473332Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-e6be60d41d66658c.xml - 2025-12-04T10:04:28.2473507Z =========================== short test summary info ============================ 2025-12-04T10:04:28.2474300Z FAILED [0.2708s] inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T10:04:28.2474305Z 2025-12-04T10:04:28.2474520Z To execute this test, run the following from the base repo dir: 2025-12-04T10:04:28.2475104Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor.py TritonCodeGenTests.test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2475109Z 2025-12-04T10:04:28.2475375Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:04:28.2475560Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:04:28.2475769Z ================= 1 failed, 462 deselected, 2 rerun in 19.46s ================== 2025-12-04T10:04:28.2475863Z Got exit code 1 2025-12-04T10:04:28.2476374Z FAILED CONSISTENTLY: test/inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile 2025-12-04T10:04:28.2476789Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T10:04:28.2477234Z W1204 10:04:17.882000 24409 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:04:28.2477837Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-27e2dd5fd3a036de.xml 2025-12-04T10:04:28.2477998Z ============================= test session starts ============================== 2025-12-04T10:04:28.2478368Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:04:28.2478475Z cachedir: .pytest_cache 2025-12-04T10:04:28.2478999Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:04:28.2479129Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:04:28.2479239Z configfile: pytest.ini 2025-12-04T10:04:28.2479788Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:04:28.2480014Z collecting ... collected 981 items / 459 deselected / 522 selected 2025-12-04T10:04:28.2480160Z stepcurrent: skipping 459 already run items. 2025-12-04T10:04:28.2480284Z Running 4 items in this shard 2025-12-04T10:04:28.2480289Z 2025-12-04T10:04:28.2480677Z inductor/test_torchinductor.py::TritonCodeGenTests::test_skip_l1_cache PASSED [3.0842s] [ 25%] 2025-12-04T10:04:28.2481089Z inductor/test_torchinductor.py::TritonCodeGenTests::test_split_op_with_sym PASSED [0.6450s] [ 50%] 2025-12-04T10:04:28.2481651Z inductor/test_torchinductor.py::TritonCodeGenTests::test_triton_attrs_dict_constexpr_signature PASSED [0.2217s] [ 75%] 2025-12-04T10:04:28.2482029Z inductor/test_torchinductor.py::NanCheckerTest::test_nan_checker_fail PASSED [0.3273s] [100%] 2025-12-04T10:04:28.2482091Z 2025-12-04T10:04:28.2482881Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-27e2dd5fd3a036de.xml - 2025-12-04T10:04:28.2483065Z ====================== 4 passed, 459 deselected in 4.37s ======================= 2025-12-04T10:04:28.2484144Z The following tests failed consistently: ['test/inductor/test_torchinductor.py::TritonCodeGenTests::test_donated_buffer_inplace_gpt', 'test/inductor/test_torchinductor.py::TritonCodeGenTests::test_sdpa_inference_mode_aot_compile'] 2025-12-04T10:04:28.2484159Z 2025-12-04T10:04:28.2484735Z FINISHED PRINTING LOG FILE of inductor/test_torchinductor 1/2 (test/test-reports/inductor.test_torchinductor_1.2_9aedcc93d70c8453_.log) 2025-12-04T10:04:28.2484740Z 2025-12-04T10:04:28.2485109Z Finished inductor/test_torchinductor 1/2 ... [2025-12-04 10:04:28.073790][3072.195002636], took 8.91min 2025-12-04T10:04:28.2485947Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-bf110f953f5c5355.xml 2025-12-04T10:04:28.2486803Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-268cc96fdd12a701.xml 2025-12-04T10:04:28.2567747Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-9d3344bbd0e95b72.xml 2025-12-04T10:04:28.2872847Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-c49d705795b8a118.xml 2025-12-04T10:04:28.3201318Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-03cc5e31909b0a52.xml 2025-12-04T10:04:28.3523157Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-e6be60d41d66658c.xml 2025-12-04T10:04:28.3897413Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-27e2dd5fd3a036de.xml 2025-12-04T10:04:28.7267324Z Uploading logs for 57119749259 to S3 2025-12-04T10:04:28.7703667Z Uploading artifacts took 0.35 seconds 2025-12-04T10:04:28.7704149Z inductor/test_torchinductor 1/2 failed! 2025-12-04T10:04:28.7709727Z Running inductor/test_torchinductor_dynamic_shapes 3/5 ... [2025-12-04 10:04:28.770780][3072.892000978] 2025-12-04T10:04:28.7710410Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T10:04:28.7714587Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_dynamic_shapes.py', '--shard-id=3', '--num-shards=5', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:04:28.771210] 2025-12-04T10:15:30.9453804Z 2025-12-04T10:15:30.9456926Z PRINTING LOG FILE of inductor/test_torchinductor_dynamic_shapes 3/5 (test/test-reports/inductor.test_torchinductor_dynamic_shapes_3.5_c72f035bd2cef633_.log) 2025-12-04T10:15:30.9459244Z W1204 10:04:37.675000 24633 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:15:30.9461273Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-655beaec67eb1ac4.xml 2025-12-04T10:15:30.9462779Z ============================= test session starts ============================== 2025-12-04T10:15:30.9463637Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:15:30.9465005Z cachedir: .pytest_cache 2025-12-04T10:15:30.9466043Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:15:30.9467582Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:15:30.9468039Z configfile: pytest.ini 2025-12-04T10:15:30.9469245Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:15:30.9470570Z collecting ... collected 1815 items 2025-12-04T10:15:30.9471167Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T10:15:30.9702481Z Running 371 items in this shard: test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_AllenaiLongformerBase_repro_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test__dyn_quant_matmul_4bit_fp32_input_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test__dyn_quant_pack_4bit_weight_bf16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_abs_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_adaptive_avg_pool2d1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_add_complex10_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_allow_reuse_active_if_under_peak_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_aoti_eager_cache_hit_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_arange6_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_argmax_argmin1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_argmax_argmin_with_duplicates_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_assert_alignment_op_name_pass_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_assert_size_stride_op_name_pass_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool3d_backward3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bernoulli1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_add_autotune_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_int64_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_int64_int64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_int8_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_int8_int64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_uint8_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_builtins_round_float_ndigits_neg_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_builtins_round_int_ndigits_zero_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cat_upcasting_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cauchy_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_computed_buffer_inlining_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_consecutive_split_cumprod_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_constant_pad_2d_strides_nonpositive_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv3d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv_functional_bn_fuse_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_convolution2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cos_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cpu_tensor_with_cpu_tensor_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cudnn_rnn_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cumsum_zero_dim_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_deterministic_codegen_with_suffix_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dist_bf16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div9_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div_softmax_symfloat_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div_zero_dim_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout_deterministic_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout_trivial_1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtype_sympy_expr_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float16_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float32_float32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float32_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float64_float32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float64_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float64_int8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_fusion_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int16_float32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int16_int8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int32_float16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int32_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_float16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_int8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int8_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_elu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_empty2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_exact_stride_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fallback_mutable_op_list_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fft_real_input_real_output_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_float16_to_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fmin_fmax_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_full_boolean_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_full_like_sliced_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_full_like_transposed_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_functionalize_rng_wrappers_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_gather2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_gather_scatter_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_generated_code_has_alignment_assert_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_glu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_graph_partition_both_scalars_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_graph_partition_constant_tensor1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_grid_sampler_2d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_propagation_device_assert_masked_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_propagation_remainder_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_as_masked_fill_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_fallback2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_index_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_select_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_inductor_layout_optimization_input_mutations_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_inplace_activations_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_input_mutation4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_input_mutation5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_int8_weight_only_quant_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_int_input_dynamic_shapes_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_invalid_operand_issue1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_isin_tensor_scalar_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_kwargs_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_large_block_sizes_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_large_tensor_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_leaky_relu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_lerp_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linear_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linspace1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linspace3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_lite_mode_fallback_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_low_memory_max_pool_dilation_1_dim_3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_low_memory_max_pool_dilation_2_dim_2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mark_dynamic_with_hint_override_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_masked_fill_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d6_dilation_1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d_with_indices_backward5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mean_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_min_max_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_min_max_reduction_nan_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mul_index_expr_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multi_device_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multi_gpu_recompile_on_index_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multilayer_sum_low_prec_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multilayer_var_lowp_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mutable_custom_op_fixed_layout_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_nan_assert_inside_triton_kernel_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_nan_sort_stable_True_descending_False_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_nan_sort_stable_True_descending_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_no_op_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_no_specization_over_symbolic_value_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_bessel_j1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_chebyshev_polynomial_w_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_entr_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_gammainc_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_i0e_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_legendre_polynomial_p_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_ndtri_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_shifted_chebyshev_polynomial_t_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_shifted_chebyshev_polynomial_v_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_shifted_chebyshev_polynomial_w_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_zeta_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pow3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_randint_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_randn_generator_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_reduction3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_reduction5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_remove_noop_view_dtype_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_require_stride_expanded_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_roi_align_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter_bf16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter_reduce1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_False_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_shape_padding_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_signbit_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_size_asserts_for_multi_output_fallback_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_scatter2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sort_stable_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sort_transpose_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_special_polygamma_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_cumsum_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_failed_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_reduction_with_int64_size_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_with_integer_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_squeeze1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum_int_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tensor3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tensor_index_slice_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tmp_not_defined_issue1_use_block_ptr_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_to_device_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_torch_device_split_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_transpose_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_uint4x2_mixed_mm_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unfold_zero_dimension_tensor_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_bilinear2d_b_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_nearest1d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_nearest2d_backward_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_vectorized_ops_masked_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_view_as_complex_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_views2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_zeros_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_AllenaiLongformerBase_repro_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_avg_pool2d1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_avg_pool2d_low_prec_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_avg_pool_errors_with_long_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_max_pool2d2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_add_complex3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_add_complex7_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_any_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_argmin2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_argmin_with_nan_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_min_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_as_strided_scatter_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_assert_alignment_op_name_fail_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool_errors_with_uint_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bernoulli2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bmm1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_broadcast_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int16_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int32_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int64_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int64_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_uint8_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_nd_tiling_False_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_builtins_round_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_empty_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_unbacked_legacy_empty_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_config_option_dont_assume_alignment_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_const_int32_to_float_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_constant_pad_2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv1d_depthwise_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv1d_with_permute_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_convolution3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_convolution4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cpu_scalar_with_gpu_tensor_dynamic_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_op_3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_scan_would_split_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_deterministic_codegen_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_deterministic_codegen_with_suffix_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div_prim_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_float32_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int16_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int16_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int32_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int64_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int64_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_elu_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_empty1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_expand_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fallback_mutable_op_no_mutated_tensors_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_flip_cat_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fmin_fmax_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_full_boolean_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fuse_large_params_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fuse_tiled_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_gelu_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_generated_code_has_alignment_assert_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_gpu_scalar_with_gpu_tensor_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_graph_partition_mutation_real_name_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_graph_partition_no_inputs_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_graph_partition_unbacked_symint_as_output_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_horizonal_fusion2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_float_zero_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_propagation_device_assert_masked_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_propagation_flip_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_put_as_masked_fill_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_put_failed_reinplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_remainder_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_tensor_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inductor_layout_optimization_input_mutations_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inner_fn_str_and_stride_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inplace_activations_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inplace_mixed_dtype_ops_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_isinf2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_large_tensor_reduction_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_leaky_relu_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linspace2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_lite_dynamic_shape_assertion_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_logaddexp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d5_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d7_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d_with_indices_backward2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_min_max_reduction_nan_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mixed_mm_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_move_arange_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_multilayer_var_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_multilayer_var_lowp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mutable_custom_op_fixed_layout_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mutations_loop_fusion_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nan_assert_inside_triton_kernel_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nan_sort_stable_True_descending_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nan_to_num_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_new_ones_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_no_mega_fusion_during_lowering_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_no_op_reduction_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nonzero_unbacked_refinement_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_one_hot_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_permute1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_airy_ai_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_digamma_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erf_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erfc_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erfinv_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_gammaincc_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_i1e_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_legendre_polynomial_p_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_logit_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_modified_bessel_i0_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_ndtr_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_shifted_chebyshev_polynomial_u_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pow1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pow_by_natural_log2_dynamic_shapes_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pow_symfloat_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_randint_int64_mod_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_randint_kernel_count_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_randn_with_dtype_and_device_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reduction1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reinterpret_dtypeview_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_remove_noop_slice1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_interleave_Tensor_decomp_int32_nd_1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_resize_as_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_resize_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_roi_align_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_round_correctness_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scalar_cpu_tensor_arg_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scalar_output_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter_add1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter_add2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_searchsorted_broadcast_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sign_dtype_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_simplify_loops_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_mutation2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter_reinplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_view_with_graph_break_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_softmax_backward_data_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sort_bool_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_cumprod_low_prec_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_cumsum_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sqrt_dynamic_shapes_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_squeeze2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sum2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sum_int_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tensor1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tensor2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tmp_not_defined_issue1_use_block_ptr_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_to_memory_format_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_transpose_add_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_triton_argmin_argmax_transpose_logical_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unbacked_floordiv_simplify_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unsigned_constant_tensors_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unspec_inputs_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unspec_inputs_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_upsample_nearest2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_var_mean_div_by_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_view_as_complex_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_view_as_real_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_view_detach_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_weight_norm_conv2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_bool_mask_nobreak_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_dynamic_stride_nobreak_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_float_item_neginf_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_float_item_return_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_full_recompiles_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_bool_nobreak_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_materialize_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_return_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_mark_unbacked_slice_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_math_ops_op2_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_multi_output_unbacked_custom_op_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_non_persistent_dynamic_rblock_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_slice_scatter_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_sort_dynamic_shape_with_check_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_symint_sum_list_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_unspecialized_float_operations_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_wrapper_codegen_statically_known_int_or_none_cuda 2025-12-04T10:15:30.9927003Z 2025-12-04T10:15:30.9928738Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_AllenaiLongformerBase_repro_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py W1204 10:04:40.404000 24633 site-packages/torch/_dynamo/variables/torch.py:1533] [0/0] Calling on only torch.SymInt arguments is not yet supported. 2025-12-04T10:15:30.9931389Z W1204 10:04:40.404000 24633 site-packages/torch/_dynamo/variables/torch.py:1533] [0/0] To support this behavior, we need to allow const-propping tensors that store symint data. 2025-12-04T10:15:30.9932962Z W1204 10:04:40.404000 24633 site-packages/torch/_dynamo/variables/torch.py:1533] [0/0] For now, dynamo will explicitly graph break when it encounters user code with this behavior. 2025-12-04T10:15:30.9934119Z W1204 10:04:40.404000 24633 site-packages/torch/_dynamo/variables/torch.py:1533] [0/0] 2025-12-04T10:15:30.9934688Z XFAIL [18.5404s] [ 0%] 2025-12-04T10:15:30.9935735Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test__dyn_quant_matmul_4bit_fp32_input_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [4.4354s] [ 0%] 2025-12-04T10:15:30.9937569Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test__dyn_quant_pack_4bit_weight_bf16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [4.1143s] [ 0%] 2025-12-04T10:15:30.9939315Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_abs_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7558s] [ 1%] 2025-12-04T10:15:30.9940975Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_adaptive_avg_pool2d1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [7.6780s] [ 1%] 2025-12-04T10:15:30.9942675Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_add_complex10_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9310s] [ 1%] 2025-12-04T10:15:30.9944432Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_allow_reuse_active_if_under_peak_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.6091s] [ 1%] 2025-12-04T10:15:30.9946558Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_aoti_eager_cache_hit_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py W1204 10:05:18.995000 24633 site-packages/torch/_export/__init__.py:71] +============================+ 2025-12-04T10:15:30.9948174Z W1204 10:05:18.995000 24633 site-packages/torch/_export/__init__.py:72] | !!! WARNING !!! | 2025-12-04T10:15:30.9949029Z W1204 10:05:18.996000 24633 site-packages/torch/_export/__init__.py:73] +============================+ 2025-12-04T10:15:30.9950750Z W1204 10:05:18.996000 24633 site-packages/torch/_export/__init__.py:74] torch._export.aot_compile()/torch._export.aot_load() is being deprecated, please switch to directly calling torch._inductor.aoti_compile_and_package(torch.export.export())/torch._inductor.aoti_load_package() instead. 2025-12-04T10:15:30.9952222Z PASSED [5.4246s] [ 2%] 2025-12-04T10:15:30.9953141Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_arange6_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.4237s] [ 2%] 2025-12-04T10:15:30.9954794Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_argmax_argmin1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0773s] [ 2%] 2025-12-04T10:15:30.9956547Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_argmax_argmin_with_duplicates_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.2992s] [ 2%] 2025-12-04T10:15:30.9958363Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_assert_alignment_op_name_pass_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0032s] [ 3%] 2025-12-04T10:15:30.9960263Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_assert_size_stride_op_name_pass_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0029s] [ 3%] 2025-12-04T10:15:30.9962088Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool3d_backward3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [4.4975s] [ 3%] 2025-12-04T10:15:30.9963774Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bernoulli1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0407s] [ 4%] 2025-12-04T10:15:30.9965478Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_add_autotune_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.3510s] [ 4%] 2025-12-04T10:15:30.9967234Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_int64_int16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0991s] [ 4%] 2025-12-04T10:15:30.9969005Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_int64_int64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.4767s] [ 4%] 2025-12-04T10:15:30.9970759Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_int8_int16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1108s] [ 5%] 2025-12-04T10:15:30.9972515Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_int8_int64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1633s] [ 5%] 2025-12-04T10:15:30.9974257Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_int_uint8_int32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1719s] [ 5%] 2025-12-04T10:15:30.9976071Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_builtins_round_float_ndigits_neg_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7517s] [ 5%] 2025-12-04T10:15:30.9977901Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_builtins_round_int_ndigits_zero_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7109s] [ 6%] 2025-12-04T10:15:30.9979708Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cat_upcasting_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0191s] [ 6%] 2025-12-04T10:15:30.9981338Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cauchy_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1142s] [ 6%] 2025-12-04T10:15:30.9983019Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_computed_buffer_inlining_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8454s] [ 7%] 2025-12-04T10:15:30.9984808Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_consecutive_split_cumprod_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8534s] [ 7%] 2025-12-04T10:15:30.9986643Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_constant_pad_2d_strides_nonpositive_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0558s] [ 7%] 2025-12-04T10:15:30.9988511Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv3d_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7374s] [ 7%] 2025-12-04T10:15:30.9990188Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv_functional_bn_fuse_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.4811s] [ 8%] 2025-12-04T10:15:30.9991905Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_convolution2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.4064s] [ 8%] 2025-12-04T10:15:30.9993630Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cos_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9595s] [ 8%] 2025-12-04T10:15:30.9995284Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cpu_tensor_with_cpu_tensor_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7248s] [ 8%] 2025-12-04T10:15:30.9997149Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cudnn_rnn_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (requires cuda) [ 9%] 2025-12-04T10:15:30.9998879Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cumsum_zero_dim_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.6794s] [ 9%] 2025-12-04T10:15:31.0000643Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_deterministic_codegen_with_suffix_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.2897s] [ 9%] 2025-12-04T10:15:31.0002393Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dist_bf16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8299s] [ 9%] 2025-12-04T10:15:31.0003969Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div9_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7578s] [ 10%] 2025-12-04T10:15:31.0005621Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div_softmax_symfloat_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.5358s] [ 10%] 2025-12-04T10:15:31.0007300Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div_zero_dim_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [3.5510s] [ 10%] 2025-12-04T10:15:31.0008926Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.6263s] [ 11%] 2025-12-04T10:15:31.0010534Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.3370s] [ 11%] 2025-12-04T10:15:31.0012214Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout_deterministic_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0089s] [ 11%] 2025-12-04T10:15:31.0013952Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout_trivial_1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.1127s] [ 11%] 2025-12-04T10:15:31.0015640Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtype_sympy_expr_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.1434s] [ 12%] 2025-12-04T10:15:31.0017371Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float16_float64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8327s] [ 12%] 2025-12-04T10:15:31.0019211Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float32_float32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.7423s] [ 12%] 2025-12-04T10:15:31.0020977Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float32_int16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8648s] [ 12%] 2025-12-04T10:15:31.0022752Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float64_float32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8696s] [ 13%] 2025-12-04T10:15:31.0024534Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float64_float64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.7834s] [ 13%] 2025-12-04T10:15:31.0026304Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_float64_int8_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0042s] [ 13%] 2025-12-04T10:15:31.0028099Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_fusion_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8732s] [ 14%] 2025-12-04T10:15:31.0029814Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int16_float32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8022s] [ 14%] 2025-12-04T10:15:31.0031642Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int16_int8_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.7892s] [ 14%] 2025-12-04T10:15:31.0033391Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int32_float16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8109s] [ 14%] 2025-12-04T10:15:31.0035155Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int32_float64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.7910s] [ 15%] 2025-12-04T10:15:31.0036905Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_float16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0041s] [ 15%] 2025-12-04T10:15:31.0038647Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_int16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0037s] [ 15%] 2025-12-04T10:15:31.0040384Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_int32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8717s] [ 15%] 2025-12-04T10:15:31.0042114Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_int8_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0041s] [ 16%] 2025-12-04T10:15:31.0043847Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int64_uint8_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0037s] [ 16%] 2025-12-04T10:15:31.0045590Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_int8_float64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8301s] [ 16%] 2025-12-04T10:15:31.0047244Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_elu_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1012s] [ 16%] 2025-12-04T10:15:31.0048835Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_empty2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0827s] [ 17%] 2025-12-04T10:15:31.0050463Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_exact_stride_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1455s] [ 17%] 2025-12-04T10:15:31.0052166Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fallback_mutable_op_list_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.5602s] [ 17%] 2025-12-04T10:15:31.0053921Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fft_real_input_real_output_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.2343s] [ 18%] 2025-12-04T10:15:31.0055648Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_float16_to_int16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7371s] [ 18%] 2025-12-04T10:15:31.0057299Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fmin_fmax_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9127s] [ 18%] 2025-12-04T10:15:31.0059005Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_full_boolean_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7372s] [ 18%] 2025-12-04T10:15:31.0060689Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_full_like_sliced_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7571s] [ 19%] 2025-12-04T10:15:31.0062437Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_full_like_transposed_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8057s] [ 19%] 2025-12-04T10:15:31.0064209Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_functionalize_rng_wrappers_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0545s] [ 19%] 2025-12-04T10:15:31.0066044Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_gather2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0032s] [ 19%] 2025-12-04T10:15:31.0067685Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_gather_scatter_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1201s] [ 20%] 2025-12-04T10:15:31.0069654Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_generated_code_has_alignment_assert_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (triton backend is required for cpu) [ 20%] 2025-12-04T10:15:31.0071564Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_glu_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.6879s] [ 20%] 2025-12-04T10:15:31.0073249Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_graph_partition_both_scalars_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7728s] [ 21%] 2025-12-04T10:15:31.0075087Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_graph_partition_constant_tensor1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.6969s] [ 21%] 2025-12-04T10:15:31.0076854Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_grid_sampler_2d_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [6.4655s] [ 21%] 2025-12-04T10:15:31.0078493Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1531s] [ 21%] 2025-12-04T10:15:31.0080248Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_propagation_device_assert_masked_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0884s] [ 22%] 2025-12-04T10:15:31.0082119Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_propagation_remainder_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0252s] [ 22%] 2025-12-04T10:15:31.0083848Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1926s] [ 22%] 2025-12-04T10:15:31.0085534Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_as_masked_fill_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.6887s] [ 22%] 2025-12-04T10:15:31.0087259Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_fallback2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.3234s] [ 23%] 2025-12-04T10:15:31.0089100Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_index_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9187s] [ 23%] 2025-12-04T10:15:31.0090760Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_select_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.7463s] [ 23%] 2025-12-04T10:15:31.0092589Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_inductor_layout_optimization_input_mutations_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.7364s] [ 23%] 2025-12-04T10:15:31.0094456Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_inplace_activations_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.3373s] [ 24%] 2025-12-04T10:15:31.0096149Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_input_mutation4_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7385s] [ 24%] 2025-12-04T10:15:31.0097934Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_input_mutation5_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.6886s] [ 24%] 2025-12-04T10:15:31.0099727Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_int8_weight_only_quant_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8282s] [ 25%] 2025-12-04T10:15:31.0101566Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_int_input_dynamic_shapes_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8563s] [ 25%] 2025-12-04T10:15:31.0103323Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_invalid_operand_issue1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.7659s] [ 25%] 2025-12-04T10:15:31.0105034Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_isin_tensor_scalar_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [3.1305s] [ 25%] 2025-12-04T10:15:31.0106681Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_kwargs_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.0235s] [ 26%] 2025-12-04T10:15:31.0108323Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_large_block_sizes_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [18.3523s] [ 26%] 2025-12-04T10:15:31.0110163Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_large_tensor_reduction_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (cpu not supported) [ 26%] 2025-12-04T10:15:31.0125015Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_leaky_relu_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8752s] [ 26%] 2025-12-04T10:15:31.0126637Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_lerp_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9949s] [ 27%] 2025-12-04T10:15:31.0128294Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linear_float64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.1555s] [ 27%] 2025-12-04T10:15:31.0129954Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linspace1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7764s] [ 27%] 2025-12-04T10:15:31.0131596Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linspace3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0852s] [ 28%] 2025-12-04T10:15:31.0133258Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_lite_mode_fallback_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.1176s] [ 28%] 2025-12-04T10:15:31.0135043Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_low_memory_max_pool_dilation_1_dim_3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [6.2199s] [ 28%] 2025-12-04T10:15:31.0136902Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_low_memory_max_pool_dilation_2_dim_2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [9.8670s] [ 28%] 2025-12-04T10:15:31.0139135Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mark_dynamic_with_hint_override_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 29%] 2025-12-04T10:15:31.0141209Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_masked_fill_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9580s] [ 29%] 2025-12-04T10:15:31.0142849Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.6791s] [ 29%] 2025-12-04T10:15:31.0144469Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d4_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.4010s] [ 29%] 2025-12-04T10:15:31.0146317Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d6_dilation_1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.1155s] [ 30%] 2025-12-04T10:15:31.0148215Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d_with_indices_backward5_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.3912s] [ 30%] 2025-12-04T10:15:31.0149937Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mean_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.2888s] [ 30%] 2025-12-04T10:15:31.0151561Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_min_max_reduction_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.8879s] [ 30%] 2025-12-04T10:15:31.0153275Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_min_max_reduction_nan_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7543s] [ 31%] 2025-12-04T10:15:31.0154975Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mul_index_expr_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7495s] [ 31%] 2025-12-04T10:15:31.0157039Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multi_device_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py W1204 10:08:04.833000 24633 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:15:31.0158720Z W1204 10:08:04.837000 24633 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:15:31.0159646Z W1204 10:08:04.840000 24633 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:15:31.0160577Z W1204 10:08:04.844000 24633 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:15:31.0161226Z PASSED [1.1537s] [ 31%] 2025-12-04T10:15:31.0162427Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multi_gpu_recompile_on_index_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (requires multiple cuda devices) [ 32%] 2025-12-04T10:15:31.0164452Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multilayer_sum_low_prec_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (requires cuda) [ 32%] 2025-12-04T10:15:31.0166275Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multilayer_var_lowp_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.9047s] [ 32%] 2025-12-04T10:15:31.0168047Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mutable_custom_op_fixed_layout_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7377s] [ 32%] 2025-12-04T10:15:31.0169868Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_nan_assert_inside_triton_kernel_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.6958s] [ 33%] 2025-12-04T10:15:31.0171715Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_nan_sort_stable_True_descending_False_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.1154s] [ 33%] 2025-12-04T10:15:31.0173584Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_nan_sort_stable_True_descending_True_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.1097s] [ 33%] 2025-12-04T10:15:31.0175348Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_no_op_reduction_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7505s] [ 33%] 2025-12-04T10:15:31.0177113Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_no_specization_over_symbolic_value_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7255s] [ 34%] 2025-12-04T10:15:31.0179061Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_bessel_j1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8308s] [ 34%] 2025-12-04T10:15:31.0180867Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_chebyshev_polynomial_w_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7789s] [ 34%] 2025-12-04T10:15:31.0182699Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_entr_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0416s] [ 35%] 2025-12-04T10:15:31.0184403Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_gammainc_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8616s] [ 35%] 2025-12-04T10:15:31.0186101Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_i0e_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7387s] [ 35%] 2025-12-04T10:15:31.0188025Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_legendre_polynomial_p_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7570s] [ 35%] 2025-12-04T10:15:31.0189798Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_ndtri_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7699s] [ 36%] 2025-12-04T10:15:31.0191610Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_shifted_chebyshev_polynomial_t_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7701s] [ 36%] 2025-12-04T10:15:31.0193560Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_shifted_chebyshev_polynomial_v_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7793s] [ 36%] 2025-12-04T10:15:31.0195505Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_shifted_chebyshev_polynomial_w_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7765s] [ 36%] 2025-12-04T10:15:31.0197330Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_zeta_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7826s] [ 37%] 2025-12-04T10:15:31.0198956Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pow3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.6928s] [ 37%] 2025-12-04T10:15:31.0200531Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_randint_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.3713s] [ 37%] 2025-12-04T10:15:31.0202174Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_randn_generator_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7759s] [ 38%] 2025-12-04T10:15:31.0203836Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_reduction3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9279s] [ 38%] 2025-12-04T10:15:31.0205653Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_reduction5_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (Non-deterministic CPU results) [ 38%] 2025-12-04T10:15:31.0207496Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_remove_noop_view_dtype_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.1967s] [ 38%] 2025-12-04T10:15:31.0209251Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_require_stride_expanded_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8339s] [ 39%] 2025-12-04T10:15:31.0210939Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_roi_align_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.0456s] [ 39%] 2025-12-04T10:15:31.0212551Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter5_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0620s] [ 39%] 2025-12-04T10:15:31.0214298Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter_bf16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.3931s] [ 39%] 2025-12-04T10:15:31.0215951Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter_reduce1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8038s] [ 40%] 2025-12-04T10:15:31.0218054Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_False_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Does not support SDPA or pre-SM80 hardware) [ 40%] 2025-12-04T10:15:31.0220148Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_shape_padding_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [3.0734s] [ 40%] 2025-12-04T10:15:31.0221783Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_signbit_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0339s] [ 40%] 2025-12-04T10:15:31.0223539Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_size_asserts_for_multi_output_fallback_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.1440s] [ 41%] 2025-12-04T10:15:31.0225273Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice4_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.1609s] [ 41%] 2025-12-04T10:15:31.0226898Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_scatter2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8267s] [ 41%] 2025-12-04T10:15:31.0228547Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sort_stable_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.6289s] [ 42%] 2025-12-04T10:15:31.0230201Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sort_transpose_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.3063s] [ 42%] 2025-12-04T10:15:31.0231891Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_special_polygamma_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.1499s] [ 42%] 2025-12-04T10:15:31.0233556Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_cumsum_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [5.6919s] [ 42%] 2025-12-04T10:15:31.0235205Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_failed_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0261s] [ 43%] 2025-12-04T10:15:31.0237581Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_reduction_with_int64_size_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (The test fails some times on CI: https://github.com/pytorch/pytorch/actions/runs/15333913377/job/43153170162. Skip for now.) [ 43%] 2025-12-04T10:15:31.0239974Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_with_integer_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7846s] [ 43%] 2025-12-04T10:15:31.0241640Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_squeeze1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9232s] [ 43%] 2025-12-04T10:15:31.0243224Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8062s] [ 44%] 2025-12-04T10:15:31.0244785Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8208s] [ 44%] 2025-12-04T10:15:31.0246364Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum_int_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.3479s] [ 44%] 2025-12-04T10:15:31.0248030Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tensor3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8103s] [ 45%] 2025-12-04T10:15:31.0249690Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tensor_index_slice_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [4.6946s] [ 45%] 2025-12-04T10:15:31.0251719Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tmp_not_defined_issue1_use_block_ptr_True_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (triton backend is required for cpu) [ 45%] 2025-12-04T10:15:31.0253664Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_to_device_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.1369s] [ 45%] 2025-12-04T10:15:31.0255335Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_torch_device_split_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0892s] [ 46%] 2025-12-04T10:15:31.0257002Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_transpose_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.6373s] [ 46%] 2025-12-04T10:15:31.0258731Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_uint4x2_mixed_mm_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1521s] [ 46%] 2025-12-04T10:15:31.0260470Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unfold_zero_dimension_tensor_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0927s] [ 46%] 2025-12-04T10:15:31.0262243Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_bilinear2d_b_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.7154s] [ 47%] 2025-12-04T10:15:31.0263983Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_nearest1d_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.6488s] [ 47%] 2025-12-04T10:15:31.0265761Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_nearest2d_backward_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [4.4465s] [ 47%] 2025-12-04T10:15:31.0267538Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_vectorized_ops_masked_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9303s] [ 47%] 2025-12-04T10:15:31.0269236Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_view_as_complex_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9836s] [ 48%] 2025-12-04T10:15:31.0270873Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_views2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [5.9968s] [ 48%] 2025-12-04T10:15:31.0272456Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_zeros_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7943s] [ 48%] 2025-12-04T10:15:31.0274173Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_AllenaiLongformerBase_repro_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.3986s] [ 49%] 2025-12-04T10:15:31.0275954Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_avg_pool2d1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [5.5080s] [ 49%] 2025-12-04T10:15:31.0277726Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_avg_pool2d_low_prec_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.0460s] [ 49%] 2025-12-04T10:15:31.0279554Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_avg_pool_errors_with_long_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5916s] [ 49%] 2025-12-04T10:15:31.0281347Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_max_pool2d2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2019s] [ 50%] 2025-12-04T10:15:31.0283125Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_add_complex3_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6771s] [ 50%] 2025-12-04T10:15:31.0284777Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_add_complex7_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2734s] [ 50%] 2025-12-04T10:15:31.0286454Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_any_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.4386s] [ 50%] 2025-12-04T10:15:31.0288254Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_argmin2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.0547s] [ 51%] 2025-12-04T10:15:31.0289975Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_argmin_with_nan_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.5332s] [ 51%] 2025-12-04T10:15:31.0291701Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_min_int32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.9916s] [ 51%] 2025-12-04T10:15:31.0293391Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_as_strided_scatter_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.3096s] [ 52%] 2025-12-04T10:15:31.0295161Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_assert_alignment_op_name_fail_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.0034s] [ 52%] 2025-12-04T10:15:31.0296948Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool_errors_with_uint_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3572s] [ 52%] 2025-12-04T10:15:31.0298707Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bernoulli2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.7223s] [ 52%] 2025-12-04T10:15:31.0300332Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bmm1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.4236s] [ 53%] 2025-12-04T10:15:31.0301980Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_broadcast_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8623s] [ 53%] 2025-12-04T10:15:31.0303743Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int16_int64_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.4696s] [ 53%] 2025-12-04T10:15:31.0305523Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int32_int32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.4367s] [ 53%] 2025-12-04T10:15:31.0307297Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int64_int32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.4716s] [ 54%] 2025-12-04T10:15:31.0309071Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_int64_int8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.4753s] [ 54%] 2025-12-04T10:15:31.0310827Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_int_uint8_int16_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.4362s] [ 54%] 2025-12-04T10:15:31.0312619Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_nd_tiling_False_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5831s] [ 54%] 2025-12-04T10:15:31.0314352Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_builtins_round_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3107s] [ 55%] 2025-12-04T10:15:31.0315984Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [3.8344s] [ 55%] 2025-12-04T10:15:31.0317670Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_empty_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.7042s] [ 55%] 2025-12-04T10:15:31.0319384Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_unbacked_legacy_empty_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.0433s] [ 56%] 2025-12-04T10:15:31.0321307Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_config_option_dont_assume_alignment_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.3384s] [ 56%] 2025-12-04T10:15:31.0323112Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_const_int32_to_float_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7315s] [ 56%] 2025-12-04T10:15:31.0324814Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_constant_pad_2d_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.2608s] [ 56%] 2025-12-04T10:15:31.0326507Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv1d_depthwise_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.0178s] [ 57%] 2025-12-04T10:15:31.0328231Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv1d_with_permute_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.0965s] [ 57%] 2025-12-04T10:15:31.0329942Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_convolution3_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [4.0907s] [ 57%] 2025-12-04T10:15:31.0331616Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_convolution4_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.8104s] [ 57%] 2025-12-04T10:15:31.0333388Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cpu_scalar_with_gpu_tensor_dynamic_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2241s] [ 58%] 2025-12-04T10:15:31.0335116Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_op_3_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.1833s] [ 58%] 2025-12-04T10:15:31.0336813Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_scan_would_split_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4601s] [ 58%] 2025-12-04T10:15:31.0338640Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_deterministic_codegen_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [11.1297s] [ 59%] 2025-12-04T10:15:31.0340478Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_deterministic_codegen_with_suffix_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.4929s] [ 59%] 2025-12-04T10:15:31.0342216Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.0769s] [ 59%] 2025-12-04T10:15:31.0343790Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8606s] [ 59%] 2025-12-04T10:15:31.0345390Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div_prim_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6557s] [ 60%] 2025-12-04T10:15:31.0347280Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_float32_int16_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (uses bfloat16 which requires SM >= 80) [ 60%] 2025-12-04T10:15:31.0349430Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int16_float32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0031s] (uses bfloat16 which requires SM >= 80) [ 60%] 2025-12-04T10:15:31.0351631Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int16_int32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 60%] 2025-12-04T10:15:31.0353744Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int32_uint8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 61%] 2025-12-04T10:15:31.0355943Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int64_float32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 61%] 2025-12-04T10:15:31.0358081Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int64_int32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (uses bfloat16 which requires SM >= 80) [ 61%] 2025-12-04T10:15:31.0360220Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_float32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 61%] 2025-12-04T10:15:31.0362342Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_int32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 62%] 2025-12-04T10:15:31.0364463Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_int64_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 62%] 2025-12-04T10:15:31.0366561Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_int8_int8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0031s] (uses bfloat16 which requires SM >= 80) [ 62%] 2025-12-04T10:15:31.0368405Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_elu_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.0774s] [ 63%] 2025-12-04T10:15:31.0369993Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_empty1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7359s] [ 63%] 2025-12-04T10:15:31.0371604Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_expand_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6511s] [ 63%] 2025-12-04T10:15:31.0373354Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fallback_mutable_op_no_mutated_tensors_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.0335s] [ 63%] 2025-12-04T10:15:31.0375100Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_flip_cat_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5751s] [ 64%] 2025-12-04T10:15:31.0376721Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fmin_fmax_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7291s] [ 64%] 2025-12-04T10:15:31.0378414Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_full_boolean_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3635s] [ 64%] 2025-12-04T10:15:31.0380382Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fuse_large_params_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 64%] 2025-12-04T10:15:31.0382348Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fuse_tiled_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8074s] [ 65%] 2025-12-04T10:15:31.0383941Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_gelu_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.0002s] [ 65%] 2025-12-04T10:15:31.0385745Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_generated_code_has_alignment_assert_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3267s] [ 65%] 2025-12-04T10:15:31.0387574Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_gpu_scalar_with_gpu_tensor_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2011s] [ 66%] 2025-12-04T10:15:31.0389616Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_graph_partition_mutation_real_name_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5249s] [ 66%] 2025-12-04T10:15:31.0391450Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_graph_partition_no_inputs_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7773s] [ 66%] 2025-12-04T10:15:31.0393305Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_graph_partition_unbacked_symint_as_output_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6333s] [ 66%] 2025-12-04T10:15:31.0395146Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_horizonal_fusion2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7731s] [ 67%] 2025-12-04T10:15:31.0396859Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_float_zero_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5698s] [ 67%] 2025-12-04T10:15:31.0398665Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_propagation_device_assert_masked_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8926s] [ 67%] 2025-12-04T10:15:31.0400508Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_propagation_flip_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2087s] [ 67%] 2025-12-04T10:15:31.0402256Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_put_as_masked_fill_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.3972s] [ 68%] 2025-12-04T10:15:31.0404034Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_put_failed_reinplace_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5778s] [ 68%] 2025-12-04T10:15:31.0405772Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_remainder_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5418s] [ 68%] 2025-12-04T10:15:31.0407455Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_tensor_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7467s] [ 69%] 2025-12-04T10:15:31.0409271Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inductor_layout_optimization_input_mutations_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5786s] [ 69%] 2025-12-04T10:15:31.0411139Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inner_fn_str_and_stride_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2740s] [ 69%] 2025-12-04T10:15:31.0412881Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inplace_activations_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.3721s] [ 69%] 2025-12-04T10:15:31.0414629Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inplace_mixed_dtype_ops_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3461s] [ 70%] 2025-12-04T10:15:31.0416320Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_isinf2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4062s] [ 70%] 2025-12-04T10:15:31.0417992Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_large_tensor_reduction_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.0068s] [ 70%] 2025-12-04T10:15:31.0419759Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_leaky_relu_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7968s] [ 70%] 2025-12-04T10:15:31.0421686Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.0073s] [ 71%] 2025-12-04T10:15:31.0423795Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.0049s] [ 71%] 2025-12-04T10:15:31.0425723Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py FAILED [0.0047s] [ 71%] 2025-12-04T10:15:31.0426702Z 2025-12-04T10:15:31.0426848Z ==================================== RERUNS ==================================== 2025-12-04T10:15:31.0427472Z _ DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda _ 2025-12-04T10:15:31.0428418Z Traceback (most recent call last): 2025-12-04T10:15:31.0429217Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5999, in test_linalg_eig_stride_consistency 2025-12-04T10:15:31.0429989Z self.common( 2025-12-04T10:15:31.0430442Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:15:31.0430988Z return func(*args, **kwds) 2025-12-04T10:15:31.0431639Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T10:15:31.0432310Z check_model( 2025-12-04T10:15:31.0432879Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 497, in check_model 2025-12-04T10:15:31.0433582Z correct = ref_model(*ref_inputs, **ref_kwargs) 2025-12-04T10:15:31.0434224Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5994, in fn 2025-12-04T10:15:31.0434882Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.0435924Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.0436812Z 2025-12-04T10:15:31.0437043Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.0438180Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.0439081Z 2025-12-04T10:15:31.0439347Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.0440084Z _ DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda _ 2025-12-04T10:15:31.0440673Z Traceback (most recent call last): 2025-12-04T10:15:31.0441455Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5999, in test_linalg_eig_stride_consistency 2025-12-04T10:15:31.0442224Z self.common( 2025-12-04T10:15:31.0442672Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:15:31.0443215Z return func(*args, **kwds) 2025-12-04T10:15:31.0443855Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T10:15:31.0444530Z check_model( 2025-12-04T10:15:31.0445100Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 497, in check_model 2025-12-04T10:15:31.0445792Z correct = ref_model(*ref_inputs, **ref_kwargs) 2025-12-04T10:15:31.0446447Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5994, in fn 2025-12-04T10:15:31.0447107Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.0448148Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.0449036Z 2025-12-04T10:15:31.0449254Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.0450462Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.0451386Z 2025-12-04T10:15:31.0451655Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.0452286Z =================================== FAILURES =================================== 2025-12-04T10:15:31.0452892Z _ DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda _ 2025-12-04T10:15:31.0453482Z Traceback (most recent call last): 2025-12-04T10:15:31.0454271Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5999, in test_linalg_eig_stride_consistency 2025-12-04T10:15:31.0455053Z self.common( 2025-12-04T10:15:31.0455485Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:15:31.0456023Z return func(*args, **kwds) 2025-12-04T10:15:31.0456677Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T10:15:31.0457345Z check_model( 2025-12-04T10:15:31.0457912Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 497, in check_model 2025-12-04T10:15:31.0458709Z correct = ref_model(*ref_inputs, **ref_kwargs) 2025-12-04T10:15:31.0459366Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5994, in fn 2025-12-04T10:15:31.0460016Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.0461057Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.0461949Z 2025-12-04T10:15:31.0462179Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.0463310Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.0464209Z 2025-12-04T10:15:31.0464479Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.0465821Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-655beaec67eb1ac4.xml - 2025-12-04T10:15:31.0467073Z =========================== short test summary info ============================ 2025-12-04T10:15:31.0468868Z FAILED [0.0047s] inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda - RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.0470468Z 2025-12-04T10:15:31.0470696Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.0471808Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.0472721Z 2025-12-04T10:15:31.0472985Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.0473581Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:15:31.0474146Z == 1 failed, 239 passed, 21 skipped, 3 xfailed, 2 rerun in 363.82s (0:06:03) === 2025-12-04T10:15:31.0474640Z Got exit code 1 2025-12-04T10:15:31.0474911Z Retrying single test... 2025-12-04T10:15:31.0475551Z W1204 10:10:56.185000 32507 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:15:31.0476866Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-3eb5ad48105c1cd2.xml 2025-12-04T10:15:31.0477916Z ============================= test session starts ============================== 2025-12-04T10:15:31.0478659Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:15:31.0479272Z cachedir: .pytest_cache 2025-12-04T10:15:31.0479972Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:15:31.0480833Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:15:31.0481189Z configfile: pytest.ini 2025-12-04T10:15:31.0481911Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:15:31.0482828Z collecting ... collected 1815 items / 370 deselected / 1445 selected 2025-12-04T10:15:31.0484061Z stepcurrent: skipping 263 already run items. Running only test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.0485170Z Running 1 items in this shard 2025-12-04T10:15:31.0485382Z 2025-12-04T10:15:31.0486755Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py [W1204 10:10:58.537202202 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.0488391Z 2025-12-04T10:15:31.0488918Z [W1204 10:11:14.570299440 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.0489593Z 2025-12-04T10:15:31.0489729Z ('RERUN', {'yellow': True}) [16.0729s] [100%] 2025-12-04T10:15:31.0491366Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py [W1204 10:11:14.615657500 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.0492858Z 2025-12-04T10:15:31.0493399Z [W1204 10:11:14.616472507 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.0494055Z 2025-12-04T10:15:31.0494204Z ('RERUN', {'yellow': True}) [0.0054s] [100%] 2025-12-04T10:15:31.0495818Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py [W1204 10:11:14.622265730 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.0497320Z 2025-12-04T10:15:31.0497843Z [W1204 10:11:14.623043464 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.0498604Z 2025-12-04T10:15:31.0498710Z FAILED [0.0049s] [100%] 2025-12-04T10:15:31.0498888Z 2025-12-04T10:15:31.0499047Z ==================================== RERUNS ==================================== 2025-12-04T10:15:31.0499656Z _ DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda _ 2025-12-04T10:15:31.0500247Z Traceback (most recent call last): 2025-12-04T10:15:31.0501038Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5999, in test_linalg_eig_stride_consistency 2025-12-04T10:15:31.0501829Z self.common( 2025-12-04T10:15:31.0502264Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:15:31.0502811Z return func(*args, **kwds) 2025-12-04T10:15:31.0503468Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T10:15:31.0504136Z check_model( 2025-12-04T10:15:31.0504706Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 497, in check_model 2025-12-04T10:15:31.0505412Z correct = ref_model(*ref_inputs, **ref_kwargs) 2025-12-04T10:15:31.0506060Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5994, in fn 2025-12-04T10:15:31.0506834Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.0507877Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.0509651Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T10:15:31.0510655Z C++ CapturedTraceback: 2025-12-04T10:15:31.0512165Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T10:15:31.0514101Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T10:15:31.0515070Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T10:15:31.0516364Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.0517707Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T10:15:31.0518695Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.0519559Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.0520594Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0523761Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0526581Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.0527168Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T10:15:31.0529499Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0531929Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T10:15:31.0532922Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.0535951Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.0538630Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T10:15:31.0539480Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T10:15:31.0540355Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T10:15:31.0541132Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0541993Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0542915Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0543703Z #25 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0544496Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0545433Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0546359Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0547178Z #29 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.0547880Z #30 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0548664Z #31 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0549583Z #32 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0550517Z #33 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0551304Z #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0552074Z #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0552859Z #36 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0553648Z #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0554580Z #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0555501Z #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0556286Z #40 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0557066Z #41 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0558001Z #42 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0558924Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0559853Z #44 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0560788Z #45 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0561851Z #46 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0563052Z #47 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0563980Z #48 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0564911Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0565717Z #50 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.0566580Z #51 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0567363Z #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0568307Z #53 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0569101Z #54 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0569854Z #55 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0570606Z #56 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0571476Z #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0572394Z #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0573333Z #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0574268Z #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0575045Z #61 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0575828Z #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0576755Z #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0577684Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0578714Z #65 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0579647Z #66 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0580515Z #67 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0581328Z #68 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0582071Z #69 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0582791Z #70 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T10:15:31.0583471Z #71 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0584253Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0585175Z #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0586105Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0587045Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0588111Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0588881Z #77 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0589641Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0590558Z #79 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0591459Z #80 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0592374Z #81 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0593292Z #82 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0594070Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0594947Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0595875Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0596797Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0597803Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0598712Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0599582Z #89 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0600384Z #90 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0601118Z #91 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0601861Z #92 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0602721Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0603650Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0604420Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0605189Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0606110Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0607031Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0607942Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0608869Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0609752Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0610561Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0611310Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0612061Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0612925Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0613848Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0614789Z #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0615722Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0616661Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0617588Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0618457Z #111 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0619250Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0620189Z #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0621112Z #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0622044Z #115 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0622980Z #116 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0623951Z #117 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0624754Z #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0625505Z #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0626316Z #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0627172Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0628115Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0629048Z #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0629982Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0630911Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0631849Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0632778Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0633716Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0634644Z #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0635582Z #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0636400Z #131 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T10:15:31.0637142Z #132 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T10:15:31.0637856Z #133 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T10:15:31.0638552Z #134 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T10:15:31.0639335Z #135 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T10:15:31.0640151Z #136 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T10:15:31.0640916Z #137 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T10:15:31.0641621Z #138 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T10:15:31.0642299Z #139 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T10:15:31.0642887Z #140 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T10:15:31.0643321Z #141 _start from ??:0 2025-12-04T10:15:31.0643617Z #142 from ??:0 2025-12-04T10:15:31.0643852Z 2025-12-04T10:15:31.0643857Z 2025-12-04T10:15:31.0644071Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.0645203Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.0646119Z 2025-12-04T10:15:31.0646387Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.0647016Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:15:31.0648512Z /var/lib/jenkins/workspace/test/inductor/test_torchinductor.py:5994: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:15:31.0649912Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.0650499Z _ DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda _ 2025-12-04T10:15:31.0651082Z Traceback (most recent call last): 2025-12-04T10:15:31.0651907Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5999, in test_linalg_eig_stride_consistency 2025-12-04T10:15:31.0652691Z self.common( 2025-12-04T10:15:31.0653132Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:15:31.0653725Z return func(*args, **kwds) 2025-12-04T10:15:31.0654357Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T10:15:31.0655031Z check_model( 2025-12-04T10:15:31.0655592Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 497, in check_model 2025-12-04T10:15:31.0656271Z correct = ref_model(*ref_inputs, **ref_kwargs) 2025-12-04T10:15:31.0656911Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5994, in fn 2025-12-04T10:15:31.0657557Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.0658705Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.0660354Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T10:15:31.0661369Z C++ CapturedTraceback: 2025-12-04T10:15:31.0662870Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T10:15:31.0664783Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T10:15:31.0665732Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T10:15:31.0667016Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.0668365Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T10:15:31.0669351Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.0670195Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.0671220Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0674384Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0677197Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.0677786Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T10:15:31.0680170Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0682625Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T10:15:31.0683657Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.0686618Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.0689391Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T10:15:31.0690111Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T10:15:31.0690992Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T10:15:31.0691752Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0692614Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0693541Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0694313Z #25 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0695097Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0696022Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0696949Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0697750Z #29 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.0698525Z #30 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0699302Z #31 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0700233Z #32 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0701152Z #33 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0701936Z #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0702712Z #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0703499Z #36 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0704264Z #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0705203Z #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0706135Z #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0706903Z #40 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0707683Z #41 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0708614Z #42 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0709545Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0710573Z #44 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0711506Z #45 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0712431Z #46 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0713441Z #47 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0714356Z #48 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0715282Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0716098Z #50 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.0716799Z #51 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0717570Z #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0718450Z #53 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0719260Z #54 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0720004Z #55 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0720753Z #56 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0721619Z #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0722552Z #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0723469Z #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0724399Z #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0725186Z #61 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0725966Z #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0726886Z #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0727816Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0728743Z #65 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0729661Z #66 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0730533Z #67 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0731338Z #68 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0732099Z #69 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0732801Z #70 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T10:15:31.0733477Z #71 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0734261Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0735192Z #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0736111Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0737045Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0737978Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0738837Z #77 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0739693Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0740623Z #79 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0741617Z #80 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0742530Z #81 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0743461Z #82 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0744242Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0745017Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0745933Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0746871Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0747799Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0748737Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0749601Z #89 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0750409Z #90 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0751161Z #91 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0751912Z #92 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0752760Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0753692Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0754474Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0755237Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0756172Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0757102Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0758033Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0758961Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0759854Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0760690Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0761449Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0762199Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0763084Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0764033Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0764972Z #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0765920Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0766871Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0767820Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0768670Z #111 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0769476Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0770489Z #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0771439Z #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0772372Z #115 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0773319Z #116 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0774202Z #117 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0775022Z #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0775778Z #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0776542Z #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0777424Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0778432Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0779381Z #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0780326Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0781270Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0782197Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0783139Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0784083Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0785031Z #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0785959Z #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0786788Z #131 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T10:15:31.0787540Z #132 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T10:15:31.0788439Z #133 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T10:15:31.0789132Z #134 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T10:15:31.0789922Z #135 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T10:15:31.0790757Z #136 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T10:15:31.0791517Z #137 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T10:15:31.0792237Z #138 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T10:15:31.0792926Z #139 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T10:15:31.0793542Z #140 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T10:15:31.0793970Z #141 _start from ??:0 2025-12-04T10:15:31.0794271Z #142 from ??:0 2025-12-04T10:15:31.0794506Z 2025-12-04T10:15:31.0794511Z 2025-12-04T10:15:31.0794740Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.0795987Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.0796895Z 2025-12-04T10:15:31.0797163Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.0797808Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:15:31.0799398Z /var/lib/jenkins/workspace/test/inductor/test_torchinductor.py:5994: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:15:31.0800826Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.0801238Z =================================== FAILURES =================================== 2025-12-04T10:15:31.0801864Z _ DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda _ 2025-12-04T10:15:31.0802452Z Traceback (most recent call last): 2025-12-04T10:15:31.0803225Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5999, in test_linalg_eig_stride_consistency 2025-12-04T10:15:31.0804009Z self.common( 2025-12-04T10:15:31.0804462Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:15:31.0805003Z return func(*args, **kwds) 2025-12-04T10:15:31.0805641Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T10:15:31.0806318Z check_model( 2025-12-04T10:15:31.0806883Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 497, in check_model 2025-12-04T10:15:31.0807574Z correct = ref_model(*ref_inputs, **ref_kwargs) 2025-12-04T10:15:31.0808224Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5994, in fn 2025-12-04T10:15:31.0808885Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.0809923Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.0811576Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T10:15:31.0812600Z C++ CapturedTraceback: 2025-12-04T10:15:31.0814112Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T10:15:31.0816041Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T10:15:31.0816996Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T10:15:31.0818292Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.0819712Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T10:15:31.0820705Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.0821566Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.0822581Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0825814Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0828702Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.0829302Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T10:15:31.0831645Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0834093Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T10:15:31.0835072Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.0838067Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.0840665Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T10:15:31.0841399Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T10:15:31.0842281Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T10:15:31.0843051Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0843918Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0844855Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0845646Z #25 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0846416Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0847347Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0848280Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0849087Z #29 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.0849788Z #30 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0850572Z #31 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0851501Z #32 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0852420Z #33 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0853204Z #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0853987Z #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0854772Z #36 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0855606Z #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0856545Z #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0857542Z #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0858381Z #40 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0859172Z #41 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0860112Z #42 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0861050Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0861969Z #44 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0862913Z #45 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0863846Z #46 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0864783Z #47 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0865805Z #48 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0866854Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0867680Z #50 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.0868386Z #51 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0869154Z #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0870032Z #53 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0870837Z #54 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0871576Z #55 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0872326Z #56 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0873186Z #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0874118Z #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0875035Z #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0875964Z #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0876749Z #61 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0877534Z #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0878452Z #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0879389Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0880318Z #65 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0881250Z #66 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0882113Z #67 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0882921Z #68 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0883673Z #69 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0884476Z #70 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T10:15:31.0885155Z #71 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0886001Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0886933Z #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0888027Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0888958Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0889883Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0890665Z #77 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0891441Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0892376Z #79 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0893312Z #80 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0894229Z #81 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0895157Z #82 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0895940Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0896721Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0897639Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0898634Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0899570Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0900503Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0901371Z #89 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0902183Z #90 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0902931Z #91 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0903668Z #92 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0904513Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0905439Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0906217Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0906983Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0907909Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0908838Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0909764Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0910694Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0911576Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0912391Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0913260Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0914001Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0914953Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0915893Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0916829Z #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0917756Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0918686Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0919626Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0920412Z #111 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0921194Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0922139Z #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0923074Z #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0923998Z #115 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0924934Z #116 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0925818Z #117 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.0926630Z #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.0927381Z #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.0928126Z #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0928990Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0929933Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0930859Z #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0931793Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0932729Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0933644Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0934598Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0935544Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0936487Z #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0937424Z #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0938253Z #131 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T10:15:31.0939076Z #132 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T10:15:31.0939815Z #133 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T10:15:31.0940510Z #134 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T10:15:31.0941306Z #135 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T10:15:31.0942213Z #136 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T10:15:31.0942990Z #137 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T10:15:31.0943778Z #138 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T10:15:31.0944470Z #139 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T10:15:31.0945089Z #140 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T10:15:31.0945523Z #141 _start from ??:0 2025-12-04T10:15:31.0945833Z #142 from ??:0 2025-12-04T10:15:31.0946071Z 2025-12-04T10:15:31.0946076Z 2025-12-04T10:15:31.0946306Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.0947447Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.0948360Z 2025-12-04T10:15:31.0948630Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.0949270Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:15:31.0950778Z /var/lib/jenkins/workspace/test/inductor/test_torchinductor.py:5994: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:15:31.0952205Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.0953395Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-3eb5ad48105c1cd2.xml - 2025-12-04T10:15:31.0954631Z =========================== short test summary info ============================ 2025-12-04T10:15:31.0956443Z FAILED [0.0049s] inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda - RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.0958818Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T10:15:31.0959833Z C++ CapturedTraceback: 2025-12-04T10:15:31.0961329Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T10:15:31.0963260Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T10:15:31.0964231Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T10:15:31.0965526Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.0966869Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T10:15:31.0967861Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.0968718Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.0969752Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0972966Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0975840Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.0976428Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T10:15:31.0978815Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.0981283Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T10:15:31.0982272Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.0985253Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.0987987Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T10:15:31.0988703Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T10:15:31.0989585Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T10:15:31.0990369Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.0991227Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.0992172Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0992970Z #25 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0993755Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0994684Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0995616Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0996439Z #29 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.0997140Z #30 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.0997905Z #31 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.0998837Z #32 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.0999761Z #33 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1000539Z #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1001408Z #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1002194Z #36 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1003122Z #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1004041Z #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1004973Z #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1005754Z #40 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1006533Z #41 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1007450Z #42 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1008385Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1009313Z #44 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1010240Z #45 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1011162Z #46 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1012093Z #47 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1013020Z #48 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1013950Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1014751Z #50 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.1015448Z #51 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1016234Z #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1017093Z #53 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1017909Z #54 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1018733Z #55 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1019483Z #56 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1020340Z #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1021270Z #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1022209Z #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1023146Z #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1023917Z #61 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1024703Z #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1025634Z #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1026553Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1027482Z #65 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1028414Z #66 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1029284Z #67 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1030155Z #68 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1030913Z #69 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1031625Z #70 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T10:15:31.1032366Z #71 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1033136Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1034076Z #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1035010Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1035950Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1036868Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1037662Z #77 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1038448Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1039373Z #79 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1040310Z #80 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1041245Z #81 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1042178Z #82 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1042951Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1043737Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1044676Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1045611Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1046535Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1047465Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1048339Z #89 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1049137Z #90 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1062360Z #91 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1063136Z #92 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1064001Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1064937Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1065737Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1066525Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1067441Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1068366Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1069293Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1070220Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1071267Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1072091Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1072851Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1073668Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1074546Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1075489Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1076432Z #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1077365Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1078305Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1079245Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1080040Z #111 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1080827Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1081768Z #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1082703Z #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1083638Z #115 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1084582Z #116 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1085465Z #117 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1086283Z #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1087026Z #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1087959Z #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1088838Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1089781Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1090707Z #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1091650Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1092588Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1093531Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1094455Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1095393Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1096332Z #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1097257Z #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1098076Z #131 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T10:15:31.1098889Z #132 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T10:15:31.1099618Z #133 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T10:15:31.1100430Z #134 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T10:15:31.1101219Z #135 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T10:15:31.1102050Z #136 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T10:15:31.1102902Z #137 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T10:15:31.1103604Z #138 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T10:15:31.1104290Z #139 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T10:15:31.1104908Z #140 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T10:15:31.1105335Z #141 _start from ??:0 2025-12-04T10:15:31.1105640Z #142 from ??:0 2025-12-04T10:15:31.1105876Z 2025-12-04T10:15:31.1105894Z 2025-12-04T10:15:31.1106107Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.1107239Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.1108150Z 2025-12-04T10:15:31.1108421Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.1109018Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:15:31.1109545Z ================= 1 failed, 370 deselected, 2 rerun in 16.37s ================== 2025-12-04T10:15:31.1109993Z Got exit code 1 2025-12-04T10:15:31.1110247Z Retrying single test... 2025-12-04T10:15:31.1110875Z W1204 10:11:24.627000 32628 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:15:31.1112204Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-50486f79f4e32e59.xml 2025-12-04T10:15:31.1113239Z ============================= test session starts ============================== 2025-12-04T10:15:31.1113904Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:15:31.1114504Z cachedir: .pytest_cache 2025-12-04T10:15:31.1115215Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:15:31.1115988Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:15:31.1116339Z configfile: pytest.ini 2025-12-04T10:15:31.1117062Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:15:31.1117978Z collecting ... collected 1815 items / 370 deselected / 1445 selected 2025-12-04T10:15:31.1119196Z stepcurrent: skipping 263 already run items. Running only test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.1120299Z Running 1 items in this shard 2025-12-04T10:15:31.1120511Z 2025-12-04T10:15:31.1121889Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py [W1204 10:11:26.980484181 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.1123399Z 2025-12-04T10:15:31.1123933Z [W1204 10:11:42.222115542 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.1124588Z 2025-12-04T10:15:31.1124718Z ('RERUN', {'yellow': True}) [15.2805s] [100%] 2025-12-04T10:15:31.1126334Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py [W1204 10:11:42.266658261 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.1127891Z 2025-12-04T10:15:31.1128414Z [W1204 10:11:42.267538728 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.1129156Z 2025-12-04T10:15:31.1129287Z ('RERUN', {'yellow': True}) [0.0056s] [100%] 2025-12-04T10:15:31.1130911Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py [W1204 10:11:42.273444151 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.1132392Z 2025-12-04T10:15:31.1132923Z [W1204 10:11:42.274234274 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T10:15:31.1132928Z 2025-12-04T10:15:31.1133030Z FAILED [0.0049s] [100%] 2025-12-04T10:15:31.1133035Z 2025-12-04T10:15:31.1133182Z ==================================== RERUNS ==================================== 2025-12-04T10:15:31.1133515Z _ DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda _ 2025-12-04T10:15:31.1133639Z Traceback (most recent call last): 2025-12-04T10:15:31.1134200Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5999, in test_linalg_eig_stride_consistency 2025-12-04T10:15:31.1134300Z self.common( 2025-12-04T10:15:31.1134583Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:15:31.1134710Z return func(*args, **kwds) 2025-12-04T10:15:31.1135143Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T10:15:31.1135239Z check_model( 2025-12-04T10:15:31.1135654Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 497, in check_model 2025-12-04T10:15:31.1135804Z correct = ref_model(*ref_inputs, **ref_kwargs) 2025-12-04T10:15:31.1136170Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5994, in fn 2025-12-04T10:15:31.1136334Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.1137091Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.1137877Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T10:15:31.1137987Z C++ CapturedTraceback: 2025-12-04T10:15:31.1139390Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T10:15:31.1139887Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T10:15:31.1140224Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T10:15:31.1141046Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.1141447Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T10:15:31.1141899Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.1142173Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.1142850Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1145258Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1145599Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.1145777Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T10:15:31.1147803Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1148110Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T10:15:31.1148658Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.1150944Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.1151127Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T10:15:31.1151547Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T10:15:31.1151872Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T10:15:31.1152176Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1152597Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1152977Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1153238Z #25 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1153626Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1154031Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1154413Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1154708Z #29 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.1154968Z #30 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1155346Z #31 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1155755Z #32 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1156136Z #33 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1156470Z #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1156845Z #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1157171Z #36 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1157539Z #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1157956Z #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1158324Z #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1158580Z #40 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1158960Z #41 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1159372Z #42 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1159740Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1160154Z #44 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1160532Z #45 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1160952Z #46 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1161322Z #47 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1161728Z #48 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1162108Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1162406Z #50 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.1162673Z #51 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1163042Z #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1163396Z #53 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1163710Z #54 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1164005Z #55 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1164317Z #56 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1164721Z #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1165091Z #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1165509Z #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1165878Z #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1166139Z #61 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1166518Z #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1166921Z #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1167304Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1167707Z #65 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1168073Z #66 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1168493Z #67 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1168799Z #68 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1169161Z #69 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1169429Z #70 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T10:15:31.1169683Z #71 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1170065Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1170474Z #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1170838Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1171253Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1171629Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1171894Z #77 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1172268Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1172669Z #79 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1173048Z #80 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1173448Z #81 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1173825Z #82 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1177137Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1177526Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1177944Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1178391Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1178804Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1179185Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1179533Z #89 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1179845Z #90 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1180142Z #91 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1180479Z #92 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1180899Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1181271Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1181531Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1181911Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1182316Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1182683Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1183098Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1183576Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1183948Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1184259Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1184594Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1184906Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1185324Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1185718Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1186129Z #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1186509Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1187066Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1187684Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1188240Z #111 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1188814Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1189389Z #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1189783Z #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1190199Z #115 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1190706Z #116 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1191083Z #117 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1191398Z #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1191714Z #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1192023Z #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1192438Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1192828Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1193240Z #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1193631Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1194050Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1194427Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1194852Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1195229Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1195650Z #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1196026Z #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1196319Z #131 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T10:15:31.1196635Z #132 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T10:15:31.1196995Z #133 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T10:15:31.1197282Z #134 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T10:15:31.1197707Z #135 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T10:15:31.1198033Z #136 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T10:15:31.1198335Z #137 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T10:15:31.1198604Z #138 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T10:15:31.1198873Z #139 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T10:15:31.1199083Z #140 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T10:15:31.1199187Z #141 _start from ??:0 2025-12-04T10:15:31.1199310Z #142 from ??:0 2025-12-04T10:15:31.1199330Z 2025-12-04T10:15:31.1199335Z 2025-12-04T10:15:31.1199558Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.1200443Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.1200452Z 2025-12-04T10:15:31.1200733Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.1200958Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:15:31.1202098Z /var/lib/jenkins/workspace/test/inductor/test_torchinductor.py:5994: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:15:31.1202246Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.1202569Z _ DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda _ 2025-12-04T10:15:31.1202762Z Traceback (most recent call last): 2025-12-04T10:15:31.1203309Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5999, in test_linalg_eig_stride_consistency 2025-12-04T10:15:31.1203412Z self.common( 2025-12-04T10:15:31.1203709Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:15:31.1203823Z return func(*args, **kwds) 2025-12-04T10:15:31.1204267Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T10:15:31.1204364Z check_model( 2025-12-04T10:15:31.1204766Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 497, in check_model 2025-12-04T10:15:31.1204928Z correct = ref_model(*ref_inputs, **ref_kwargs) 2025-12-04T10:15:31.1205285Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5994, in fn 2025-12-04T10:15:31.1205427Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.1206215Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.1206982Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T10:15:31.1207105Z C++ CapturedTraceback: 2025-12-04T10:15:31.1208430Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T10:15:31.1208927Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T10:15:31.1209267Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T10:15:31.1210144Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.1210591Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T10:15:31.1211032Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.1211314Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.1211926Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1214350Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1214634Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.1214797Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T10:15:31.1216830Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1217161Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T10:15:31.1217728Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.1220088Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.1220283Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T10:15:31.1220691Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T10:15:31.1221028Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T10:15:31.1221334Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1221742Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1222127Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1222390Z #25 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1222761Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1223246Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1223624Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1223979Z #29 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.1224678Z #30 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1225459Z #31 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1226398Z #32 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1227336Z #33 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1228111Z #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1228906Z #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1229689Z #36 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1230459Z #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1231396Z #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1232331Z #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1233118Z #40 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1233889Z #41 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1234815Z #42 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1235737Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1236711Z #44 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1237627Z #45 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1238561Z #46 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1239483Z #47 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1240408Z #48 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1241321Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1242140Z #50 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.1242844Z #51 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1243629Z #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1244487Z #53 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1245321Z #54 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1246073Z #55 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1246803Z #56 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1247660Z #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1248591Z #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1249515Z #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1250502Z #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1251286Z #61 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1252063Z #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1253022Z #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1253936Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1254869Z #65 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1255793Z #66 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1256659Z #67 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1257451Z #68 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1258202Z #69 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1259015Z #70 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T10:15:31.1259685Z #71 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1260466Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1261399Z #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1262334Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1263254Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1264187Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1265066Z #77 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1265863Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1266786Z #79 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1267717Z #80 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1268640Z #81 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1269568Z #82 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1270338Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1271116Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1272054Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1272980Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1273898Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1274822Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1275698Z #89 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1276497Z #90 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1277234Z #91 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1277974Z #92 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1278834Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1279830Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1280608Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1281417Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1282344Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1283257Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1284182Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1285113Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1286001Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1286810Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1287572Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1288602Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1289465Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1290405Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1291349Z #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1292294Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1293222Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1294251Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1295049Z #111 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1295836Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1296764Z #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1297707Z #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1298705Z #115 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1299652Z #116 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1300526Z #117 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1301358Z #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1302120Z #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1302868Z #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1303738Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1304681Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1305622Z #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1306550Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1307488Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1308526Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1309467Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1310444Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1311387Z #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1312329Z #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1313153Z #131 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T10:15:31.1313891Z #132 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T10:15:31.1314622Z #133 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T10:15:31.1315331Z #134 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T10:15:31.1316108Z #135 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T10:15:31.1316937Z #136 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T10:15:31.1317704Z #137 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T10:15:31.1318419Z #138 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T10:15:31.1319094Z #139 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T10:15:31.1319698Z #140 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T10:15:31.1320139Z #141 _start from ??:0 2025-12-04T10:15:31.1320429Z #142 from ??:0 2025-12-04T10:15:31.1320672Z 2025-12-04T10:15:31.1320677Z 2025-12-04T10:15:31.1320895Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.1322069Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.1322972Z 2025-12-04T10:15:31.1323249Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.1323893Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:15:31.1325380Z /var/lib/jenkins/workspace/test/inductor/test_torchinductor.py:5994: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:15:31.1326810Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.1327229Z =================================== FAILURES =================================== 2025-12-04T10:15:31.1327830Z _ DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda _ 2025-12-04T10:15:31.1328422Z Traceback (most recent call last): 2025-12-04T10:15:31.1329213Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5999, in test_linalg_eig_stride_consistency 2025-12-04T10:15:31.1329997Z self.common( 2025-12-04T10:15:31.1330430Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T10:15:31.1330968Z return func(*args, **kwds) 2025-12-04T10:15:31.1331619Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T10:15:31.1332281Z check_model( 2025-12-04T10:15:31.1332838Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 497, in check_model 2025-12-04T10:15:31.1333536Z correct = ref_model(*ref_inputs, **ref_kwargs) 2025-12-04T10:15:31.1334186Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5994, in fn 2025-12-04T10:15:31.1334831Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.1335946Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.1337621Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T10:15:31.1338740Z C++ CapturedTraceback: 2025-12-04T10:15:31.1340249Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T10:15:31.1342177Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T10:15:31.1343139Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T10:15:31.1344439Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.1345790Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T10:15:31.1346762Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.1347617Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.1348645Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1351814Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1354660Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.1355243Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T10:15:31.1357578Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1360009Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T10:15:31.1360996Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.1363973Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.1366700Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T10:15:31.1367416Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T10:15:31.1368318Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T10:15:31.1369093Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1369949Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1370870Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1371648Z #25 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1372432Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1373352Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1374283Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1375088Z #29 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.1375779Z #30 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1376540Z #31 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1377465Z #32 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1378456Z #33 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1379233Z #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1380000Z #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1380823Z #36 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1381595Z #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1382511Z #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1383432Z #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1384201Z #40 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1384966Z #41 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1385879Z #42 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1386796Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1387876Z #44 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1388799Z #45 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1389715Z #46 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1390637Z #47 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1391559Z #48 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1392481Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1393285Z #50 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.1393981Z #51 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1394752Z #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1395722Z #53 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1396526Z #54 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1397311Z #55 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1398055Z #56 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1398903Z #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1399823Z #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1400747Z #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1401671Z #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1402444Z #61 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1403213Z #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1404137Z #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1405057Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1405970Z #65 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1406891Z #66 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1407754Z #67 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1408542Z #68 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1409370Z #69 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1410070Z #70 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T10:15:31.1410732Z #71 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1411499Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1412425Z #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1413347Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1414269Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1415187Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1415959Z #77 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1416737Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1417651Z #79 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1418657Z #80 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1419580Z #81 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1420504Z #82 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1421273Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1422042Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1422963Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1423956Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1424869Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1425820Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1426682Z #89 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1427478Z #90 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1428212Z #91 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1428949Z #92 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1429795Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1430715Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1431488Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1432260Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1433182Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1434101Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1435028Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1435961Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1436843Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1437687Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1438448Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1439196Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1440068Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1440997Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1441937Z #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1442878Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1443808Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1444756Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1445546Z #111 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1446332Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1447262Z #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1448200Z #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1449135Z #115 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1450077Z #116 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1450951Z #117 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1451769Z #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1452587Z #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1453345Z #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1454244Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1455191Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1456139Z #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1457079Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1458027Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1459056Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1460015Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1460947Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1461894Z #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1462845Z #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1463670Z #131 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T10:15:31.1464416Z #132 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T10:15:31.1465147Z #133 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T10:15:31.1465853Z #134 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T10:15:31.1466689Z #135 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T10:15:31.1467508Z #136 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T10:15:31.1468277Z #137 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T10:15:31.1468996Z #138 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T10:15:31.1469670Z #139 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T10:15:31.1470283Z #140 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T10:15:31.1470726Z #141 _start from ??:0 2025-12-04T10:15:31.1471030Z #142 from ??:0 2025-12-04T10:15:31.1471268Z 2025-12-04T10:15:31.1471274Z 2025-12-04T10:15:31.1471492Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.1472631Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.1473538Z 2025-12-04T10:15:31.1473822Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.1474460Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:15:31.1475953Z /var/lib/jenkins/workspace/test/inductor/test_torchinductor.py:5994: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T10:15:31.1477376Z eigenvals, eigenvecs = torch.linalg.eig(x) 2025-12-04T10:15:31.1478584Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-50486f79f4e32e59.xml - 2025-12-04T10:15:31.1479827Z =========================== short test summary info ============================ 2025-12-04T10:15:31.1481689Z FAILED [0.0049s] inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda - RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T10:15:31.1484094Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T10:15:31.1485119Z C++ CapturedTraceback: 2025-12-04T10:15:31.1486628Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T10:15:31.1488746Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T10:15:31.1489715Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T10:15:31.1491019Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.1492384Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T10:15:31.1493385Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T10:15:31.1494233Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.1495267Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1498571Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1501396Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T10:15:31.1501994Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T10:15:31.1504330Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T10:15:31.1506764Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T10:15:31.1507756Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.1510819Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T10:15:31.1513425Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T10:15:31.1514197Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T10:15:31.1515057Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T10:15:31.1515836Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1516703Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1517646Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1518423Z #25 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1519217Z #26 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1520153Z #27 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1521089Z #28 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1521896Z #29 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.1522598Z #30 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1523382Z #31 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1524314Z #32 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1525232Z #33 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1526054Z #34 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1526841Z #35 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1527612Z #36 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1528398Z #37 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1529337Z #38 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1530266Z #39 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1531037Z #40 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1531812Z #41 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1532743Z #42 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1533682Z #43 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1534596Z #44 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1535525Z #45 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1536457Z #46 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1537392Z #47 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1538367Z #48 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1539299Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1540115Z #50 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T10:15:31.1540877Z #51 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1541659Z #52 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1542567Z #53 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1543377Z #54 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1544115Z #55 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1544860Z #56 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1545719Z #57 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1546652Z #58 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1547580Z #59 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1548514Z #60 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1549293Z #61 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1550057Z #62 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1550986Z #63 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1551907Z #64 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1552833Z #65 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1553747Z #66 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1554650Z #67 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1555460Z #68 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1556204Z #69 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1556904Z #70 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T10:15:31.1557573Z #71 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1558352Z #72 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1559281Z #73 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1560189Z #74 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1561113Z #75 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1562048Z #76 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1562814Z #77 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1563592Z #78 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1564511Z #79 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1565434Z #80 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1566346Z #81 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1567271Z #82 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1568043Z #83 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1568883Z #84 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1569801Z #85 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1570757Z #86 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1571691Z #87 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1572626Z #88 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1573487Z #89 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1574292Z #90 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1575040Z #91 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1575779Z #92 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1576643Z #93 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1577570Z #94 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1578418Z #95 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1579189Z #96 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1580116Z #97 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1581053Z #98 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1581985Z #99 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1582909Z #100 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1583844Z #101 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1584655Z #102 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1585405Z #103 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1586163Z #104 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1587037Z #105 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1588139Z #106 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1589078Z #107 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1590019Z #108 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1590968Z #109 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1591912Z #110 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1592696Z #111 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T10:15:31.1593483Z #112 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1594427Z #113 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1595374Z #114 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1596309Z #115 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1597246Z #116 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1598231Z #117 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T10:15:31.1599036Z #118 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T10:15:31.1600285Z #119 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T10:15:31.1601043Z #120 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T10:15:31.1601914Z #121 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T10:15:31.1602846Z #122 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1603789Z #123 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1604732Z #124 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1605677Z #125 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1606603Z #126 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1607544Z #127 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1608489Z #128 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1609426Z #129 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T10:15:31.1610356Z #130 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T10:15:31.1611180Z #131 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T10:15:31.1611925Z #132 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T10:15:31.1612710Z #133 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T10:15:31.1613417Z #134 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T10:15:31.1614208Z #135 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T10:15:31.1615046Z #136 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T10:15:31.1615807Z #137 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T10:15:31.1616525Z #138 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T10:15:31.1617215Z #139 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T10:15:31.1617824Z #140 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T10:15:31.1618247Z #141 _start from ??:0 2025-12-04T10:15:31.1618625Z #142 from ??:0 2025-12-04T10:15:31.1618868Z 2025-12-04T10:15:31.1618874Z 2025-12-04T10:15:31.1619107Z To execute this test, run the following from the base repo dir: 2025-12-04T10:15:31.1620224Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_dynamic_shapes.py DynamicShapesGPUTests.test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.1621148Z 2025-12-04T10:15:31.1621417Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:15:31.1622004Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:15:31.1622533Z ================= 1 failed, 370 deselected, 2 rerun in 15.58s ================== 2025-12-04T10:15:31.1622969Z Got exit code 1 2025-12-04T10:15:31.1623826Z FAILED CONSISTENTLY: test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda 2025-12-04T10:15:31.1625070Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T10:15:31.1626150Z W1204 10:11:52.515000 32749 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:15:31.1627465Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-305489af275a491d.xml 2025-12-04T10:15:31.1628540Z ============================= test session starts ============================== 2025-12-04T10:15:31.1629214Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:15:31.1629820Z cachedir: .pytest_cache 2025-12-04T10:15:31.1630523Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:15:31.1631304Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:15:31.1631658Z configfile: pytest.ini 2025-12-04T10:15:31.1632372Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:15:31.1633297Z collecting ... collected 1815 items / 264 deselected / 1551 selected 2025-12-04T10:15:31.1633818Z stepcurrent: skipping 264 already run items. 2025-12-04T10:15:31.1634213Z Running 107 items in this shard 2025-12-04T10:15:31.1634429Z 2025-12-04T10:15:31.1635194Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linspace2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [3.0363s] [ 0%] 2025-12-04T10:15:31.1636926Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_lite_dynamic_shape_assertion_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2574s] [ 1%] 2025-12-04T10:15:31.1638799Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_logaddexp_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (Not implemented for CUDA) [ 2%] 2025-12-04T10:15:31.1640590Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d4_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [5.1115s] [ 3%] 2025-12-04T10:15:31.1642269Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d5_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [4.9779s] [ 4%] 2025-12-04T10:15:31.1643902Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d7_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.8805s] [ 5%] 2025-12-04T10:15:31.1645651Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d_with_indices_backward2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [6.2363s] [ 6%] 2025-12-04T10:15:31.1647449Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_min_max_reduction_nan_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.9760s] [ 7%] 2025-12-04T10:15:31.1649208Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mixed_mm_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (Requires sm80) [ 8%] 2025-12-04T10:15:31.1650914Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_move_arange_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5167s] [ 9%] 2025-12-04T10:15:31.1652584Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_multilayer_var_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [3.0482s] [ 10%] 2025-12-04T10:15:31.1654288Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_multilayer_var_lowp_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.9962s] [ 11%] 2025-12-04T10:15:31.1656061Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mutable_custom_op_fixed_layout_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.9888s] [ 12%] 2025-12-04T10:15:31.1657921Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mutations_loop_fusion_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3220s] [ 13%] 2025-12-04T10:15:31.1659775Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nan_assert_inside_triton_kernel_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3977s] [ 14%] 2025-12-04T10:15:31.1661674Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nan_sort_stable_True_descending_True_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.1323s] [ 14%] 2025-12-04T10:15:31.1663437Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nan_to_num_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5321s] [ 15%] 2025-12-04T10:15:31.1665048Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_new_ones_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3057s] [ 16%] 2025-12-04T10:15:31.1666773Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_no_mega_fusion_during_lowering_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.2812s] [ 17%] 2025-12-04T10:15:31.1668514Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_no_op_reduction_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7286s] [ 18%] 2025-12-04T10:15:31.1670270Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nonzero_unbacked_refinement_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.0937s] [ 19%] 2025-12-04T10:15:31.1671981Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_one_hot_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5751s] [ 20%] 2025-12-04T10:15:31.1674182Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_permute1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py W1204 10:12:29.890000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1676242Z W1204 10:12:29.892000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1677515Z W1204 10:12:29.894000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1678798Z W1204 10:12:29.896000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1680074Z W1204 10:12:29.898000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1681351Z W1204 10:12:29.900000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1682639Z W1204 10:12:30.726000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1683906Z W1204 10:12:30.729000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1685186Z W1204 10:12:30.731000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1686469Z W1204 10:12:30.733000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1687892Z W1204 10:12:30.735000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1689166Z W1204 10:12:30.736000 32749 site-packages/torch/utils/_sympy/interp.py:179] [0/0] failed while executing pow_by_natural([VR[2, int_oo], VR[-1, -1]]) 2025-12-04T10:15:31.1689989Z PASSED [1.6955s] [ 21%] 2025-12-04T10:15:31.1691062Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_airy_ai_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.1624s] [ 22%] 2025-12-04T10:15:31.1692823Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_digamma_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2913s] [ 23%] 2025-12-04T10:15:31.1694519Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erf_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6139s] [ 24%] 2025-12-04T10:15:31.1696199Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erfc_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7139s] [ 25%] 2025-12-04T10:15:31.1697897Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erfinv_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5048s] [ 26%] 2025-12-04T10:15:31.1699696Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_gammaincc_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.1725s] [ 27%] 2025-12-04T10:15:31.1701406Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_i1e_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3080s] [ 28%] 2025-12-04T10:15:31.1703164Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_legendre_polynomial_p_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.1485s] [ 28%] 2025-12-04T10:15:31.1704941Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_logit_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7397s] [ 29%] 2025-12-04T10:15:31.1706697Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_modified_bessel_i0_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2496s] [ 30%] 2025-12-04T10:15:31.1708504Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_ndtr_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6022s] [ 31%] 2025-12-04T10:15:31.1710327Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_shifted_chebyshev_polynomial_u_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.3338s] [ 32%] 2025-12-04T10:15:31.1712089Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pow1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.3528s] [ 33%] 2025-12-04T10:15:31.1713804Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pow_by_natural_log2_dynamic_shapes_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.9351s] [ 34%] 2025-12-04T10:15:31.1715554Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pow_symfloat_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4552s] [ 35%] 2025-12-04T10:15:31.1717244Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_randint_int64_mod_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3217s] [ 36%] 2025-12-04T10:15:31.1718961Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_randint_kernel_count_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3092s] [ 37%] 2025-12-04T10:15:31.1720963Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_randn_with_dtype_and_device_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0035s] (only support cpu randn_with_dtype_and_device test) [ 38%] 2025-12-04T10:15:31.1722937Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reduction1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8792s] [ 39%] 2025-12-04T10:15:31.1724748Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reinterpret_dtypeview_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3648s] [ 40%] 2025-12-04T10:15:31.1726490Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_remove_noop_slice1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3756s] [ 41%] 2025-12-04T10:15:31.1728356Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_interleave_Tensor_decomp_int32_nd_1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4029s] [ 42%] 2025-12-04T10:15:31.1729254Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3790s] [ 42%] 2025-12-04T10:15:31.1729999Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_resize_as_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [18.3590s] [ 43%] 2025-12-04T10:15:31.1730748Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_resize_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [9.5564s] [ 44%] 2025-12-04T10:15:31.1731486Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_roi_align_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6934s] [ 45%] 2025-12-04T10:15:31.1732493Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_round_correctness_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (need to debug tl.libdevice on A100/V100) [ 46%] 2025-12-04T10:15:31.1733284Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scalar_cpu_tensor_arg_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.3646s] [ 47%] 2025-12-04T10:15:31.1734056Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scalar_output_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.5690s] [ 48%] 2025-12-04T10:15:31.1734847Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter4_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.2789s] [ 49%] 2025-12-04T10:15:31.1735762Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter_add1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Flaky test, needs debugging) [ 50%] 2025-12-04T10:15:31.1736533Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter_add2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6249s] [ 51%] 2025-12-04T10:15:31.1737355Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_searchsorted_broadcast_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6023s] [ 52%] 2025-12-04T10:15:31.1738113Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sign_dtype_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6741s] [ 53%] 2025-12-04T10:15:31.1738969Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_simplify_loops_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.8488s] [ 54%] 2025-12-04T10:15:31.1739717Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice4_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3301s] [ 55%] 2025-12-04T10:15:31.1740502Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_mutation2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3892s] [ 56%] 2025-12-04T10:15:31.1741270Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter3_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6034s] [ 57%] 2025-12-04T10:15:31.1742048Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter4_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5829s] [ 57%] 2025-12-04T10:15:31.1742935Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter_reinplace_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.3264s] [ 58%] 2025-12-04T10:15:31.1743774Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_view_with_graph_break_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4059s] [ 59%] 2025-12-04T10:15:31.1744610Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_softmax_backward_data_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.9485s] [ 60%] 2025-12-04T10:15:31.1745356Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sort_bool_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6976s] [ 61%] 2025-12-04T10:15:31.1746243Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_cumprod_low_prec_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (Requires sm80) [ 62%] 2025-12-04T10:15:31.1747015Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_cumsum_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [6.2177s] [ 63%] 2025-12-04T10:15:31.1748025Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sqrt_dynamic_shapes_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (sqrt dynamic shapes only supports cpu) [ 64%] 2025-12-04T10:15:31.1748769Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_squeeze2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.3139s] [ 65%] 2025-12-04T10:15:31.1749503Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sum2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.9930s] [ 66%] 2025-12-04T10:15:31.1750234Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sum_int_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7628s] [ 67%] 2025-12-04T10:15:31.1751179Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tensor1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3998s] [ 68%] 2025-12-04T10:15:31.1751980Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tensor2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3428s] [ 69%] 2025-12-04T10:15:31.1752866Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tmp_not_defined_issue1_use_block_ptr_True_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.9777s] [ 70%] 2025-12-04T10:15:31.1753656Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_to_memory_format_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.5908s] [ 71%] 2025-12-04T10:15:31.1754418Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_transpose_add_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5082s] [ 71%] 2025-12-04T10:15:31.1755358Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_triton_argmin_argmax_transpose_logical_index_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [4.8233s] [ 72%] 2025-12-04T10:15:31.1756199Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unbacked_floordiv_simplify_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8220s] [ 73%] 2025-12-04T10:15:31.1757043Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unsigned_constant_tensors_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.1430s] [ 74%] 2025-12-04T10:15:31.1757850Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unspec_inputs_float32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5671s] [ 75%] 2025-12-04T10:15:31.1758711Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unspec_inputs_int16_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4997s] [ 76%] 2025-12-04T10:15:31.1759522Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_upsample_nearest2d_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [3.5288s] [ 77%] 2025-12-04T10:15:31.1760316Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_var_mean_div_by_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.5420s] [ 78%] 2025-12-04T10:15:31.1761104Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_view_as_complex_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.2647s] [ 79%] 2025-12-04T10:15:31.1761852Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_view_as_real_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3115s] [ 80%] 2025-12-04T10:15:31.1762617Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_view_detach_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3171s] [ 81%] 2025-12-04T10:15:31.1763356Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [47.7486s] [ 82%] 2025-12-04T10:15:31.1764090Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [6.0765s] [ 83%] 2025-12-04T10:15:31.1764890Z inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_weight_norm_conv2d_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [4.1233s] [ 84%] 2025-12-04T10:15:31.1765431Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_bool_mask_nobreak_cuda PASSED [1.1413s] [ 85%] 2025-12-04T10:15:31.1766003Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_dynamic_stride_nobreak_cuda PASSED [0.6473s] [ 85%] 2025-12-04T10:15:31.1766566Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_float_item_neginf_cuda PASSED [0.6114s] [ 86%] 2025-12-04T10:15:31.1767100Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_float_item_return_cuda SKIPPED [0.0003s] [ 87%] 2025-12-04T10:15:31.1767634Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_full_recompiles_cuda PASSED [0.7808s] [ 88%] 2025-12-04T10:15:31.1768164Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_bool_nobreak_cuda PASSED [0.6104s] [ 89%] 2025-12-04T10:15:31.1768707Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_materialize_cuda PASSED [0.7173s] [ 90%] 2025-12-04T10:15:31.1769211Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_return_cuda PASSED [0.6116s] [ 91%] 2025-12-04T10:15:31.1769750Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_mark_unbacked_slice_cuda PASSED [1.0519s] [ 92%] 2025-12-04T10:15:31.1770277Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_math_ops_op2_cuda PASSED [0.7703s] [ 93%] 2025-12-04T10:15:31.1770881Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_multi_output_unbacked_custom_op_cuda PASSED [1.0641s] [ 94%] 2025-12-04T10:15:31.1771486Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_non_persistent_dynamic_rblock_cuda PASSED [1.3400s] [ 95%] 2025-12-04T10:15:31.1772004Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_slice_scatter_cuda PASSED [0.8576s] [ 96%] 2025-12-04T10:15:31.1772865Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_sort_dynamic_shape_with_check_cuda SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 97%] 2025-12-04T10:15:31.1773403Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_symint_sum_list_cuda PASSED [14.1367s] [ 98%] 2025-12-04T10:15:31.1774066Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_unspecialized_float_operations_cuda PASSED [4.0835s] [ 99%] 2025-12-04T10:15:31.1774759Z inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_wrapper_codegen_statically_known_int_or_none_cuda PASSED [2.4756s] [100%] 2025-12-04T10:15:31.1774797Z 2025-12-04T10:15:31.1775724Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-305489af275a491d.xml - 2025-12-04T10:15:31.1775969Z ========== 98 passed, 9 skipped, 264 deselected in 212.86s (0:03:32) =========== 2025-12-04T10:15:31.1776769Z The following tests failed consistently: ['test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linalg_eig_stride_consistency_dynamic_shapes_cuda'] 2025-12-04T10:15:31.1776776Z 2025-12-04T10:15:31.1777512Z FINISHED PRINTING LOG FILE of inductor/test_torchinductor_dynamic_shapes 3/5 (test/test-reports/inductor.test_torchinductor_dynamic_shapes_3.5_c72f035bd2cef633_.log) 2025-12-04T10:15:31.1777534Z 2025-12-04T10:15:31.1777981Z Finished inductor/test_torchinductor_dynamic_shapes 3/5 ... [2025-12-04 10:15:30.948235][3735.069446532], took 11.04min 2025-12-04T10:15:31.1779027Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-655beaec67eb1ac4.xml 2025-12-04T10:15:31.1780042Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-3eb5ad48105c1cd2.xml 2025-12-04T10:15:31.1781011Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-50486f79f4e32e59.xml 2025-12-04T10:15:31.1782033Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-305489af275a491d.xml 2025-12-04T10:15:31.4307964Z Uploading logs for 57119749259 to S3 2025-12-04T10:15:31.4805605Z Uploading artifacts took 0.27 seconds 2025-12-04T10:15:31.4805823Z inductor/test_torchinductor_dynamic_shapes 3/5 failed! 2025-12-04T10:15:31.4810732Z Running inductor/test_torchinductor_codegen_dynamic_shapes 3/4 ... [2025-12-04 10:15:31.480885][3735.602105074] 2025-12-04T10:15:31.4810874Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T10:15:31.4815532Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_codegen_dynamic_shapes.py', '--shard-id=3', '--num-shards=4', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:15:31.481308] 2025-12-04T10:24:47.6894093Z 2025-12-04T10:24:47.6895233Z PRINTING LOG FILE of inductor/test_torchinductor_codegen_dynamic_shapes 3/4 (test/test-reports/inductor.test_torchinductor_codegen_dynamic_shapes_3.4_8476a077570d9691_.log) 2025-12-04T10:24:47.6897321Z W1204 10:15:40.409000 36243 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:24:47.6899261Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-3d2f5f1cdf9aac8a.xml 2025-12-04T10:24:47.6900969Z ============================= test session starts ============================== 2025-12-04T10:24:47.6901826Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:24:47.6902722Z cachedir: .pytest_cache 2025-12-04T10:24:47.6903710Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:24:47.6904891Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:24:47.6905429Z configfile: pytest.ini 2025-12-04T10:24:47.6906983Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:24:47.6908220Z collecting ... collected 1750 items 2025-12-04T10:24:47.6909049Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T10:24:47.7219766Z Running 439 items in this shard: test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_adaptive_max_pool2d2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_adaptive_pool_errors_with_long_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_add_complex3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_add_complex9_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_addmv_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_aliased_buffer_reuse_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_allow_reuse_active_if_under_peak_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_arange3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_arange4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_arange5_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_argmax_argmin3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_argmax_argmin_with_duplicates_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_argmax_min_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool2d1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool2d3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool2d5_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool2d8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool3d_backward3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool3d_backward4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool3d_backward_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool_errors_with_uint_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_baddbmm_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bernoulli1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bernoulli2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bfloat16_to_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bmm2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_both_scalars_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_computed_offsets_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_default_kwargs_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_int32_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_int64_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_int64_int8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_int8_int64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_uint8_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_uint8_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_nd_tiling_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_buffer_copied_in_graph_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_builtins_round_int_ndigits_pos_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_builtins_round_int_ndigits_zero_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cat_empty_index_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cat_unbacked_2d_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_chunk_recompiles_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_clamp_type_promotion_non_tensor_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_complex_memory_overlap_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_config_option_dont_assume_alignment_recompiles_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_consecutive_split_cumprod_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_constant_pad_fill_dtype_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_constant_pad_nd_inplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_conv2d_channels_last_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_conv3d_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_convolution4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_copy_non_blocking_is_pinned_use_cat_False_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cos_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cpu_scalar_with_gpu_tensor_cpp_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cumsum_no_mask_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cumsum_pattern_matcher_issue_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_custom_op_default_layout_constraint_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_custom_scan_would_split_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_deterministic_codegen_with_suffix_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_diagonal_copy_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dist_bf16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div7_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div_zero_dim_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dropout_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dropout_trivial_1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtype_mismatch_issue_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float16_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float32_float16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float32_int64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float32_int8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float64_float32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float64_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int16_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int32_float16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int32_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int64_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int8_float16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int8_float32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int8_int64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int8_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_uint8_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_empty_strided_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_exact_stride_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fallback_mutable_op_basic_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fallback_mutable_op_list_tensor_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_flip_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_float32_to_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fractional_max_pool2d1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fractional_max_pool2d2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_full_like_transposed_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fuse_tiled_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_gather1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_gather_scatter_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_gelu_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_constant_tensor2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_mutation_real_name_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_no_inputs_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_unbacked_symint_as_output_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_grid_sampler_2d_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_grid_sampler_expand_preserves_view_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_hardtanh_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_horizonal_fusion1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_horizonal_fusion2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_dynamic_shapes_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_propagation_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_propagation_flip_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put_as_masked_fill_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put_index_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_select_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_indirect_load_broadcast_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_inductor_multiple_specializations_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_input_mutation2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_input_mutation5_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_isin_tensor_scalar_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_l1_loss_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_large_grid_use_block_ptr_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_large_pointwise_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_lerp_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_like_channels_last_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_like_rands_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_linear1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_linear_dynamic_maxautotune_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_linspace1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_logaddexp_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_logsumexp_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mark_unbacked_with_hint_override_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_masked_fill_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_masked_scatter_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_matmul_layer_norm_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_min_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d5_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d7_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d_with_indices_backward_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mixed_mm_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mul_index_expr_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_multi_gpu_device_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_multilayer_prime_size_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_multilayer_var_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_multilayer_var_lowp_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mutable_custom_op_fixed_layout_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_new_empty_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_no_mega_fusion_during_lowering_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_nonzero_unbacked_refinement_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_output_strides_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pattern_matcher_multi_user_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_permute2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_bessel_j1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_chebyshev_polynomial_t_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_chebyshev_polynomial_v_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_digamma_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_erfcx_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_erfinv_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_expit_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_gammainc_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_i1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_laguerre_polynomial_l_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_legendre_polynomial_p_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_log1p_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_modified_bessel_k0_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_psi_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_round_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_shifted_chebyshev_polynomial_u_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_xlog1py_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pow1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pow_by_natural_log2_dynamic_shapes_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pow_symfloat_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_profiler_mark_wrapper_call_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_randint_kernel_count_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_randn_generator_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_randn_like_empty_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_reflection_pad2d_backward_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_remove_noop_clone_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_remove_noop_copy_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_remove_noop_slice_scatter_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_repeat_interleave_2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_repeat_interleave_Tensor_decomp_int32_nd_1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_repeat_interleave_Tensor_decomp_int64_nd_1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_reuse_buffers_with_aliasing_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scalar_output_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scaled_dot_product_attention_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter5_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter_add1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter_reduce1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter_reduce2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_searchsorted_broadcast_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_searchsorted_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sgn_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_shape_prop_torch_ones_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_silu_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_simplify_loops_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_single_elem_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sizehint_issue1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_slice4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_slice_scatter4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_slice_scatter_dtype_consistency_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_slice_view_with_graph_break_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sort_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_special_polygamma_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_split_cumsum_index_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_split_reduction_dynamic_shape_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_split_with_integer_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sqrt_dynamic_shapes_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sum_dtype_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sum_keepdims_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_tensor3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_tmp_not_defined_issue1_use_block_ptr_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_tmp_not_defined_issue2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_to_device_constant_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_to_memory_format_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_transpose_add_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_float16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_int8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_upsample_bilinear2d_a_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_upsample_bilinear2d_b_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_upsample_cat_conv_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_upsample_nearest3d_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_var_mean_div_by_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_vertical_fusion1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_view_as_real_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_view_uint8_through_differing_bitwidths_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_zero_dim_reductions_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_zeros_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test__dyn_quant_matmul_4bit_bf16_input_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_adaptive_avg_pool1d_argmax_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_adaptive_avg_pool2d2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_add_complex6_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_add_complex8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_adding_tensor_offsets_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_addmv_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_allow_reuse_disable_if_exceed_peak_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_angle_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_aoti_eager_support_str_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_argmax_argmin2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_argmax_argmin_with_duplicates_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_argmax_argmin_with_nan_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_argmax_min_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_argmax_to_float_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_assert_alignment_op_name_pass_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool2d5_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool2d7_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool2d_backward2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool3d_backward_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool_errors_with_uint_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_baddbmm_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_batch_norm_2d_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bernoulli1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bitwise2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bitwise3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bool_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_add_autotune_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int16_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int32_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int32_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int32_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int32_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int64_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int64_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int8_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int8_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_uint8_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_nd_tiling_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_builtins_round_float_ndigits_neg_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_builtins_round_int_ndigits_pos_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cat_inplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cat_unbacked_empty_1d_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_clamp_type_promotion_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_complex_from_real_imag_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_complex_memory_overlap_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_config_option_dont_assume_alignment_cudagraphs_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_constant_pad_2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_conv2d_backward_channels_last_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_conv3d_channels_last_use_block_ptr_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_conv_backward_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_convolution5_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cos_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cpu_scalar_with_gpu_tensor_cpp_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cpu_scalar_with_gpu_tensor_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cpu_tensor_with_cpu_tensor_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cpu_tensor_with_gpu_tensor_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cumsum_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cumsum_pattern_matcher_issue_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_custom_op_1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_custom_op_fixed_layout_channels_last_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_div4_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_div5_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_div_softmax_symfloat_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dropout2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dropout_deterministic_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtype_sympy_expr_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float32_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float32_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float32_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float64_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float64_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int16_float16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int16_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int16_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int64_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_embedding_bag_byte_unpack_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_erfc_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_expanded_reduction_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_expm1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fallback_mutable_op_basic_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_flexible_layout_immutable_free_symbols_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_flip_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_forced_buffer_realize_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fractional_max_pool2d3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fractional_max_pool2d5_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_full_boolean_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_full_like_sliced_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_full_like_transposed_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_full_truncation_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_functionalize_rng_wrappers_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fuse_tiled_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_graph_partition_both_scalars_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_graph_partition_constant_tensor1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_graph_partition_scalar_inputs_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_hardtanh_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_horizonal_fusion1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_dynamic_shapes_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_propagation_nested_indirect_indexing_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_put_as_masked_fill_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_put_failed_reinplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_put_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_put_reinplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_inductor_layout_optimization_input_mutations_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_inner_reduction_detection_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_inplace_activations_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_input_mutation3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_input_mutation5_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isin_tensor_scalar_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_layer_norm_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_linspace3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_lite_regional_compile_repeated_blocks_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_log1p_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_logcumsumexp_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_logcumsumexp_zero_dim_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_long_tensor_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mark_unbacked_with_hint_override_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_masked_fill_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_max_pool2d2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_max_pool2d5_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_max_pool2d_with_indices_backward4_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_min_max_reduction_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mix_device_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mm_views_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mul_softmax_symfloat_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multi_device_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multi_gpu_recompile_on_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multilayer_any_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multilayer_sum_low_prec_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multilayer_var_lowp_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mutable_custom_op_fixed_layout_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mutations_loop_fusion_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_nan_to_num_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_neg_max_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_no_mega_fusion_during_lowering_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_norm_constant_overflow_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_one_hot_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_output_strides_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pad_cast_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pad_single_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_permute2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_airy_ai_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_chebyshev_polynomial_u_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_erf_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_erfcx_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_exp2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_gammainc_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_laguerre_polynomial_l_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_modified_bessel_i1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_modified_bessel_k1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_shifted_chebyshev_polynomial_u_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_sinc_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_spherical_bessel_j0_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_xlogy_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_prod_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_reduction3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_reduction_config_limit_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_reflection_pad2d_backward_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_reinterpret_dtypeview_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_remove_noop_copy_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_remove_noop_slice1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_remove_noop_slice_scatter_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_remove_noop_view_default_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_repeat_interleave_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_replication_pad_errors_with_bool_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_roi_align_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_round_correctness_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_round_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_scatter6_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_scatter_add1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sdpa_unaligned_mask_freezing_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_setitem_with_int_parameter_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_shape_padding_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_should_pad_bench_for_bmm_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sign_dtype_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice4_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice_mutation1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice_scatter3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice_scatter4_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sort_transpose_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_cumsum_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_reduction_dynamic_shape_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_with_sizes_with_unbacked_symints_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_with_unbacked_symints_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_strided_inputs_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sum_keepdims_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_tensor2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_tensor_index_put_slice_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_to_device_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_to_memory_format_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_topk_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_uint_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unbind_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unspec_inputs_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unspec_inputs_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_upsample_bicubic2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_upsample_cat_conv_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_vectorized_ops_masked_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_vertical_fusion1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_views5_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_xblock_divides_xnumel_dynamic_shapes_cuda 2025-12-04T10:24:47.7526292Z 2025-12-04T10:24:47.7527219Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_adaptive_max_pool2d2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [2.9706s] [ 0%] 2025-12-04T10:24:47.7529192Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_adaptive_pool_errors_with_long_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.3029s] [ 0%] 2025-12-04T10:24:47.7531112Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_add_complex3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8654s] [ 0%] 2025-12-04T10:24:47.7533026Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_add_complex9_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 0%] 2025-12-04T10:24:47.7534875Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_addmv_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7113s] [ 1%] 2025-12-04T10:24:47.7536705Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_aliased_buffer_reuse_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [1.0149s] [ 1%] 2025-12-04T10:24:47.7538709Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_allow_reuse_active_if_under_peak_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.6508s] [ 1%] 2025-12-04T10:24:47.7540747Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_arange3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.7245s] [ 1%] 2025-12-04T10:24:47.7542510Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_arange4_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.7657s] [ 2%] 2025-12-04T10:24:47.7544387Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_arange5_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 2%] 2025-12-04T10:24:47.7546268Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_argmax_argmin3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1666s] [ 2%] 2025-12-04T10:24:47.7548187Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_argmax_argmin_with_duplicates_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.3359s] [ 2%] 2025-12-04T10:24:47.7550128Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_argmax_min_int32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.2514s] [ 2%] 2025-12-04T10:24:47.7551962Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool2d1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9805s] [ 3%] 2025-12-04T10:24:47.7553766Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool2d3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [7.5076s] [ 3%] 2025-12-04T10:24:47.7555581Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool2d5_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [4.1217s] [ 3%] 2025-12-04T10:24:47.7557397Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool2d8_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [5.0311s] [ 3%] 2025-12-04T10:24:47.7559306Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool3d_backward3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [4.3376s] [ 4%] 2025-12-04T10:24:47.7561205Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool3d_backward4_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.4903s] [ 4%] 2025-12-04T10:24:47.7563216Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool3d_backward_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [3.4158s] [ 4%] 2025-12-04T10:24:47.7565719Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool_errors_with_uint_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:24:47.7567828Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] Traceback (most recent call last): 2025-12-04T10:24:47.7569338Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7570803Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] r = func(*args, **kwargs) 2025-12-04T10:24:47.7572118Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7573448Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7574989Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:24:47.7576373Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] torch._check( 2025-12-04T10:24:47.7577686Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7579318Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7580899Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7582297Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] raise error_type(message_evaluated) 2025-12-04T10:24:47.7583509Z E1204 10:16:25.485000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] RuntimeError: "avg_pool2d" not implemented for 'torch.uint8' 2025-12-04T10:24:47.7584824Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:24:47.7586013Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] Traceback (most recent call last): 2025-12-04T10:24:47.7587512Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7589186Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] r = func(*args, **kwargs) 2025-12-04T10:24:47.7590519Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7591932Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7593393Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:24:47.7594792Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] torch._check( 2025-12-04T10:24:47.7596071Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7597642Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7599227Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7600633Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] raise error_type(message_evaluated) 2025-12-04T10:24:47.7601838Z E1204 10:16:25.520000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/1] RuntimeError: "avg_pool2d" not implemented for 'torch.uint16' 2025-12-04T10:24:47.7603158Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:24:47.7604355Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] Traceback (most recent call last): 2025-12-04T10:24:47.7605923Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7607395Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] r = func(*args, **kwargs) 2025-12-04T10:24:47.7608750Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7610094Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7611557Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:24:47.7612941Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] torch._check( 2025-12-04T10:24:47.7614223Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7615785Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7617368Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7618814Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] raise error_type(message_evaluated) 2025-12-04T10:24:47.7620026Z E1204 10:16:25.555000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/2] RuntimeError: "avg_pool2d" not implemented for 'torch.uint32' 2025-12-04T10:24:47.7621378Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:24:47.7622575Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] Traceback (most recent call last): 2025-12-04T10:24:47.7624054Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7625514Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] r = func(*args, **kwargs) 2025-12-04T10:24:47.7626833Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7628172Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7629644Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:24:47.7631014Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] torch._check( 2025-12-04T10:24:47.7632291Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7633847Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7635438Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7636875Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] raise error_type(message_evaluated) 2025-12-04T10:24:47.7638080Z E1204 10:16:25.589000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/3] RuntimeError: "avg_pool2d" not implemented for 'torch.uint64' 2025-12-04T10:24:47.7639440Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:24:47.7640627Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] Traceback (most recent call last): 2025-12-04T10:24:47.7642121Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7643560Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] r = func(*args, **kwargs) 2025-12-04T10:24:47.7644883Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7646353Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7647823Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:24:47.7649193Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] torch._check( 2025-12-04T10:24:47.7650482Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7652086Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7653676Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7655069Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] raise error_type(message_evaluated) 2025-12-04T10:24:47.7656252Z E1204 10:16:25.626000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/4] RuntimeError: "avg_pool2d" not implemented for 'torch.uint8' 2025-12-04T10:24:47.7657563Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:24:47.7658824Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] Traceback (most recent call last): 2025-12-04T10:24:47.7660335Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7661786Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] r = func(*args, **kwargs) 2025-12-04T10:24:47.7663101Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7664446Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7666040Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:24:47.7667508Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] torch._check( 2025-12-04T10:24:47.7668772Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7670366Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7671954Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7673348Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] raise error_type(message_evaluated) 2025-12-04T10:24:47.7674547Z E1204 10:16:25.649000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/5] RuntimeError: "avg_pool2d" not implemented for 'torch.uint16' 2025-12-04T10:24:47.7675856Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:24:47.7677044Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] Traceback (most recent call last): 2025-12-04T10:24:47.7678533Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7679986Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] r = func(*args, **kwargs) 2025-12-04T10:24:47.7681290Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7682671Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7684134Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:24:47.7685522Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] torch._check( 2025-12-04T10:24:47.7686783Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7688490Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7690082Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7691480Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] raise error_type(message_evaluated) 2025-12-04T10:24:47.7692678Z E1204 10:16:25.672000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/6] RuntimeError: "avg_pool2d" not implemented for 'torch.uint32' 2025-12-04T10:24:47.7693979Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] failed while attempting to run meta for aten.avg_pool2d.default 2025-12-04T10:24:47.7695177Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] Traceback (most recent call last): 2025-12-04T10:24:47.7696665Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7698225Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] r = func(*args, **kwargs) 2025-12-04T10:24:47.7699594Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7700989Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7702457Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2803, in meta_avg_pool2d 2025-12-04T10:24:47.7703840Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] torch._check( 2025-12-04T10:24:47.7705114Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7706664Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7708253Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7709649Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] raise error_type(message_evaluated) 2025-12-04T10:24:47.7710849Z E1204 10:16:25.694000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/7] RuntimeError: "avg_pool2d" not implemented for 'torch.uint64' 2025-12-04T10:24:47.7712163Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] failed while attempting to run meta for aten.avg_pool3d.default 2025-12-04T10:24:47.7713403Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] Traceback (most recent call last): 2025-12-04T10:24:47.7714895Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7716353Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] r = func(*args, **kwargs) 2025-12-04T10:24:47.7717670Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7719000Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7720422Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T10:24:47.7721830Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] result = fn(*args, **kwargs) 2025-12-04T10:24:47.7723289Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 3001, in meta_avg_pool3d 2025-12-04T10:24:47.7724655Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] torch._check( 2025-12-04T10:24:47.7725931Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7727498Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7729150Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7730577Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] raise error_type(message_evaluated) 2025-12-04T10:24:47.7731766Z E1204 10:16:25.733000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/8] RuntimeError: "avg_pool3d" not implemented for 'torch.uint8' 2025-12-04T10:24:47.7733076Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] failed while attempting to run meta for aten.avg_pool3d.default 2025-12-04T10:24:47.7734275Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] Traceback (most recent call last): 2025-12-04T10:24:47.7735777Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7737227Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] r = func(*args, **kwargs) 2025-12-04T10:24:47.7738616Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7739964Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7741381Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T10:24:47.7742787Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] result = fn(*args, **kwargs) 2025-12-04T10:24:47.7744256Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 3001, in meta_avg_pool3d 2025-12-04T10:24:47.7745635Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] torch._check( 2025-12-04T10:24:47.7746906Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7748462Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7750044Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7751440Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] raise error_type(message_evaluated) 2025-12-04T10:24:47.7752641Z E1204 10:16:25.758000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/9] RuntimeError: "avg_pool3d" not implemented for 'torch.uint16' 2025-12-04T10:24:47.7753966Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] failed while attempting to run meta for aten.avg_pool3d.default 2025-12-04T10:24:47.7755161Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] Traceback (most recent call last): 2025-12-04T10:24:47.7756650Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7758123Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] r = func(*args, **kwargs) 2025-12-04T10:24:47.7759524Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7760902Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7762315Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T10:24:47.7763727Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] result = fn(*args, **kwargs) 2025-12-04T10:24:47.7765190Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 3001, in meta_avg_pool3d 2025-12-04T10:24:47.7766592Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] torch._check( 2025-12-04T10:24:47.7767879Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7769432Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7771028Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7772433Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] raise error_type(message_evaluated) 2025-12-04T10:24:47.7773648Z E1204 10:16:25.782000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/10] RuntimeError: "avg_pool3d" not implemented for 'torch.uint32' 2025-12-04T10:24:47.7775002Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] failed while attempting to run meta for aten.avg_pool3d.default 2025-12-04T10:24:47.7776194Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] Traceback (most recent call last): 2025-12-04T10:24:47.7777698Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T10:24:47.7779241Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] r = func(*args, **kwargs) 2025-12-04T10:24:47.7780565Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T10:24:47.7781913Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] return self._op(*args, **kwargs) 2025-12-04T10:24:47.7783347Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T10:24:47.7784770Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] result = fn(*args, **kwargs) 2025-12-04T10:24:47.7786227Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 3001, in meta_avg_pool3d 2025-12-04T10:24:47.7787620Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] torch._check( 2025-12-04T10:24:47.7789231Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T10:24:47.7790812Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T10:24:47.7792452Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T10:24:47.7793855Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] raise error_type(message_evaluated) 2025-12-04T10:24:47.7795057Z E1204 10:16:25.806000 36243 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/11] RuntimeError: "avg_pool3d" not implemented for 'torch.uint64' 2025-12-04T10:24:47.7795892Z PASSED [0.3810s] [ 4%] 2025-12-04T10:24:47.7796926Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_baddbmm_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.3224s] [ 5%] 2025-12-04T10:24:47.7798731Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bernoulli1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0605s] [ 5%] 2025-12-04T10:24:47.7800541Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bernoulli2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1196s] [ 5%] 2025-12-04T10:24:47.7802382Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bfloat16_to_int16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7807s] [ 5%] 2025-12-04T10:24:47.7804184Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bmm2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.2061s] [ 5%] 2025-12-04T10:24:47.7806011Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_both_scalars_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.7803s] [ 6%] 2025-12-04T10:24:47.7808083Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_computed_offsets_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7622s] [ 6%] 2025-12-04T10:24:47.7810135Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_default_kwargs_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.1338s] [ 6%] 2025-12-04T10:24:47.7812083Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_int32_int32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.1460s] [ 6%] 2025-12-04T10:24:47.7814021Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_int64_int16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1903s] [ 7%] 2025-12-04T10:24:47.7815961Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_int64_int8_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.2546s] [ 7%] 2025-12-04T10:24:47.7817897Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_int8_int64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.5412s] [ 7%] 2025-12-04T10:24:47.7819922Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_uint8_int16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1977s] [ 7%] 2025-12-04T10:24:47.7821838Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_uint8_int32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.2775s] [ 7%] 2025-12-04T10:24:47.7823849Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_nd_tiling_True_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.1466s] [ 8%] 2025-12-04T10:24:47.7825773Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_buffer_copied_in_graph_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7753s] [ 8%] 2025-12-04T10:24:47.7827770Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_builtins_round_int_ndigits_pos_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7397s] [ 8%] 2025-12-04T10:24:47.7829781Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_builtins_round_int_ndigits_zero_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7423s] [ 8%] 2025-12-04T10:24:47.7831801Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cat_empty_index_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7942s] [ 9%] 2025-12-04T10:24:47.7833668Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cat_unbacked_2d_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1344s] [ 9%] 2025-12-04T10:24:47.7835536Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_chunk_recompiles_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9931s] [ 9%] 2025-12-04T10:24:47.7837472Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_clamp_type_promotion_non_tensor_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8185s] [ 9%] 2025-12-04T10:24:47.7839447Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_complex_memory_overlap_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0035s] [ 10%] 2025-12-04T10:24:47.7841492Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_config_option_dont_assume_alignment_recompiles_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0069s] [ 10%] 2025-12-04T10:24:47.7843607Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_consecutive_split_cumprod_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9120s] [ 10%] 2025-12-04T10:24:47.7845555Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_constant_pad_fill_dtype_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0440s] [ 10%] 2025-12-04T10:24:47.7847474Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_constant_pad_nd_inplace_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7232s] [ 10%] 2025-12-04T10:24:47.7849389Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_conv2d_channels_last_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [1.8557s] [ 11%] 2025-12-04T10:24:47.7851201Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_conv3d_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.7387s] [ 11%] 2025-12-04T10:24:47.7853011Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_convolution4_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8713s] [ 11%] 2025-12-04T10:24:47.7855372Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_copy_non_blocking_is_pinned_use_cat_False_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py W1204 10:16:54.022000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7857282Z W1204 10:16:54.024000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7858221Z W1204 10:16:54.025000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7859321Z W1204 10:16:54.026000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7860252Z W1204 10:16:54.027000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7861215Z W1204 10:16:54.028000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7862144Z W1204 10:16:54.029000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7863058Z W1204 10:16:54.030000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7863985Z W1204 10:16:54.031000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7864915Z W1204 10:16:54.032000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7865849Z W1204 10:16:54.033000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7866769Z W1204 10:16:54.034000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7867695Z W1204 10:16:54.035000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7868629Z W1204 10:16:54.036000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7869557Z W1204 10:16:54.037000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7870472Z W1204 10:16:54.039000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7871399Z W1204 10:16:54.040000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7872333Z W1204 10:16:54.041000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7873289Z W1204 10:16:54.042000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7874218Z W1204 10:16:54.043000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7875145Z W1204 10:16:54.044000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7876069Z W1204 10:16:54.045000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7876981Z W1204 10:16:54.046000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7877905Z W1204 10:16:54.047000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7878830Z W1204 10:16:54.048000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7879758Z W1204 10:16:54.049000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7880672Z W1204 10:16:54.050000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7881595Z W1204 10:16:54.051000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7882518Z W1204 10:16:54.052000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7883444Z W1204 10:16:54.053000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7884354Z W1204 10:16:54.054000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7885297Z W1204 10:16:54.055000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7886226Z W1204 10:16:54.056000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7887228Z W1204 10:16:54.057000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7888297Z W1204 10:16:54.058000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7889290Z W1204 10:16:54.059000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7890219Z W1204 10:16:54.060000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7891148Z W1204 10:16:54.061000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7892062Z W1204 10:16:54.062000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7892989Z W1204 10:16:54.063000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7893910Z W1204 10:16:54.064000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7894837Z W1204 10:16:54.065000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7895747Z W1204 10:16:54.066000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7896820Z W1204 10:16:54.067000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7897752Z W1204 10:16:54.068000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7898734Z W1204 10:16:54.069000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7899664Z W1204 10:16:54.070000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7900593Z W1204 10:16:54.071000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7901594Z W1204 10:16:54.072000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7902512Z W1204 10:16:54.073000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7903447Z W1204 10:16:54.074000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7904375Z W1204 10:16:54.075000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7905309Z W1204 10:16:54.076000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7906218Z W1204 10:16:54.077000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7907213Z W1204 10:16:54.078000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7908145Z W1204 10:16:54.079000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7909081Z W1204 10:16:54.080000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7909995Z W1204 10:16:54.082000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7910925Z W1204 10:16:54.083000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7911856Z W1204 10:16:54.084000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7912781Z W1204 10:16:54.085000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7913695Z W1204 10:16:54.086000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7914619Z W1204 10:16:54.087000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7915634Z W1204 10:16:54.088000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7916559Z W1204 10:16:54.089000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7917504Z W1204 10:16:54.090000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7918432Z W1204 10:16:54.091000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7919364Z W1204 10:16:54.092000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7920287Z W1204 10:16:54.093000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7921198Z W1204 10:16:54.094000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7922117Z W1204 10:16:54.095000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7923046Z W1204 10:16:54.096000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7923975Z W1204 10:16:54.097000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7924885Z W1204 10:16:54.098000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7925811Z W1204 10:16:54.099000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7926737Z W1204 10:16:54.100000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7927649Z W1204 10:16:54.101000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7928571Z W1204 10:16:54.102000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7929531Z W1204 10:16:54.103000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7930458Z W1204 10:16:54.104000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7931372Z W1204 10:16:54.105000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7932303Z W1204 10:16:54.106000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7933231Z W1204 10:16:54.107000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7934154Z W1204 10:16:54.108000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7935074Z W1204 10:16:54.109000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7935998Z W1204 10:16:54.110000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7936926Z W1204 10:16:54.111000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7937857Z W1204 10:16:54.112000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7938861Z W1204 10:16:54.113000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7939786Z W1204 10:16:54.114000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7940710Z W1204 10:16:54.115000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7941635Z W1204 10:16:54.116000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7942549Z W1204 10:16:54.117000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7943566Z W1204 10:16:54.118000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7944496Z W1204 10:16:54.119000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7945459Z W1204 10:16:54.120000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7946376Z W1204 10:16:54.122000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7947310Z W1204 10:16:54.123000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7948244Z W1204 10:16:54.124000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7949172Z W1204 10:16:54.125000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7949807Z PASSED [2.4607s] [ 11%] 2025-12-04T10:24:47.7950832Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cos_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9885s] [ 12%] 2025-12-04T10:24:47.7953089Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cpu_scalar_with_gpu_tensor_cpp_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py W1204 10:16:55.487000 36243 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.7954659Z PASSED [2.1763s] [ 12%] 2025-12-04T10:24:47.7955680Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cumsum_no_mask_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.2058s] [ 12%] 2025-12-04T10:24:47.7957593Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cumsum_pattern_matcher_issue_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1415s] [ 12%] 2025-12-04T10:24:47.7959655Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_custom_op_default_layout_constraint_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7840s] [ 12%] 2025-12-04T10:24:47.7961858Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_custom_scan_would_split_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (associative_scan only supported on GPU) [ 13%] 2025-12-04T10:24:47.7964052Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_deterministic_codegen_with_suffix_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.3641s] [ 13%] 2025-12-04T10:24:47.7965989Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_diagonal_copy_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.7385s] [ 13%] 2025-12-04T10:24:47.7967806Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dist_bf16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8704s] [ 13%] 2025-12-04T10:24:47.7969580Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9711s] [ 14%] 2025-12-04T10:24:47.7971329Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0464s] [ 14%] 2025-12-04T10:24:47.7973070Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div4_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0457s] [ 14%] 2025-12-04T10:24:47.7974799Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div7_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0561s] [ 14%] 2025-12-04T10:24:47.7976633Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div_zero_dim_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [3.6217s] [ 15%] 2025-12-04T10:24:47.7978566Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dropout_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 15%] 2025-12-04T10:24:47.7980555Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dropout_trivial_1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 15%] 2025-12-04T10:24:47.7982564Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtype_mismatch_issue_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 15%] 2025-12-04T10:24:47.7984534Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float16_int16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8461s] [ 15%] 2025-12-04T10:24:47.7986492Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float32_float16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9704s] [ 16%] 2025-12-04T10:24:47.7988676Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float32_int64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9108s] [ 16%] 2025-12-04T10:24:47.7990621Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float32_int8_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0041s] [ 16%] 2025-12-04T10:24:47.7992568Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float64_float32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9611s] [ 16%] 2025-12-04T10:24:47.7994530Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float64_float64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.2410s] [ 17%] 2025-12-04T10:24:47.7996555Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int16_int32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8916s] [ 17%] 2025-12-04T10:24:47.7998481Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int32_float16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9057s] [ 17%] 2025-12-04T10:24:47.8000430Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int32_float64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8823s] [ 17%] 2025-12-04T10:24:47.8002356Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int64_int32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9752s] [ 17%] 2025-12-04T10:24:47.8004275Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int8_float16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9413s] [ 18%] 2025-12-04T10:24:47.8006206Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int8_float32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9381s] [ 18%] 2025-12-04T10:24:47.8008127Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int8_int64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9299s] [ 18%] 2025-12-04T10:24:47.8010028Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int8_uint8_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8832s] [ 18%] 2025-12-04T10:24:47.8011950Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_uint8_float64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9287s] [ 19%] 2025-12-04T10:24:47.8013888Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_empty_strided_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.1019s] [ 19%] 2025-12-04T10:24:47.8015749Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_exact_stride_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1851s] [ 19%] 2025-12-04T10:24:47.8017634Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fallback_mutable_op_basic_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.0494s] [ 19%] 2025-12-04T10:24:47.8019694Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fallback_mutable_op_list_tensor_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8269s] [ 20%] 2025-12-04T10:24:47.8021585Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_flip_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1442s] [ 20%] 2025-12-04T10:24:47.8023378Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_float32_to_int32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8193s] [ 20%] 2025-12-04T10:24:47.8025273Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fractional_max_pool2d1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [3.5498s] [ 20%] 2025-12-04T10:24:47.8027213Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fractional_max_pool2d2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9111s] [ 20%] 2025-12-04T10:24:47.8029145Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_full_like_transposed_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8429s] [ 21%] 2025-12-04T10:24:47.8031046Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fuse_tiled_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.2106s] [ 21%] 2025-12-04T10:24:47.8032826Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_gather1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9669s] [ 21%] 2025-12-04T10:24:47.8034636Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_gather_scatter_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1769s] [ 21%] 2025-12-04T10:24:47.8036440Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_gelu_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0842s] [ 22%] 2025-12-04T10:24:47.8038325Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_constant_tensor2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7271s] [ 22%] 2025-12-04T10:24:47.8040365Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_mutation_real_name_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1356s] [ 22%] 2025-12-04T10:24:47.8042352Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_no_inputs_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7848s] [ 22%] 2025-12-04T10:24:47.8044383Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_unbacked_symint_as_output_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.6988s] [ 23%] 2025-12-04T10:24:47.8046375Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_grid_sampler_2d_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [7.1944s] [ 23%] 2025-12-04T10:24:47.8048498Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_grid_sampler_expand_preserves_view_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (requires CUDA or XPU) [ 23%] 2025-12-04T10:24:47.8050554Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_hardtanh_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8504s] [ 23%] 2025-12-04T10:24:47.8052431Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_horizonal_fusion1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9881s] [ 23%] 2025-12-04T10:24:47.8054304Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_horizonal_fusion2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9097s] [ 24%] 2025-12-04T10:24:47.8056131Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1816s] [ 24%] 2025-12-04T10:24:47.8057975Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_dynamic_shapes_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.4827s] [ 24%] 2025-12-04T10:24:47.8059937Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_propagation_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7446s] [ 24%] 2025-12-04T10:24:47.8061841Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_propagation_flip_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7459s] [ 25%] 2025-12-04T10:24:47.8063697Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.5392s] [ 25%] 2025-12-04T10:24:47.8065506Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put4_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7910s] [ 25%] 2025-12-04T10:24:47.8067413Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put_as_masked_fill_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.7230s] [ 25%] 2025-12-04T10:24:47.8069302Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put_index_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9390s] [ 25%] 2025-12-04T10:24:47.8071139Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_select_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.8593s] [ 26%] 2025-12-04T10:24:47.8073009Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_indirect_load_broadcast_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.6017s] [ 26%] 2025-12-04T10:24:47.8075439Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_inductor_multiple_specializations_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 26%] 2025-12-04T10:24:47.8077904Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_input_mutation2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 26%] 2025-12-04T10:24:47.8079831Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_input_mutation5_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7014s] [ 27%] 2025-12-04T10:24:47.8081698Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_isin_tensor_scalar_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [3.2062s] [ 27%] 2025-12-04T10:24:47.8083586Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_l1_loss_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0284s] [ 27%] 2025-12-04T10:24:47.8085639Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_large_grid_use_block_ptr_True_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (triton backend is required for cpu) [ 27%] 2025-12-04T10:24:47.8088038Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_large_pointwise_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.9480s] [ 28%] 2025-12-04T10:24:47.8089912Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_lerp_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 28%] 2025-12-04T10:24:47.8091792Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_like_channels_last_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0677s] [ 28%] 2025-12-04T10:24:47.8093624Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_like_rands_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.1988s] [ 28%] 2025-12-04T10:24:47.8095496Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_linear1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.7485s] [ 28%] 2025-12-04T10:24:47.8097700Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_linear_dynamic_maxautotune_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 29%] 2025-12-04T10:24:47.8099973Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_linspace1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.7894s] [ 29%] 2025-12-04T10:24:47.8101779Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_logaddexp_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.6523s] [ 29%] 2025-12-04T10:24:47.8103659Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_logsumexp_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.4163s] [ 29%] 2025-12-04T10:24:47.8105884Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mark_unbacked_with_hint_override_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 30%] 2025-12-04T10:24:47.8108113Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_masked_fill_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9265s] [ 30%] 2025-12-04T10:24:47.8109942Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_masked_scatter_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.1667s] [ 30%] 2025-12-04T10:24:47.8111803Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_matmul_layer_norm_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.9167s] [ 30%] 2025-12-04T10:24:47.8113627Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_min_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9228s] [ 30%] 2025-12-04T10:24:47.8115410Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d4_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.5319s] [ 31%] 2025-12-04T10:24:47.8117226Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d5_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.8904s] [ 31%] 2025-12-04T10:24:47.8119044Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d7_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.3515s] [ 31%] 2025-12-04T10:24:47.8121034Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d_with_indices_backward_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.4672s] [ 31%] 2025-12-04T10:24:47.8123003Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mixed_mm_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8105s] [ 32%] 2025-12-04T10:24:47.8124796Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mul_index_expr_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.7637s] [ 32%] 2025-12-04T10:24:47.8126808Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_multi_gpu_device_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (requires multiple cuda devices) [ 32%] 2025-12-04T10:24:47.8128880Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_multilayer_prime_size_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8632s] [ 32%] 2025-12-04T10:24:47.8130774Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_multilayer_var_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.3355s] [ 33%] 2025-12-04T10:24:47.8132650Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_multilayer_var_lowp_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.7045s] [ 33%] 2025-12-04T10:24:47.8134583Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mutable_custom_op_fixed_layout_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7405s] [ 33%] 2025-12-04T10:24:47.8136481Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_new_empty_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.1180s] [ 33%] 2025-12-04T10:24:47.8138497Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_no_mega_fusion_during_lowering_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8203s] [ 33%] 2025-12-04T10:24:47.8140492Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_nonzero_unbacked_refinement_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.7986s] [ 34%] 2025-12-04T10:24:47.8142469Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_output_strides_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 34%] 2025-12-04T10:24:47.8144424Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pattern_matcher_multi_user_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [3.5623s] [ 34%] 2025-12-04T10:24:47.8146305Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_permute2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.2999s] [ 34%] 2025-12-04T10:24:47.8148232Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_bessel_j1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8457s] [ 35%] 2025-12-04T10:24:47.8150212Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_chebyshev_polynomial_t_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7835s] [ 35%] 2025-12-04T10:24:47.8152253Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_chebyshev_polynomial_v_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7909s] [ 35%] 2025-12-04T10:24:47.8154208Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_digamma_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8191s] [ 35%] 2025-12-04T10:24:47.8156161Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_erfcx_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8719s] [ 35%] 2025-12-04T10:24:47.8158154Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_erfinv_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7676s] [ 36%] 2025-12-04T10:24:47.8160022Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_expit_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9405s] [ 36%] 2025-12-04T10:24:47.8161900Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_gammainc_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8877s] [ 36%] 2025-12-04T10:24:47.8163755Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_i1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7902s] [ 36%] 2025-12-04T10:24:47.8165692Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_laguerre_polynomial_l_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7668s] [ 37%] 2025-12-04T10:24:47.8167829Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_legendre_polynomial_p_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7740s] [ 37%] 2025-12-04T10:24:47.8169849Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_log1p_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9120s] [ 37%] 2025-12-04T10:24:47.8171780Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_modified_bessel_k0_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7937s] [ 37%] 2025-12-04T10:24:47.8173735Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_psi_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8201s] [ 38%] 2025-12-04T10:24:47.8175587Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_round_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7649s] [ 38%] 2025-12-04T10:24:47.8177587Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_shifted_chebyshev_polynomial_u_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7856s] [ 38%] 2025-12-04T10:24:47.8179658Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_xlog1py_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0529s] [ 38%] 2025-12-04T10:24:47.8181481Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pow1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1768s] [ 38%] 2025-12-04T10:24:47.8183353Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pow_by_natural_log2_dynamic_shapes_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7874s] [ 39%] 2025-12-04T10:24:47.8185353Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pow_symfloat_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7807s] [ 39%] 2025-12-04T10:24:47.8187311Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_profiler_mark_wrapper_call_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 39%] 2025-12-04T10:24:47.8189634Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_randint_kernel_count_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (Only valid for GPU!) [ 39%] 2025-12-04T10:24:47.8191746Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_randn_generator_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.7741s] [ 40%] 2025-12-04T10:24:47.8193590Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_randn_like_empty_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.1494s] [ 40%] 2025-12-04T10:24:47.8195525Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_reflection_pad2d_backward_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [13.7986s] [ 40%] 2025-12-04T10:24:47.8197449Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_remove_noop_clone_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0538s] [ 40%] 2025-12-04T10:24:47.8199314Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_remove_noop_copy_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.8878s] [ 41%] 2025-12-04T10:24:47.8201221Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_remove_noop_slice_scatter_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8688s] [ 41%] 2025-12-04T10:24:47.8203144Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_repeat_interleave_2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.8374s] [ 41%] 2025-12-04T10:24:47.8205130Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_repeat_interleave_Tensor_decomp_int32_nd_1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.3587s] [ 41%] 2025-12-04T10:24:47.8207250Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_repeat_interleave_Tensor_decomp_int64_nd_1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8821s] [ 41%] 2025-12-04T10:24:47.8209339Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_reuse_buffers_with_aliasing_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.2674s] [ 42%] 2025-12-04T10:24:47.8211252Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scalar_output_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1583s] [ 42%] 2025-12-04T10:24:47.8213172Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scaled_dot_product_attention_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.4229s] [ 42%] 2025-12-04T10:24:47.8215048Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8239s] [ 42%] 2025-12-04T10:24:47.8216842Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter5_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0861s] [ 43%] 2025-12-04T10:24:47.8218886Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter_add1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Flaky test, needs debugging) [ 43%] 2025-12-04T10:24:47.8220880Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter_reduce1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8170s] [ 43%] 2025-12-04T10:24:47.8222752Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter_reduce2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9433s] [ 43%] 2025-12-04T10:24:47.8224830Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_searchsorted_broadcast_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (triton backend is required for cpu) [ 43%] 2025-12-04T10:24:47.8226969Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_searchsorted_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.1492s] [ 44%] 2025-12-04T10:24:47.8228771Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sgn_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7791s] [ 44%] 2025-12-04T10:24:47.8230650Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_shape_prop_torch_ones_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9941s] [ 44%] 2025-12-04T10:24:47.8232483Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_silu_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9399s] [ 44%] 2025-12-04T10:24:47.8234271Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_simplify_loops_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.1852s] [ 45%] 2025-12-04T10:24:47.8236107Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_single_elem_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.6871s] [ 45%] 2025-12-04T10:24:47.8237946Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sizehint_issue1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.2657s] [ 45%] 2025-12-04T10:24:47.8239757Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_slice4_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.1686s] [ 45%] 2025-12-04T10:24:47.8241558Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_slice_scatter4_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9082s] [ 46%] 2025-12-04T10:24:47.8243483Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_slice_scatter_dtype_consistency_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.6977s] [ 46%] 2025-12-04T10:24:47.8245510Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_slice_view_with_graph_break_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.4791s] [ 46%] 2025-12-04T10:24:47.8247357Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sort_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.1733s] [ 46%] 2025-12-04T10:24:47.8249158Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_special_polygamma_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.7319s] [ 46%] 2025-12-04T10:24:47.8251042Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_split_cumsum_index_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0597s] [ 47%] 2025-12-04T10:24:47.8252969Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_split_reduction_dynamic_shape_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8518s] [ 47%] 2025-12-04T10:24:47.8254912Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_split_with_integer_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8341s] [ 47%] 2025-12-04T10:24:47.8256803Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sqrt_dynamic_shapes_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.8041s] [ 47%] 2025-12-04T10:24:47.8258716Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sum_dtype_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.9512s] [ 48%] 2025-12-04T10:24:47.8260528Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sum_keepdims_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7920s] [ 48%] 2025-12-04T10:24:47.8262389Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_tensor3_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.8204s] [ 48%] 2025-12-04T10:24:47.8264497Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_tmp_not_defined_issue1_use_block_ptr_True_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (triton backend is required for cpu) [ 48%] 2025-12-04T10:24:47.8266697Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_tmp_not_defined_issue2_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.2423s] [ 48%] 2025-12-04T10:24:47.8268603Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_to_device_constant_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.7987s] [ 49%] 2025-12-04T10:24:47.8270473Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_to_memory_format_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [3.2691s] [ 49%] 2025-12-04T10:24:47.8272337Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_transpose_add_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.6064s] [ 49%] 2025-12-04T10:24:47.8274323Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_float16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (Testing mixed devices) [ 49%] 2025-12-04T10:24:47.8276487Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_float64_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (Testing mixed devices) [ 50%] 2025-12-04T10:24:47.8278636Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_int16_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (Testing mixed devices) [ 50%] 2025-12-04T10:24:47.8280814Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_int32_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (Testing mixed devices) [ 50%] 2025-12-04T10:24:47.8282953Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_int8_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (Testing mixed devices) [ 50%] 2025-12-04T10:24:47.8284970Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_upsample_bilinear2d_a_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [4.6216s] [ 51%] 2025-12-04T10:24:47.8286903Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_upsample_bilinear2d_b_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.4974s] [ 51%] 2025-12-04T10:24:47.8288967Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_upsample_cat_conv_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [3.0350s] [ 51%] 2025-12-04T10:24:47.8290854Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_upsample_nearest3d_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [2.8074s] [ 51%] 2025-12-04T10:24:47.8292734Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_var_mean_div_by_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.3061s] [ 51%] 2025-12-04T10:24:47.8294573Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_vertical_fusion1_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [1.0517s] [ 52%] 2025-12-04T10:24:47.8296417Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_view_as_real_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py PASSED [0.7576s] [ 52%] 2025-12-04T10:24:47.8298544Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_view_uint8_through_differing_bitwidths_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.3831s] [ 52%] 2025-12-04T10:24:47.8300600Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_zero_dim_reductions_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 52%] 2025-12-04T10:24:47.8302525Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_zeros_dynamic_shapes_cpu <- test/inductor/test_torchinductor.py XFAIL [0.8049s] [ 53%] 2025-12-04T10:24:47.8304625Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test__dyn_quant_matmul_4bit_bf16_input_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (No _dyn_quant_matmul_4bit implementation on CUDA) [ 53%] 2025-12-04T10:24:47.8306870Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_adaptive_avg_pool1d_argmax_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.1816s] [ 53%] 2025-12-04T10:24:47.8308816Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_adaptive_avg_pool2d2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1770s] [ 53%] 2025-12-04T10:24:47.8310698Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_add_complex6_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6393s] [ 53%] 2025-12-04T10:24:47.8312539Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_add_complex8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5167s] [ 54%] 2025-12-04T10:24:47.8314423Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_adding_tensor_offsets_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2681s] [ 54%] 2025-12-04T10:24:47.8316308Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_addmv_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6971s] [ 54%] 2025-12-04T10:24:47.8318217Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_allow_reuse_disable_if_exceed_peak_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.0039s] [ 54%] 2025-12-04T10:24:47.8320131Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_angle_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7784s] [ 55%] 2025-12-04T10:24:47.8322064Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_aoti_eager_support_str_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (Requires sm80) [ 55%] 2025-12-04T10:24:47.8324048Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_argmax_argmin2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8811s] [ 55%] 2025-12-04T10:24:47.8325980Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_argmax_argmin_with_duplicates_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.3287s] [ 55%] 2025-12-04T10:24:47.8327965Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_argmax_argmin_with_nan_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.7312s] [ 56%] 2025-12-04T10:24:47.8329855Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_argmax_min_int32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8705s] [ 56%] 2025-12-04T10:24:47.8331709Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_argmax_to_float_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.7619s] [ 56%] 2025-12-04T10:24:47.8333680Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_assert_alignment_op_name_pass_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.0033s] [ 56%] 2025-12-04T10:24:47.8335578Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool2d5_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.0660s] [ 56%] 2025-12-04T10:24:47.8337422Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool2d7_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.2550s] [ 57%] 2025-12-04T10:24:47.8339354Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool2d_backward2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [8.7067s] [ 57%] 2025-12-04T10:24:47.8341262Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool3d_backward_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.1273s] [ 57%] 2025-12-04T10:24:47.8343198Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool_errors_with_uint_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3623s] [ 57%] 2025-12-04T10:24:47.8345053Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_baddbmm_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.3144s] [ 58%] 2025-12-04T10:24:47.8346867Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_batch_norm_2d_2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [6.9729s] [ 58%] 2025-12-04T10:24:47.8348709Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bernoulli1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.3068s] [ 58%] 2025-12-04T10:24:47.8350533Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bitwise2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3577s] [ 58%] 2025-12-04T10:24:47.8352372Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bitwise3_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.9665s] [ 58%] 2025-12-04T10:24:47.8354143Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bool_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4012s] [ 59%] 2025-12-04T10:24:47.8355989Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_add_autotune_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.1487s] [ 59%] 2025-12-04T10:24:47.8357942Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int16_int16_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.5243s] [ 59%] 2025-12-04T10:24:47.8359908Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int32_int16_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.5260s] [ 59%] 2025-12-04T10:24:47.8361863Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int32_int64_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.5003s] [ 60%] 2025-12-04T10:24:47.8363801Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int32_int8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.5077s] [ 60%] 2025-12-04T10:24:47.8365752Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int32_uint8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.5318s] [ 60%] 2025-12-04T10:24:47.8367698Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int64_int16_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.5227s] [ 60%] 2025-12-04T10:24:47.8369713Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int64_uint8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.5137s] [ 61%] 2025-12-04T10:24:47.8371687Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int8_int16_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.4925s] [ 61%] 2025-12-04T10:24:47.8373613Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int8_int64_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.4946s] [ 61%] 2025-12-04T10:24:47.8375558Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_uint8_int64_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.4944s] [ 61%] 2025-12-04T10:24:47.8377514Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_nd_tiling_True_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8458s] [ 61%] 2025-12-04T10:24:47.8379579Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_builtins_round_float_ndigits_neg_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2464s] [ 62%] 2025-12-04T10:24:47.8381610Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_builtins_round_int_ndigits_pos_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.1685s] [ 62%] 2025-12-04T10:24:47.8383582Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cat_inplace_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 62%] 2025-12-04T10:24:47.8385521Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cat_unbacked_empty_1d_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5211s] [ 62%] 2025-12-04T10:24:47.8387616Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_clamp_type_promotion_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2345s] [ 63%] 2025-12-04T10:24:47.8389844Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_complex_from_real_imag_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1623s] [ 63%] 2025-12-04T10:24:47.8391779Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_complex_memory_overlap_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.0033s] [ 63%] 2025-12-04T10:24:47.8393850Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_config_option_dont_assume_alignment_cudagraphs_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.9696s] [ 63%] 2025-12-04T10:24:47.8395864Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_constant_pad_2d_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.2581s] [ 64%] 2025-12-04T10:24:47.8397799Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_conv2d_backward_channels_last_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7657s] [ 64%] 2025-12-04T10:24:47.8400116Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_conv3d_channels_last_use_block_ptr_True_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (only support cpu conv3d channels_last) [ 64%] 2025-12-04T10:24:47.8402274Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_conv_backward_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [1.0111s] [ 64%] 2025-12-04T10:24:47.8404252Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_convolution5_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.4143s] [ 64%] 2025-12-04T10:24:47.8406054Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cos_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.0455s] [ 65%] 2025-12-04T10:24:47.8407963Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cpu_scalar_with_gpu_tensor_cpp_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.4830s] [ 65%] 2025-12-04T10:24:47.8409947Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cpu_scalar_with_gpu_tensor_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.1853s] [ 65%] 2025-12-04T10:24:47.8411905Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cpu_tensor_with_cpu_tensor_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7281s] [ 65%] 2025-12-04T10:24:47.8413869Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cpu_tensor_with_gpu_tensor_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.0208s] [ 66%] 2025-12-04T10:24:47.8415730Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cumsum_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.5790s] [ 66%] 2025-12-04T10:24:47.8417620Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cumsum_pattern_matcher_issue_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5065s] [ 66%] 2025-12-04T10:24:47.8419588Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_custom_op_1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4846s] [ 66%] 2025-12-04T10:24:47.8421519Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_custom_op_fixed_layout_channels_last_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.3421s] [ 66%] 2025-12-04T10:24:47.8423484Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_div4_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4593s] [ 67%] 2025-12-04T10:24:47.8425230Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_div5_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4230s] [ 67%] 2025-12-04T10:24:47.8427067Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_div_softmax_symfloat_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.5042s] [ 67%] 2025-12-04T10:24:47.8428987Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dropout2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 67%] 2025-12-04T10:24:47.8430997Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dropout_deterministic_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 68%] 2025-12-04T10:24:47.8432979Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtype_sympy_expr_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.1233s] [ 68%] 2025-12-04T10:24:47.8435070Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float32_int32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (uses bfloat16 which requires SM >= 80) [ 68%] 2025-12-04T10:24:47.8437405Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float32_int8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (uses bfloat16 which requires SM >= 80) [ 68%] 2025-12-04T10:24:47.8439797Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float32_uint8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 69%] 2025-12-04T10:24:47.8442220Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float64_int32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 69%] 2025-12-04T10:24:47.8444586Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float64_int8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 69%] 2025-12-04T10:24:47.8446909Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int16_float16_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (uses bfloat16 which requires SM >= 80) [ 69%] 2025-12-04T10:24:47.8449246Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int16_float32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 69%] 2025-12-04T10:24:47.8451568Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int16_int32_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (uses bfloat16 which requires SM >= 80) [ 70%] 2025-12-04T10:24:47.8453883Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int64_int64_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0029s] (uses bfloat16 which requires SM >= 80) [ 70%] 2025-12-04T10:24:47.8456229Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_embedding_bag_byte_unpack_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (No cuda implementation (it returns empty)) [ 70%] 2025-12-04T10:24:47.8458404Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_erfc_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2648s] [ 70%] 2025-12-04T10:24:47.8460234Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_expanded_reduction_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.9551s] [ 71%] 2025-12-04T10:24:47.8462062Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_expm1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [3.0527s] [ 71%] 2025-12-04T10:24:47.8463926Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fallback_mutable_op_basic_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.0472s] [ 71%] 2025-12-04T10:24:47.8465960Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_flexible_layout_immutable_free_symbols_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.0035s] [ 71%] 2025-12-04T10:24:47.8467897Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_flip_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4338s] [ 71%] 2025-12-04T10:24:47.8469928Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_forced_buffer_realize_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 72%] 2025-12-04T10:24:47.8471918Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fractional_max_pool2d3_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7043s] [ 72%] 2025-12-04T10:24:47.8473868Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fractional_max_pool2d5_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.1981s] [ 72%] 2025-12-04T10:24:47.8475829Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_full_boolean_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2936s] [ 72%] 2025-12-04T10:24:47.8477696Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_full_like_sliced_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2206s] [ 73%] 2025-12-04T10:24:47.8479786Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_full_like_transposed_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3410s] [ 73%] 2025-12-04T10:24:47.8481676Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_full_truncation_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.2580s] [ 73%] 2025-12-04T10:24:47.8483610Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_functionalize_rng_wrappers_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.0553s] [ 73%] 2025-12-04T10:24:47.8485535Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fuse_tiled_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3960s] [ 74%] 2025-12-04T10:24:47.8487451Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_graph_partition_both_scalars_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7699s] [ 74%] 2025-12-04T10:24:47.8489673Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_graph_partition_constant_tensor1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6920s] [ 74%] 2025-12-04T10:24:47.8491697Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_graph_partition_scalar_inputs_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7807s] [ 74%] 2025-12-04T10:24:47.8493623Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_hardtanh_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2913s] [ 74%] 2025-12-04T10:24:47.8495551Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_horizonal_fusion1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5100s] [ 75%] 2025-12-04T10:24:47.8497464Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_dynamic_shapes_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.1352s] [ 75%] 2025-12-04T10:24:47.8499585Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_propagation_nested_indirect_indexing_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5652s] [ 75%] 2025-12-04T10:24:47.8501630Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_put_as_masked_fill_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6961s] [ 75%] 2025-12-04T10:24:47.8503580Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_put_failed_reinplace_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3016s] [ 76%] 2025-12-04T10:24:47.8505499Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_put_index_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3408s] [ 76%] 2025-12-04T10:24:47.8507384Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_put_reinplace_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2902s] [ 76%] 2025-12-04T10:24:47.8509422Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_inductor_layout_optimization_input_mutations_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.6402s] [ 76%] 2025-12-04T10:24:47.8531191Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_inner_reduction_detection_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4824s] [ 76%] 2025-12-04T10:24:47.8533408Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_inplace_activations_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7246s] [ 77%] 2025-12-04T10:24:47.8535574Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_input_mutation3_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 77%] 2025-12-04T10:24:47.8537522Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_input_mutation5_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.1649s] [ 77%] 2025-12-04T10:24:47.8539510Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isin_tensor_scalar_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.9835s] [ 77%] 2025-12-04T10:24:47.8541456Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.8937s] [ 78%] 2025-12-04T10:24:47.8543428Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.8834s] [ 78%] 2025-12-04T10:24:47.8545276Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py FAILED [0.8757s] [ 78%] 2025-12-04T10:24:47.8546240Z 2025-12-04T10:24:47.8546386Z ==================================== RERUNS ==================================== 2025-12-04T10:24:47.8546998Z _________ DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda __________ 2025-12-04T10:24:47.8547583Z Traceback (most recent call last): 2025-12-04T10:24:47.8548287Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 8265, in test_isinf 2025-12-04T10:24:47.8549102Z self.common(fn, [torch.tensor(values, dtype=dtype)], check_lowp=False) 2025-12-04T10:24:47.8550019Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 457, in common 2025-12-04T10:24:47.8550801Z return check_codegen( 2025-12-04T10:24:47.8551543Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 87, in check_codegen 2025-12-04T10:24:47.8552438Z code = run_and_get_triton_code(run, *example_inputs, **kwargs) 2025-12-04T10:24:47.8553313Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T10:24:47.8554089Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T10:24:47.8554543Z AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8554883Z 2025-12-04T10:24:47.8555101Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8556202Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8557069Z 2025-12-04T10:24:47.8557344Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8557991Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8558529Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8559193Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8560348Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 6), ('fxgraph_cache_miss', 5), ('async_compile_cache_hit', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8561256Z graph_break [] 2025-12-04T10:24:47.8561635Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8562808Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8563780Z warnings.warn( 2025-12-04T10:24:47.8564710Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8565682Z warnings.warn( 2025-12-04T10:24:47.8566150Z _________ DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda __________ 2025-12-04T10:24:47.8566730Z Traceback (most recent call last): 2025-12-04T10:24:47.8567386Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 8265, in test_isinf 2025-12-04T10:24:47.8568194Z self.common(fn, [torch.tensor(values, dtype=dtype)], check_lowp=False) 2025-12-04T10:24:47.8569097Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 457, in common 2025-12-04T10:24:47.8569882Z return check_codegen( 2025-12-04T10:24:47.8570636Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 87, in check_codegen 2025-12-04T10:24:47.8571519Z code = run_and_get_triton_code(run, *example_inputs, **kwargs) 2025-12-04T10:24:47.8572383Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T10:24:47.8573168Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T10:24:47.8573617Z AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8573938Z 2025-12-04T10:24:47.8574153Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8575236Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8576146Z 2025-12-04T10:24:47.8576413Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8577047Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8577565Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8578229Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8579468Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 6), ('fxgraph_cache_miss', 5), ('async_compile_cache_hit', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8580383Z graph_break [] 2025-12-04T10:24:47.8580750Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8581864Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8582851Z warnings.warn( 2025-12-04T10:24:47.8583741Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8584718Z warnings.warn( 2025-12-04T10:24:47.8585103Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8585633Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8586281Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8587443Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 6), ('fxgraph_cache_miss', 5), ('async_compile_cache_hit', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8588500Z graph_break [] 2025-12-04T10:24:47.8588878Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8590074Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8591049Z warnings.warn( 2025-12-04T10:24:47.8591949Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8592950Z warnings.warn( 2025-12-04T10:24:47.8593267Z =================================== FAILURES =================================== 2025-12-04T10:24:47.8593883Z _________ DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda __________ 2025-12-04T10:24:47.8594465Z Traceback (most recent call last): 2025-12-04T10:24:47.8595106Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 8265, in test_isinf 2025-12-04T10:24:47.8595919Z self.common(fn, [torch.tensor(values, dtype=dtype)], check_lowp=False) 2025-12-04T10:24:47.8596836Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 457, in common 2025-12-04T10:24:47.8597621Z return check_codegen( 2025-12-04T10:24:47.8598363Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 87, in check_codegen 2025-12-04T10:24:47.8599259Z code = run_and_get_triton_code(run, *example_inputs, **kwargs) 2025-12-04T10:24:47.8600127Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T10:24:47.8600904Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T10:24:47.8601355Z AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8601673Z 2025-12-04T10:24:47.8601906Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8602993Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8603902Z 2025-12-04T10:24:47.8604172Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8604819Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8605358Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8606025Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8607181Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 6), ('fxgraph_cache_miss', 5), ('async_compile_cache_hit', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8608086Z graph_break [] 2025-12-04T10:24:47.8608462Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8609550Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8610528Z warnings.warn( 2025-12-04T10:24:47.8611420Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8612385Z warnings.warn( 2025-12-04T10:24:47.8612760Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8613293Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8613952Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8615116Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 6), ('fxgraph_cache_miss', 5), ('async_compile_cache_hit', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8616002Z graph_break [] 2025-12-04T10:24:47.8616379Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8617554Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8618613Z warnings.warn( 2025-12-04T10:24:47.8619499Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8620526Z warnings.warn( 2025-12-04T10:24:47.8620909Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8621429Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8622087Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8623244Z inductor [('triton_bundler_save_kernel', 24), ('async_compile_cache_miss', 6), ('fxgraph_cache_miss', 5), ('async_compile_cache_hit', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8624146Z graph_break [] 2025-12-04T10:24:47.8624510Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8625609Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8626585Z warnings.warn( 2025-12-04T10:24:47.8627466Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8628438Z warnings.warn( 2025-12-04T10:24:47.8629617Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-3d2f5f1cdf9aac8a.xml - 2025-12-04T10:24:47.8630937Z =========================== short test summary info ============================ 2025-12-04T10:24:47.8632115Z FAILED [0.8757s] inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8633154Z 2025-12-04T10:24:47.8633375Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8634459Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8635324Z 2025-12-04T10:24:47.8635604Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8636199Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:24:47.8636767Z == 1 failed, 257 passed, 44 skipped, 41 xfailed, 2 rerun in 390.29s (0:06:30) == 2025-12-04T10:24:47.8637263Z Got exit code 1 2025-12-04T10:24:47.8637536Z Retrying single test... 2025-12-04T10:24:47.8638163Z W1204 10:22:25.388000 45853 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:24:47.8639586Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-53adf85a57cb1562.xml 2025-12-04T10:24:47.8640719Z ============================= test session starts ============================== 2025-12-04T10:24:47.8641389Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:24:47.8641986Z cachedir: .pytest_cache 2025-12-04T10:24:47.8642706Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:24:47.8643499Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:24:47.8643850Z configfile: pytest.ini 2025-12-04T10:24:47.8644583Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:24:47.8645500Z collecting ... collected 1750 items / 438 deselected / 1312 selected 2025-12-04T10:24:47.8646765Z stepcurrent: skipping 342 already run items. Running only test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8647829Z Running 1 items in this shard 2025-12-04T10:24:47.8648083Z 2025-12-04T10:24:47.8649115Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [3.9187s] [100%] 2025-12-04T10:24:47.8651087Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.8566s] [100%] 2025-12-04T10:24:47.8652956Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py FAILED [0.8898s] [100%] 2025-12-04T10:24:47.8653909Z 2025-12-04T10:24:47.8654066Z ==================================== RERUNS ==================================== 2025-12-04T10:24:47.8654668Z _________ DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda __________ 2025-12-04T10:24:47.8655249Z Traceback (most recent call last): 2025-12-04T10:24:47.8655905Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 8265, in test_isinf 2025-12-04T10:24:47.8656702Z self.common(fn, [torch.tensor(values, dtype=dtype)], check_lowp=False) 2025-12-04T10:24:47.8657611Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 457, in common 2025-12-04T10:24:47.8658448Z return check_codegen( 2025-12-04T10:24:47.8659204Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 87, in check_codegen 2025-12-04T10:24:47.8660178Z code = run_and_get_triton_code(run, *example_inputs, **kwargs) 2025-12-04T10:24:47.8661057Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T10:24:47.8661899Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T10:24:47.8662347Z AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8662668Z 2025-12-04T10:24:47.8662886Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8663971Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8664834Z 2025-12-04T10:24:47.8665113Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8665747Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8666262Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8667107Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8668159Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8668754Z graph_break [] 2025-12-04T10:24:47.8669134Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8670239Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8671403Z warnings.warn( 2025-12-04T10:24:47.8672281Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8673250Z warnings.warn( 2025-12-04T10:24:47.8673729Z _________ DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda __________ 2025-12-04T10:24:47.8674310Z Traceback (most recent call last): 2025-12-04T10:24:47.8674960Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 8265, in test_isinf 2025-12-04T10:24:47.8675840Z self.common(fn, [torch.tensor(values, dtype=dtype)], check_lowp=False) 2025-12-04T10:24:47.8676758Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 457, in common 2025-12-04T10:24:47.8677552Z return check_codegen( 2025-12-04T10:24:47.8678299Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 87, in check_codegen 2025-12-04T10:24:47.8679193Z code = run_and_get_triton_code(run, *example_inputs, **kwargs) 2025-12-04T10:24:47.8680054Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T10:24:47.8680832Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T10:24:47.8681280Z AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8681602Z 2025-12-04T10:24:47.8681837Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8682912Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8683788Z 2025-12-04T10:24:47.8684058Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8684693Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8685222Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8686056Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8687104Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8687903Z graph_break [] 2025-12-04T10:24:47.8688323Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8689507Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8690492Z warnings.warn( 2025-12-04T10:24:47.8691386Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8692362Z warnings.warn( 2025-12-04T10:24:47.8692727Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8693250Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8693900Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8694934Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8695715Z graph_break [] 2025-12-04T10:24:47.8696087Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8697178Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8698134Z warnings.warn( 2025-12-04T10:24:47.8699083Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8700045Z warnings.warn( 2025-12-04T10:24:47.8700354Z =================================== FAILURES =================================== 2025-12-04T10:24:47.8700947Z _________ DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda __________ 2025-12-04T10:24:47.8701523Z Traceback (most recent call last): 2025-12-04T10:24:47.8702173Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 8265, in test_isinf 2025-12-04T10:24:47.8703072Z self.common(fn, [torch.tensor(values, dtype=dtype)], check_lowp=False) 2025-12-04T10:24:47.8703971Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 457, in common 2025-12-04T10:24:47.8704777Z return check_codegen( 2025-12-04T10:24:47.8705524Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 87, in check_codegen 2025-12-04T10:24:47.8706394Z code = run_and_get_triton_code(run, *example_inputs, **kwargs) 2025-12-04T10:24:47.8707247Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T10:24:47.8708113Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T10:24:47.8708553Z AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8708868Z 2025-12-04T10:24:47.8709078Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8710153Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8711019Z 2025-12-04T10:24:47.8711286Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8711907Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8712416Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8713250Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8714286Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8714876Z graph_break [] 2025-12-04T10:24:47.8715227Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8716357Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8717316Z warnings.warn( 2025-12-04T10:24:47.8718199Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8719146Z warnings.warn( 2025-12-04T10:24:47.8719527Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8720055Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8720696Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8721734Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8722520Z graph_break [] 2025-12-04T10:24:47.8722891Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8723973Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8724948Z warnings.warn( 2025-12-04T10:24:47.8725836Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8726801Z warnings.warn( 2025-12-04T10:24:47.8727167Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8727695Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8728353Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8729389Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8730176Z graph_break [] 2025-12-04T10:24:47.8730631Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8731732Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8732729Z warnings.warn( 2025-12-04T10:24:47.8733618Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8734586Z warnings.warn( 2025-12-04T10:24:47.8735749Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-53adf85a57cb1562.xml - 2025-12-04T10:24:47.8737073Z =========================== short test summary info ============================ 2025-12-04T10:24:47.8738335Z FAILED [0.8898s] inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8739336Z 2025-12-04T10:24:47.8739570Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8740649Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8741513Z 2025-12-04T10:24:47.8741783Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8742380Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:24:47.8742914Z ================== 1 failed, 438 deselected, 2 rerun in 5.79s ================== 2025-12-04T10:24:47.8743366Z Got exit code 1 2025-12-04T10:24:47.8743623Z Retrying single test... 2025-12-04T10:24:47.8744307Z W1204 10:22:44.660000 46123 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:24:47.8745723Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-ae6d18145ba692e1.xml 2025-12-04T10:24:47.8746847Z ============================= test session starts ============================== 2025-12-04T10:24:47.8747516Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:24:47.8748118Z cachedir: .pytest_cache 2025-12-04T10:24:47.8748841Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:24:47.8749626Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:24:47.8749986Z configfile: pytest.ini 2025-12-04T10:24:47.8750722Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:24:47.8751641Z collecting ... collected 1750 items / 438 deselected / 1312 selected 2025-12-04T10:24:47.8752842Z stepcurrent: skipping 342 already run items. Running only test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8753915Z Running 1 items in this shard 2025-12-04T10:24:47.8754125Z 2025-12-04T10:24:47.8755056Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [3.9215s] [100%] 2025-12-04T10:24:47.8757024Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.8545s] [100%] 2025-12-04T10:24:47.8758884Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py FAILED [0.8562s] [100%] 2025-12-04T10:24:47.8759907Z 2025-12-04T10:24:47.8760054Z ==================================== RERUNS ==================================== 2025-12-04T10:24:47.8760661Z _________ DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda __________ 2025-12-04T10:24:47.8761274Z Traceback (most recent call last): 2025-12-04T10:24:47.8761921Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 8265, in test_isinf 2025-12-04T10:24:47.8762730Z self.common(fn, [torch.tensor(values, dtype=dtype)], check_lowp=False) 2025-12-04T10:24:47.8763647Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 457, in common 2025-12-04T10:24:47.8764409Z return check_codegen( 2025-12-04T10:24:47.8765155Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 87, in check_codegen 2025-12-04T10:24:47.8766053Z code = run_and_get_triton_code(run, *example_inputs, **kwargs) 2025-12-04T10:24:47.8766921Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T10:24:47.8767697Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T10:24:47.8768149Z AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8768469Z 2025-12-04T10:24:47.8768699Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8769778Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8770645Z 2025-12-04T10:24:47.8770910Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8771548Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8772075Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8772960Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8773992Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8774598Z graph_break [] 2025-12-04T10:24:47.8774975Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8776066Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8777043Z warnings.warn( 2025-12-04T10:24:47.8777933Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8778983Z warnings.warn( 2025-12-04T10:24:47.8779448Z _________ DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda __________ 2025-12-04T10:24:47.8780035Z Traceback (most recent call last): 2025-12-04T10:24:47.8780692Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 8265, in test_isinf 2025-12-04T10:24:47.8781484Z self.common(fn, [torch.tensor(values, dtype=dtype)], check_lowp=False) 2025-12-04T10:24:47.8782404Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 457, in common 2025-12-04T10:24:47.8783184Z return check_codegen( 2025-12-04T10:24:47.8783930Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 87, in check_codegen 2025-12-04T10:24:47.8784812Z code = run_and_get_triton_code(run, *example_inputs, **kwargs) 2025-12-04T10:24:47.8785673Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T10:24:47.8786455Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T10:24:47.8786905Z AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8787225Z 2025-12-04T10:24:47.8787504Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8788750Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8789679Z 2025-12-04T10:24:47.8789963Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8790598Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8791116Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8791968Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8793011Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8793609Z graph_break [] 2025-12-04T10:24:47.8793989Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8795091Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8796069Z warnings.warn( 2025-12-04T10:24:47.8796951Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8797922Z warnings.warn( 2025-12-04T10:24:47.8798302Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8798820Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8799474Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8800523Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8801360Z graph_break [] 2025-12-04T10:24:47.8801724Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8802821Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8803796Z warnings.warn( 2025-12-04T10:24:47.8804685Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8805640Z warnings.warn( 2025-12-04T10:24:47.8805954Z =================================== FAILURES =================================== 2025-12-04T10:24:47.8806566Z _________ DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda __________ 2025-12-04T10:24:47.8807135Z Traceback (most recent call last): 2025-12-04T10:24:47.8807789Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 8265, in test_isinf 2025-12-04T10:24:47.8808587Z self.common(fn, [torch.tensor(values, dtype=dtype)], check_lowp=False) 2025-12-04T10:24:47.8809502Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 457, in common 2025-12-04T10:24:47.8810268Z return check_codegen( 2025-12-04T10:24:47.8811014Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor_codegen_dynamic_shapes.py", line 87, in check_codegen 2025-12-04T10:24:47.8811911Z code = run_and_get_triton_code(run, *example_inputs, **kwargs) 2025-12-04T10:24:47.8812774Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T10:24:47.8813548Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T10:24:47.8813990Z AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8814311Z 2025-12-04T10:24:47.8814616Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8815692Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8816601Z 2025-12-04T10:24:47.8816870Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8817502Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8818034Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8818947Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8819996Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8820606Z graph_break [] 2025-12-04T10:24:47.8820992Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8822086Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8823066Z warnings.warn( 2025-12-04T10:24:47.8823969Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8824935Z warnings.warn( 2025-12-04T10:24:47.8825312Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8825840Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8826497Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8827532Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8828362Z graph_break [] 2025-12-04T10:24:47.8828747Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8829843Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8830808Z warnings.warn( 2025-12-04T10:24:47.8831699Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8832677Z warnings.warn( 2025-12-04T10:24:47.8833063Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:24:47.8833582Z stats [('calls_captured', 10), ('unique_graphs', 3)] 2025-12-04T10:24:47.8834249Z aot_autograd [('total', 5), ('autograd_cache_miss', 5), ('autograd_cache_saved', 3), ('ok', 3), ('not_ok', 2)] 2025-12-04T10:24:47.8835300Z inductor [('triton_bundler_save_kernel', 24), ('fxgraph_cache_miss', 5), ('async_compile_cache_miss', 3), ('triton_bundler_save_static_autotuner', 3)] 2025-12-04T10:24:47.8836072Z graph_break [] 2025-12-04T10:24:47.8836443Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:24:47.8837537Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8838498Z warnings.warn( 2025-12-04T10:24:47.8839374Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T10:24:47.8840337Z warnings.warn( 2025-12-04T10:24:47.8841509Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-ae6d18145ba692e1.xml - 2025-12-04T10:24:47.8842914Z =========================== short test summary info ============================ 2025-12-04T10:24:47.8844087Z FAILED [0.8562s] inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T10:24:47.8845124Z 2025-12-04T10:24:47.8845342Z To execute this test, run the following from the base repo dir: 2025-12-04T10:24:47.8846430Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_torchinductor_codegen_dynamic_shapes.py DynamicShapesCodegenGPUTests.test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8847290Z 2025-12-04T10:24:47.8847569Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:24:47.8848147Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:24:47.8848676Z ================== 1 failed, 438 deselected, 2 rerun in 5.76s ================== 2025-12-04T10:24:47.8849126Z Got exit code 1 2025-12-04T10:24:47.8849938Z FAILED CONSISTENTLY: test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda 2025-12-04T10:24:47.8851132Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T10:24:47.8852130Z W1204 10:23:03.834000 46393 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:24:47.8853542Z Test results will be stored in test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-7f9a6ec5b091c39f.xml 2025-12-04T10:24:47.8854769Z ============================= test session starts ============================== 2025-12-04T10:24:47.8855465Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:24:47.8856064Z cachedir: .pytest_cache 2025-12-04T10:24:47.8856821Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:24:47.8857597Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:24:47.8857945Z configfile: pytest.ini 2025-12-04T10:24:47.8858745Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:24:47.8859661Z collecting ... collected 1750 items / 343 deselected / 1407 selected 2025-12-04T10:24:47.8860170Z stepcurrent: skipping 343 already run items. 2025-12-04T10:24:47.8860564Z Running 96 items in this shard 2025-12-04T10:24:47.8860774Z 2025-12-04T10:24:47.8862664Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_layer_norm_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py W1204 10:23:09.867000 46393 site-packages/torch/_inductor/debug.py:518] [0/0_1] model__1_inference_0 debug trace: /var/lib/jenkins/workspace/test/torch_compile_debug/run_2025_12_04_10_23_08_949803-pid_46393/torchinductor/model__1_inference_0.0 2025-12-04T10:24:47.8864775Z PASSED [3.7939s] [ 1%] 2025-12-04T10:24:47.8865791Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_linspace3_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1320s] [ 2%] 2025-12-04T10:24:47.8867750Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_lite_regional_compile_repeated_blocks_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4206s] [ 3%] 2025-12-04T10:24:47.8869691Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_log1p_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [3.5842s] [ 4%] 2025-12-04T10:24:47.8871519Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_logcumsumexp_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [6.5718s] [ 5%] 2025-12-04T10:24:47.8873481Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_logcumsumexp_zero_dim_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4518s] [ 6%] 2025-12-04T10:24:47.8875350Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_long_tensor_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8150s] [ 7%] 2025-12-04T10:24:47.8877619Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mark_unbacked_with_hint_override_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 8%] 2025-12-04T10:24:47.8879868Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_masked_fill_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8131s] [ 9%] 2025-12-04T10:24:47.8881701Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_max_pool2d2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [3.0628s] [ 10%] 2025-12-04T10:24:47.8883522Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_max_pool2d5_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.7577s] [ 11%] 2025-12-04T10:24:47.8885436Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_max_pool2d_with_indices_backward4_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [12.9904s] [ 12%] 2025-12-04T10:24:47.8887457Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_min_max_reduction_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 13%] 2025-12-04T10:24:47.8889604Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mix_device_index_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5715s] [ 14%] 2025-12-04T10:24:47.8891984Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mm_views_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py W1204 10:23:42.268000 46393 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:24:47.8893510Z XFAIL [0.9809s] [ 15%] 2025-12-04T10:24:47.8894405Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mul_softmax_symfloat_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [2.7710s] [ 16%] 2025-12-04T10:24:47.8895653Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multi_device_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py W1204 10:23:46.197000 46393 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.8896055Z W1204 10:23:46.201000 46393 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.8896458Z W1204 10:23:46.204000 46393 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:24:47.8896577Z PASSED [1.1475s] [ 17%] 2025-12-04T10:24:47.8897560Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multi_gpu_recompile_on_index_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 18%] 2025-12-04T10:24:47.8898491Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multilayer_any_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.7757s] [ 19%] 2025-12-04T10:24:47.8899479Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multilayer_sum_low_prec_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.9278s] [ 20%] 2025-12-04T10:24:47.8900362Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multilayer_var_lowp_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.7702s] [ 21%] 2025-12-04T10:24:47.8901410Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mutable_custom_op_fixed_layout_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7403s] [ 22%] 2025-12-04T10:24:47.8902405Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mutations_loop_fusion_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 23%] 2025-12-04T10:24:47.8903253Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_nan_to_num_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8310s] [ 25%] 2025-12-04T10:24:47.8904087Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_neg_max_uint8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2758s] [ 26%] 2025-12-04T10:24:47.8905037Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_no_mega_fusion_during_lowering_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.1201s] [ 27%] 2025-12-04T10:24:47.8905933Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_norm_constant_overflow_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8102s] [ 28%] 2025-12-04T10:24:47.8906758Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_one_hot_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5816s] [ 29%] 2025-12-04T10:24:47.8907678Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_output_strides_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 30%] 2025-12-04T10:24:47.8908502Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pad_cast_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4523s] [ 31%] 2025-12-04T10:24:47.8909387Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pad_single_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.1735s] [ 32%] 2025-12-04T10:24:47.8910210Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_permute2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.3168s] [ 33%] 2025-12-04T10:24:47.8911085Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_airy_ai_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1367s] [ 34%] 2025-12-04T10:24:47.8912031Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_chebyshev_polynomial_u_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [1.2833s] [ 35%] 2025-12-04T10:24:47.8912904Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_erf_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2526s] [ 36%] 2025-12-04T10:24:47.8913764Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_erfcx_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2501s] [ 37%] 2025-12-04T10:24:47.8914618Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_exp2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2314s] [ 38%] 2025-12-04T10:24:47.8915504Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_gammainc_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1531s] [ 39%] 2025-12-04T10:24:47.8916444Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_laguerre_polynomial_l_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1359s] [ 40%] 2025-12-04T10:24:47.8917556Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_modified_bessel_i1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.2407s] [ 41%] 2025-12-04T10:24:47.8918480Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_modified_bessel_k1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1335s] [ 42%] 2025-12-04T10:24:47.8919521Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_shifted_chebyshev_polynomial_u_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1367s] [ 43%] 2025-12-04T10:24:47.8920379Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_sinc_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3471s] [ 44%] 2025-12-04T10:24:47.8921317Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_spherical_bessel_j0_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1355s] [ 45%] 2025-12-04T10:24:47.8922184Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_xlogy_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3557s] [ 46%] 2025-12-04T10:24:47.8922994Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_prod_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.8717s] [ 47%] 2025-12-04T10:24:47.8923851Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_reduction3_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3122s] [ 48%] 2025-12-04T10:24:47.8924748Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_reduction_config_limit_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.0033s] [ 50%] 2025-12-04T10:24:47.8925729Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_reflection_pad2d_backward_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [9.1915s] [ 51%] 2025-12-04T10:24:47.8926629Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_reinterpret_dtypeview_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1586s] [ 52%] 2025-12-04T10:24:47.8927511Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_remove_noop_copy_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7955s] [ 53%] 2025-12-04T10:24:47.8928387Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_remove_noop_slice1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3856s] [ 54%] 2025-12-04T10:24:47.8929292Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_remove_noop_slice_scatter_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3556s] [ 55%] 2025-12-04T10:24:47.8930209Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_remove_noop_view_default_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3794s] [ 56%] 2025-12-04T10:24:47.8931205Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5085s] [ 57%] 2025-12-04T10:24:47.8932093Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_repeat_interleave_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.2028s] [ 58%] 2025-12-04T10:24:47.8933027Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_replication_pad_errors_with_bool_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.3593s] [ 59%] 2025-12-04T10:24:47.8933921Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_roi_align_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.3177s] [ 60%] 2025-12-04T10:24:47.8935007Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_round_correctness_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (need to debug tl.libdevice on A100/V100) [ 61%] 2025-12-04T10:24:47.8935861Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_round_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3651s] [ 62%] 2025-12-04T10:24:47.8936689Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_scatter6_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7307s] [ 63%] 2025-12-04T10:24:47.8937686Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_scatter_add1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Flaky test, needs debugging) [ 64%] 2025-12-04T10:24:47.8938971Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_True_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Does not support SDPA or pre-SM80 hardware) [ 65%] 2025-12-04T10:24:47.8940179Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_True_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Does not support SDPA or pre-SM80 hardware) [ 66%] 2025-12-04T10:24:47.8941120Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sdpa_unaligned_mask_freezing_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4132s] [ 67%] 2025-12-04T10:24:47.8942036Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_setitem_with_int_parameter_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4659s] [ 68%] 2025-12-04T10:24:47.8942936Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_shape_padding_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.2620s] [ 69%] 2025-12-04T10:24:47.8943838Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_should_pad_bench_for_bmm_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.0042s] [ 70%] 2025-12-04T10:24:47.8944681Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sign_dtype_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3672s] [ 71%] 2025-12-04T10:24:47.8945491Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice4_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1641s] [ 72%] 2025-12-04T10:24:47.8946356Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice_mutation1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7149s] [ 73%] 2025-12-04T10:24:47.8947226Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice_scatter3_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3151s] [ 75%] 2025-12-04T10:24:47.8948084Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice_scatter4_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3003s] [ 76%] 2025-12-04T10:24:47.8948948Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sort_transpose_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1666s] [ 77%] 2025-12-04T10:24:47.8949819Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_cumsum_index_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.0126s] [ 78%] 2025-12-04T10:24:47.8950693Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.3160s] [ 79%] 2025-12-04T10:24:47.8951631Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_reduction_dynamic_shape_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4592s] [ 80%] 2025-12-04T10:24:47.8952633Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_with_sizes_with_unbacked_symints_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.9934s] [ 81%] 2025-12-04T10:24:47.8953565Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_with_unbacked_symints_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7441s] [ 82%] 2025-12-04T10:24:47.8954493Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_strided_inputs_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 83%] 2025-12-04T10:24:47.8955399Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sum_keepdims_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3295s] [ 84%] 2025-12-04T10:24:47.8956283Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_tensor2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.1740s] [ 85%] 2025-12-04T10:24:47.8957182Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_tensor_index_put_slice_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [6.9600s] [ 86%] 2025-12-04T10:24:47.8958071Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_to_device_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 87%] 2025-12-04T10:24:47.8958972Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_to_memory_format_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.3826s] [ 88%] 2025-12-04T10:24:47.8959781Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_topk_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1567s] [ 89%] 2025-12-04T10:24:47.8960590Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_uint_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.1619s] [ 90%] 2025-12-04T10:24:47.8961415Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unbind_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1728s] [ 91%] 2025-12-04T10:24:47.8962292Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unspec_inputs_int64_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5856s] [ 92%] 2025-12-04T10:24:47.8963182Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unspec_inputs_int8_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.5041s] [ 93%] 2025-12-04T10:24:47.8964067Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_upsample_bicubic2d_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [8.2939s] [ 94%] 2025-12-04T10:24:47.8965159Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_upsample_cat_conv_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (only support cpu upsample_cat_conv test) [ 95%] 2025-12-04T10:24:47.8966052Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_vectorized_ops_masked_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.4297s] [ 96%] 2025-12-04T10:24:47.8966980Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_vertical_fusion1_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.7069s] [ 97%] 2025-12-04T10:24:47.8967809Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_views5_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py XFAIL [0.1684s] [ 98%] 2025-12-04T10:24:47.8968732Z inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_xblock_divides_xnumel_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [0.3917s] [100%] 2025-12-04T10:24:47.8968738Z 2025-12-04T10:24:47.8969768Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-7f9a6ec5b091c39f.xml - 2025-12-04T10:24:47.8970026Z ==== 65 passed, 12 skipped, 343 deselected, 19 xfailed in 98.81s (0:01:38) ===== 2025-12-04T10:24:47.8970805Z The following tests failed consistently: ['test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf_dynamic_shapes_cuda'] 2025-12-04T10:24:47.8970818Z 2025-12-04T10:24:47.8971706Z FINISHED PRINTING LOG FILE of inductor/test_torchinductor_codegen_dynamic_shapes 3/4 (test/test-reports/inductor.test_torchinductor_codegen_dynamic_shapes_3.4_8476a077570d9691_.log) 2025-12-04T10:24:47.8971717Z 2025-12-04T10:24:47.8972253Z Finished inductor/test_torchinductor_codegen_dynamic_shapes 3/4 ... [2025-12-04 10:24:47.691160][4291.812372178], took 9.27min 2025-12-04T10:24:47.8973329Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-3d2f5f1cdf9aac8a.xml 2025-12-04T10:24:47.8974388Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-53adf85a57cb1562.xml 2025-12-04T10:24:47.8975502Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-ae6d18145ba692e1.xml 2025-12-04T10:24:47.8976560Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-7f9a6ec5b091c39f.xml 2025-12-04T10:24:48.0430054Z Uploading logs for 57119749259 to S3 2025-12-04T10:24:48.0932482Z Uploading artifacts took 0.22 seconds 2025-12-04T10:24:48.0932847Z inductor/test_torchinductor_codegen_dynamic_shapes 3/4 failed! 2025-12-04T10:24:48.0937442Z Running inductor/test_torchinductor_opinfo 4/17 ... [2025-12-04 10:24:48.093579][4292.214800093] 2025-12-04T10:24:48.0937638Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T10:24:48.0942097Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_opinfo.py', '--shard-id=4', '--num-shards=17', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:24:48.093967] 2025-12-04T10:32:44.4175110Z 2025-12-04T10:32:44.4176840Z inductor/test_torchinductor_opinfo 4/17 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_opinfo_4.17_3a9725e48b5f31f6_.log 2025-12-04T10:32:44.4288968Z Running 185 items in this shard: test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___radd___cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___ror___cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rxor___cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__native_batch_norm_legit_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_acos_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_amin_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_arange_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argmax_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argsort_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_copy_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_asin_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_2d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_baddbmm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bfloat16_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bitwise_xor_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bool_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bucketize_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bucketize_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ceil_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_chunk_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_max_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_min_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_min_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_column_stack_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_column_stack_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_conj_physical_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_contiguous_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_corrcoef_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cos_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummin_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummin_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cumulative_trapezoid_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_deg2rad_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diag_embed_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagflat_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_copy_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_digamma_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_digamma_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_dist_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_trunc_rounding_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_dsplit_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_einsum_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_permuted_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_erf_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_erfc_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_erfinv_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expand_as_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expand_copy_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expand_copy_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expm1_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft2_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft2_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fftshift_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ihfft_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfft2_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfft2_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfft_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fliplr_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_power_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_power_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fmin_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_frexp_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_full_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gather_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_geqrf_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gt_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_half_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hsplit_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hstack_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_copy_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_copy_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_amax_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_prod_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isfinite_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isfinite_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_item_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_binary_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_binary_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_kthvalue_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_le_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lgamma_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_vecdot_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_vector_norm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linspace_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log10_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log2_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log_softmax_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log_softmax_with_dtype_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_not_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logspace_tensor_overload_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lu_solve_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mH_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_amax_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_amin_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_cumsum_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_scatter_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_reduction_with_dim_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_minimum_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_movedim_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mul_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_1_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nan_to_num_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nan_to_num_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nansum_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_empty_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_empty_strided_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_alpha_dropout_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_bilinear_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_celu_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv_transpose2d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_elu_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_gaussian_nll_loss_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_grid_sample_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_hardshrink_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_margin_ranking_loss_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_pool2d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_unpool1d_grad_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_multi_head_attention_forward_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_constant_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_replicate_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_replicate_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_replicate_negative_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pixel_shuffle_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_prelu_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_softsign_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_upsample_bilinear_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_fro_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_nuc_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polar_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_2_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_positive_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_put_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_put_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scalar_tensor_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_amax_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sigmoid_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_kaiser_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signbit_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_u_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_erfcx_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i0e_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_legendre_polynomial_p_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_i1_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtr_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_t_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_spherical_bessel_j0_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_xlog1py_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_zeta_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_list_args_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sqrt_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_squeeze_copy_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_squeeze_multiple_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_stack_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sub_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_take_along_dim_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tanh_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tensor_split_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_topk_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unique_consecutive_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unique_cuda_uint32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_chunk_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_split_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_copy_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_vstack_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_where_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_zeros_cuda_float64 2025-12-04T10:32:44.4398026Z 2025-12-04T10:32:44.4398546Z Finished inductor/test_torchinductor_opinfo 4/17 ... [2025-12-04 10:32:44.417864][4768.539083282], took 7.94min 2025-12-04T10:32:44.4400015Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_opinfo/inductor.test_torchinductor_opinfo-b99623f068a7d8ad.xml 2025-12-04T10:32:44.5032094Z Running inductor/test_torchinductor_opinfo 9/17 ... [2025-12-04 10:32:44.502970][4768.624191169] 2025-12-04T10:32:44.5032732Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T10:32:44.5035771Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_opinfo.py', '--shard-id=9', '--num-shards=17', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:32:44.503316] 2025-12-04T10:41:40.8921992Z 2025-12-04T10:41:40.8923482Z inductor/test_torchinductor_opinfo 9/17 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_opinfo_9.17_99683e4d5a5c32ee_.log 2025-12-04T10:41:40.9054622Z Running 219 items in this shard: test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___getitem___cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___getitem___cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___radd___cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rmatmul___cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rmod___cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rpow___cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_abs_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_acos_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addmv_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addr_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addr_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_alias_copy_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_amax_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_amax_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argmin_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argsort_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argwhere_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_partial_views_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_asinh_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atanh_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bernoulli_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bitwise_and_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bitwise_right_shift_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bitwise_xor_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cdouble_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cfloat_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_column_stack_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_complex_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cosh_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cosh_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cross_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cumprod_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cumsum_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagflat_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_floor_rounding_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_like_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_permuted_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_erf_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_erfc_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expand_copy_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_hfft2_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_hfft_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifftn_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifftn_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifftshift_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ihfft_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ihfft_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfft_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfft2_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfft_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfftn_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfftn_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_floor_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_floor_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fmax_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_full_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gcd_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_grid_sampler_2d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_grid_sampler_3d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gt_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hash_tensor_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_heaviside_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_heaviside_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_histc_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hsplit_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_igammac_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_igammac_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_add_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_add_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_add_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_fill_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_fill_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_inner_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isinf_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isposinf_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_item_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_item_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_unary_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_unary_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_kthvalue_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_le_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lgamma_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_det_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_diagonal_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_eigh_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_pinv_hermitian_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_tensorinv_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_tensorinv_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linspace_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log10_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log_softmax_with_dtype_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logaddexp_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_and_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_and_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_xor_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logspace_tensor_overload_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logsumexp_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_median_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_prod_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_var_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_binary_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_reduction_no_dim_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_meshgrid_variadic_tensors_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_binary_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_reduction_with_dim_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_minimum_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mode_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_multinomial_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_1_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nanmedian_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nansum_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_native_dropout_backward_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_neg_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_ones_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_alpha_dropout_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_avg_pool3d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_bilinear_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv_transpose3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_cosine_embedding_loss_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_embedding_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_gelu_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_hardtanh_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_interpolate_area_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_interpolate_nearest-exact_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_kl_div_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_kl_div_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_margin_ranking_loss_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_mse_loss_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_multilabel_margin_loss_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_constant_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_poisson_nll_loss_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_scaled_dot_product_attention_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_softmin_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_softshrink_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nonzero_static_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_inf_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ormqr_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_outer_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_permute_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_1_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_1_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_4_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_prod_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_quantile_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rad2deg_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rand_like_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ravel_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reshape_as_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reshape_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resize__cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resolve_conj_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_roll_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_roll_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_round_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rsub_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_amin_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sgn_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sgn_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sign_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_bartlett_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_cosine_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_gaussian_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_hamming_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sin_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sinh_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sinh_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_slice_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_erfcx_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_legendre_polynomial_p_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_legendre_polynomial_p_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_legendre_polynomial_p_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_i0_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_i1_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtri_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_scaled_modified_bessel_k0_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_scaled_modified_bessel_k1_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_scaled_modified_bessel_k1_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_zeta_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_with_sizes_copy_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_with_sizes_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sqrt_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_squeeze_copy_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_squeeze_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_squeeze_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tan_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tan_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tanh_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tensor_split_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tile_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_to_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_transpose_copy_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_copy_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unflatten_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unfold_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_chunk_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsqueeze_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_vdot_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_as_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_vstack_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_where_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_xlogy_cuda_float16 2025-12-04T10:41:40.9183344Z 2025-12-04T10:41:40.9183767Z Finished inductor/test_torchinductor_opinfo 9/17 ... [2025-12-04 10:41:40.892546][5305.013763495], took 8.94min 2025-12-04T10:41:40.9185244Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_opinfo/inductor.test_torchinductor_opinfo-5a150b3a059b9a30.xml 2025-12-04T10:41:40.9800035Z Running inductor/test_torchinductor_opinfo 14/17 ... [2025-12-04 10:41:40.979758][5305.100979646] 2025-12-04T10:41:40.9800668Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T10:41:40.9803970Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_opinfo.py', '--shard-id=14', '--num-shards=17', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:41:40.980126] 2025-12-04T10:49:17.8102882Z 2025-12-04T10:49:17.8103974Z inductor/test_torchinductor_opinfo 14/17 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_opinfo_14.17_f7d5878415016f45_.log 2025-12-04T10:49:17.8227003Z Running 206 items in this shard: test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___getitem___cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rmod___cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rxor___cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__segment_reduce_offsets_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__softmax_backward_data_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_add_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addcmul_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addcmul_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_aminmax_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_any_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atan_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atanh_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_1d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_1d_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_2d_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_3d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bincount_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_broadcast_tensors_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bucketize_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_byte_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cartesian_prod_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cartesian_prod_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cauchy_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_char_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_char_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_min_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_column_stack_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummax_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cumsum_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_deg2rad_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_deg2rad_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diag_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diag_embed_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diag_embed_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_scatter_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_trunc_rounding_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_strided_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_equal_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_erfc_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expand_copy_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expand_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_exponential_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft2_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fftn_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_hfft2_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_hfftn_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_hfftn_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifft_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifftshift_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ihfft2_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfft_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfft_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfftn_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfft2_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfftn_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fill_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flatten_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fliplr_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fliplr_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_power_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_floor_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fmod_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gather_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gradient_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_half_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hash_tensor_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_heaviside_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_histc_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_histc_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hstack_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_i0_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_amin_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isclose_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isin_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isnan_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isposinf_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isposinf_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_item_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_4inputs_with_extra_args_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_unary_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_kron_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_cross_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_householder_product_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_lu_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_lu_factor_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_lu_solve_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_svdvals_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linspace_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log2_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log2_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log2_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log_softmax_with_dtype_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_xor_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logspace_tensor_overload_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logsumexp_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logsumexp_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_fill_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_log_softmax_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_logsumexp_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_median_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_select_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_select_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_var_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_matrix_exp_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_binary_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_reduction_with_dim_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_maximum_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mean_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_meshgrid_list_of_tensors_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_meshgrid_list_of_tensors_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_binary_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_reduction_no_dim_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_multinomial_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mv_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_empty_strided_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_ones_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_adaptive_avg_pool1d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_adaptive_avg_pool2d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_adaptive_max_pool2d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_channel_shuffle_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv_transpose2d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv_transpose3d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_cosine_embedding_loss_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_cosine_similarity_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_dropout3d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_dropout_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_elu_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_hardtanh_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_interpolate_bicubic_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_l1_loss_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_margin_ranking_loss_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_pool3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_normalize_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_circular_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_constant_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_replicate_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pairwise_distance_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pixel_shuffle_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_relu6_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_triplet_margin_with_distance_loss_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nonzero_static_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_normal_number_mean_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ones_like_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_pinverse_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_1_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_3_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_put_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_put_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_remainder_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_renorm_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_repeat_interleave_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reshape_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resize_as__cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resolve_neg_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_sum_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sigmoid_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sign_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sign_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_general_hamming_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_nuttall_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sin_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_softmax_with_dtype_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sparse_sampled_addmm_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_t_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_u_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_erfcx_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i0e_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i1_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_log_ndtr_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_i0_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_i1_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtr_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtr_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtri_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_with_sizes_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_with_sizes_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sqrt_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_square_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sub_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_take_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tanh_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tile_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_topk_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_torch_ops_aten__efficient_attention_forward_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_torch_ops_aten__flash_attention_forward_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_torch_ops_aten__safe_softmax_default_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trace_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_transpose_copy_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_copy_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unfold_copy_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_uniform_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_split_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsqueeze_copy_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsqueeze_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_as_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_xlogy_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_zeros_cuda_uint8 2025-12-04T10:49:17.8350663Z 2025-12-04T10:49:17.8351100Z Finished inductor/test_torchinductor_opinfo 14/17 ... [2025-12-04 10:49:17.810060][5761.931278021], took 7.61min 2025-12-04T10:49:17.8352701Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_opinfo/inductor.test_torchinductor_opinfo-a7e0d08ca6140a5e.xml 2025-12-04T10:49:18.2579947Z Uploading artifacts took 0.35 seconds 2025-12-04T10:49:18.2585091Z Running dynamo/test_activation_checkpointing 1/1 ... [2025-12-04 10:49:18.258308][5762.379528929] 2025-12-04T10:49:18.2585747Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T10:49:18.2589697Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_activation_checkpointing.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:49:18.258720] 2025-12-04T10:51:07.5712721Z 2025-12-04T10:51:07.5714187Z PRINTING LOG FILE of dynamo/test_activation_checkpointing 1/1 (test/test-reports/dynamo.test_activation_checkpointing_1.1_2178259ee06de350_.log) 2025-12-04T10:51:07.5716388Z Test results will be stored in test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-102e4c3a904c277d.xml 2025-12-04T10:51:07.5718040Z ============================= test session starts ============================== 2025-12-04T10:51:07.5719161Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:51:07.5720178Z cachedir: .pytest_cache 2025-12-04T10:51:07.5721125Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:51:07.5722445Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:51:07.5722808Z configfile: pytest.ini 2025-12-04T10:51:07.5723585Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:51:07.5724959Z collecting ... collected 64 items 2025-12-04T10:51:07.5725533Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T10:51:07.5773614Z Running 64 items in this shard: test/dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_ac_rematerialize_simple_forward_backward, test/dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_ac_rematerialize_with_no_annotations_warns_and_returns_unchanged, test/dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_ac_rematerialize_with_rng_ops_raises_error, test/dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_ac_rematerialize_with_selective_checkpoint_policy, test/dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_joint_graph_passes_permute_optimization, test/dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_joint_graph_passes_view_optimization, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_autocast_flash_attention_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_custom_rule_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_custom_rule_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_inplace_op_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_inplace_op_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_invalid_context_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_invalid_context_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_list_ops_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_list_ops_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_not_recompute_gemm_no_functionalization_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_not_recompute_gemm_no_functionalization_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_not_recompute_gemm_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_not_recompute_gemm_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_recompute_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_recompute_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_outplace_op_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_outplace_op_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_parametrization_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_parametrization_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_partial_ctx_fn_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_partial_ctx_fn_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_random_op_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_random_op_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_tensor_subclass_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_tensor_subclass_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_triton_kernel_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_triton_kernel_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_distributed_utils_checkpoint_wrapper_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_dynamo_does_not_trace_getattr_as_top_frame_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_error_msg_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_fallback_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_kwargs_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_list_inputs_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_nonlocal_list_mutation_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_nonlocal_list_mutation_hidden_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_nonlocal_mutation_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_return_same_element_twice_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_sac_with_partial_context_fn_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_symints_location_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_decomps_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_decomps_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_dropout_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_via_global_checkpoint_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_via_global_checkpoint_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_with_kwargs_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_with_kwargs_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_module_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_module_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_multiple_checkpoints_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_multiple_checkpoints_partition_fn1_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_must_save_tensor_that_has_backward_hook_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_rand_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_recomputed_rand_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_sequential_layers_partition_fn0_cuda, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_sequential_layers_partition_fn1_cuda 2025-12-04T10:51:07.5821853Z 2025-12-04T10:51:07.5822515Z dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_ac_rematerialize_simple_forward_backward PASSED [0.3610s] [ 1%] 2025-12-04T10:51:07.5824070Z dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_ac_rematerialize_with_no_annotations_warns_and_returns_unchanged PASSED [0.1496s] [ 3%] 2025-12-04T10:51:07.5825629Z dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_ac_rematerialize_with_rng_ops_raises_error PASSED [0.0547s] [ 4%] 2025-12-04T10:51:07.5827364Z dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_ac_rematerialize_with_selective_checkpoint_policy W1204 10:49:29.175000 81418 site-packages/torch/_logging/_internal.py:1204] [0/0] 2025-12-04T10:51:07.5829110Z W1204 10:49:29.175000 81418 site-packages/torch/_logging/_internal.py:1204] [0/0] Detected that context_fn is passed to torch.utils.checkpoint under torch.compile. 2025-12-04T10:51:07.5830579Z W1204 10:49:29.175000 81418 site-packages/torch/_logging/_internal.py:1204] [0/0] Please make sure the checkpointed region does not contain in-place ops (e.g. torch.relu_). 2025-12-04T10:51:07.5831675Z W1204 10:49:29.175000 81418 site-packages/torch/_logging/_internal.py:1204] [0/0] 2025-12-04T10:51:07.5832190Z PASSED [0.2460s] [ 6%] 2025-12-04T10:51:07.5833026Z dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_joint_graph_passes_permute_optimization PASSED [0.2911s] [ 7%] 2025-12-04T10:51:07.5834500Z dynamo/test_activation_checkpointing.py::RematerializeACNodesPassTests::test_joint_graph_passes_view_optimization PASSED [0.0965s] [ 9%] 2025-12-04T10:51:07.5835920Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_autocast_flash_attention_cuda PASSED [2.7547s] [ 10%] 2025-12-04T10:51:07.5837975Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_custom_rule_partition_fn0_cuda W1204 10:49:33.659000 81418 site-packages/torch/_inductor/utils.py:1703] [1/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:51:07.5839466Z PASSED [3.1488s] [ 12%] 2025-12-04T10:51:07.5840471Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_custom_rule_partition_fn1_cuda PASSED [0.8822s] [ 14%] 2025-12-04T10:51:07.5843033Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_inplace_op_partition_fn0_cuda SKIPPED [0.0004s] (In-place op support in selective checkpointing + torch.compile requires TorchDispatchMode + torch.compile work to complete) [ 15%] 2025-12-04T10:51:07.5846016Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_inplace_op_partition_fn1_cuda SKIPPED [0.0002s] (In-place op support in selective checkpointing + torch.compile requires TorchDispatchMode + torch.compile work to complete) [ 17%] 2025-12-04T10:51:07.5848399Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_invalid_context_partition_fn0_cuda PASSED [0.0521s] [ 18%] 2025-12-04T10:51:07.5850191Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_invalid_context_partition_fn1_cuda PASSED [0.0434s] [ 20%] 2025-12-04T10:51:07.5852048Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_list_ops_partition_fn0_cuda PASSED [2.2074s] [ 21%] 2025-12-04T10:51:07.5853786Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_list_ops_partition_fn1_cuda PASSED [0.6802s] [ 23%] 2025-12-04T10:51:07.5855763Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_not_recompute_gemm_no_functionalization_partition_fn0_cuda PASSED [1.6293s] [ 25%] 2025-12-04T10:51:07.5857855Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_not_recompute_gemm_no_functionalization_partition_fn1_cuda PASSED [0.7942s] [ 26%] 2025-12-04T10:51:07.5859902Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_not_recompute_gemm_partition_fn0_cuda PASSED [0.7914s] [ 28%] 2025-12-04T10:51:07.5861780Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_not_recompute_gemm_partition_fn1_cuda PASSED [0.8033s] [ 29%] 2025-12-04T10:51:07.5863616Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_recompute_partition_fn0_cuda PASSED [0.2282s] [ 31%] 2025-12-04T10:51:07.5865404Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_must_recompute_partition_fn1_cuda PASSED [0.2213s] [ 32%] 2025-12-04T10:51:07.5867163Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_outplace_op_partition_fn0_cuda PASSED [2.9677s] [ 34%] 2025-12-04T10:51:07.5868920Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_outplace_op_partition_fn1_cuda PASSED [0.8732s] [ 35%] 2025-12-04T10:51:07.5870754Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_parametrization_partition_fn0_cuda PASSED [0.1495s] [ 37%] 2025-12-04T10:51:07.5872569Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_parametrization_partition_fn1_cuda PASSED [0.1306s] [ 39%] 2025-12-04T10:51:07.5874359Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_partial_ctx_fn_partition_fn0_cuda PASSED [0.8003s] [ 40%] 2025-12-04T10:51:07.5876140Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_partial_ctx_fn_partition_fn1_cuda PASSED [0.7974s] [ 42%] 2025-12-04T10:51:07.5877898Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_random_op_partition_fn0_cuda PASSED [2.3218s] [ 43%] 2025-12-04T10:51:07.5879641Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_random_op_partition_fn1_cuda PASSED [1.9912s] [ 45%] 2025-12-04T10:51:07.5881412Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_tensor_subclass_partition_fn0_cuda PASSED [1.8736s] [ 46%] 2025-12-04T10:51:07.5883197Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_tensor_subclass_partition_fn1_cuda PASSED [1.1584s] [ 48%] 2025-12-04T10:51:07.5884981Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_triton_kernel_partition_fn0_cuda PASSED [2.6941s] [ 50%] 2025-12-04T10:51:07.5886762Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_compile_selective_checkpoint_triton_kernel_partition_fn1_cuda PASSED [1.4607s] [ 51%] 2025-12-04T10:51:07.5888737Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_distributed_utils_checkpoint_wrapper_cuda PASSED [0.0772s] [ 53%] 2025-12-04T10:51:07.5890345Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_dynamo_does_not_trace_getattr_as_top_frame_cuda PASSED [0.0884s] [ 54%] 2025-12-04T10:51:07.5891845Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_error_msg_cuda PASSED [0.0168s] [ 56%] 2025-12-04T10:51:07.5893146Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_fallback_cuda PASSED [0.1464s] [ 57%] 2025-12-04T10:51:07.5894434Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_kwargs_cuda PASSED [0.1016s] [ 59%] 2025-12-04T10:51:07.5895733Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_list_inputs_cuda PASSED [0.0597s] [ 60%] 2025-12-04T10:51:07.5897103Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_nonlocal_list_mutation_cuda PASSED [0.0547s] [ 62%] 2025-12-04T10:51:07.5898632Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_nonlocal_list_mutation_hidden_cuda PASSED [0.1232s] [ 64%] 2025-12-04T10:51:07.5900077Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_nonlocal_mutation_cuda PASSED [0.0474s] [ 65%] 2025-12-04T10:51:07.5901542Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda ('RERUN', {'yellow': True}) [0.9376s] [ 67%] 2025-12-04T10:51:07.5903081Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda ('RERUN', {'yellow': True}) [0.5679s] [ 67%] 2025-12-04T10:51:07.5904537Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda FAILED [0.5649s] [ 67%] 2025-12-04T10:51:07.5905345Z 2025-12-04T10:51:07.5905497Z ==================================== RERUNS ==================================== 2025-12-04T10:51:07.5906137Z ______ ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda _______ 2025-12-04T10:51:07.5906724Z Traceback (most recent call last): 2025-12-04T10:51:07.5907503Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 1752, in test_pattern_matcher 2025-12-04T10:51:07.5908273Z count_ops( 2025-12-04T10:51:07.5908878Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 103, in count_ops 2025-12-04T10:51:07.5909582Z assert actual_count == freq, err_msg 2025-12-04T10:51:07.5909970Z AssertionError: In graph GraphModule() 2025-12-04T10:51:07.5910222Z 2025-12-04T10:51:07.5910227Z 2025-12-04T10:51:07.5910232Z 2025-12-04T10:51:07.5910476Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.5910993Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.5912437Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.5914018Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.5914638Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.5915037Z 2025-12-04T10:51:07.5915888Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.5916759Z 2025-12-04T10:51:07.5916979Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.5918047Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.5918889Z 2025-12-04T10:51:07.5919224Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.5919868Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.5920427Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.5920941Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.5922611Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('async_compile_cache_miss', 4), ('benchmarking.InductorBenchmarker.benchmark', 4), ('benchmarking.InductorBenchmarker.benchmark_gpu', 4), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_hit', 2)] 2025-12-04T10:51:07.5924160Z graph_break [] 2025-12-04T10:51:07.5924656Z ______ ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda _______ 2025-12-04T10:51:07.5925261Z Traceback (most recent call last): 2025-12-04T10:51:07.5926021Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 1752, in test_pattern_matcher 2025-12-04T10:51:07.5926785Z count_ops( 2025-12-04T10:51:07.5927392Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 103, in count_ops 2025-12-04T10:51:07.5928104Z assert actual_count == freq, err_msg 2025-12-04T10:51:07.5928498Z AssertionError: In graph GraphModule() 2025-12-04T10:51:07.5928761Z 2025-12-04T10:51:07.5928766Z 2025-12-04T10:51:07.5928771Z 2025-12-04T10:51:07.5929007Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.5929534Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.5930951Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.5932557Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.5933180Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.5933598Z 2025-12-04T10:51:07.5934442Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.5935318Z 2025-12-04T10:51:07.5935534Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.5936603Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.5937443Z 2025-12-04T10:51:07.5937725Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.5938429Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.5938970Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.5939493Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.5941172Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('async_compile_cache_miss', 4), ('benchmarking.InductorBenchmarker.benchmark', 4), ('benchmarking.InductorBenchmarker.benchmark_gpu', 4), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_hit', 2)] 2025-12-04T10:51:07.5942713Z graph_break [] 2025-12-04T10:51:07.5943092Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.5943618Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.5944130Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.5945229Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('async_compile_cache_miss', 4), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_hit', 2)] 2025-12-04T10:51:07.5946313Z graph_break [] 2025-12-04T10:51:07.5946621Z =================================== FAILURES =================================== 2025-12-04T10:51:07.5947244Z ______ ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda _______ 2025-12-04T10:51:07.5947880Z Traceback (most recent call last): 2025-12-04T10:51:07.5948648Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 1752, in test_pattern_matcher 2025-12-04T10:51:07.5949410Z count_ops( 2025-12-04T10:51:07.5950001Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 103, in count_ops 2025-12-04T10:51:07.5950716Z assert actual_count == freq, err_msg 2025-12-04T10:51:07.5951106Z AssertionError: In graph GraphModule() 2025-12-04T10:51:07.5951358Z 2025-12-04T10:51:07.5951363Z 2025-12-04T10:51:07.5951367Z 2025-12-04T10:51:07.5951614Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.5952129Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.5953561Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.5955124Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.5955749Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.5956154Z 2025-12-04T10:51:07.5957009Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.5957866Z 2025-12-04T10:51:07.5958095Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.5959155Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.5960044Z 2025-12-04T10:51:07.5960313Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.5960947Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.5961467Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.5961965Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.5963639Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('async_compile_cache_miss', 4), ('benchmarking.InductorBenchmarker.benchmark', 4), ('benchmarking.InductorBenchmarker.benchmark_gpu', 4), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_hit', 2)] 2025-12-04T10:51:07.5965190Z graph_break [] 2025-12-04T10:51:07.5965566Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.5966081Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.5966597Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.5967708Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('async_compile_cache_miss', 4), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_hit', 2)] 2025-12-04T10:51:07.5968707Z graph_break [] 2025-12-04T10:51:07.5969072Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.5969592Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.5970101Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.5971215Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('async_compile_cache_miss', 4), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_hit', 2)] 2025-12-04T10:51:07.5972198Z graph_break [] 2025-12-04T10:51:07.5973308Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-102e4c3a904c277d.xml - 2025-12-04T10:51:07.5974488Z =========================== short test summary info ============================ 2025-12-04T10:51:07.5975613Z FAILED [0.5649s] dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda - AssertionError: In graph GraphModule() 2025-12-04T10:51:07.5976531Z 2025-12-04T10:51:07.5976536Z 2025-12-04T10:51:07.5976540Z 2025-12-04T10:51:07.5976778Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.5977305Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.5978925Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.5980507Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.5981116Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.5981535Z 2025-12-04T10:51:07.5982396Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.5983253Z 2025-12-04T10:51:07.5983483Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.5984536Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.5985389Z 2025-12-04T10:51:07.5985659Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.5986295Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:51:07.5986836Z ============== 1 failed, 40 passed, 2 skipped, 2 rerun in 35.56s =============== 2025-12-04T10:51:07.5987280Z Got exit code 1 2025-12-04T10:51:07.5987547Z Retrying single test... 2025-12-04T10:51:07.5988606Z Test results will be stored in test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-ea0dea10ce3f91a8.xml 2025-12-04T10:51:07.5989585Z ============================= test session starts ============================== 2025-12-04T10:51:07.5990252Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:51:07.5990861Z cachedir: .pytest_cache 2025-12-04T10:51:07.5991579Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:51:07.5992357Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:51:07.5992713Z configfile: pytest.ini 2025-12-04T10:51:07.5993454Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:51:07.5994345Z collecting ... collected 64 items / 63 deselected / 1 selected 2025-12-04T10:51:07.5995496Z stepcurrent: skipping 42 already run items. Running only test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda 2025-12-04T10:51:07.5996543Z Running 1 items in this shard 2025-12-04T10:51:07.5996755Z 2025-12-04T10:51:07.5997477Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda ('RERUN', {'yellow': True}) [3.2766s] [100%] 2025-12-04T10:51:07.5999016Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda ('RERUN', {'yellow': True}) [0.5040s] [100%] 2025-12-04T10:51:07.6000475Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda FAILED [0.5039s] [100%] 2025-12-04T10:51:07.6001235Z 2025-12-04T10:51:07.6002313Z ==================================== RERUNS ==================================== 2025-12-04T10:51:07.6002978Z ______ ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda _______ 2025-12-04T10:51:07.6003631Z Traceback (most recent call last): 2025-12-04T10:51:07.6004401Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 1752, in test_pattern_matcher 2025-12-04T10:51:07.6005170Z count_ops( 2025-12-04T10:51:07.6005776Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 103, in count_ops 2025-12-04T10:51:07.6006479Z assert actual_count == freq, err_msg 2025-12-04T10:51:07.6006873Z AssertionError: In graph GraphModule() 2025-12-04T10:51:07.6007127Z 2025-12-04T10:51:07.6007132Z 2025-12-04T10:51:07.6007136Z 2025-12-04T10:51:07.6007385Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.6007917Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.6009342Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.6010917Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.6011561Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.6011981Z 2025-12-04T10:51:07.6012821Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.6013695Z 2025-12-04T10:51:07.6013913Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.6014983Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.6015893Z 2025-12-04T10:51:07.6016177Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.6016803Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6017333Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6018353Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6019356Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6019808Z graph_break [] 2025-12-04T10:51:07.6020306Z ______ ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda _______ 2025-12-04T10:51:07.6020916Z Traceback (most recent call last): 2025-12-04T10:51:07.6021675Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 1752, in test_pattern_matcher 2025-12-04T10:51:07.6022454Z count_ops( 2025-12-04T10:51:07.6023068Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 103, in count_ops 2025-12-04T10:51:07.6023783Z assert actual_count == freq, err_msg 2025-12-04T10:51:07.6024178Z AssertionError: In graph GraphModule() 2025-12-04T10:51:07.6024445Z 2025-12-04T10:51:07.6024449Z 2025-12-04T10:51:07.6024454Z 2025-12-04T10:51:07.6024689Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.6025220Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.6026639Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.6028195Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.6028894Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.6029303Z 2025-12-04T10:51:07.6030140Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.6031043Z 2025-12-04T10:51:07.6031260Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.6032326Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.6033166Z 2025-12-04T10:51:07.6033447Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.6034066Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6034592Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6035528Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6036527Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6036977Z graph_break [] 2025-12-04T10:51:07.6037354Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6037878Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6038380Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6039370Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6040245Z graph_break [] 2025-12-04T10:51:07.6040553Z =================================== FAILURES =================================== 2025-12-04T10:51:07.6041171Z ______ ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda _______ 2025-12-04T10:51:07.6041817Z Traceback (most recent call last): 2025-12-04T10:51:07.6042584Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 1752, in test_pattern_matcher 2025-12-04T10:51:07.6043354Z count_ops( 2025-12-04T10:51:07.6043947Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 103, in count_ops 2025-12-04T10:51:07.6044665Z assert actual_count == freq, err_msg 2025-12-04T10:51:07.6045060Z AssertionError: In graph GraphModule() 2025-12-04T10:51:07.6045315Z 2025-12-04T10:51:07.6045319Z 2025-12-04T10:51:07.6045324Z 2025-12-04T10:51:07.6045561Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.6046093Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.6047527Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.6049095Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.6049706Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.6050121Z 2025-12-04T10:51:07.6050975Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.6051831Z 2025-12-04T10:51:07.6052059Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.6053110Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.6053959Z 2025-12-04T10:51:07.6054226Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.6054859Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6055441Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6056358Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6057378Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6057837Z graph_break [] 2025-12-04T10:51:07.6058212Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6058798Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6059306Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6060297Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6061160Z graph_break [] 2025-12-04T10:51:07.6061538Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6062057Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6062565Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6063541Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6064411Z graph_break [] 2025-12-04T10:51:07.6065437Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-ea0dea10ce3f91a8.xml - 2025-12-04T10:51:07.6066620Z =========================== short test summary info ============================ 2025-12-04T10:51:07.6067711Z FAILED [0.5039s] dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda - AssertionError: In graph GraphModule() 2025-12-04T10:51:07.6068677Z 2025-12-04T10:51:07.6068682Z 2025-12-04T10:51:07.6068687Z 2025-12-04T10:51:07.6068921Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.6069450Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.6070875Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.6072432Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.6073059Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.6073475Z 2025-12-04T10:51:07.6074332Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.6075191Z 2025-12-04T10:51:07.6075412Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.6076474Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.6077329Z 2025-12-04T10:51:07.6077596Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.6078190Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:51:07.6078707Z ================== 1 failed, 63 deselected, 2 rerun in 4.32s =================== 2025-12-04T10:51:07.6079152Z Got exit code 1 2025-12-04T10:51:07.6079421Z Retrying single test... 2025-12-04T10:51:07.6080276Z Test results will be stored in test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-70230936f8120c6b.xml 2025-12-04T10:51:07.6081264Z ============================= test session starts ============================== 2025-12-04T10:51:07.6082015Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:51:07.6082624Z cachedir: .pytest_cache 2025-12-04T10:51:07.6083369Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:51:07.6084169Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:51:07.6084528Z configfile: pytest.ini 2025-12-04T10:51:07.6085250Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:51:07.6086152Z collecting ... collected 64 items / 63 deselected / 1 selected 2025-12-04T10:51:07.6087306Z stepcurrent: skipping 42 already run items. Running only test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda 2025-12-04T10:51:07.6088533Z Running 1 items in this shard 2025-12-04T10:51:07.6088747Z 2025-12-04T10:51:07.6089464Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda ('RERUN', {'yellow': True}) [3.2960s] [100%] 2025-12-04T10:51:07.6091020Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda ('RERUN', {'yellow': True}) [0.5048s] [100%] 2025-12-04T10:51:07.6092484Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda FAILED [0.5062s] [100%] 2025-12-04T10:51:07.6093228Z 2025-12-04T10:51:07.6093389Z ==================================== RERUNS ==================================== 2025-12-04T10:51:07.6094009Z ______ ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda _______ 2025-12-04T10:51:07.6094620Z Traceback (most recent call last): 2025-12-04T10:51:07.6095395Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 1752, in test_pattern_matcher 2025-12-04T10:51:07.6096238Z count_ops( 2025-12-04T10:51:07.6096839Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 103, in count_ops 2025-12-04T10:51:07.6097565Z assert actual_count == freq, err_msg 2025-12-04T10:51:07.6097963Z AssertionError: In graph GraphModule() 2025-12-04T10:51:07.6098215Z 2025-12-04T10:51:07.6098220Z 2025-12-04T10:51:07.6098224Z 2025-12-04T10:51:07.6098543Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.6099064Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.6100487Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.6102053Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.6102674Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.6103076Z 2025-12-04T10:51:07.6103936Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.6104794Z 2025-12-04T10:51:07.6105024Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.6106077Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.6106932Z 2025-12-04T10:51:07.6107201Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.6107832Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6108358Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6109368Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6110370Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6110832Z graph_break [] 2025-12-04T10:51:07.6111372Z ______ ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda _______ 2025-12-04T10:51:07.6111970Z Traceback (most recent call last): 2025-12-04T10:51:07.6112736Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 1752, in test_pattern_matcher 2025-12-04T10:51:07.6113504Z count_ops( 2025-12-04T10:51:07.6114098Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 103, in count_ops 2025-12-04T10:51:07.6114816Z assert actual_count == freq, err_msg 2025-12-04T10:51:07.6115207Z AssertionError: In graph GraphModule() 2025-12-04T10:51:07.6115458Z 2025-12-04T10:51:07.6115462Z 2025-12-04T10:51:07.6115469Z 2025-12-04T10:51:07.6115715Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.6116232Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.6117653Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.6119216Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.6119838Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.6120239Z 2025-12-04T10:51:07.6121096Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.6121965Z 2025-12-04T10:51:07.6122182Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.6123287Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.6124130Z 2025-12-04T10:51:07.6124399Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.6125038Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6125563Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6126495Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6127480Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6127943Z graph_break [] 2025-12-04T10:51:07.6128321Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6128830Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6129343Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6130334Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6131212Z graph_break [] 2025-12-04T10:51:07.6131499Z =================================== FAILURES =================================== 2025-12-04T10:51:07.6132128Z ______ ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda _______ 2025-12-04T10:51:07.6132728Z Traceback (most recent call last): 2025-12-04T10:51:07.6133486Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 1752, in test_pattern_matcher 2025-12-04T10:51:07.6134249Z count_ops( 2025-12-04T10:51:07.6134852Z File "/var/lib/jenkins/workspace/test/dynamo/test_activation_checkpointing.py", line 103, in count_ops 2025-12-04T10:51:07.6135572Z assert actual_count == freq, err_msg 2025-12-04T10:51:07.6136013Z AssertionError: In graph GraphModule() 2025-12-04T10:51:07.6136279Z 2025-12-04T10:51:07.6136284Z 2025-12-04T10:51:07.6136288Z 2025-12-04T10:51:07.6136525Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.6137087Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.6138583Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.6140136Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.6140759Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.6141174Z 2025-12-04T10:51:07.6142030Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.6142895Z 2025-12-04T10:51:07.6143110Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.6144171Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.6145024Z 2025-12-04T10:51:07.6145295Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.6145927Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6146437Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6147369Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6148360Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6148850Z graph_break [] 2025-12-04T10:51:07.6149233Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6149753Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6150264Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6151242Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6152118Z graph_break [] 2025-12-04T10:51:07.6152495Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:51:07.6153013Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:51:07.6153509Z aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('ok', 1)] 2025-12-04T10:51:07.6154496Z inductor [('pattern_matcher_nodes', 55), ('pattern_matcher_count', 12), ('extern_calls', 11), ('fuse_attention', 2), ('fxgraph_cache_bypass', 2), ('async_compile_cache_miss', 2)] 2025-12-04T10:51:07.6155378Z graph_break [] 2025-12-04T10:51:07.6156392Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-70230936f8120c6b.xml - 2025-12-04T10:51:07.6157583Z =========================== short test summary info ============================ 2025-12-04T10:51:07.6158684Z FAILED [0.5062s] dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda - AssertionError: In graph GraphModule() 2025-12-04T10:51:07.6159588Z 2025-12-04T10:51:07.6159592Z 2025-12-04T10:51:07.6159597Z 2025-12-04T10:51:07.6159848Z def forward(self, primals_1, primals_2, primals_3, fwd_rng_state_0): 2025-12-04T10:51:07.6160378Z sin = torch.ops.aten.sin.default(primals_1) 2025-12-04T10:51:07.6161855Z graphsafe_run_with_rng_state = torch.ops.higher_order.graphsafe_run_with_rng_state(torch.ops.aten._scaled_dot_product_efficient_attention.default, sin, primals_2, primals_3, None, True, scale = 0.17677669529663687, rng_state = fwd_rng_state_0); fwd_rng_state_0 = None 2025-12-04T10:51:07.6163431Z getitem = graphsafe_run_with_rng_state[0]; graphsafe_run_with_rng_state = None 2025-12-04T10:51:07.6164087Z return (getitem, primals_1, primals_2, primals_3, sin) 2025-12-04T10:51:07.6164507Z 2025-12-04T10:51:07.6165354Z # To see more debug info, please use `graph_module.print_readable()`, expected aten._scaled_dot_product_flash_attention.default to have occurred 1 times in the graph, but got 0. 2025-12-04T10:51:07.6166233Z 2025-12-04T10:51:07.6166450Z To execute this test, run the following from the base repo dir: 2025-12-04T10:51:07.6167522Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_activation_checkpointing.py ActivationCheckpointingViaTagsTestsCUDA.test_pattern_matcher_cuda 2025-12-04T10:51:07.6168366Z 2025-12-04T10:51:07.6168650Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:51:07.6169244Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:51:07.6169775Z ================== 1 failed, 63 deselected, 2 rerun in 4.34s =================== 2025-12-04T10:51:07.6170229Z Got exit code 1 2025-12-04T10:51:07.6171016Z FAILED CONSISTENTLY: test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda 2025-12-04T10:51:07.6172197Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T10:51:07.6173437Z Test results will be stored in test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-327d67e6c79eec23.xml 2025-12-04T10:51:07.6174425Z ============================= test session starts ============================== 2025-12-04T10:51:07.6175078Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:51:07.6175736Z cachedir: .pytest_cache 2025-12-04T10:51:07.6176455Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:51:07.6177240Z rootdir: /var/lib/jenkins/workspace 2025-12-04T10:51:07.6177587Z configfile: pytest.ini 2025-12-04T10:51:07.6178409Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:51:07.6179317Z collecting ... collected 64 items / 43 deselected / 21 selected 2025-12-04T10:51:07.6179803Z stepcurrent: skipping 43 already run items. 2025-12-04T10:51:07.6180193Z Running 21 items in this shard 2025-12-04T10:51:07.6180410Z 2025-12-04T10:51:07.6181086Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_return_same_element_twice_cuda PASSED [0.2458s] [ 4%] 2025-12-04T10:51:07.6182815Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_sac_with_partial_context_fn_cuda W1204 10:50:53.188000 82340 site-packages/torch/_logging/_internal.py:1204] [0/0] 2025-12-04T10:51:07.6184512Z W1204 10:50:53.188000 82340 site-packages/torch/_logging/_internal.py:1204] [0/0] Detected that context_fn is passed to torch.utils.checkpoint under torch.compile. 2025-12-04T10:51:07.6185987Z W1204 10:50:53.188000 82340 site-packages/torch/_logging/_internal.py:1204] [0/0] Please make sure the checkpointed region does not contain in-place ops (e.g. torch.relu_). 2025-12-04T10:51:07.6187090Z W1204 10:50:53.188000 82340 site-packages/torch/_logging/_internal.py:1204] [0/0] 2025-12-04T10:51:07.6187616Z PASSED [0.1322s] [ 9%] 2025-12-04T10:51:07.6188592Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_symints_location_cuda PASSED [1.1920s] [ 14%] 2025-12-04T10:51:07.6190018Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_decomps_partition_fn0_cuda PASSED [0.1632s] [ 19%] 2025-12-04T10:51:07.6191620Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_decomps_partition_fn1_cuda PASSED [0.1301s] [ 23%] 2025-12-04T10:51:07.6193472Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_dropout_cuda W1204 10:50:57.755000 82340 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T10:51:07.6194796Z PASSED [3.2263s] [ 28%] 2025-12-04T10:51:07.6195653Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_partition_fn0_cuda PASSED [0.1118s] [ 33%] 2025-12-04T10:51:07.6197126Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_partition_fn1_cuda PASSED [0.0977s] [ 38%] 2025-12-04T10:51:07.6198707Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_via_global_checkpoint_partition_fn0_cuda PASSED [0.0972s] [ 42%] 2025-12-04T10:51:07.6200415Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_via_global_checkpoint_partition_fn1_cuda PASSED [0.0943s] [ 47%] 2025-12-04T10:51:07.6202054Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_with_kwargs_partition_fn0_cuda PASSED [0.0999s] [ 52%] 2025-12-04T10:51:07.6203649Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_function_with_kwargs_partition_fn1_cuda PASSED [0.0985s] [ 57%] 2025-12-04T10:51:07.6205168Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_module_partition_fn0_cuda PASSED [0.1233s] [ 61%] 2025-12-04T10:51:07.6206616Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_module_partition_fn1_cuda PASSED [0.1159s] [ 66%] 2025-12-04T10:51:07.6208136Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_multiple_checkpoints_partition_fn0_cuda PASSED [0.1295s] [ 71%] 2025-12-04T10:51:07.6209792Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_multiple_checkpoints_partition_fn1_cuda PASSED [0.1250s] [ 76%] 2025-12-04T10:51:07.6211405Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_must_save_tensor_that_has_backward_hook_cuda PASSED [4.7831s] [ 80%] 2025-12-04T10:51:07.6212877Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_rand_cuda PASSED [0.6234s] [ 85%] 2025-12-04T10:51:07.6214219Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_recomputed_rand_cuda PASSED [0.8115s] [ 90%] 2025-12-04T10:51:07.6215715Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_sequential_layers_partition_fn0_cuda PASSED [0.1647s] [ 95%] 2025-12-04T10:51:07.6217285Z dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_tags_sequential_layers_partition_fn1_cuda PASSED [0.1592s] [100%] 2025-12-04T10:51:07.6218141Z 2025-12-04T10:51:07.6219087Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-327d67e6c79eec23.xml - 2025-12-04T10:51:07.6220298Z ====================== 21 passed, 43 deselected in 12.79s ====================== 2025-12-04T10:51:07.6221363Z The following tests failed consistently: ['test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTestsCUDA::test_pattern_matcher_cuda'] 2025-12-04T10:51:07.6222246Z 2025-12-04T10:51:07.6222913Z FINISHED PRINTING LOG FILE of dynamo/test_activation_checkpointing 1/1 (test/test-reports/dynamo.test_activation_checkpointing_1.1_2178259ee06de350_.log) 2025-12-04T10:51:07.6223728Z 2025-12-04T10:51:07.6224143Z Finished dynamo/test_activation_checkpointing 1/1 ... [2025-12-04 10:51:07.571519][5871.692735426], took 1.82min 2025-12-04T10:51:07.6225693Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-102e4c3a904c277d.xml 2025-12-04T10:51:07.6801854Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-ea0dea10ce3f91a8.xml 2025-12-04T10:51:07.7089546Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-70230936f8120c6b.xml 2025-12-04T10:51:07.7387654Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-327d67e6c79eec23.xml 2025-12-04T10:51:08.0161961Z Uploading logs for 57119749259 to S3 2025-12-04T10:51:08.0892919Z Uploading artifacts took 0.32 seconds 2025-12-04T10:51:08.0893393Z dynamo/test_activation_checkpointing 1/1 failed! 2025-12-04T10:51:08.0898316Z Running inductor/test_flex_attention 1/1 ... [2025-12-04 10:51:08.089662][5872.210883126] 2025-12-04T10:51:08.0899010Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T10:51:08.0902675Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_flex_attention.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:51:08.090025] 2025-12-04T10:51:17.4671936Z 2025-12-04T10:51:17.4672882Z PRINTING LOG FILE of inductor/test_flex_attention 1/1 (test/test-reports/inductor.test_flex_attention_1.1_20245f741d40e72b_.log) 2025-12-04T10:51:17.4673715Z Traceback (most recent call last): 2025-12-04T10:51:17.4674357Z File "/var/lib/jenkins/workspace/test/inductor/test_flex_attention.py", line 447, in 2025-12-04T10:51:17.4675083Z @large_tensor_test_class("2GB", device=test_device[0]) 2025-12-04T10:51:17.4675744Z NameError: name 'test_device' is not defined 2025-12-04T10:51:17.4676122Z Got exit code 1 2025-12-04T10:51:17.4676732Z No stepcurrent file found. Either pytest didn't get to run (e.g. import error) or file got deleted (contact dev infra) 2025-12-04T10:51:17.4677309Z 2025-12-04T10:51:17.4677901Z FINISHED PRINTING LOG FILE of inductor/test_flex_attention 1/1 (test/test-reports/inductor.test_flex_attention_1.1_20245f741d40e72b_.log) 2025-12-04T10:51:17.4678605Z 2025-12-04T10:51:17.4678982Z Finished inductor/test_flex_attention 1/1 ... [2025-12-04 10:51:17.466974][5881.588196514], took 0.16min 2025-12-04T10:51:18.0457055Z Uploading logs for 57119749259 to S3 2025-12-04T10:51:18.1579507Z Uploading artifacts took 0.69 seconds 2025-12-04T10:51:18.1579940Z inductor/test_flex_attention 1/1 failed! 2025-12-04T10:51:18.1584630Z Running inductor/test_cuda_select_algorithm 1/5 ... [2025-12-04 10:51:18.158302][5882.27952331] 2025-12-04T10:51:18.1585251Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T10:51:18.1589376Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cuda_select_algorithm.py', '--shard-id=1', '--num-shards=5', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:51:18.158680] 2025-12-04T11:07:12.9560541Z 2025-12-04T11:07:12.9561908Z PRINTING LOG FILE of inductor/test_cuda_select_algorithm 1/5 (test/test-reports/inductor.test_cuda_select_algorithm_1.5_fb546dd2d7cc6229_.log) 2025-12-04T11:07:12.9563915Z W1204 10:51:27.141000 82834 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:12.9565982Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-b8b19f7b3ae0b98c.xml 2025-12-04T11:07:12.9567844Z ============================= test session starts ============================== 2025-12-04T11:07:12.9569007Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:12.9569644Z cachedir: .pytest_cache 2025-12-04T11:07:12.9570751Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:12.9571882Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:12.9572475Z configfile: pytest.ini 2025-12-04T11:07:12.9573632Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:12.9574906Z collecting ... collected 58 items 2025-12-04T11:07:12.9575616Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T11:07:12.9590921Z Running 11 items in this shard: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16, test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16, test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16, test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16, test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16, test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16, test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16, test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16, test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16, test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16, test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:12.9605971Z 2025-12-04T11:07:12.9607538Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 ('RERUN', {'yellow': True}) [4.1628s] [ 9%] 2025-12-04T11:07:12.9610863Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.7724s] [ 9%] 2025-12-04T11:07:12.9614067Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 FAILED [0.7656s] [ 9%] 2025-12-04T11:07:12.9615715Z 2025-12-04T11:07:12.9615962Z ==================================== RERUNS ==================================== 2025-12-04T11:07:12.9617319Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:12.9618785Z Traceback (most recent call last): 2025-12-04T11:07:12.9620195Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:12.9621978Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:12.9623228Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:12.9624565Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:12.9626159Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:12.9627622Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:12.9628297Z AssertionError: Scalars are not equal! 2025-12-04T11:07:12.9628561Z 2025-12-04T11:07:12.9628721Z Expected 3 but got 6. 2025-12-04T11:07:12.9629182Z Absolute difference: 3 2025-12-04T11:07:12.9629688Z Relative difference: 1.0 2025-12-04T11:07:12.9630020Z 2025-12-04T11:07:12.9630391Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:12.9632317Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:12.9633889Z 2025-12-04T11:07:12.9634283Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:12.9635349Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:12.9636182Z stats [('calls_captured', 36)] 2025-12-04T11:07:12.9637459Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:12.9639017Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:12.9639763Z graph_break [] 2025-12-04T11:07:12.9640270Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:12.9641899Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9643055Z warnings.warn( 2025-12-04T11:07:12.9643967Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9645034Z warnings.warn( 2025-12-04T11:07:12.9645751Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:12.9646564Z Traceback (most recent call last): 2025-12-04T11:07:12.9647339Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:12.9648257Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:12.9649089Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:12.9649863Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:12.9650693Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:12.9651587Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:12.9652066Z AssertionError: Scalars are not equal! 2025-12-04T11:07:12.9652319Z 2025-12-04T11:07:12.9652427Z Expected 3 but got 6. 2025-12-04T11:07:12.9652720Z Absolute difference: 3 2025-12-04T11:07:12.9653018Z Relative difference: 1.0 2025-12-04T11:07:12.9653211Z 2025-12-04T11:07:12.9653440Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:12.9654725Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:12.9655806Z 2025-12-04T11:07:12.9656076Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:12.9656713Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:12.9657197Z stats [('calls_captured', 36)] 2025-12-04T11:07:12.9658026Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:12.9659017Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:12.9659497Z graph_break [] 2025-12-04T11:07:12.9659909Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:12.9661016Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9661998Z warnings.warn( 2025-12-04T11:07:12.9662895Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9663854Z warnings.warn( 2025-12-04T11:07:12.9664238Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:12.9664723Z stats [('calls_captured', 36)] 2025-12-04T11:07:12.9665161Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:12.9666067Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:12.9666851Z graph_break [] 2025-12-04T11:07:12.9667225Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:12.9668314Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9669298Z warnings.warn( 2025-12-04T11:07:12.9670191Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9671165Z warnings.warn( 2025-12-04T11:07:12.9671468Z =================================== FAILURES =================================== 2025-12-04T11:07:12.9672354Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:12.9673156Z Traceback (most recent call last): 2025-12-04T11:07:12.9673931Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:12.9674847Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:12.9675676Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:12.9676447Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:12.9677279Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:12.9678168Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:12.9678647Z AssertionError: Scalars are not equal! 2025-12-04T11:07:12.9678898Z 2025-12-04T11:07:12.9679026Z Expected 3 but got 6. 2025-12-04T11:07:12.9679306Z Absolute difference: 3 2025-12-04T11:07:12.9679607Z Relative difference: 1.0 2025-12-04T11:07:12.9679799Z 2025-12-04T11:07:12.9680030Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:12.9681318Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:12.9682405Z 2025-12-04T11:07:12.9682673Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:12.9683307Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:12.9683790Z stats [('calls_captured', 36)] 2025-12-04T11:07:12.9684532Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:12.9685508Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:12.9685986Z graph_break [] 2025-12-04T11:07:12.9686368Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:12.9687544Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9688749Z warnings.warn( 2025-12-04T11:07:12.9689653Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9690613Z warnings.warn( 2025-12-04T11:07:12.9691075Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:12.9691563Z stats [('calls_captured', 36)] 2025-12-04T11:07:12.9692017Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:12.9692927Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:12.9693706Z graph_break [] 2025-12-04T11:07:12.9694091Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:12.9695184Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9696159Z warnings.warn( 2025-12-04T11:07:12.9697053Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9698024Z warnings.warn( 2025-12-04T11:07:12.9698457Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:12.9698938Z stats [('calls_captured', 36)] 2025-12-04T11:07:12.9699488Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:12.9700398Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:12.9701161Z graph_break [] 2025-12-04T11:07:12.9701542Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:12.9702645Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9703607Z warnings.warn( 2025-12-04T11:07:12.9704497Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9705467Z warnings.warn( 2025-12-04T11:07:12.9706486Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-b8b19f7b3ae0b98c.xml - 2025-12-04T11:07:12.9707635Z =========================== short test summary info ============================ 2025-12-04T11:07:12.9708934Z FAILED [0.7656s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:12.9710046Z 2025-12-04T11:07:12.9710154Z Expected 3 but got 6. 2025-12-04T11:07:12.9710449Z Absolute difference: 3 2025-12-04T11:07:12.9710738Z Relative difference: 1.0 2025-12-04T11:07:12.9710943Z 2025-12-04T11:07:12.9711162Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:12.9712454Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:12.9713528Z 2025-12-04T11:07:12.9713905Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:12.9714493Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:12.9714991Z ========================== 1 failed, 2 rerun in 5.73s ========================== 2025-12-04T11:07:12.9715454Z Got exit code 1 2025-12-04T11:07:12.9715713Z Retrying single test... 2025-12-04T11:07:12.9716354Z W1204 10:51:46.521000 83004 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:12.9717615Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-cdcb3ea635f515ef.xml 2025-12-04T11:07:12.9718593Z ============================= test session starts ============================== 2025-12-04T11:07:12.9719247Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:12.9719855Z cachedir: .pytest_cache 2025-12-04T11:07:12.9720574Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:12.9721370Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:12.9721714Z configfile: pytest.ini 2025-12-04T11:07:12.9722446Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:12.9723352Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:12.9724718Z stepcurrent: skipping 0 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:12.9726002Z Running 1 items in this shard 2025-12-04T11:07:12.9726227Z 2025-12-04T11:07:12.9727547Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:51:50.262537542 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9729035Z 2025-12-04T11:07:12.9729559Z [W1204 10:52:05.532233201 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9730215Z 2025-12-04T11:07:12.9730744Z [W1204 10:52:05.532506617 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9731395Z 2025-12-04T11:07:12.9731923Z [W1204 10:52:05.533132399 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9732572Z 2025-12-04T11:07:12.9733086Z [W1204 10:52:05.533346067 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9733756Z 2025-12-04T11:07:12.9734277Z [W1204 10:52:05.535167758 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9734941Z 2025-12-04T11:07:12.9735452Z [W1204 10:52:05.535348868 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9736110Z 2025-12-04T11:07:12.9736635Z [W1204 10:52:05.535662740 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9737283Z 2025-12-04T11:07:12.9737810Z [W1204 10:52:05.535834389 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9738529Z 2025-12-04T11:07:12.9739047Z [W1204 10:52:05.546469265 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9739713Z 2025-12-04T11:07:12.9740296Z [W1204 10:52:05.546696302 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9740959Z 2025-12-04T11:07:12.9741473Z [W1204 10:52:05.546882677 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9742168Z 2025-12-04T11:07:12.9742679Z [W1204 10:52:05.547166852 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9743329Z 2025-12-04T11:07:12.9743857Z [W1204 10:52:05.547337827 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9744508Z 2025-12-04T11:07:12.9745032Z [W1204 10:52:05.547633789 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9745684Z 2025-12-04T11:07:12.9746204Z [W1204 10:52:05.547814833 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9746867Z 2025-12-04T11:07:12.9747381Z [W1204 10:52:05.548095343 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9748049Z 2025-12-04T11:07:12.9748563Z [W1204 10:52:05.548262629 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9749219Z 2025-12-04T11:07:12.9749749Z [W1204 10:52:05.667888968 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9750406Z 2025-12-04T11:07:12.9750931Z [W1204 10:52:05.668165609 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9751582Z 2025-12-04T11:07:12.9752132Z [W1204 10:52:05.668352742 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9752799Z 2025-12-04T11:07:12.9753312Z [W1204 10:52:05.668627464 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9753980Z 2025-12-04T11:07:12.9754497Z [W1204 10:52:05.668796221 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9755153Z 2025-12-04T11:07:12.9755679Z [W1204 10:52:05.669081058 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9756331Z 2025-12-04T11:07:12.9756857Z [W1204 10:52:05.669249648 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9757507Z 2025-12-04T11:07:12.9758020Z [W1204 10:52:05.669521250 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9758778Z 2025-12-04T11:07:12.9759292Z [W1204 10:52:05.669687091 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9759958Z 2025-12-04T11:07:12.9760474Z [W1204 10:52:07.742535273 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9761122Z 2025-12-04T11:07:12.9761647Z [W1204 10:52:07.743766602 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9762301Z 2025-12-04T11:07:12.9762827Z [W1204 10:52:07.743965281 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9763480Z 2025-12-04T11:07:12.9763994Z [W1204 10:52:07.744251371 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9764659Z 2025-12-04T11:07:12.9765242Z [W1204 10:52:07.744429388 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9765937Z 2025-12-04T11:07:12.9766453Z [W1204 10:52:07.744724766 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9767106Z 2025-12-04T11:07:12.9767656Z [W1204 10:52:07.744902784 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9768310Z 2025-12-04T11:07:12.9768838Z [W1204 10:52:07.745182334 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9769491Z 2025-12-04T11:07:12.9770007Z [W1204 10:52:07.745359473 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9770673Z 2025-12-04T11:07:12.9771190Z [W1204 10:52:07.753661085 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9771857Z 2025-12-04T11:07:12.9772372Z [W1204 10:52:07.753901340 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9773038Z 2025-12-04T11:07:12.9773554Z [W1204 10:52:07.754091820 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9774208Z 2025-12-04T11:07:12.9774737Z [W1204 10:52:07.754366003 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9775392Z 2025-12-04T11:07:12.9775921Z [W1204 10:52:07.754541397 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9776607Z 2025-12-04T11:07:12.9777123Z [W1204 10:52:07.754830796 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9777794Z 2025-12-04T11:07:12.9778378Z [W1204 10:52:07.755006684 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9779050Z 2025-12-04T11:07:12.9779569Z [W1204 10:52:07.755305996 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9780223Z 2025-12-04T11:07:12.9780753Z [W1204 10:52:07.755490644 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9781408Z 2025-12-04T11:07:12.9781936Z [W1204 10:52:07.869726948 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9782596Z 2025-12-04T11:07:12.9783114Z [W1204 10:52:07.869971301 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9783783Z 2025-12-04T11:07:12.9784299Z [W1204 10:52:07.870186008 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9784970Z 2025-12-04T11:07:12.9785486Z [W1204 10:52:07.870470974 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9786146Z 2025-12-04T11:07:12.9786673Z [W1204 10:52:07.870644362 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9787325Z 2025-12-04T11:07:12.9788052Z [W1204 10:52:07.870936237 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9788714Z 2025-12-04T11:07:12.9789358Z [W1204 10:52:07.871123527 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9790025Z 2025-12-04T11:07:12.9790541Z [W1204 10:52:07.871411672 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9791249Z 2025-12-04T11:07:12.9791763Z [W1204 10:52:07.871584531 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9792412Z 2025-12-04T11:07:12.9792564Z ('RERUN', {'yellow': True}) [19.4869s] [100%] 2025-12-04T11:07:12.9794148Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:52:08.291690260 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9795597Z 2025-12-04T11:07:12.9796121Z [W1204 10:52:08.291962587 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9796783Z 2025-12-04T11:07:12.9797300Z [W1204 10:52:08.292151192 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9797964Z 2025-12-04T11:07:12.9798478Z [W1204 10:52:08.292429807 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9799129Z 2025-12-04T11:07:12.9799658Z [W1204 10:52:08.292608991 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9800307Z 2025-12-04T11:07:12.9800833Z [W1204 10:52:08.292902480 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9801482Z 2025-12-04T11:07:12.9802069Z [W1204 10:52:08.293074798 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9802736Z 2025-12-04T11:07:12.9803256Z [W1204 10:52:08.293352105 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9803923Z 2025-12-04T11:07:12.9804437Z [W1204 10:52:08.293521868 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9805101Z 2025-12-04T11:07:12.9805618Z [W1204 10:52:08.301865169 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9806270Z 2025-12-04T11:07:12.9806796Z [W1204 10:52:08.302097778 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9807446Z 2025-12-04T11:07:12.9807976Z [W1204 10:52:08.302285776 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9808629Z 2025-12-04T11:07:12.9809148Z [W1204 10:52:08.302555287 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9809814Z 2025-12-04T11:07:12.9810330Z [W1204 10:52:08.302727598 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9810996Z 2025-12-04T11:07:12.9811509Z [W1204 10:52:08.303015071 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9812164Z 2025-12-04T11:07:12.9812689Z [W1204 10:52:08.303200025 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9813344Z 2025-12-04T11:07:12.9813870Z [W1204 10:52:08.303482427 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9814581Z 2025-12-04T11:07:12.9815097Z [W1204 10:52:08.303652571 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9815791Z 2025-12-04T11:07:12.9816307Z [W1204 10:52:08.418325424 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9816968Z 2025-12-04T11:07:12.9817478Z [W1204 10:52:08.418573473 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9818129Z 2025-12-04T11:07:12.9818740Z [W1204 10:52:08.418760841 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9819391Z 2025-12-04T11:07:12.9819918Z [W1204 10:52:08.419037864 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9820571Z 2025-12-04T11:07:12.9821086Z [W1204 10:52:08.419224668 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9821748Z 2025-12-04T11:07:12.9822264Z [W1204 10:52:08.419520557 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9822929Z 2025-12-04T11:07:12.9823440Z [W1204 10:52:08.419690322 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9824091Z 2025-12-04T11:07:12.9824619Z [W1204 10:52:08.419965976 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9825271Z 2025-12-04T11:07:12.9825799Z [W1204 10:52:08.420160375 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9826484Z 2025-12-04T11:07:12.9827001Z [W1204 10:52:08.580985610 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9827669Z 2025-12-04T11:07:12.9828185Z [W1204 10:52:08.581252811 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9828852Z 2025-12-04T11:07:12.9829363Z [W1204 10:52:08.581447514 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9830015Z 2025-12-04T11:07:12.9830539Z [W1204 10:52:08.581727540 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9831188Z 2025-12-04T11:07:12.9831716Z [W1204 10:52:08.581905727 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9832370Z 2025-12-04T11:07:12.9832890Z [W1204 10:52:08.582200275 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9833549Z 2025-12-04T11:07:12.9834061Z [W1204 10:52:08.582377465 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9834727Z 2025-12-04T11:07:12.9835238Z [W1204 10:52:08.582658069 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9835886Z 2025-12-04T11:07:12.9836414Z [W1204 10:52:08.582831607 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9837065Z 2025-12-04T11:07:12.9837593Z [W1204 10:52:08.590940432 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9838245Z 2025-12-04T11:07:12.9838821Z [W1204 10:52:08.591191727 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9839485Z 2025-12-04T11:07:12.9839997Z [W1204 10:52:08.591386559 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9840691Z 2025-12-04T11:07:12.9841205Z [W1204 10:52:08.591663383 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9841867Z 2025-12-04T11:07:12.9842379Z [W1204 10:52:08.591839157 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9843030Z 2025-12-04T11:07:12.9843555Z [W1204 10:52:08.592129098 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9844203Z 2025-12-04T11:07:12.9844735Z [W1204 10:52:08.592303981 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9845385Z 2025-12-04T11:07:12.9845893Z [W1204 10:52:08.592583495 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9846559Z 2025-12-04T11:07:12.9847072Z [W1204 10:52:08.592757883 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9847735Z 2025-12-04T11:07:12.9848249Z [W1204 10:52:08.706846295 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9848897Z 2025-12-04T11:07:12.9849422Z [W1204 10:52:08.707097782 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9850076Z 2025-12-04T11:07:12.9850606Z [W1204 10:52:08.707302632 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9851287Z 2025-12-04T11:07:12.9851801Z [W1204 10:52:08.707582261 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9852465Z 2025-12-04T11:07:12.9852977Z [W1204 10:52:08.707756099 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9853640Z 2025-12-04T11:07:12.9854155Z [W1204 10:52:08.708048910 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9854806Z 2025-12-04T11:07:12.9855334Z [W1204 10:52:08.708223272 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9855983Z 2025-12-04T11:07:12.9856509Z [W1204 10:52:08.708503290 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9857167Z 2025-12-04T11:07:12.9857679Z [W1204 10:52:08.708675640 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9858415Z 2025-12-04T11:07:12.9858551Z ('RERUN', {'yellow': True}) [0.7983s] [100%] 2025-12-04T11:07:12.9860128Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:52:08.066374853 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9861567Z 2025-12-04T11:07:12.9862100Z [W1204 10:52:08.066646996 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9862752Z 2025-12-04T11:07:12.9863281Z [W1204 10:52:08.066834933 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9864003Z 2025-12-04T11:07:12.9864520Z [W1204 10:52:08.067129253 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9865213Z 2025-12-04T11:07:12.9865734Z [W1204 10:52:08.067313696 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9866395Z 2025-12-04T11:07:12.9866908Z [W1204 10:52:08.067609523 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9867558Z 2025-12-04T11:07:12.9868082Z [W1204 10:52:08.067784454 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9868732Z 2025-12-04T11:07:12.9869255Z [W1204 10:52:08.068067216 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9869908Z 2025-12-04T11:07:12.9870426Z [W1204 10:52:08.068239085 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9871093Z 2025-12-04T11:07:12.9871606Z [W1204 10:52:08.076262292 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9872269Z 2025-12-04T11:07:12.9872785Z [W1204 10:52:08.076496446 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9873444Z 2025-12-04T11:07:12.9873959Z [W1204 10:52:08.076684580 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9874613Z 2025-12-04T11:07:12.9875137Z [W1204 10:52:08.076955475 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9875825Z 2025-12-04T11:07:12.9876353Z [W1204 10:52:08.077124998 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9877005Z 2025-12-04T11:07:12.9877523Z [W1204 10:52:08.077412797 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9878190Z 2025-12-04T11:07:12.9878704Z [W1204 10:52:08.077582478 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9879369Z 2025-12-04T11:07:12.9879887Z [W1204 10:52:08.077855776 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9880537Z 2025-12-04T11:07:12.9881063Z [W1204 10:52:08.078024124 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9881719Z 2025-12-04T11:07:12.9882250Z [W1204 10:52:09.191676316 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9882900Z 2025-12-04T11:07:12.9883415Z [W1204 10:52:09.191909918 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9884084Z 2025-12-04T11:07:12.9884599Z [W1204 10:52:09.192104025 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9885266Z 2025-12-04T11:07:12.9885784Z [W1204 10:52:09.192375274 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9886436Z 2025-12-04T11:07:12.9886967Z [W1204 10:52:09.192544525 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9887624Z 2025-12-04T11:07:12.9888467Z [W1204 10:52:09.192831422 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9889125Z 2025-12-04T11:07:12.9889640Z [W1204 10:52:09.192999648 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9890347Z 2025-12-04T11:07:12.9890862Z [W1204 10:52:09.193273089 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9891531Z 2025-12-04T11:07:12.9892045Z [W1204 10:52:09.193438866 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9892699Z 2025-12-04T11:07:12.9893228Z [W1204 10:52:09.351705644 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9893878Z 2025-12-04T11:07:12.9894415Z [W1204 10:52:09.351953395 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9895070Z 2025-12-04T11:07:12.9895584Z [W1204 10:52:09.352146570 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9896253Z 2025-12-04T11:07:12.9896769Z [W1204 10:52:09.352422557 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9897429Z 2025-12-04T11:07:12.9897941Z [W1204 10:52:09.352598387 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9898655Z 2025-12-04T11:07:12.9899185Z [W1204 10:52:09.352889180 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9899833Z 2025-12-04T11:07:12.9900366Z [W1204 10:52:09.353064813 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9901064Z 2025-12-04T11:07:12.9901578Z [W1204 10:52:09.353346054 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9902252Z 2025-12-04T11:07:12.9902765Z [W1204 10:52:09.353518614 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9903431Z 2025-12-04T11:07:12.9903944Z [W1204 10:52:09.361561736 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9904595Z 2025-12-04T11:07:12.9905117Z [W1204 10:52:09.361797388 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9905770Z 2025-12-04T11:07:12.9906300Z [W1204 10:52:09.361989314 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9906956Z 2025-12-04T11:07:12.9907468Z [W1204 10:52:09.362265090 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9908139Z 2025-12-04T11:07:12.9908652Z [W1204 10:52:09.362438879 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9909313Z 2025-12-04T11:07:12.9909825Z [W1204 10:52:09.362728784 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9910488Z 2025-12-04T11:07:12.9910997Z [W1204 10:52:09.362904096 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9911647Z 2025-12-04T11:07:12.9912172Z [W1204 10:52:09.363207726 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9912824Z 2025-12-04T11:07:12.9913406Z [W1204 10:52:09.363385530 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9914059Z 2025-12-04T11:07:12.9914605Z [W1204 10:52:09.476530584 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9915269Z 2025-12-04T11:07:12.9915782Z [W1204 10:52:09.476769644 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9916447Z 2025-12-04T11:07:12.9916961Z [W1204 10:52:09.476959870 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9917611Z 2025-12-04T11:07:12.9918134Z [W1204 10:52:09.477237323 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9918788Z 2025-12-04T11:07:12.9919317Z [W1204 10:52:09.477419587 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9919970Z 2025-12-04T11:07:12.9920482Z [W1204 10:52:09.477714504 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9921144Z 2025-12-04T11:07:12.9921656Z [W1204 10:52:09.477888847 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9922321Z 2025-12-04T11:07:12.9922835Z [W1204 10:52:09.478168129 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9923484Z 2025-12-04T11:07:12.9924006Z [W1204 10:52:09.478341799 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:12.9924695Z 2025-12-04T11:07:12.9924813Z FAILED [0.7672s] [100%] 2025-12-04T11:07:12.9924996Z 2025-12-04T11:07:12.9925139Z ==================================== RERUNS ==================================== 2025-12-04T11:07:12.9925970Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:12.9926777Z Traceback (most recent call last): 2025-12-04T11:07:12.9927559Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:12.9928484Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:12.9929323Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:12.9930095Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:12.9930927Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:12.9931825Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:12.9932303Z AssertionError: Scalars are not equal! 2025-12-04T11:07:12.9932554Z 2025-12-04T11:07:12.9932677Z Expected 3 but got 6. 2025-12-04T11:07:12.9932956Z Absolute difference: 3 2025-12-04T11:07:12.9933255Z Relative difference: 1.0 2025-12-04T11:07:12.9933446Z 2025-12-04T11:07:12.9933674Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:12.9934960Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:12.9936046Z 2025-12-04T11:07:12.9936313Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:12.9936952Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:12.9937441Z stats [('calls_captured', 36)] 2025-12-04T11:07:12.9938314Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:12.9939234Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:12.9939749Z graph_break [] 2025-12-04T11:07:12.9940130Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:12.9941693Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:12.9943171Z if out == self.unknown_value: 2025-12-04T11:07:12.9944131Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9945110Z warnings.warn( 2025-12-04T11:07:12.9945995Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9946964Z warnings.warn( 2025-12-04T11:07:12.9947671Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:12.9948459Z Traceback (most recent call last): 2025-12-04T11:07:12.9949245Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:12.9950158Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:12.9950986Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:12.9951742Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:12.9952626Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:12.9953514Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:12.9953989Z AssertionError: Scalars are not equal! 2025-12-04T11:07:12.9954241Z 2025-12-04T11:07:12.9954347Z Expected 3 but got 6. 2025-12-04T11:07:12.9954641Z Absolute difference: 3 2025-12-04T11:07:12.9954940Z Relative difference: 1.0 2025-12-04T11:07:12.9955132Z 2025-12-04T11:07:12.9955349Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:12.9956649Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:12.9957735Z 2025-12-04T11:07:12.9958007Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:12.9958646Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:12.9959121Z stats [('calls_captured', 36)] 2025-12-04T11:07:12.9959873Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:12.9960775Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:12.9961249Z graph_break [] 2025-12-04T11:07:12.9961614Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:12.9963197Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:12.9964668Z if out == self.unknown_value: 2025-12-04T11:07:12.9965607Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9966688Z warnings.warn( 2025-12-04T11:07:12.9967587Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9968599Z warnings.warn( 2025-12-04T11:07:12.9968977Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:12.9969467Z stats [('calls_captured', 36)] 2025-12-04T11:07:12.9969926Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:12.9970838Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:12.9971604Z graph_break [] 2025-12-04T11:07:12.9971987Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:12.9973096Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9974065Z warnings.warn( 2025-12-04T11:07:12.9974960Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9975935Z warnings.warn( 2025-12-04T11:07:12.9976252Z =================================== FAILURES =================================== 2025-12-04T11:07:12.9977080Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:12.9977886Z Traceback (most recent call last): 2025-12-04T11:07:12.9978755Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:12.9979683Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:12.9980573Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:12.9981347Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:12.9982188Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:12.9983067Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:12.9983545Z AssertionError: Scalars are not equal! 2025-12-04T11:07:12.9983808Z 2025-12-04T11:07:12.9983914Z Expected 3 but got 6. 2025-12-04T11:07:12.9984203Z Absolute difference: 3 2025-12-04T11:07:12.9984491Z Relative difference: 1.0 2025-12-04T11:07:12.9984690Z 2025-12-04T11:07:12.9984904Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:12.9986198Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:12.9987273Z 2025-12-04T11:07:12.9987562Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:12.9988404Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:12.9988893Z stats [('calls_captured', 36)] 2025-12-04T11:07:12.9989644Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:12.9990540Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:12.9991016Z graph_break [] 2025-12-04T11:07:12.9991399Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:12.9992979Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:12.9994550Z if out == self.unknown_value: 2025-12-04T11:07:12.9995516Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9996544Z warnings.warn( 2025-12-04T11:07:12.9997436Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:12.9998400Z warnings.warn( 2025-12-04T11:07:12.9998782Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:12.9999265Z stats [('calls_captured', 36)] 2025-12-04T11:07:12.9999700Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0000609Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0001384Z graph_break [] 2025-12-04T11:07:13.0001767Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0002850Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0003826Z warnings.warn( 2025-12-04T11:07:13.0004721Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0005694Z warnings.warn( 2025-12-04T11:07:13.0006064Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0006546Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0006994Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0007887Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0008707Z graph_break [] 2025-12-04T11:07:13.0009083Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0010184Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0011138Z warnings.warn( 2025-12-04T11:07:13.0012033Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0012999Z warnings.warn( 2025-12-04T11:07:13.0014002Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-cdcb3ea635f515ef.xml - 2025-12-04T11:07:13.0015160Z =========================== short test summary info ============================ 2025-12-04T11:07:13.0016460Z FAILED [0.7672s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0017557Z 2025-12-04T11:07:13.0017679Z Expected 3 but got 6. 2025-12-04T11:07:13.0017962Z Absolute difference: 3 2025-12-04T11:07:13.0018342Z Relative difference: 1.0 2025-12-04T11:07:13.0018553Z 2025-12-04T11:07:13.0018770Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0020072Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0021143Z 2025-12-04T11:07:13.0021411Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0022004Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.0022609Z ================== 1 failed, 10 deselected, 2 rerun in 21.09s ================== 2025-12-04T11:07:13.0023058Z Got exit code 1 2025-12-04T11:07:13.0023314Z Retrying single test... 2025-12-04T11:07:13.0023982Z W1204 10:52:20.120000 83179 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.0025241Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-2c53b8c4a22bc4ac.xml 2025-12-04T11:07:13.0026201Z ============================= test session starts ============================== 2025-12-04T11:07:13.0026871Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.0027479Z cachedir: .pytest_cache 2025-12-04T11:07:13.0028197Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.0028981Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.0029343Z configfile: pytest.ini 2025-12-04T11:07:13.0030077Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.0030984Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.0032359Z stepcurrent: skipping 0 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0033629Z Running 1 items in this shard 2025-12-04T11:07:13.0033837Z 2025-12-04T11:07:13.0035160Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:52:23.839234551 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0036637Z 2025-12-04T11:07:13.0037176Z [W1204 10:52:39.319724830 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0037836Z 2025-12-04T11:07:13.0038363Z [W1204 10:52:39.319979453 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0039015Z 2025-12-04T11:07:13.0039531Z [W1204 10:52:39.320630676 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0040194Z 2025-12-04T11:07:13.0040707Z [W1204 10:52:39.320847139 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0041374Z 2025-12-04T11:07:13.0041887Z [W1204 10:52:39.322636245 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0042543Z 2025-12-04T11:07:13.0043072Z [W1204 10:52:39.322815745 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0043726Z 2025-12-04T11:07:13.0044256Z [W1204 10:52:39.323154966 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0044907Z 2025-12-04T11:07:13.0045422Z [W1204 10:52:39.323335788 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0046091Z 2025-12-04T11:07:13.0046605Z [W1204 10:52:39.333855850 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0047271Z 2025-12-04T11:07:13.0047790Z [W1204 10:52:39.334090330 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0048445Z 2025-12-04T11:07:13.0049034Z [W1204 10:52:39.334280775 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0049687Z 2025-12-04T11:07:13.0050214Z [W1204 10:52:39.334559525 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0050898Z 2025-12-04T11:07:13.0051410Z [W1204 10:52:39.334731682 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0052085Z 2025-12-04T11:07:13.0052601Z [W1204 10:52:39.335022657 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0053269Z 2025-12-04T11:07:13.0053786Z [W1204 10:52:39.335207957 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0054443Z 2025-12-04T11:07:13.0054977Z [W1204 10:52:39.335499050 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0055632Z 2025-12-04T11:07:13.0056162Z [W1204 10:52:39.335670101 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0056820Z 2025-12-04T11:07:13.0057336Z [W1204 10:52:39.454796267 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0058004Z 2025-12-04T11:07:13.0058579Z [W1204 10:52:39.455101031 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0059245Z 2025-12-04T11:07:13.0059761Z [W1204 10:52:39.455293369 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0060452Z 2025-12-04T11:07:13.0060990Z [W1204 10:52:39.455572464 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0061642Z 2025-12-04T11:07:13.0062170Z [W1204 10:52:39.455741435 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0062828Z 2025-12-04T11:07:13.0063342Z [W1204 10:52:39.456033489 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0064011Z 2025-12-04T11:07:13.0064525Z [W1204 10:52:39.456201824 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0065189Z 2025-12-04T11:07:13.0065701Z [W1204 10:52:39.456478060 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0066361Z 2025-12-04T11:07:13.0066882Z [W1204 10:52:39.456647724 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0067537Z 2025-12-04T11:07:13.0068068Z [W1204 10:52:41.529075990 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0068721Z 2025-12-04T11:07:13.0069247Z [W1204 10:52:41.530278519 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0069896Z 2025-12-04T11:07:13.0070413Z [W1204 10:52:41.530486601 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0071077Z 2025-12-04T11:07:13.0071588Z [W1204 10:52:41.530780595 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0072251Z 2025-12-04T11:07:13.0072765Z [W1204 10:52:41.530959871 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0073475Z 2025-12-04T11:07:13.0074001Z [W1204 10:52:41.531280168 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0074699Z 2025-12-04T11:07:13.0075223Z [W1204 10:52:41.531459748 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0075878Z 2025-12-04T11:07:13.0076393Z [W1204 10:52:41.531746486 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0077053Z 2025-12-04T11:07:13.0077567Z [W1204 10:52:41.531920595 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0078231Z 2025-12-04T11:07:13.0078748Z [W1204 10:52:41.540132153 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0079404Z 2025-12-04T11:07:13.0079938Z [W1204 10:52:41.540363361 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0080592Z 2025-12-04T11:07:13.0081117Z [W1204 10:52:41.540554111 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0081770Z 2025-12-04T11:07:13.0082281Z [W1204 10:52:41.540831144 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0082951Z 2025-12-04T11:07:13.0083463Z [W1204 10:52:41.541005910 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0084126Z 2025-12-04T11:07:13.0084638Z [W1204 10:52:41.541297905 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0085320Z 2025-12-04T11:07:13.0085857Z [W1204 10:52:41.541473452 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0086510Z 2025-12-04T11:07:13.0087040Z [W1204 10:52:41.541755822 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0087691Z 2025-12-04T11:07:13.0088415Z [W1204 10:52:41.541930736 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0089082Z 2025-12-04T11:07:13.0089596Z [W1204 10:52:41.655850452 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0090267Z 2025-12-04T11:07:13.0090781Z [W1204 10:52:41.656089925 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0091435Z 2025-12-04T11:07:13.0091967Z [W1204 10:52:41.656281969 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0092623Z 2025-12-04T11:07:13.0093149Z [W1204 10:52:41.656561416 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0093803Z 2025-12-04T11:07:13.0094316Z [W1204 10:52:41.656736121 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0094979Z 2025-12-04T11:07:13.0095489Z [W1204 10:52:41.657030478 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0096154Z 2025-12-04T11:07:13.0096668Z [W1204 10:52:41.657205584 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0097318Z 2025-12-04T11:07:13.0097943Z [W1204 10:52:41.657493489 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0098659Z 2025-12-04T11:07:13.0099190Z [W1204 10:52:41.657669190 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0099892Z 2025-12-04T11:07:13.0100030Z ('RERUN', {'yellow': True}) [19.6810s] [100%] 2025-12-04T11:07:13.0101618Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:52:41.080469147 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0103067Z 2025-12-04T11:07:13.0103587Z [W1204 10:52:41.080748252 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0104255Z 2025-12-04T11:07:13.0104773Z [W1204 10:52:41.080938139 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0105424Z 2025-12-04T11:07:13.0105948Z [W1204 10:52:41.081222201 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0106599Z 2025-12-04T11:07:13.0107124Z [W1204 10:52:41.081399218 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0107773Z 2025-12-04T11:07:13.0108288Z [W1204 10:52:41.081693500 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0108949Z 2025-12-04T11:07:13.0109462Z [W1204 10:52:41.081866363 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0110178Z 2025-12-04T11:07:13.0110697Z [W1204 10:52:41.082147515 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0111349Z 2025-12-04T11:07:13.0111871Z [W1204 10:52:41.082318642 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0112524Z 2025-12-04T11:07:13.0113051Z [W1204 10:52:41.090623636 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0113704Z 2025-12-04T11:07:13.0114220Z [W1204 10:52:41.090858393 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0114884Z 2025-12-04T11:07:13.0115396Z [W1204 10:52:41.091045475 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0116061Z 2025-12-04T11:07:13.0116583Z [W1204 10:52:41.091333977 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0117236Z 2025-12-04T11:07:13.0117763Z [W1204 10:52:41.091506870 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0118417Z 2025-12-04T11:07:13.0118944Z [W1204 10:52:41.091797569 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0119595Z 2025-12-04T11:07:13.0120111Z [W1204 10:52:41.091969674 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0120773Z 2025-12-04T11:07:13.0121284Z [W1204 10:52:41.092250356 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0121947Z 2025-12-04T11:07:13.0122528Z [W1204 10:52:41.092420909 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0123181Z 2025-12-04T11:07:13.0123705Z [W1204 10:52:42.206566852 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0124388Z 2025-12-04T11:07:13.0124912Z [W1204 10:52:42.206801209 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0125559Z 2025-12-04T11:07:13.0126071Z [W1204 10:52:42.206987814 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0126734Z 2025-12-04T11:07:13.0127248Z [W1204 10:52:42.207279742 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0127908Z 2025-12-04T11:07:13.0128421Z [W1204 10:52:42.207453813 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0129079Z 2025-12-04T11:07:13.0129601Z [W1204 10:52:42.207743892 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0130251Z 2025-12-04T11:07:13.0130779Z [W1204 10:52:42.207914231 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0131429Z 2025-12-04T11:07:13.0131943Z [W1204 10:52:42.208192435 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0132607Z 2025-12-04T11:07:13.0133121Z [W1204 10:52:42.208360762 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0133787Z 2025-12-04T11:07:13.0134301Z [W1204 10:52:42.367475294 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0134999Z 2025-12-04T11:07:13.0135517Z [W1204 10:52:42.367725149 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0136169Z 2025-12-04T11:07:13.0136700Z [W1204 10:52:42.367919524 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0137352Z 2025-12-04T11:07:13.0137879Z [W1204 10:52:42.368212259 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0138596Z 2025-12-04T11:07:13.0139112Z [W1204 10:52:42.368391160 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0139778Z 2025-12-04T11:07:13.0140293Z [W1204 10:52:42.368690838 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0140960Z 2025-12-04T11:07:13.0141479Z [W1204 10:52:42.368868493 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0142126Z 2025-12-04T11:07:13.0142655Z [W1204 10:52:42.369154984 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0143309Z 2025-12-04T11:07:13.0143836Z [W1204 10:52:42.369332496 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0144487Z 2025-12-04T11:07:13.0145000Z [W1204 10:52:42.377389196 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0145664Z 2025-12-04T11:07:13.0146180Z [W1204 10:52:42.377624097 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0146846Z 2025-12-04T11:07:13.0147424Z [W1204 10:52:42.377817135 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0148077Z 2025-12-04T11:07:13.0148604Z [W1204 10:52:42.378095498 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0149290Z 2025-12-04T11:07:13.0149816Z [W1204 10:52:42.378272945 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0150469Z 2025-12-04T11:07:13.0150981Z [W1204 10:52:42.378569287 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0151644Z 2025-12-04T11:07:13.0152158Z [W1204 10:52:42.378747139 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0152836Z 2025-12-04T11:07:13.0153361Z [W1204 10:52:42.379033648 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0154014Z 2025-12-04T11:07:13.0154545Z [W1204 10:52:42.379224123 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0155198Z 2025-12-04T11:07:13.0155725Z [W1204 10:52:42.493422155 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0156377Z 2025-12-04T11:07:13.0156891Z [W1204 10:52:42.493677718 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0157561Z 2025-12-04T11:07:13.0158078Z [W1204 10:52:42.493871015 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0158745Z 2025-12-04T11:07:13.0159295Z [W1204 10:52:42.494158564 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0159947Z 2025-12-04T11:07:13.0160476Z [W1204 10:52:42.494336312 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0161132Z 2025-12-04T11:07:13.0161662Z [W1204 10:52:42.494636760 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0162318Z 2025-12-04T11:07:13.0162833Z [W1204 10:52:42.494815389 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0163499Z 2025-12-04T11:07:13.0164012Z [W1204 10:52:42.495117948 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0164681Z 2025-12-04T11:07:13.0165196Z [W1204 10:52:42.495298229 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0165870Z 2025-12-04T11:07:13.0166006Z ('RERUN', {'yellow': True}) [0.7995s] [100%] 2025-12-04T11:07:13.0167586Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:52:42.853637760 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0169016Z 2025-12-04T11:07:13.0169552Z [W1204 10:52:42.853909354 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0170205Z 2025-12-04T11:07:13.0170720Z [W1204 10:52:42.854097266 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0171386Z 2025-12-04T11:07:13.0171978Z [W1204 10:52:42.854427556 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0172646Z 2025-12-04T11:07:13.0173162Z [W1204 10:52:42.854629323 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0173843Z 2025-12-04T11:07:13.0174372Z [W1204 10:52:42.854954758 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0175025Z 2025-12-04T11:07:13.0175551Z [W1204 10:52:42.855142184 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0176204Z 2025-12-04T11:07:13.0176721Z [W1204 10:52:42.855444550 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0177386Z 2025-12-04T11:07:13.0177898Z [W1204 10:52:42.855618057 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0178645Z 2025-12-04T11:07:13.0179166Z [W1204 10:52:42.864877427 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0179819Z 2025-12-04T11:07:13.0180348Z [W1204 10:52:42.865111186 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0181000Z 2025-12-04T11:07:13.0181525Z [W1204 10:52:42.865296957 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0182175Z 2025-12-04T11:07:13.0182685Z [W1204 10:52:42.865569711 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0183354Z 2025-12-04T11:07:13.0183867Z [W1204 10:52:42.865740431 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0184568Z 2025-12-04T11:07:13.0185084Z [W1204 10:52:42.866028281 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0185738Z 2025-12-04T11:07:13.0186268Z [W1204 10:52:42.866199710 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0186922Z 2025-12-04T11:07:13.0187451Z [W1204 10:52:42.866475198 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0188346Z 2025-12-04T11:07:13.0188864Z [W1204 10:52:42.866644217 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0189526Z 2025-12-04T11:07:13.0190042Z [W1204 10:52:42.981209669 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0190712Z 2025-12-04T11:07:13.0191230Z [W1204 10:52:42.981466822 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0191880Z 2025-12-04T11:07:13.0192412Z [W1204 10:52:42.981654048 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0193065Z 2025-12-04T11:07:13.0193593Z [W1204 10:52:42.981937145 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0194242Z 2025-12-04T11:07:13.0194756Z [W1204 10:52:42.982110179 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0195425Z 2025-12-04T11:07:13.0195939Z [W1204 10:52:42.982404995 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0196606Z 2025-12-04T11:07:13.0197223Z [W1204 10:52:42.982575611 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0197877Z 2025-12-04T11:07:13.0198408Z [W1204 10:52:42.982851887 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0199103Z 2025-12-04T11:07:13.0199631Z [W1204 10:52:42.983020069 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0200281Z 2025-12-04T11:07:13.0200794Z [W1204 10:52:43.142720299 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0201453Z 2025-12-04T11:07:13.0201967Z [W1204 10:52:43.142975288 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0202632Z 2025-12-04T11:07:13.0203151Z [W1204 10:52:43.143185302 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0203817Z 2025-12-04T11:07:13.0204329Z [W1204 10:52:43.143471824 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0204984Z 2025-12-04T11:07:13.0205506Z [W1204 10:52:43.143648732 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0206159Z 2025-12-04T11:07:13.0206682Z [W1204 10:52:43.143946203 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0207337Z 2025-12-04T11:07:13.0207852Z [W1204 10:52:43.144132791 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0208516Z 2025-12-04T11:07:13.0209074Z [W1204 10:52:43.144419513 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0209736Z 2025-12-04T11:07:13.0210252Z [W1204 10:52:43.144594707 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0210905Z 2025-12-04T11:07:13.0211437Z [W1204 10:52:43.152604654 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0212084Z 2025-12-04T11:07:13.0212609Z [W1204 10:52:43.152839127 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0213254Z 2025-12-04T11:07:13.0213765Z [W1204 10:52:43.153030319 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0214426Z 2025-12-04T11:07:13.0214941Z [W1204 10:52:43.153306437 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0215608Z 2025-12-04T11:07:13.0216119Z [W1204 10:52:43.153482630 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0216770Z 2025-12-04T11:07:13.0217297Z [W1204 10:52:43.153776551 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0217948Z 2025-12-04T11:07:13.0218545Z [W1204 10:52:43.153952750 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0219197Z 2025-12-04T11:07:13.0219710Z [W1204 10:52:43.154236702 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0220372Z 2025-12-04T11:07:13.0220886Z [W1204 10:52:43.154410813 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0221630Z 2025-12-04T11:07:13.0222147Z [W1204 10:52:43.267909386 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0222830Z 2025-12-04T11:07:13.0223357Z [W1204 10:52:43.268153018 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0224009Z 2025-12-04T11:07:13.0224536Z [W1204 10:52:43.268345629 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0225188Z 2025-12-04T11:07:13.0225703Z [W1204 10:52:43.268626416 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0226365Z 2025-12-04T11:07:13.0226879Z [W1204 10:52:43.268802346 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0227544Z 2025-12-04T11:07:13.0228063Z [W1204 10:52:43.269098312 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0228716Z 2025-12-04T11:07:13.0229245Z [W1204 10:52:43.269275530 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0229896Z 2025-12-04T11:07:13.0230424Z [W1204 10:52:43.269558168 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0231077Z 2025-12-04T11:07:13.0231590Z [W1204 10:52:43.269731464 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0232257Z 2025-12-04T11:07:13.0232360Z FAILED [0.7721s] [100%] 2025-12-04T11:07:13.0232549Z 2025-12-04T11:07:13.0232691Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.0233568Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0234360Z Traceback (most recent call last): 2025-12-04T11:07:13.0235149Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0236066Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0236906Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0237658Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0238500Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0239386Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0239865Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0240115Z 2025-12-04T11:07:13.0240226Z Expected 3 but got 6. 2025-12-04T11:07:13.0240518Z Absolute difference: 3 2025-12-04T11:07:13.0240820Z Relative difference: 1.0 2025-12-04T11:07:13.0241011Z 2025-12-04T11:07:13.0241229Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0242531Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0243616Z 2025-12-04T11:07:13.0243886Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0244523Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0244994Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0245751Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0272916Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0273416Z graph_break [] 2025-12-04T11:07:13.0273818Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0275473Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0276958Z if out == self.unknown_value: 2025-12-04T11:07:13.0277910Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0278890Z warnings.warn( 2025-12-04T11:07:13.0279788Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0280772Z warnings.warn( 2025-12-04T11:07:13.0281469Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0282278Z Traceback (most recent call last): 2025-12-04T11:07:13.0283071Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0283978Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0284815Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0285590Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0286438Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0287353Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0288095Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0288354Z 2025-12-04T11:07:13.0288480Z Expected 3 but got 6. 2025-12-04T11:07:13.0288765Z Absolute difference: 3 2025-12-04T11:07:13.0289070Z Relative difference: 1.0 2025-12-04T11:07:13.0289264Z 2025-12-04T11:07:13.0289498Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0290806Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0291874Z 2025-12-04T11:07:13.0292143Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0292779Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0293263Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0294034Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0294929Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0295408Z graph_break [] 2025-12-04T11:07:13.0295788Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0297359Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0298898Z if out == self.unknown_value: 2025-12-04T11:07:13.0299867Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0300847Z warnings.warn( 2025-12-04T11:07:13.0301866Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0302846Z warnings.warn( 2025-12-04T11:07:13.0303237Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0303775Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0304215Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0305121Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0305902Z graph_break [] 2025-12-04T11:07:13.0306269Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0307367Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0308349Z warnings.warn( 2025-12-04T11:07:13.0309248Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0310210Z warnings.warn( 2025-12-04T11:07:13.0310532Z =================================== FAILURES =================================== 2025-12-04T11:07:13.0311372Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0312175Z Traceback (most recent call last): 2025-12-04T11:07:13.0312950Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0313864Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0314698Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0315499Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0316345Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0317240Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0317728Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0319378Z 2025-12-04T11:07:13.0319488Z Expected 3 but got 6. 2025-12-04T11:07:13.0319782Z Absolute difference: 3 2025-12-04T11:07:13.0320081Z Relative difference: 1.0 2025-12-04T11:07:13.0320273Z 2025-12-04T11:07:13.0320492Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0321791Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0322873Z 2025-12-04T11:07:13.0323141Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0323782Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0324254Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0325006Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0325916Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0326390Z graph_break [] 2025-12-04T11:07:13.0326755Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0328346Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0329814Z if out == self.unknown_value: 2025-12-04T11:07:13.0330845Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0331816Z warnings.warn( 2025-12-04T11:07:13.0332719Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0333728Z warnings.warn( 2025-12-04T11:07:13.0334118Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0334587Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0335033Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0335940Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0336702Z graph_break [] 2025-12-04T11:07:13.0337078Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0338186Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0339238Z warnings.warn( 2025-12-04T11:07:13.0340120Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0341090Z warnings.warn( 2025-12-04T11:07:13.0341470Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0341936Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0342362Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0343246Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0344006Z graph_break [] 2025-12-04T11:07:13.0344404Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0345489Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0346440Z warnings.warn( 2025-12-04T11:07:13.0347308Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0348250Z warnings.warn( 2025-12-04T11:07:13.0349248Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-2c53b8c4a22bc4ac.xml - 2025-12-04T11:07:13.0350390Z =========================== short test summary info ============================ 2025-12-04T11:07:13.0351667Z FAILED [0.7721s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0352770Z 2025-12-04T11:07:13.0352872Z Expected 3 but got 6. 2025-12-04T11:07:13.0353162Z Absolute difference: 3 2025-12-04T11:07:13.0353452Z Relative difference: 1.0 2025-12-04T11:07:13.0353635Z 2025-12-04T11:07:13.0353845Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0355128Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0356194Z 2025-12-04T11:07:13.0356461Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0357047Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.0357562Z ================== 1 failed, 10 deselected, 2 rerun in 21.28s ================== 2025-12-04T11:07:13.0358007Z Got exit code 1 2025-12-04T11:07:13.0359089Z FAILED CONSISTENTLY: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0360530Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:07:13.0361515Z W1204 10:52:53.931000 83354 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.0362757Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-c3b4fd201ecf8b1e.xml 2025-12-04T11:07:13.0363713Z ============================= test session starts ============================== 2025-12-04T11:07:13.0364372Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.0364956Z cachedir: .pytest_cache 2025-12-04T11:07:13.0365669Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.0366445Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.0366774Z configfile: pytest.ini 2025-12-04T11:07:13.0367502Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.0368391Z collecting ... collected 58 items / 1 deselected / 57 selected 2025-12-04T11:07:13.0368880Z stepcurrent: skipping 1 already run items. 2025-12-04T11:07:13.0369241Z Running 10 items in this shard 2025-12-04T11:07:13.0369457Z 2025-12-04T11:07:13.0370366Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 ('RERUN', {'yellow': True}) [4.1545s] [ 10%] 2025-12-04T11:07:13.0372283Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.7729s] [ 10%] 2025-12-04T11:07:13.0374149Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 FAILED [0.7630s] [ 10%] 2025-12-04T11:07:13.0375092Z 2025-12-04T11:07:13.0375233Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.0376039Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0376822Z Traceback (most recent call last): 2025-12-04T11:07:13.0377593Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0378560Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0379383Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0380129Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0380951Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0381816Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0382272Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0382513Z 2025-12-04T11:07:13.0382616Z Expected 3 but got 6. 2025-12-04T11:07:13.0382880Z Absolute difference: 3 2025-12-04T11:07:13.0383153Z Relative difference: 1.0 2025-12-04T11:07:13.0383338Z 2025-12-04T11:07:13.0383545Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0384820Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0385891Z 2025-12-04T11:07:13.0386219Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0386832Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0387325Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0388290Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0389173Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0389628Z graph_break [] 2025-12-04T11:07:13.0389986Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0391063Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0392027Z warnings.warn( 2025-12-04T11:07:13.0392899Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0393849Z warnings.warn( 2025-12-04T11:07:13.0394533Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0395320Z Traceback (most recent call last): 2025-12-04T11:07:13.0396084Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0396980Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0397786Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0398536Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0399357Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0400301Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0400759Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0401008Z 2025-12-04T11:07:13.0401118Z Expected 3 but got 6. 2025-12-04T11:07:13.0401391Z Absolute difference: 3 2025-12-04T11:07:13.0401674Z Relative difference: 1.0 2025-12-04T11:07:13.0401862Z 2025-12-04T11:07:13.0402070Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0403359Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0404423Z 2025-12-04T11:07:13.0404688Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0405297Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0405763Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0406511Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0407392Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0407842Z graph_break [] 2025-12-04T11:07:13.0408198Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0409285Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0410237Z warnings.warn( 2025-12-04T11:07:13.0411118Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0412075Z warnings.warn( 2025-12-04T11:07:13.0412445Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0412995Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0413429Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0414322Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0415120Z graph_break [] 2025-12-04T11:07:13.0415477Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0416572Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0417544Z warnings.warn( 2025-12-04T11:07:13.0418494Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0419451Z warnings.warn( 2025-12-04T11:07:13.0419762Z =================================== FAILURES =================================== 2025-12-04T11:07:13.0420571Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0421359Z Traceback (most recent call last): 2025-12-04T11:07:13.0422132Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0423033Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0423833Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0424584Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0425409Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0426330Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0426785Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0427045Z 2025-12-04T11:07:13.0427145Z Expected 3 but got 6. 2025-12-04T11:07:13.0427426Z Absolute difference: 3 2025-12-04T11:07:13.0427705Z Relative difference: 1.0 2025-12-04T11:07:13.0427899Z 2025-12-04T11:07:13.0428111Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0429407Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0430472Z 2025-12-04T11:07:13.0430744Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0431354Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0431822Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0432565Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0433450Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0433904Z graph_break [] 2025-12-04T11:07:13.0434259Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0435347Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0436303Z warnings.warn( 2025-12-04T11:07:13.0437179Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0438139Z warnings.warn( 2025-12-04T11:07:13.0438512Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0438968Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0439460Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0440358Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0441143Z graph_break [] 2025-12-04T11:07:13.0441496Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0442575Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0443528Z warnings.warn( 2025-12-04T11:07:13.0444396Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0445351Z warnings.warn( 2025-12-04T11:07:13.0445717Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0446192Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0446614Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0447504Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0448270Z graph_break [] 2025-12-04T11:07:13.0448633Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0449724Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0450690Z warnings.warn( 2025-12-04T11:07:13.0451582Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0452568Z warnings.warn( 2025-12-04T11:07:13.0453579Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-c3b4fd201ecf8b1e.xml - 2025-12-04T11:07:13.0454736Z =========================== short test summary info ============================ 2025-12-04T11:07:13.0456026Z FAILED [0.7630s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0457126Z 2025-12-04T11:07:13.0457232Z Expected 3 but got 6. 2025-12-04T11:07:13.0457523Z Absolute difference: 3 2025-12-04T11:07:13.0457821Z Relative difference: 1.0 2025-12-04T11:07:13.0458009Z 2025-12-04T11:07:13.0458239Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0459596Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0460683Z 2025-12-04T11:07:13.0460950Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0461543Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.0462067Z =================== 1 failed, 1 deselected, 2 rerun in 5.72s =================== 2025-12-04T11:07:13.0462498Z Got exit code 1 2025-12-04T11:07:13.0462764Z Retrying single test... 2025-12-04T11:07:13.0463400Z W1204 10:53:13.376000 83524 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.0464637Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7b498694d0586007.xml 2025-12-04T11:07:13.0465595Z ============================= test session starts ============================== 2025-12-04T11:07:13.0466347Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.0466951Z cachedir: .pytest_cache 2025-12-04T11:07:13.0467649Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.0468467Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.0468820Z configfile: pytest.ini 2025-12-04T11:07:13.0469538Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.0470434Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.0471811Z stepcurrent: skipping 1 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0473090Z Running 1 items in this shard 2025-12-04T11:07:13.0473301Z 2025-12-04T11:07:13.0474629Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:53:16.065360957 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0476084Z 2025-12-04T11:07:13.0476605Z [W1204 10:53:32.742441346 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0477277Z 2025-12-04T11:07:13.0477794Z [W1204 10:53:32.742699829 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0478458Z 2025-12-04T11:07:13.0478973Z [W1204 10:53:32.743327283 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0479656Z 2025-12-04T11:07:13.0480184Z [W1204 10:53:32.743549596 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0480832Z 2025-12-04T11:07:13.0481355Z [W1204 10:53:32.745342046 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0482003Z 2025-12-04T11:07:13.0482517Z [W1204 10:53:32.745523732 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0483178Z 2025-12-04T11:07:13.0483690Z [W1204 10:53:32.745835421 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0484348Z 2025-12-04T11:07:13.0484860Z [W1204 10:53:32.746010218 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0485512Z 2025-12-04T11:07:13.0486042Z [W1204 10:53:32.756530169 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0486692Z 2025-12-04T11:07:13.0487211Z [W1204 10:53:32.756765931 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0488137Z 2025-12-04T11:07:13.0488659Z [W1204 10:53:32.756955465 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0489320Z 2025-12-04T11:07:13.0489831Z [W1204 10:53:32.757230443 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0490493Z 2025-12-04T11:07:13.0491005Z [W1204 10:53:32.757401996 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0491656Z 2025-12-04T11:07:13.0492172Z [W1204 10:53:32.757692153 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0492934Z 2025-12-04T11:07:13.0493459Z [W1204 10:53:32.757864733 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0494154Z 2025-12-04T11:07:13.0494667Z [W1204 10:53:32.758139922 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0495333Z 2025-12-04T11:07:13.0495845Z [W1204 10:53:32.758316265 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0496511Z 2025-12-04T11:07:13.0497022Z [W1204 10:53:32.876401731 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0497685Z 2025-12-04T11:07:13.0498199Z [W1204 10:53:32.876681833 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0498914Z 2025-12-04T11:07:13.0499445Z [W1204 10:53:32.876873278 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0500096Z 2025-12-04T11:07:13.0500605Z [W1204 10:53:32.877147595 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0501271Z 2025-12-04T11:07:13.0501780Z [W1204 10:53:32.877316767 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0502447Z 2025-12-04T11:07:13.0502959Z [W1204 10:53:32.877605165 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0503627Z 2025-12-04T11:07:13.0504139Z [W1204 10:53:32.877775864 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0504835Z 2025-12-04T11:07:13.0505363Z [W1204 10:53:32.878050676 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0506014Z 2025-12-04T11:07:13.0506542Z [W1204 10:53:32.878220177 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0507192Z 2025-12-04T11:07:13.0507709Z [W1204 10:53:34.946121222 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0508372Z 2025-12-04T11:07:13.0508884Z [W1204 10:53:34.947318896 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0509546Z 2025-12-04T11:07:13.0510061Z [W1204 10:53:34.947518801 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0510711Z 2025-12-04T11:07:13.0511240Z [W1204 10:53:34.947806710 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0511891Z 2025-12-04T11:07:13.0512419Z [W1204 10:53:34.947989370 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0513066Z 2025-12-04T11:07:13.0513577Z [W1204 10:53:34.948285538 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0514238Z 2025-12-04T11:07:13.0514748Z [W1204 10:53:34.948465911 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0515412Z 2025-12-04T11:07:13.0515924Z [W1204 10:53:34.948753399 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0516575Z 2025-12-04T11:07:13.0517157Z [W1204 10:53:34.948930697 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0517812Z 2025-12-04T11:07:13.0518337Z [W1204 10:53:34.957008483 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0519014Z 2025-12-04T11:07:13.0519525Z [W1204 10:53:34.957243326 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0520183Z 2025-12-04T11:07:13.0520692Z [W1204 10:53:34.957436663 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0521352Z 2025-12-04T11:07:13.0521864Z [W1204 10:53:34.957710731 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0522512Z 2025-12-04T11:07:13.0523044Z [W1204 10:53:34.957887956 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0523697Z 2025-12-04T11:07:13.0524217Z [W1204 10:53:34.958178921 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0524868Z 2025-12-04T11:07:13.0525380Z [W1204 10:53:34.958357589 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0526040Z 2025-12-04T11:07:13.0526556Z [W1204 10:53:34.958639756 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0527217Z 2025-12-04T11:07:13.0527728Z [W1204 10:53:34.958817069 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0528382Z 2025-12-04T11:07:13.0528905Z [W1204 10:53:34.072149579 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0529587Z 2025-12-04T11:07:13.0530104Z [W1204 10:53:34.072396113 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0530753Z 2025-12-04T11:07:13.0531263Z [W1204 10:53:34.072589755 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0531921Z 2025-12-04T11:07:13.0532431Z [W1204 10:53:34.072865682 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0533091Z 2025-12-04T11:07:13.0533600Z [W1204 10:53:34.073042924 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0534266Z 2025-12-04T11:07:13.0534774Z [W1204 10:53:34.073333655 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0535432Z 2025-12-04T11:07:13.0535961Z [W1204 10:53:34.073507181 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0536609Z 2025-12-04T11:07:13.0537131Z [W1204 10:53:34.073787424 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0537779Z 2025-12-04T11:07:13.0538348Z [W1204 10:53:34.073961395 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0539057Z 2025-12-04T11:07:13.0539189Z ('RERUN', {'yellow': True}) [19.8538s] [100%] 2025-12-04T11:07:13.0540782Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:53:35.488157768 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0542294Z 2025-12-04T11:07:13.0542822Z [W1204 10:53:35.488430473 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0543506Z 2025-12-04T11:07:13.0544030Z [W1204 10:53:35.488624081 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0544678Z 2025-12-04T11:07:13.0545189Z [W1204 10:53:35.488901140 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0545853Z 2025-12-04T11:07:13.0546364Z [W1204 10:53:35.489078445 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0547022Z 2025-12-04T11:07:13.0547536Z [W1204 10:53:35.489369854 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0548188Z 2025-12-04T11:07:13.0548716Z [W1204 10:53:35.489541685 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0549366Z 2025-12-04T11:07:13.0549886Z [W1204 10:53:35.489822855 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0550539Z 2025-12-04T11:07:13.0551050Z [W1204 10:53:35.489994245 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0551708Z 2025-12-04T11:07:13.0552219Z [W1204 10:53:35.498273062 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0552877Z 2025-12-04T11:07:13.0553385Z [W1204 10:53:35.498499069 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0554067Z 2025-12-04T11:07:13.0554594Z [W1204 10:53:35.498687172 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0555244Z 2025-12-04T11:07:13.0555765Z [W1204 10:53:35.498956030 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0556411Z 2025-12-04T11:07:13.0556920Z [W1204 10:53:35.499140781 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0557579Z 2025-12-04T11:07:13.0558094Z [W1204 10:53:35.499433632 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0558756Z 2025-12-04T11:07:13.0559264Z [W1204 10:53:35.499606076 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0559917Z 2025-12-04T11:07:13.0560444Z [W1204 10:53:35.499882855 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0561090Z 2025-12-04T11:07:13.0561611Z [W1204 10:53:35.500074116 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0562263Z 2025-12-04T11:07:13.0562772Z [W1204 10:53:35.614125917 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0563436Z 2025-12-04T11:07:13.0563947Z [W1204 10:53:35.614363032 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0564607Z 2025-12-04T11:07:13.0565120Z [W1204 10:53:35.614553361 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0565779Z 2025-12-04T11:07:13.0566377Z [W1204 10:53:35.614826754 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0567026Z 2025-12-04T11:07:13.0567549Z [W1204 10:53:35.614998118 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0568230Z 2025-12-04T11:07:13.0568740Z [W1204 10:53:35.615305701 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0569401Z 2025-12-04T11:07:13.0569912Z [W1204 10:53:35.615481461 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0570570Z 2025-12-04T11:07:13.0571075Z [W1204 10:53:35.615759300 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0571740Z 2025-12-04T11:07:13.0572257Z [W1204 10:53:35.615928983 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0572907Z 2025-12-04T11:07:13.0573432Z [W1204 10:53:35.775415142 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0574082Z 2025-12-04T11:07:13.0574602Z [W1204 10:53:35.775658205 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0575251Z 2025-12-04T11:07:13.0575762Z [W1204 10:53:35.775851974 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0576425Z 2025-12-04T11:07:13.0576933Z [W1204 10:53:35.776128717 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0577595Z 2025-12-04T11:07:13.0578143Z [W1204 10:53:35.776305265 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0578849Z 2025-12-04T11:07:13.0579376Z [W1204 10:53:35.776598192 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0580026Z 2025-12-04T11:07:13.0580552Z [W1204 10:53:35.776775544 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0581203Z 2025-12-04T11:07:13.0581714Z [W1204 10:53:35.777058960 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0582375Z 2025-12-04T11:07:13.0582883Z [W1204 10:53:35.777234103 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0583548Z 2025-12-04T11:07:13.0584058Z [W1204 10:53:35.785327408 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0584711Z 2025-12-04T11:07:13.0585236Z [W1204 10:53:35.785559156 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0585886Z 2025-12-04T11:07:13.0586410Z [W1204 10:53:35.785752869 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0587057Z 2025-12-04T11:07:13.0587566Z [W1204 10:53:35.786029119 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0588517Z 2025-12-04T11:07:13.0589148Z [W1204 10:53:35.786205225 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0589818Z 2025-12-04T11:07:13.0590330Z [W1204 10:53:35.786497318 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0590983Z 2025-12-04T11:07:13.0591621Z [W1204 10:53:35.786675795 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0592309Z 2025-12-04T11:07:13.0592833Z [W1204 10:53:35.786958279 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0593485Z 2025-12-04T11:07:13.0593995Z [W1204 10:53:35.787146892 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0594657Z 2025-12-04T11:07:13.0595168Z [W1204 10:53:35.900450615 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0595827Z 2025-12-04T11:07:13.0596337Z [W1204 10:53:35.900685607 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0596991Z 2025-12-04T11:07:13.0597520Z [W1204 10:53:35.900880456 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0598170Z 2025-12-04T11:07:13.0598690Z [W1204 10:53:35.901156408 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0599338Z 2025-12-04T11:07:13.0599849Z [W1204 10:53:35.901340222 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0600508Z 2025-12-04T11:07:13.0601022Z [W1204 10:53:35.901633999 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0601681Z 2025-12-04T11:07:13.0602194Z [W1204 10:53:35.901812265 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0602902Z 2025-12-04T11:07:13.0603417Z [W1204 10:53:35.902093308 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0604071Z 2025-12-04T11:07:13.0604591Z [W1204 10:53:35.902270792 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0605246Z 2025-12-04T11:07:13.0605382Z ('RERUN', {'yellow': True}) [0.7895s] [100%] 2025-12-04T11:07:13.0606953Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:53:36.262505588 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0608412Z 2025-12-04T11:07:13.0608928Z [W1204 10:53:36.262778386 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0609591Z 2025-12-04T11:07:13.0610106Z [W1204 10:53:36.262969777 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0610755Z 2025-12-04T11:07:13.0611275Z [W1204 10:53:36.263267458 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0611928Z 2025-12-04T11:07:13.0612446Z [W1204 10:53:36.263448210 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0613097Z 2025-12-04T11:07:13.0613604Z [W1204 10:53:36.263740845 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0614263Z 2025-12-04T11:07:13.0614774Z [W1204 10:53:36.263914833 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0615436Z 2025-12-04T11:07:13.0616009Z [W1204 10:53:36.264193350 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0616656Z 2025-12-04T11:07:13.0617177Z [W1204 10:53:36.264365052 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0617858Z 2025-12-04T11:07:13.0618467Z [W1204 10:53:36.272381280 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0619119Z 2025-12-04T11:07:13.0619627Z [W1204 10:53:36.272613778 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0620284Z 2025-12-04T11:07:13.0620798Z [W1204 10:53:36.272802295 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0621461Z 2025-12-04T11:07:13.0621979Z [W1204 10:53:36.273083369 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0622625Z 2025-12-04T11:07:13.0623146Z [W1204 10:53:36.273258890 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0623796Z 2025-12-04T11:07:13.0624317Z [W1204 10:53:36.273565157 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0624967Z 2025-12-04T11:07:13.0625474Z [W1204 10:53:36.273742187 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0626137Z 2025-12-04T11:07:13.0626648Z [W1204 10:53:36.274037479 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0627307Z 2025-12-04T11:07:13.0627861Z [W1204 10:53:36.274211737 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0628515Z 2025-12-04T11:07:13.0629029Z [W1204 10:53:36.388323554 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0629680Z 2025-12-04T11:07:13.0630200Z [W1204 10:53:36.388569936 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0630849Z 2025-12-04T11:07:13.0631358Z [W1204 10:53:36.388759331 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0632023Z 2025-12-04T11:07:13.0632532Z [W1204 10:53:36.389036572 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0633192Z 2025-12-04T11:07:13.0633706Z [W1204 10:53:36.389220582 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0634365Z 2025-12-04T11:07:13.0634877Z [W1204 10:53:36.389511943 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0635524Z 2025-12-04T11:07:13.0636050Z [W1204 10:53:36.389682340 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0636055Z 2025-12-04T11:07:13.0636562Z [W1204 10:53:36.389957535 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0636567Z 2025-12-04T11:07:13.0637081Z [W1204 10:53:36.390148464 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0637086Z 2025-12-04T11:07:13.0637595Z [W1204 10:53:36.550208332 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0637658Z 2025-12-04T11:07:13.0638176Z [W1204 10:53:36.550470545 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0638211Z 2025-12-04T11:07:13.0638721Z [W1204 10:53:36.550666500 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0638726Z 2025-12-04T11:07:13.0639233Z [W1204 10:53:36.550949761 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0639246Z 2025-12-04T11:07:13.0639751Z [W1204 10:53:36.551143835 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0639756Z 2025-12-04T11:07:13.0640260Z [W1204 10:53:36.551441717 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0640268Z 2025-12-04T11:07:13.0640783Z [W1204 10:53:36.551620257 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0640788Z 2025-12-04T11:07:13.0641299Z [W1204 10:53:36.551900747 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0641303Z 2025-12-04T11:07:13.0641815Z [W1204 10:53:36.552075931 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0641820Z 2025-12-04T11:07:13.0642328Z [W1204 10:53:36.560081765 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0642333Z 2025-12-04T11:07:13.0642847Z [W1204 10:53:36.560314489 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0642898Z 2025-12-04T11:07:13.0643411Z [W1204 10:53:36.560507821 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0643416Z 2025-12-04T11:07:13.0643937Z [W1204 10:53:36.560781957 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0643944Z 2025-12-04T11:07:13.0644450Z [W1204 10:53:36.560958819 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0644455Z 2025-12-04T11:07:13.0644960Z [W1204 10:53:36.561248964 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0644965Z 2025-12-04T11:07:13.0645483Z [W1204 10:53:36.561425305 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0645490Z 2025-12-04T11:07:13.0646001Z [W1204 10:53:36.561705599 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0646006Z 2025-12-04T11:07:13.0646528Z [W1204 10:53:36.561881305 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0646535Z 2025-12-04T11:07:13.0647042Z [W1204 10:53:36.674590934 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0647047Z 2025-12-04T11:07:13.0647566Z [W1204 10:53:36.674826493 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0647571Z 2025-12-04T11:07:13.0648074Z [W1204 10:53:36.675018628 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0648081Z 2025-12-04T11:07:13.0648655Z [W1204 10:53:36.675314894 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0648660Z 2025-12-04T11:07:13.0649170Z [W1204 10:53:36.675491382 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0649203Z 2025-12-04T11:07:13.0649709Z [W1204 10:53:36.675784826 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0649722Z 2025-12-04T11:07:13.0650229Z [W1204 10:53:36.675966997 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0650234Z 2025-12-04T11:07:13.0650739Z [W1204 10:53:36.676252084 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0650744Z 2025-12-04T11:07:13.0651267Z [W1204 10:53:36.676425917 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0651272Z 2025-12-04T11:07:13.0651376Z FAILED [0.7726s] [100%] 2025-12-04T11:07:13.0651381Z 2025-12-04T11:07:13.0651529Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.0652076Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0652197Z Traceback (most recent call last): 2025-12-04T11:07:13.0652754Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0652990Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0653467Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0653630Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0654204Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0654423Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0654555Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0654561Z 2025-12-04T11:07:13.0654673Z Expected 3 but got 6. 2025-12-04T11:07:13.0654782Z Absolute difference: 3 2025-12-04T11:07:13.0654888Z Relative difference: 1.0 2025-12-04T11:07:13.0654893Z 2025-12-04T11:07:13.0655117Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0656062Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0656067Z 2025-12-04T11:07:13.0656334Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0656574Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0656689Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0657232Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0657460Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0657554Z graph_break [] 2025-12-04T11:07:13.0657783Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0659077Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0659213Z if out == self.unknown_value: 2025-12-04T11:07:13.0660012Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0660118Z warnings.warn( 2025-12-04T11:07:13.0660858Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0660991Z warnings.warn( 2025-12-04T11:07:13.0661543Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0661662Z Traceback (most recent call last): 2025-12-04T11:07:13.0662204Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0662444Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0662898Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0663070Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0663618Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0663826Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0663972Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0663978Z 2025-12-04T11:07:13.0664082Z Expected 3 but got 6. 2025-12-04T11:07:13.0664187Z Absolute difference: 3 2025-12-04T11:07:13.0664303Z Relative difference: 1.0 2025-12-04T11:07:13.0664308Z 2025-12-04T11:07:13.0664522Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0665471Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0665521Z 2025-12-04T11:07:13.0665788Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0666011Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0666133Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0666672Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0666899Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0667003Z graph_break [] 2025-12-04T11:07:13.0667220Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0668443Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0668561Z if out == self.unknown_value: 2025-12-04T11:07:13.0669301Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0669409Z warnings.warn( 2025-12-04T11:07:13.0670133Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0670249Z warnings.warn( 2025-12-04T11:07:13.0670461Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0670577Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0670815Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0671348Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0671446Z graph_break [] 2025-12-04T11:07:13.0671688Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0672482Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0672623Z warnings.warn( 2025-12-04T11:07:13.0673346Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0673444Z warnings.warn( 2025-12-04T11:07:13.0673602Z =================================== FAILURES =================================== 2025-12-04T11:07:13.0674140Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0674276Z Traceback (most recent call last): 2025-12-04T11:07:13.0674819Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0675051Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0675524Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0675687Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0676226Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0676440Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0676570Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0676575Z 2025-12-04T11:07:13.0676690Z Expected 3 but got 6. 2025-12-04T11:07:13.0676796Z Absolute difference: 3 2025-12-04T11:07:13.0676903Z Relative difference: 1.0 2025-12-04T11:07:13.0676908Z 2025-12-04T11:07:13.0677132Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0678114Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0678120Z 2025-12-04T11:07:13.0678395Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0678614Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0678726Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0679271Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0679497Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0679594Z graph_break [] 2025-12-04T11:07:13.0679821Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0681037Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0681161Z if out == self.unknown_value: 2025-12-04T11:07:13.0681897Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0682000Z warnings.warn( 2025-12-04T11:07:13.0682742Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0682845Z warnings.warn( 2025-12-04T11:07:13.0683077Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0683194Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0683423Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0684032Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0684133Z graph_break [] 2025-12-04T11:07:13.0684350Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0685133Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0685235Z warnings.warn( 2025-12-04T11:07:13.0685970Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0686072Z warnings.warn( 2025-12-04T11:07:13.0686290Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0686422Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0686656Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0687209Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0687310Z graph_break [] 2025-12-04T11:07:13.0687529Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0688591Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0688699Z warnings.warn( 2025-12-04T11:07:13.0689426Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0689543Z warnings.warn( 2025-12-04T11:07:13.0690387Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7b498694d0586007.xml - 2025-12-04T11:07:13.0690651Z =========================== short test summary info ============================ 2025-12-04T11:07:13.0691631Z FAILED [0.7726s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0691640Z 2025-12-04T11:07:13.0691749Z Expected 3 but got 6. 2025-12-04T11:07:13.0691875Z Absolute difference: 3 2025-12-04T11:07:13.0691988Z Relative difference: 1.0 2025-12-04T11:07:13.0691993Z 2025-12-04T11:07:13.0692230Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0693176Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0693184Z 2025-12-04T11:07:13.0693458Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0693656Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.0693857Z ================== 1 failed, 10 deselected, 2 rerun in 21.45s ================== 2025-12-04T11:07:13.0693977Z Got exit code 1 2025-12-04T11:07:13.0694087Z Retrying single test... 2025-12-04T11:07:13.0694541Z W1204 10:53:47.224000 83699 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.0695226Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-f04dfce55afddfa2.xml 2025-12-04T11:07:13.0695393Z ============================= test session starts ============================== 2025-12-04T11:07:13.0695761Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.0695875Z cachedir: .pytest_cache 2025-12-04T11:07:13.0696484Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.0696627Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.0696734Z configfile: pytest.ini 2025-12-04T11:07:13.0697332Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.0697566Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.0698662Z stepcurrent: skipping 1 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0698797Z Running 1 items in this shard 2025-12-04T11:07:13.0698802Z 2025-12-04T11:07:13.0700128Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:53:50.952971405 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0700137Z 2025-12-04T11:07:13.0700675Z [W1204 10:54:06.821224169 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0700682Z 2025-12-04T11:07:13.0701202Z [W1204 10:54:06.821502951 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0701207Z 2025-12-04T11:07:13.0701724Z [W1204 10:54:06.822114036 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0701741Z 2025-12-04T11:07:13.0702253Z [W1204 10:54:06.822325103 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0702298Z 2025-12-04T11:07:13.0702813Z [W1204 10:54:06.824151494 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0702819Z 2025-12-04T11:07:13.0703344Z [W1204 10:54:06.824336957 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0703352Z 2025-12-04T11:07:13.0703863Z [W1204 10:54:06.824653513 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0703868Z 2025-12-04T11:07:13.0704397Z [W1204 10:54:06.824828795 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0704402Z 2025-12-04T11:07:13.0704914Z [W1204 10:54:06.835338201 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0704921Z 2025-12-04T11:07:13.0705448Z [W1204 10:54:06.835564904 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0705453Z 2025-12-04T11:07:13.0705966Z [W1204 10:54:06.835750931 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0705973Z 2025-12-04T11:07:13.0706496Z [W1204 10:54:06.836029323 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0706500Z 2025-12-04T11:07:13.0707012Z [W1204 10:54:06.836201378 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0707017Z 2025-12-04T11:07:13.0707527Z [W1204 10:54:06.836494314 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0707544Z 2025-12-04T11:07:13.0708120Z [W1204 10:54:06.836665363 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0708125Z 2025-12-04T11:07:13.0708636Z [W1204 10:54:06.836944542 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0708671Z 2025-12-04T11:07:13.0709194Z [W1204 10:54:06.837113925 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0709199Z 2025-12-04T11:07:13.0709710Z [W1204 10:54:06.956853065 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0709714Z 2025-12-04T11:07:13.0710239Z [W1204 10:54:06.957159459 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0710244Z 2025-12-04T11:07:13.0710757Z [W1204 10:54:06.957349966 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0710768Z 2025-12-04T11:07:13.0711290Z [W1204 10:54:06.957639248 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0711297Z 2025-12-04T11:07:13.0711808Z [W1204 10:54:06.957810125 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0711813Z 2025-12-04T11:07:13.0712326Z [W1204 10:54:06.958099828 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0712343Z 2025-12-04T11:07:13.0712857Z [W1204 10:54:06.958269261 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0712862Z 2025-12-04T11:07:13.0713373Z [W1204 10:54:06.958542855 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0713411Z 2025-12-04T11:07:13.0713941Z [W1204 10:54:06.958710117 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0713948Z 2025-12-04T11:07:13.0714464Z [W1204 10:54:08.031146331 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0714469Z 2025-12-04T11:07:13.0714993Z [W1204 10:54:08.032367350 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0714998Z 2025-12-04T11:07:13.0715511Z [W1204 10:54:08.032577222 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0715516Z 2025-12-04T11:07:13.0716044Z [W1204 10:54:08.032866148 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0716051Z 2025-12-04T11:07:13.0716568Z [W1204 10:54:08.033046682 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0716573Z 2025-12-04T11:07:13.0717101Z [W1204 10:54:08.033343811 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0717105Z 2025-12-04T11:07:13.0717618Z [W1204 10:54:08.033526811 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0717623Z 2025-12-04T11:07:13.0718133Z [W1204 10:54:08.033809757 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0718150Z 2025-12-04T11:07:13.0718664Z [W1204 10:54:08.033985066 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0718670Z 2025-12-04T11:07:13.0719240Z [W1204 10:54:08.042208626 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0719245Z 2025-12-04T11:07:13.0719772Z [W1204 10:54:08.042450342 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0719805Z 2025-12-04T11:07:13.0720317Z [W1204 10:54:08.042645668 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0720322Z 2025-12-04T11:07:13.0720847Z [W1204 10:54:08.042924239 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0720851Z 2025-12-04T11:07:13.0721365Z [W1204 10:54:08.043112874 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0721372Z 2025-12-04T11:07:13.0721901Z [W1204 10:54:08.043413841 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0721906Z 2025-12-04T11:07:13.0722417Z [W1204 10:54:08.043592695 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0722425Z 2025-12-04T11:07:13.0722947Z [W1204 10:54:08.043875303 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0722952Z 2025-12-04T11:07:13.0723464Z [W1204 10:54:08.044052127 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0723468Z 2025-12-04T11:07:13.0723979Z [W1204 10:54:09.158270267 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0724027Z 2025-12-04T11:07:13.0724539Z [W1204 10:54:09.158514257 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0724544Z 2025-12-04T11:07:13.0725055Z [W1204 10:54:09.158706910 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0725062Z 2025-12-04T11:07:13.0725586Z [W1204 10:54:09.158988145 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0725590Z 2025-12-04T11:07:13.0726102Z [W1204 10:54:09.159178861 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0726106Z 2025-12-04T11:07:13.0726630Z [W1204 10:54:09.159477682 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0726634Z 2025-12-04T11:07:13.0727157Z [W1204 10:54:09.159654010 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0727162Z 2025-12-04T11:07:13.0727689Z [W1204 10:54:09.159932190 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0727697Z 2025-12-04T11:07:13.0728208Z [W1204 10:54:09.160131393 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0728212Z 2025-12-04T11:07:13.0728348Z ('RERUN', {'yellow': True}) [20.0808s] [100%] 2025-12-04T11:07:13.0729680Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:54:09.576216536 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0729685Z 2025-12-04T11:07:13.0730276Z [W1204 10:54:09.576491614 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0730281Z 2025-12-04T11:07:13.0730811Z [W1204 10:54:09.576685433 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0730844Z 2025-12-04T11:07:13.0731361Z [W1204 10:54:09.576972585 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0731365Z 2025-12-04T11:07:13.0731890Z [W1204 10:54:09.577149557 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0731895Z 2025-12-04T11:07:13.0732410Z [W1204 10:54:09.577439961 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0732415Z 2025-12-04T11:07:13.0732941Z [W1204 10:54:09.577616129 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0732948Z 2025-12-04T11:07:13.0733462Z [W1204 10:54:09.577894024 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0733469Z 2025-12-04T11:07:13.0733992Z [W1204 10:54:09.578065920 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0733997Z 2025-12-04T11:07:13.0734506Z [W1204 10:54:09.586313222 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0734511Z 2025-12-04T11:07:13.0735020Z [W1204 10:54:09.586548251 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0735038Z 2025-12-04T11:07:13.0735550Z [W1204 10:54:09.586737312 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0735588Z 2025-12-04T11:07:13.0736100Z [W1204 10:54:09.587010324 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0736107Z 2025-12-04T11:07:13.0736637Z [W1204 10:54:09.587195837 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0736641Z 2025-12-04T11:07:13.0737154Z [W1204 10:54:09.587491311 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0737159Z 2025-12-04T11:07:13.0737682Z [W1204 10:54:09.587665220 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0737687Z 2025-12-04T11:07:13.0738200Z [W1204 10:54:09.587948204 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0738207Z 2025-12-04T11:07:13.0738812Z [W1204 10:54:09.588119419 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0738817Z 2025-12-04T11:07:13.0739333Z [W1204 10:54:09.702745518 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0739338Z 2025-12-04T11:07:13.0739865Z [W1204 10:54:09.702980001 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0739869Z 2025-12-04T11:07:13.0740380Z [W1204 10:54:09.703184644 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0740384Z 2025-12-04T11:07:13.0740896Z [W1204 10:54:09.703464499 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0740917Z 2025-12-04T11:07:13.0741492Z [W1204 10:54:09.703635435 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0741497Z 2025-12-04T11:07:13.0742013Z [W1204 10:54:09.703924365 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0742046Z 2025-12-04T11:07:13.0742571Z [W1204 10:54:09.704095823 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0742576Z 2025-12-04T11:07:13.0743085Z [W1204 10:54:09.704370172 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0743090Z 2025-12-04T11:07:13.0743620Z [W1204 10:54:09.704539806 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0743626Z 2025-12-04T11:07:13.0744140Z [W1204 10:54:09.864638368 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0744145Z 2025-12-04T11:07:13.0744668Z [W1204 10:54:09.864885582 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0744674Z 2025-12-04T11:07:13.0745185Z [W1204 10:54:09.865079828 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0745190Z 2025-12-04T11:07:13.0745701Z [W1204 10:54:09.865361434 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0745721Z 2025-12-04T11:07:13.0746231Z [W1204 10:54:09.865540331 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0746265Z 2025-12-04T11:07:13.0746784Z [W1204 10:54:09.865846915 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0746789Z 2025-12-04T11:07:13.0747314Z [W1204 10:54:09.866026211 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0747321Z 2025-12-04T11:07:13.0747834Z [W1204 10:54:09.866310486 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0747839Z 2025-12-04T11:07:13.0748365Z [W1204 10:54:09.866486910 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0748370Z 2025-12-04T11:07:13.0748881Z [W1204 10:54:09.874531363 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0748885Z 2025-12-04T11:07:13.0749412Z [W1204 10:54:09.874766600 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0749416Z 2025-12-04T11:07:13.0749929Z [W1204 10:54:09.874959934 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0749936Z 2025-12-04T11:07:13.0750459Z [W1204 10:54:09.875254151 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0750464Z 2025-12-04T11:07:13.0750973Z [W1204 10:54:09.875432001 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0750977Z 2025-12-04T11:07:13.0751488Z [W1204 10:54:09.875726376 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0751504Z 2025-12-04T11:07:13.0752074Z [W1204 10:54:09.875904412 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0752081Z 2025-12-04T11:07:13.0752590Z [W1204 10:54:09.876188643 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0752624Z 2025-12-04T11:07:13.0753148Z [W1204 10:54:09.876365893 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0753153Z 2025-12-04T11:07:13.0753667Z [W1204 10:54:09.990684501 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0753672Z 2025-12-04T11:07:13.0754194Z [W1204 10:54:09.990922137 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0754198Z 2025-12-04T11:07:13.0754711Z [W1204 10:54:09.991127789 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0754722Z 2025-12-04T11:07:13.0755247Z [W1204 10:54:09.991409695 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0755253Z 2025-12-04T11:07:13.0755765Z [W1204 10:54:09.991583544 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0755770Z 2025-12-04T11:07:13.0756292Z [W1204 10:54:09.991875696 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0756296Z 2025-12-04T11:07:13.0756809Z [W1204 10:54:09.992049694 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0756813Z 2025-12-04T11:07:13.0757323Z [W1204 10:54:09.992331195 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0757370Z 2025-12-04T11:07:13.0757886Z [W1204 10:54:09.992505381 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0757892Z 2025-12-04T11:07:13.0758027Z ('RERUN', {'yellow': True}) [0.7933s] [100%] 2025-12-04T11:07:13.0759350Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 [W1204 10:54:10.347392284 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0759355Z 2025-12-04T11:07:13.0759867Z [W1204 10:54:10.347656414 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0759871Z 2025-12-04T11:07:13.0760396Z [W1204 10:54:10.347846134 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0760403Z 2025-12-04T11:07:13.0760916Z [W1204 10:54:10.348122578 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0760923Z 2025-12-04T11:07:13.0761453Z [W1204 10:54:10.348298185 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0761458Z 2025-12-04T11:07:13.0761971Z [W1204 10:54:10.348589210 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0761975Z 2025-12-04T11:07:13.0762488Z [W1204 10:54:10.348770224 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0762507Z 2025-12-04T11:07:13.0763018Z [W1204 10:54:10.349052947 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0763025Z 2025-12-04T11:07:13.0763600Z [W1204 10:54:10.349224789 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0763605Z 2025-12-04T11:07:13.0764158Z [W1204 10:54:10.357612321 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0764163Z 2025-12-04T11:07:13.0764678Z [W1204 10:54:10.357850081 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0764683Z 2025-12-04T11:07:13.0765209Z [W1204 10:54:10.358039296 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0765213Z 2025-12-04T11:07:13.0765723Z [W1204 10:54:10.358310223 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0765730Z 2025-12-04T11:07:13.0766263Z [W1204 10:54:10.358482009 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0766268Z 2025-12-04T11:07:13.0766780Z [W1204 10:54:10.358769635 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0766787Z 2025-12-04T11:07:13.0767313Z [W1204 10:54:10.358941680 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0767318Z 2025-12-04T11:07:13.0767827Z [W1204 10:54:10.359240603 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0767831Z 2025-12-04T11:07:13.0768342Z [W1204 10:54:10.359416557 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0768394Z 2025-12-04T11:07:13.0768913Z [W1204 10:54:10.473684229 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0768917Z 2025-12-04T11:07:13.0769427Z [W1204 10:54:10.473918420 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0769434Z 2025-12-04T11:07:13.0769957Z [W1204 10:54:10.474107968 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0769962Z 2025-12-04T11:07:13.0770475Z [W1204 10:54:10.474382558 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0770479Z 2025-12-04T11:07:13.0771001Z [W1204 10:54:10.474552003 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0771005Z 2025-12-04T11:07:13.0771523Z [W1204 10:54:10.474839071 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0771528Z 2025-12-04T11:07:13.0772056Z [W1204 10:54:10.475009816 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0772062Z 2025-12-04T11:07:13.0772574Z [W1204 10:54:10.475306473 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0772579Z 2025-12-04T11:07:13.0773105Z [W1204 10:54:10.475479007 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0773109Z 2025-12-04T11:07:13.0773623Z [W1204 10:54:10.635337364 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0773628Z 2025-12-04T11:07:13.0774215Z [W1204 10:54:10.635584954 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0774235Z 2025-12-04T11:07:13.0774752Z [W1204 10:54:10.635779468 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0774786Z 2025-12-04T11:07:13.0775298Z [W1204 10:54:10.636058019 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0775302Z 2025-12-04T11:07:13.0775827Z [W1204 10:54:10.636236519 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0775832Z 2025-12-04T11:07:13.0776340Z [W1204 10:54:10.636528845 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0776344Z 2025-12-04T11:07:13.0776868Z [W1204 10:54:10.636706434 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0776879Z 2025-12-04T11:07:13.0777393Z [W1204 10:54:10.636989185 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0777400Z 2025-12-04T11:07:13.0777923Z [W1204 10:54:10.637173798 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0777928Z 2025-12-04T11:07:13.0778522Z [W1204 10:54:10.645149169 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0778527Z 2025-12-04T11:07:13.0779037Z [W1204 10:54:10.645381970 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0779052Z 2025-12-04T11:07:13.0779562Z [W1204 10:54:10.645574899 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0779602Z 2025-12-04T11:07:13.0780119Z [W1204 10:54:10.645850739 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0780126Z 2025-12-04T11:07:13.0780647Z [W1204 10:54:10.646025622 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0780651Z 2025-12-04T11:07:13.0781164Z [W1204 10:54:10.646314638 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0781169Z 2025-12-04T11:07:13.0781695Z [W1204 10:54:10.646491201 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0781700Z 2025-12-04T11:07:13.0782210Z [W1204 10:54:10.646771974 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0782217Z 2025-12-04T11:07:13.0782746Z [W1204 10:54:10.646948358 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0782750Z 2025-12-04T11:07:13.0783262Z [W1204 10:54:10.760801882 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0783267Z 2025-12-04T11:07:13.0783795Z [W1204 10:54:10.761044270 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0783800Z 2025-12-04T11:07:13.0784311Z [W1204 10:54:10.761238269 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0784316Z 2025-12-04T11:07:13.0784823Z [W1204 10:54:10.761519421 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0784842Z 2025-12-04T11:07:13.0785409Z [W1204 10:54:10.761694943 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0785415Z 2025-12-04T11:07:13.0785923Z [W1204 10:54:10.761988767 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0785957Z 2025-12-04T11:07:13.0786481Z [W1204 10:54:10.762165652 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0786486Z 2025-12-04T11:07:13.0786997Z [W1204 10:54:10.762446992 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0787002Z 2025-12-04T11:07:13.0787527Z [W1204 10:54:10.762622545 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0787533Z 2025-12-04T11:07:13.0787638Z FAILED [0.7685s] [100%] 2025-12-04T11:07:13.0787642Z 2025-12-04T11:07:13.0788036Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.0788623Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0788753Z Traceback (most recent call last): 2025-12-04T11:07:13.0789315Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0789549Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0790019Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0790196Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0790737Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0791026Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0791163Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0791168Z 2025-12-04T11:07:13.0791275Z Expected 3 but got 6. 2025-12-04T11:07:13.0791397Z Absolute difference: 3 2025-12-04T11:07:13.0791507Z Relative difference: 1.0 2025-12-04T11:07:13.0791512Z 2025-12-04T11:07:13.0791742Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0792688Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0792694Z 2025-12-04T11:07:13.0792964Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0793201Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0793318Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0793869Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0794093Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0794193Z graph_break [] 2025-12-04T11:07:13.0794426Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0795647Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0795764Z if out == self.unknown_value: 2025-12-04T11:07:13.0796511Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0796613Z warnings.warn( 2025-12-04T11:07:13.0797426Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0797529Z warnings.warn( 2025-12-04T11:07:13.0799088Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0799225Z Traceback (most recent call last): 2025-12-04T11:07:13.0799768Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0800014Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0800477Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0800638Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0801190Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0801404Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0801537Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0801559Z 2025-12-04T11:07:13.0801664Z Expected 3 but got 6. 2025-12-04T11:07:13.0801769Z Absolute difference: 3 2025-12-04T11:07:13.0801890Z Relative difference: 1.0 2025-12-04T11:07:13.0801896Z 2025-12-04T11:07:13.0802113Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0803055Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0803061Z 2025-12-04T11:07:13.0803347Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0803571Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0803731Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0804273Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0804499Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0804609Z graph_break [] 2025-12-04T11:07:13.0804827Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0806054Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0806174Z if out == self.unknown_value: 2025-12-04T11:07:13.0806904Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0807018Z warnings.warn( 2025-12-04T11:07:13.0807749Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0807854Z warnings.warn( 2025-12-04T11:07:13.0808082Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0808198Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0808439Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0808972Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0809072Z graph_break [] 2025-12-04T11:07:13.0809297Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0810087Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0810204Z warnings.warn( 2025-12-04T11:07:13.0810926Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0811059Z warnings.warn( 2025-12-04T11:07:13.0811214Z =================================== FAILURES =================================== 2025-12-04T11:07:13.0811758Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.0811881Z Traceback (most recent call last): 2025-12-04T11:07:13.0812438Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 204, in test_int8_woq_mm_concat_cuda 2025-12-04T11:07:13.0812673Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 3) 2025-12-04T11:07:13.0813156Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0813323Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0813866Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0814089Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0814223Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0814228Z 2025-12-04T11:07:13.0814346Z Expected 3 but got 6. 2025-12-04T11:07:13.0814455Z Absolute difference: 3 2025-12-04T11:07:13.0814565Z Relative difference: 1.0 2025-12-04T11:07:13.0814570Z 2025-12-04T11:07:13.0814801Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0815743Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0815797Z 2025-12-04T11:07:13.0816069Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0816296Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0816415Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0816962Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0817189Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0817284Z graph_break [] 2025-12-04T11:07:13.0817510Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0818811Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0818996Z if out == self.unknown_value: 2025-12-04T11:07:13.0819730Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0819833Z warnings.warn( 2025-12-04T11:07:13.0820574Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0820676Z warnings.warn( 2025-12-04T11:07:13.0820892Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0821022Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0821248Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0821793Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0821895Z graph_break [] 2025-12-04T11:07:13.0822181Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0822925Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0823054Z warnings.warn( 2025-12-04T11:07:13.0823791Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0823891Z warnings.warn( 2025-12-04T11:07:13.0824107Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0824231Z stats [('calls_captured', 36)] 2025-12-04T11:07:13.0824463Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0824999Z inductor [('pattern_matcher_nodes', 36), ('woq_matcher_nodes', 24), ('pattern_matcher_count', 18), ('woq_matcher_count', 6), ('fxgraph_cache_miss', 2)] 2025-12-04T11:07:13.0825122Z graph_break [] 2025-12-04T11:07:13.0825338Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0826076Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0826179Z warnings.warn( 2025-12-04T11:07:13.0826902Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0827012Z warnings.warn( 2025-12-04T11:07:13.0827859Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-f04dfce55afddfa2.xml - 2025-12-04T11:07:13.0828040Z =========================== short test summary info ============================ 2025-12-04T11:07:13.0829055Z FAILED [0.7685s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0829063Z 2025-12-04T11:07:13.0829170Z Expected 3 but got 6. 2025-12-04T11:07:13.0829287Z Absolute difference: 3 2025-12-04T11:07:13.0829397Z Relative difference: 1.0 2025-12-04T11:07:13.0829402Z 2025-12-04T11:07:13.0829626Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0830566Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0830571Z 2025-12-04T11:07:13.0830840Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0831027Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.0831234Z ================== 1 failed, 10 deselected, 2 rerun in 21.67s ================== 2025-12-04T11:07:13.0831338Z Got exit code 1 2025-12-04T11:07:13.0832205Z FAILED CONSISTENTLY: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.0832622Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:07:13.0833080Z W1204 10:54:21.527000 83874 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.0833751Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a20deb48b5165834.xml 2025-12-04T11:07:13.0833929Z ============================= test session starts ============================== 2025-12-04T11:07:13.0834338Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.0834450Z cachedir: .pytest_cache 2025-12-04T11:07:13.0834989Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.0835144Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.0835254Z configfile: pytest.ini 2025-12-04T11:07:13.0835806Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.0836022Z collecting ... collected 58 items / 2 deselected / 56 selected 2025-12-04T11:07:13.0836177Z stepcurrent: skipping 2 already run items. 2025-12-04T11:07:13.0836288Z Running 9 items in this shard 2025-12-04T11:07:13.0836294Z 2025-12-04T11:07:13.0837167Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 ('RERUN', {'yellow': True}) [3.8525s] [ 11%] 2025-12-04T11:07:13.0838053Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.4276s] [ 11%] 2025-12-04T11:07:13.0838832Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 FAILED [0.4409s] [ 11%] 2025-12-04T11:07:13.0838837Z 2025-12-04T11:07:13.0838992Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.0839496Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.0839617Z Traceback (most recent call last): 2025-12-04T11:07:13.0840141Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.0840412Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.0840891Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0841058Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0841601Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0841821Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0841953Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0841958Z 2025-12-04T11:07:13.0842064Z Expected 1 but got 2. 2025-12-04T11:07:13.0842179Z Absolute difference: 1 2025-12-04T11:07:13.0842289Z Relative difference: 1.0 2025-12-04T11:07:13.0842294Z 2025-12-04T11:07:13.0842521Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0843435Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.0843442Z 2025-12-04T11:07:13.0843711Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0843944Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0844058Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0844967Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0845194Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0845294Z graph_break [] 2025-12-04T11:07:13.0845521Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0846314Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0846434Z warnings.warn( 2025-12-04T11:07:13.0847163Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0847295Z warnings.warn( 2025-12-04T11:07:13.0847817Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.0847941Z Traceback (most recent call last): 2025-12-04T11:07:13.0848456Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.0848698Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.0849160Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0849337Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0849880Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0850089Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0850231Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0850236Z 2025-12-04T11:07:13.0850342Z Expected 1 but got 2. 2025-12-04T11:07:13.0850466Z Absolute difference: 1 2025-12-04T11:07:13.0850577Z Relative difference: 1.0 2025-12-04T11:07:13.0850582Z 2025-12-04T11:07:13.0850799Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0851724Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.0851730Z 2025-12-04T11:07:13.0852033Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0852269Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0852388Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0853291Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0853534Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0853635Z graph_break [] 2025-12-04T11:07:13.0853853Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0854605Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0854709Z warnings.warn( 2025-12-04T11:07:13.0855454Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0855556Z warnings.warn( 2025-12-04T11:07:13.0855774Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0855901Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0856129Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0857036Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0857135Z graph_break [] 2025-12-04T11:07:13.0857348Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0858094Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0858329Z warnings.warn( 2025-12-04T11:07:13.0859060Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0859223Z warnings.warn( 2025-12-04T11:07:13.0859370Z =================================== FAILURES =================================== 2025-12-04T11:07:13.0859893Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.0860014Z Traceback (most recent call last): 2025-12-04T11:07:13.0860523Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.0860767Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.0861228Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0861396Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0861950Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0862155Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0862303Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0862309Z 2025-12-04T11:07:13.0862413Z Expected 1 but got 2. 2025-12-04T11:07:13.0862519Z Absolute difference: 1 2025-12-04T11:07:13.0862641Z Relative difference: 1.0 2025-12-04T11:07:13.0862646Z 2025-12-04T11:07:13.0862863Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0863783Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.0863819Z 2025-12-04T11:07:13.0864091Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0864308Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0864437Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0865325Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0865566Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0865665Z graph_break [] 2025-12-04T11:07:13.0865879Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0866627Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0866729Z warnings.warn( 2025-12-04T11:07:13.0867461Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0867572Z warnings.warn( 2025-12-04T11:07:13.0867790Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0867917Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0868143Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0869043Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0869151Z graph_break [] 2025-12-04T11:07:13.0869361Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0870158Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0870257Z warnings.warn( 2025-12-04T11:07:13.0870978Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0871120Z warnings.warn( 2025-12-04T11:07:13.0871333Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0871446Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0871684Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0872579Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0872688Z graph_break [] 2025-12-04T11:07:13.0872902Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0873628Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0873740Z warnings.warn( 2025-12-04T11:07:13.0874465Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0874574Z warnings.warn( 2025-12-04T11:07:13.0875419Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a20deb48b5165834.xml - 2025-12-04T11:07:13.0875591Z =========================== short test summary info ============================ 2025-12-04T11:07:13.0876539Z FAILED [0.4409s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0876580Z 2025-12-04T11:07:13.0876686Z Expected 1 but got 2. 2025-12-04T11:07:13.0876804Z Absolute difference: 1 2025-12-04T11:07:13.0876913Z Relative difference: 1.0 2025-12-04T11:07:13.0876921Z 2025-12-04T11:07:13.0877138Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0878053Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.0878059Z 2025-12-04T11:07:13.0878323Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0878514Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.0878711Z =================== 1 failed, 2 deselected, 2 rerun in 4.75s =================== 2025-12-04T11:07:13.0878815Z Got exit code 1 2025-12-04T11:07:13.0878933Z Retrying single test... 2025-12-04T11:07:13.0879387Z W1204 10:54:40.851000 84043 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.0880056Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-688c2e6fbd08d334.xml 2025-12-04T11:07:13.0880231Z ============================= test session starts ============================== 2025-12-04T11:07:13.0880584Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.0880705Z cachedir: .pytest_cache 2025-12-04T11:07:13.0881230Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.0881356Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.0881473Z configfile: pytest.ini 2025-12-04T11:07:13.0882019Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.0882296Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.0883299Z stepcurrent: skipping 2 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.0883444Z Running 1 items in this shard 2025-12-04T11:07:13.0883450Z 2025-12-04T11:07:13.0884744Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 [W1204 10:54:46.418996631 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0884751Z 2025-12-04T11:07:13.0885274Z [W1204 10:55:01.705471917 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0885282Z 2025-12-04T11:07:13.0885815Z [W1204 10:55:01.705731531 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0885820Z 2025-12-04T11:07:13.0886337Z [W1204 10:55:01.713043906 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0886342Z 2025-12-04T11:07:13.0886864Z [W1204 10:55:01.713726849 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0886869Z 2025-12-04T11:07:13.0887380Z [W1204 10:55:01.713917036 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0887385Z 2025-12-04T11:07:13.0888237Z [W1204 10:55:01.720752970 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0888312Z 2025-12-04T11:07:13.0888831Z [W1204 10:55:01.721500067 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0888835Z 2025-12-04T11:07:13.0889348Z [W1204 10:55:01.721684998 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0889355Z 2025-12-04T11:07:13.0889880Z [W1204 10:55:01.852993386 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0889885Z 2025-12-04T11:07:13.0890397Z [W1204 10:55:01.854719949 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0890402Z 2025-12-04T11:07:13.0890922Z [W1204 10:55:01.854932473 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0890929Z 2025-12-04T11:07:13.0891444Z [W1204 10:55:01.858842203 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0891448Z 2025-12-04T11:07:13.0891971Z [W1204 10:55:01.859519275 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0891979Z 2025-12-04T11:07:13.0892489Z [W1204 10:55:01.859716245 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0892494Z 2025-12-04T11:07:13.0893015Z [W1204 10:55:01.865760783 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0893019Z 2025-12-04T11:07:13.0893532Z [W1204 10:55:01.866435250 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0893539Z 2025-12-04T11:07:13.0894126Z [W1204 10:55:01.866630929 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0894144Z 2025-12-04T11:07:13.0894275Z ('RERUN', {'yellow': True}) [19.1554s] [100%] 2025-12-04T11:07:13.0895556Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 [W1204 10:55:02.248264934 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0895601Z 2025-12-04T11:07:13.0896128Z [W1204 10:55:02.248977162 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0896132Z 2025-12-04T11:07:13.0896638Z [W1204 10:55:02.249176800 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0896646Z 2025-12-04T11:07:13.0897171Z [W1204 10:55:02.253121074 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0897175Z 2025-12-04T11:07:13.0897683Z [W1204 10:55:02.253737411 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0897690Z 2025-12-04T11:07:13.0898211Z [W1204 10:55:02.253926852 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0898216Z 2025-12-04T11:07:13.0898797Z [W1204 10:55:02.259955852 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0898802Z 2025-12-04T11:07:13.0899319Z [W1204 10:55:02.260624771 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0899324Z 2025-12-04T11:07:13.0899876Z [W1204 10:55:02.260817210 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0899881Z 2025-12-04T11:07:13.0900391Z [W1204 10:55:02.344437699 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0900408Z 2025-12-04T11:07:13.0900917Z [W1204 10:55:02.345061900 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0900922Z 2025-12-04T11:07:13.0901437Z [W1204 10:55:02.345256219 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0901442Z 2025-12-04T11:07:13.0901966Z [W1204 10:55:02.349113303 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0901971Z 2025-12-04T11:07:13.0902480Z [W1204 10:55:02.349719819 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0902492Z 2025-12-04T11:07:13.0903013Z [W1204 10:55:02.349912796 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0903020Z 2025-12-04T11:07:13.0903528Z [W1204 10:55:02.355860862 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0903533Z 2025-12-04T11:07:13.0904053Z [W1204 10:55:02.356634797 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0904058Z 2025-12-04T11:07:13.0904567Z [W1204 10:55:02.356828869 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0904572Z 2025-12-04T11:07:13.0904703Z ('RERUN', {'yellow': True}) [0.4490s] [100%] 2025-12-04T11:07:13.0906076Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 [W1204 10:55:02.670970746 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0906113Z 2025-12-04T11:07:13.0906625Z [W1204 10:55:02.671653392 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0906630Z 2025-12-04T11:07:13.0907150Z [W1204 10:55:02.671849740 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0907154Z 2025-12-04T11:07:13.0907663Z [W1204 10:55:02.675722249 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0907667Z 2025-12-04T11:07:13.0908193Z [W1204 10:55:02.676328927 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0908204Z 2025-12-04T11:07:13.0908715Z [W1204 10:55:02.676520718 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0908723Z 2025-12-04T11:07:13.0909245Z [W1204 10:55:02.682527446 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0909250Z 2025-12-04T11:07:13.0909759Z [W1204 10:55:02.683150981 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0909763Z 2025-12-04T11:07:13.0910287Z [W1204 10:55:02.683339376 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0910292Z 2025-12-04T11:07:13.0910807Z [W1204 10:55:02.765739443 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0910841Z 2025-12-04T11:07:13.0911356Z [W1204 10:55:02.766364415 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0911372Z 2025-12-04T11:07:13.0911883Z [W1204 10:55:02.766561383 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0911888Z 2025-12-04T11:07:13.0912395Z [W1204 10:55:02.770441130 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0912400Z 2025-12-04T11:07:13.0912920Z [W1204 10:55:02.771075657 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0912925Z 2025-12-04T11:07:13.0913432Z [W1204 10:55:02.771271589 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0913439Z 2025-12-04T11:07:13.0913963Z [W1204 10:55:02.777163975 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0913968Z 2025-12-04T11:07:13.0914481Z [W1204 10:55:02.777932838 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0914488Z 2025-12-04T11:07:13.0915008Z [W1204 10:55:02.778128771 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0915013Z 2025-12-04T11:07:13.0915116Z FAILED [0.4193s] [100%] 2025-12-04T11:07:13.0915121Z 2025-12-04T11:07:13.0915264Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.0915788Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.0915916Z Traceback (most recent call last): 2025-12-04T11:07:13.0916501Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.0916781Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.0917342Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0917521Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0918072Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0918298Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0918437Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0918443Z 2025-12-04T11:07:13.0918551Z Expected 1 but got 2. 2025-12-04T11:07:13.0918675Z Absolute difference: 1 2025-12-04T11:07:13.0918788Z Relative difference: 1.0 2025-12-04T11:07:13.0918797Z 2025-12-04T11:07:13.0919017Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0919941Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.0919949Z 2025-12-04T11:07:13.0920222Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0920459Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0920577Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0921478Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0921722Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0921861Z graph_break [] 2025-12-04T11:07:13.0922101Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0923324Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0923446Z if out == self.unknown_value: 2025-12-04T11:07:13.0924192Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0924295Z warnings.warn( 2025-12-04T11:07:13.0925038Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0925143Z warnings.warn( 2025-12-04T11:07:13.0925656Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.0925800Z Traceback (most recent call last): 2025-12-04T11:07:13.0926317Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.0926553Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.0927026Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0927193Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0927746Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0927958Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0928093Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0928099Z 2025-12-04T11:07:13.0928224Z Expected 1 but got 2. 2025-12-04T11:07:13.0928334Z Absolute difference: 1 2025-12-04T11:07:13.0928502Z Relative difference: 1.0 2025-12-04T11:07:13.0928518Z 2025-12-04T11:07:13.0928735Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0929650Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.0929685Z 2025-12-04T11:07:13.0929966Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0930187Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0930315Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0931215Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0931452Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0931564Z graph_break [] 2025-12-04T11:07:13.0931782Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0933001Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0933136Z if out == self.unknown_value: 2025-12-04T11:07:13.0933868Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0933985Z warnings.warn( 2025-12-04T11:07:13.0934714Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0934844Z warnings.warn( 2025-12-04T11:07:13.0935080Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0935198Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0935436Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0936337Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0936436Z graph_break [] 2025-12-04T11:07:13.0936665Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0937401Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0937511Z warnings.warn( 2025-12-04T11:07:13.0938308Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0938414Z warnings.warn( 2025-12-04T11:07:13.0938575Z =================================== FAILURES =================================== 2025-12-04T11:07:13.0939090Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.0939211Z Traceback (most recent call last): 2025-12-04T11:07:13.0939738Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.0939970Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.0940442Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.0940607Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.0941213Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.0941435Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.0941565Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0941600Z 2025-12-04T11:07:13.0941709Z Expected 1 but got 2. 2025-12-04T11:07:13.0941831Z Absolute difference: 1 2025-12-04T11:07:13.0941941Z Relative difference: 1.0 2025-12-04T11:07:13.0941946Z 2025-12-04T11:07:13.0942176Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0943085Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.0943091Z 2025-12-04T11:07:13.0943360Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0943600Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0943720Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0944633Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0944863Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0944959Z graph_break [] 2025-12-04T11:07:13.0945185Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0946397Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.0946527Z if out == self.unknown_value: 2025-12-04T11:07:13.0947293Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0947395Z warnings.warn( 2025-12-04T11:07:13.0948136Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0948240Z warnings.warn( 2025-12-04T11:07:13.0948472Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0948586Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0948816Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0949733Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0949833Z graph_break [] 2025-12-04T11:07:13.0950052Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0950795Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0950902Z warnings.warn( 2025-12-04T11:07:13.0951641Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0951742Z warnings.warn( 2025-12-04T11:07:13.0951958Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.0952086Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.0952316Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.0953229Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.0953403Z graph_break [] 2025-12-04T11:07:13.0953619Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.0954359Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0954493Z warnings.warn( 2025-12-04T11:07:13.0955216Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.0955328Z warnings.warn( 2025-12-04T11:07:13.0956174Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-688c2e6fbd08d334.xml - 2025-12-04T11:07:13.0956357Z =========================== short test summary info ============================ 2025-12-04T11:07:13.0957304Z FAILED [0.4193s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.0957313Z 2025-12-04T11:07:13.0957423Z Expected 1 but got 2. 2025-12-04T11:07:13.0957543Z Absolute difference: 1 2025-12-04T11:07:13.0957656Z Relative difference: 1.0 2025-12-04T11:07:13.0957661Z 2025-12-04T11:07:13.0957889Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.0958796Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.0958802Z 2025-12-04T11:07:13.0959071Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.0959262Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.0959503Z ================== 1 failed, 10 deselected, 2 rerun in 20.06s ================== 2025-12-04T11:07:13.0959617Z Got exit code 1 2025-12-04T11:07:13.0959725Z Retrying single test... 2025-12-04T11:07:13.0960177Z W1204 10:55:13.460000 84217 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.0960855Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a49a4c623eea732f.xml 2025-12-04T11:07:13.0961022Z ============================= test session starts ============================== 2025-12-04T11:07:13.0961386Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.0961497Z cachedir: .pytest_cache 2025-12-04T11:07:13.0962020Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.0962164Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.0962276Z configfile: pytest.ini 2025-12-04T11:07:13.0962823Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.0963058Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.0964046Z stepcurrent: skipping 2 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.0964173Z Running 1 items in this shard 2025-12-04T11:07:13.0964178Z 2025-12-04T11:07:13.0965465Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 [W1204 10:55:18.051739920 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0965473Z 2025-12-04T11:07:13.0966066Z [W1204 10:55:34.280977525 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0966072Z 2025-12-04T11:07:13.0966631Z [W1204 10:55:34.281224964 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0966636Z 2025-12-04T11:07:13.0967150Z [W1204 10:55:34.288472689 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0967166Z 2025-12-04T11:07:13.0967676Z [W1204 10:55:34.289135908 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0967681Z 2025-12-04T11:07:13.0968192Z [W1204 10:55:34.289328314 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0968200Z 2025-12-04T11:07:13.0968728Z [W1204 10:55:34.296132987 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0968733Z 2025-12-04T11:07:13.0969243Z [W1204 10:55:34.296872638 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0969250Z 2025-12-04T11:07:13.0969775Z [W1204 10:55:34.297059962 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0969780Z 2025-12-04T11:07:13.0970287Z [W1204 10:55:34.428247299 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0970292Z 2025-12-04T11:07:13.0970815Z [W1204 10:55:34.429710904 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0970849Z 2025-12-04T11:07:13.0971366Z [W1204 10:55:34.429912237 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0971371Z 2025-12-04T11:07:13.0971894Z [W1204 10:55:34.433804260 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0971901Z 2025-12-04T11:07:13.0972410Z [W1204 10:55:34.434427056 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0972414Z 2025-12-04T11:07:13.0972924Z [W1204 10:55:34.434622438 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0972928Z 2025-12-04T11:07:13.0973451Z [W1204 10:55:34.440661845 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0973455Z 2025-12-04T11:07:13.0973971Z [W1204 10:55:34.441304260 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0973975Z 2025-12-04T11:07:13.0974498Z [W1204 10:55:34.441500219 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0974505Z 2025-12-04T11:07:13.0974638Z ('RERUN', {'yellow': True}) [19.1152s] [100%] 2025-12-04T11:07:13.0975929Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 [W1204 10:55:34.821216447 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0975935Z 2025-12-04T11:07:13.0976445Z [W1204 10:55:34.821910733 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0976452Z 2025-12-04T11:07:13.0977033Z [W1204 10:55:34.822109280 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0977038Z 2025-12-04T11:07:13.0977551Z [W1204 10:55:34.825998958 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0977585Z 2025-12-04T11:07:13.0978099Z [W1204 10:55:34.826605857 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0978116Z 2025-12-04T11:07:13.0978690Z [W1204 10:55:34.826795672 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0978695Z 2025-12-04T11:07:13.0979207Z [W1204 10:55:34.832870452 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0979212Z 2025-12-04T11:07:13.0979749Z [W1204 10:55:34.833481084 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0979754Z 2025-12-04T11:07:13.0980267Z [W1204 10:55:34.833668359 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0980274Z 2025-12-04T11:07:13.0980796Z [W1204 10:55:34.917007754 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0980801Z 2025-12-04T11:07:13.0981312Z [W1204 10:55:34.917629088 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0981318Z 2025-12-04T11:07:13.0981839Z [W1204 10:55:34.917825650 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0981844Z 2025-12-04T11:07:13.0982357Z [W1204 10:55:34.921731423 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0982403Z 2025-12-04T11:07:13.0982925Z [W1204 10:55:34.922343434 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0982932Z 2025-12-04T11:07:13.0983443Z [W1204 10:55:34.922540105 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0983448Z 2025-12-04T11:07:13.0983959Z [W1204 10:55:34.928477236 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0983976Z 2025-12-04T11:07:13.0984487Z [W1204 10:55:34.929248021 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0984491Z 2025-12-04T11:07:13.0985003Z [W1204 10:55:34.929444637 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0985011Z 2025-12-04T11:07:13.0985158Z ('RERUN', {'yellow': True}) [0.4488s] [100%] 2025-12-04T11:07:13.0986446Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 [W1204 10:55:35.241663675 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0986454Z 2025-12-04T11:07:13.0986977Z [W1204 10:55:35.242334457 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0986982Z 2025-12-04T11:07:13.0999679Z [W1204 10:55:35.242533330 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.0999697Z 2025-12-04T11:07:13.1000317Z [W1204 10:55:35.246407695 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1000346Z 2025-12-04T11:07:13.1001056Z [W1204 10:55:35.247015527 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1001112Z 2025-12-04T11:07:13.1001630Z [W1204 10:55:35.247219349 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1001635Z 2025-12-04T11:07:13.1002158Z [W1204 10:55:35.253247567 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1002163Z 2025-12-04T11:07:13.1002675Z [W1204 10:55:35.253856136 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1002679Z 2025-12-04T11:07:13.1003209Z [W1204 10:55:35.254051108 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1003217Z 2025-12-04T11:07:13.1003732Z [W1204 10:55:35.336836387 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1003737Z 2025-12-04T11:07:13.1004262Z [W1204 10:55:35.337459463 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1004268Z 2025-12-04T11:07:13.1004781Z [W1204 10:55:35.337656098 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1004786Z 2025-12-04T11:07:13.1005308Z [W1204 10:55:35.341532873 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1005313Z 2025-12-04T11:07:13.1005824Z [W1204 10:55:35.342143337 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1005898Z 2025-12-04T11:07:13.1006412Z [W1204 10:55:35.342339629 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1006429Z 2025-12-04T11:07:13.1006942Z [W1204 10:55:35.348229954 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1006949Z 2025-12-04T11:07:13.1007460Z [W1204 10:55:35.348999684 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1007465Z 2025-12-04T11:07:13.1007988Z [W1204 10:55:35.349196616 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1007992Z 2025-12-04T11:07:13.1008100Z FAILED [0.4179s] [100%] 2025-12-04T11:07:13.1008105Z 2025-12-04T11:07:13.1008259Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1008781Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1008912Z Traceback (most recent call last): 2025-12-04T11:07:13.1009446Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1009683Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1010164Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1010329Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1010872Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1011091Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1011226Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1011234Z 2025-12-04T11:07:13.1011341Z Expected 1 but got 2. 2025-12-04T11:07:13.1011517Z Absolute difference: 1 2025-12-04T11:07:13.1011630Z Relative difference: 1.0 2025-12-04T11:07:13.1011635Z 2025-12-04T11:07:13.1011860Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1012802Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1012808Z 2025-12-04T11:07:13.1013084Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1013323Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1013441Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1014354Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1014592Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1014691Z graph_break [] 2025-12-04T11:07:13.1014929Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1016153Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1016289Z if out == self.unknown_value: 2025-12-04T11:07:13.1017017Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1017120Z warnings.warn( 2025-12-04T11:07:13.1017857Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1017992Z warnings.warn( 2025-12-04T11:07:13.1018623Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1018752Z Traceback (most recent call last): 2025-12-04T11:07:13.1019265Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1019511Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1019972Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1020136Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1020690Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1020897Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1021049Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1021056Z 2025-12-04T11:07:13.1021161Z Expected 1 but got 2. 2025-12-04T11:07:13.1021266Z Absolute difference: 1 2025-12-04T11:07:13.1021388Z Relative difference: 1.0 2025-12-04T11:07:13.1021395Z 2025-12-04T11:07:13.1021616Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1022527Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1022545Z 2025-12-04T11:07:13.1022814Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1023030Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1023155Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1024126Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1024357Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1024497Z graph_break [] 2025-12-04T11:07:13.1024718Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1025946Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1026065Z if out == self.unknown_value: 2025-12-04T11:07:13.1026796Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1026911Z warnings.warn( 2025-12-04T11:07:13.1027641Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1027755Z warnings.warn( 2025-12-04T11:07:13.1027974Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1028090Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1028329Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1029227Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1029338Z graph_break [] 2025-12-04T11:07:13.1029555Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1030287Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1030445Z warnings.warn( 2025-12-04T11:07:13.1031170Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1031275Z warnings.warn( 2025-12-04T11:07:13.1031436Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1031951Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1032085Z Traceback (most recent call last): 2025-12-04T11:07:13.1032601Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1032837Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1033311Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1033481Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1034035Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1034249Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1034383Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1034389Z 2025-12-04T11:07:13.1034511Z Expected 1 but got 2. 2025-12-04T11:07:13.1034616Z Absolute difference: 1 2025-12-04T11:07:13.1034726Z Relative difference: 1.0 2025-12-04T11:07:13.1034731Z 2025-12-04T11:07:13.1034960Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1035874Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1035882Z 2025-12-04T11:07:13.1036225Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1036445Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1036591Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1037497Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1037724Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1037835Z graph_break [] 2025-12-04T11:07:13.1038052Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1039271Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1039399Z if out == self.unknown_value: 2025-12-04T11:07:13.1040128Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1040235Z warnings.warn( 2025-12-04T11:07:13.1040955Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1041048Z warnings.warn( 2025-12-04T11:07:13.1041267Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1041375Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1041597Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1042504Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1042633Z graph_break [] 2025-12-04T11:07:13.1042860Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1043594Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1043694Z warnings.warn( 2025-12-04T11:07:13.1044430Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1044531Z warnings.warn( 2025-12-04T11:07:13.1044758Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1044873Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1045098Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1046011Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1046112Z graph_break [] 2025-12-04T11:07:13.1046328Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1047066Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1047166Z warnings.warn( 2025-12-04T11:07:13.1047902Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1048001Z warnings.warn( 2025-12-04T11:07:13.1048901Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a49a4c623eea732f.xml - 2025-12-04T11:07:13.1049087Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1050027Z FAILED [0.4179s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1050077Z 2025-12-04T11:07:13.1050202Z Expected 1 but got 2. 2025-12-04T11:07:13.1050317Z Absolute difference: 1 2025-12-04T11:07:13.1050428Z Relative difference: 1.0 2025-12-04T11:07:13.1050433Z 2025-12-04T11:07:13.1050659Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1051565Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1051573Z 2025-12-04T11:07:13.1051855Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1052033Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1052233Z ================== 1 failed, 10 deselected, 2 rerun in 20.01s ================== 2025-12-04T11:07:13.1052347Z Got exit code 1 2025-12-04T11:07:13.1053173Z FAILED CONSISTENTLY: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1053602Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:07:13.1054052Z W1204 10:55:46.106000 84391 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1054714Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-9d29992f2be81c73.xml 2025-12-04T11:07:13.1054930Z ============================= test session starts ============================== 2025-12-04T11:07:13.1055283Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1055406Z cachedir: .pytest_cache 2025-12-04T11:07:13.1055930Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1056054Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1056176Z configfile: pytest.ini 2025-12-04T11:07:13.1056721Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1056934Z collecting ... collected 58 items / 3 deselected / 55 selected 2025-12-04T11:07:13.1057091Z stepcurrent: skipping 3 already run items. 2025-12-04T11:07:13.1057207Z Running 8 items in this shard 2025-12-04T11:07:13.1057214Z 2025-12-04T11:07:13.1058104Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 ('RERUN', {'yellow': True}) [3.8151s] [ 12%] 2025-12-04T11:07:13.1059041Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.4277s] [ 12%] 2025-12-04T11:07:13.1059826Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 FAILED [0.4118s] [ 12%] 2025-12-04T11:07:13.1059843Z 2025-12-04T11:07:13.1059985Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1060500Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1060633Z Traceback (most recent call last): 2025-12-04T11:07:13.1061213Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1061448Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1061952Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1062115Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1062664Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1062870Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1063007Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1063012Z 2025-12-04T11:07:13.1063124Z Expected 1 but got 2. 2025-12-04T11:07:13.1063229Z Absolute difference: 1 2025-12-04T11:07:13.1063339Z Relative difference: 1.0 2025-12-04T11:07:13.1063346Z 2025-12-04T11:07:13.1063565Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1064479Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1064487Z 2025-12-04T11:07:13.1064761Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1064983Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1065097Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1066000Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1066226Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1066368Z graph_break [] 2025-12-04T11:07:13.1066583Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1067321Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1067430Z warnings.warn( 2025-12-04T11:07:13.1068157Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1068269Z warnings.warn( 2025-12-04T11:07:13.1068773Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1068893Z Traceback (most recent call last): 2025-12-04T11:07:13.1069420Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1069654Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1070118Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1070293Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1070831Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1071045Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1071175Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1071180Z 2025-12-04T11:07:13.1071284Z Expected 1 but got 2. 2025-12-04T11:07:13.1071400Z Absolute difference: 1 2025-12-04T11:07:13.1071506Z Relative difference: 1.0 2025-12-04T11:07:13.1071512Z 2025-12-04T11:07:13.1071738Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1072705Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1072713Z 2025-12-04T11:07:13.1072980Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1073206Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1073352Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1074253Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1074478Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1074574Z graph_break [] 2025-12-04T11:07:13.1074801Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1075543Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1075647Z warnings.warn( 2025-12-04T11:07:13.1076389Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1076492Z warnings.warn( 2025-12-04T11:07:13.1076718Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1076832Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1077053Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1077959Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1078058Z graph_break [] 2025-12-04T11:07:13.1078282Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1079052Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1079152Z warnings.warn( 2025-12-04T11:07:13.1079889Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1079989Z warnings.warn( 2025-12-04T11:07:13.1080135Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1080654Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1080774Z Traceback (most recent call last): 2025-12-04T11:07:13.1081293Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1081531Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1081998Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1082173Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1082715Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1082931Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1083060Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1083066Z 2025-12-04T11:07:13.1083170Z Expected 1 but got 2. 2025-12-04T11:07:13.1083285Z Absolute difference: 1 2025-12-04T11:07:13.1083396Z Relative difference: 1.0 2025-12-04T11:07:13.1083402Z 2025-12-04T11:07:13.1083620Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1084599Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1084608Z 2025-12-04T11:07:13.1084877Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1085137Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1085252Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1086149Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1086383Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1086480Z graph_break [] 2025-12-04T11:07:13.1086704Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1087445Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1087549Z warnings.warn( 2025-12-04T11:07:13.1088588Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1088695Z warnings.warn( 2025-12-04T11:07:13.1088908Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1089032Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1089261Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1090164Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1090262Z graph_break [] 2025-12-04T11:07:13.1090548Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1091292Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1091388Z warnings.warn( 2025-12-04T11:07:13.1092119Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1092216Z warnings.warn( 2025-12-04T11:07:13.1092427Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1092549Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1092780Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1093673Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1093785Z graph_break [] 2025-12-04T11:07:13.1094000Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1094741Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1094838Z warnings.warn( 2025-12-04T11:07:13.1095563Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1095674Z warnings.warn( 2025-12-04T11:07:13.1096522Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-9d29992f2be81c73.xml - 2025-12-04T11:07:13.1096703Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1097742Z FAILED [0.4118s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1097750Z 2025-12-04T11:07:13.1097894Z Expected 1 but got 2. 2025-12-04T11:07:13.1098012Z Absolute difference: 1 2025-12-04T11:07:13.1098119Z Relative difference: 1.0 2025-12-04T11:07:13.1098124Z 2025-12-04T11:07:13.1098428Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1099337Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1099342Z 2025-12-04T11:07:13.1099610Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1099802Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1100006Z =================== 1 failed, 3 deselected, 2 rerun in 4.69s =================== 2025-12-04T11:07:13.1100115Z Got exit code 1 2025-12-04T11:07:13.1100219Z Retrying single test... 2025-12-04T11:07:13.1100666Z W1204 10:56:05.365000 84560 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1101340Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7a570f380a97ffdf.xml 2025-12-04T11:07:13.1101501Z ============================= test session starts ============================== 2025-12-04T11:07:13.1101852Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1101972Z cachedir: .pytest_cache 2025-12-04T11:07:13.1102493Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1102662Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1102771Z configfile: pytest.ini 2025-12-04T11:07:13.1103321Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1103550Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.1104542Z stepcurrent: skipping 3 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1104663Z Running 1 items in this shard 2025-12-04T11:07:13.1104668Z 2025-12-04T11:07:13.1105951Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 [W1204 10:56:10.904554897 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1105960Z 2025-12-04T11:07:13.1106488Z [W1204 10:56:26.196168396 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1106502Z 2025-12-04T11:07:13.1107012Z [W1204 10:56:26.196427587 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1107019Z 2025-12-04T11:07:13.1107530Z [W1204 10:56:26.203643724 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1107536Z 2025-12-04T11:07:13.1108053Z [W1204 10:56:26.204321855 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1108057Z 2025-12-04T11:07:13.1108566Z [W1204 10:56:26.204511254 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1108573Z 2025-12-04T11:07:13.1109168Z [W1204 10:56:26.211343433 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1109174Z 2025-12-04T11:07:13.1109686Z [W1204 10:56:26.212086813 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1109722Z 2025-12-04T11:07:13.1110243Z [W1204 10:56:26.212274896 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1110248Z 2025-12-04T11:07:13.1110756Z [W1204 10:56:26.343719600 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1110761Z 2025-12-04T11:07:13.1111281Z [W1204 10:56:26.345238691 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1111288Z 2025-12-04T11:07:13.1111803Z [W1204 10:56:26.345441538 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1111807Z 2025-12-04T11:07:13.1112314Z [W1204 10:56:26.349292128 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1112330Z 2025-12-04T11:07:13.1112843Z [W1204 10:56:26.349925106 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1112848Z 2025-12-04T11:07:13.1113355Z [W1204 10:56:26.350145155 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1113359Z 2025-12-04T11:07:13.1113881Z [W1204 10:56:26.356141576 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1113886Z 2025-12-04T11:07:13.1114428Z [W1204 10:56:26.356756330 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1114433Z 2025-12-04T11:07:13.1114952Z [W1204 10:56:26.356947150 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1114960Z 2025-12-04T11:07:13.1115091Z ('RERUN', {'yellow': True}) [19.1322s] [100%] 2025-12-04T11:07:13.1116378Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 [W1204 10:56:26.726529787 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1116384Z 2025-12-04T11:07:13.1116896Z [W1204 10:56:26.727250296 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1116901Z 2025-12-04T11:07:13.1117433Z [W1204 10:56:26.727448785 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1117438Z 2025-12-04T11:07:13.1117943Z [W1204 10:56:26.731414993 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1117950Z 2025-12-04T11:07:13.1118459Z [W1204 10:56:26.732034314 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1118475Z 2025-12-04T11:07:13.1118984Z [W1204 10:56:26.732224348 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1118989Z 2025-12-04T11:07:13.1119500Z [W1204 10:56:26.738233180 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1119504Z 2025-12-04T11:07:13.1120026Z [W1204 10:56:26.738841688 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1120092Z 2025-12-04T11:07:13.1120603Z [W1204 10:56:26.739028975 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1120637Z 2025-12-04T11:07:13.1121156Z [W1204 10:56:26.822595559 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1121161Z 2025-12-04T11:07:13.1121671Z [W1204 10:56:26.823242766 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1121676Z 2025-12-04T11:07:13.1122195Z [W1204 10:56:26.823439051 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1122200Z 2025-12-04T11:07:13.1122709Z [W1204 10:56:26.827310199 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1122716Z 2025-12-04T11:07:13.1123227Z [W1204 10:56:26.827923144 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1123242Z 2025-12-04T11:07:13.1123752Z [W1204 10:56:26.828117459 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1123757Z 2025-12-04T11:07:13.1124265Z [W1204 10:56:26.834100655 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1124270Z 2025-12-04T11:07:13.1124787Z [W1204 10:56:26.834882937 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1124792Z 2025-12-04T11:07:13.1125300Z [W1204 10:56:26.835088504 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1125333Z 2025-12-04T11:07:13.1125477Z ('RERUN', {'yellow': True}) [0.4396s] [100%] 2025-12-04T11:07:13.1126756Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 [W1204 10:56:27.144786516 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1126763Z 2025-12-04T11:07:13.1127287Z [W1204 10:56:27.145471184 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1127292Z 2025-12-04T11:07:13.1127801Z [W1204 10:56:27.145669837 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1127805Z 2025-12-04T11:07:13.1128324Z [W1204 10:56:27.149605683 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1128331Z 2025-12-04T11:07:13.1128845Z [W1204 10:56:27.150269713 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1128849Z 2025-12-04T11:07:13.1129361Z [W1204 10:56:27.150464788 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1129375Z 2025-12-04T11:07:13.1129884Z [W1204 10:56:27.156445124 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1129889Z 2025-12-04T11:07:13.1130394Z [W1204 10:56:27.157084199 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1130398Z 2025-12-04T11:07:13.1130912Z [W1204 10:56:27.157271348 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1130919Z 2025-12-04T11:07:13.1131488Z [W1204 10:56:27.240296888 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1131493Z 2025-12-04T11:07:13.1132010Z [W1204 10:56:27.240926056 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1132044Z 2025-12-04T11:07:13.1132553Z [W1204 10:56:27.241122325 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1132558Z 2025-12-04T11:07:13.1133077Z [W1204 10:56:27.244982611 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1133081Z 2025-12-04T11:07:13.1133590Z [W1204 10:56:27.245595879 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1133597Z 2025-12-04T11:07:13.1134117Z [W1204 10:56:27.245791927 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1134121Z 2025-12-04T11:07:13.1134626Z [W1204 10:56:27.251729407 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1134633Z 2025-12-04T11:07:13.1135144Z [W1204 10:56:27.252504471 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1135149Z 2025-12-04T11:07:13.1135668Z [W1204 10:56:27.252698857 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1135672Z 2025-12-04T11:07:13.1135772Z FAILED [0.4151s] [100%] 2025-12-04T11:07:13.1135777Z 2025-12-04T11:07:13.1135930Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1136475Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1136596Z Traceback (most recent call last): 2025-12-04T11:07:13.1137121Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1137356Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1137831Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1137993Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1138598Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1138819Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1138956Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1138965Z 2025-12-04T11:07:13.1139081Z Expected 1 but got 2. 2025-12-04T11:07:13.1139189Z Absolute difference: 1 2025-12-04T11:07:13.1139301Z Relative difference: 1.0 2025-12-04T11:07:13.1139306Z 2025-12-04T11:07:13.1139536Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1140454Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1140459Z 2025-12-04T11:07:13.1140739Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1140958Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1141073Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1141983Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1142292Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1142392Z graph_break [] 2025-12-04T11:07:13.1142619Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1143869Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1143995Z if out == self.unknown_value: 2025-12-04T11:07:13.1144725Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1144823Z warnings.warn( 2025-12-04T11:07:13.1145554Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1145664Z warnings.warn( 2025-12-04T11:07:13.1146181Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1146304Z Traceback (most recent call last): 2025-12-04T11:07:13.1146815Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1147061Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1147518Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1147682Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1148231Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1148436Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1148610Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1148619Z 2025-12-04T11:07:13.1148722Z Expected 1 but got 2. 2025-12-04T11:07:13.1148828Z Absolute difference: 1 2025-12-04T11:07:13.1148950Z Relative difference: 1.0 2025-12-04T11:07:13.1148957Z 2025-12-04T11:07:13.1149170Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1150088Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1150094Z 2025-12-04T11:07:13.1150360Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1150576Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1150701Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1151600Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1151837Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1151938Z graph_break [] 2025-12-04T11:07:13.1152153Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1153380Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1153494Z if out == self.unknown_value: 2025-12-04T11:07:13.1154233Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1154335Z warnings.warn( 2025-12-04T11:07:13.1155114Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1155228Z warnings.warn( 2025-12-04T11:07:13.1155444Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1155590Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1155826Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1156726Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1156834Z graph_break [] 2025-12-04T11:07:13.1157050Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1157778Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1157893Z warnings.warn( 2025-12-04T11:07:13.1158613Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1158724Z warnings.warn( 2025-12-04T11:07:13.1158867Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1159376Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1159509Z Traceback (most recent call last): 2025-12-04T11:07:13.1160024Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1160255Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1160724Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1160926Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1161466Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1161683Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1161816Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1161823Z 2025-12-04T11:07:13.1161938Z Expected 1 but got 2. 2025-12-04T11:07:13.1162045Z Absolute difference: 1 2025-12-04T11:07:13.1162155Z Relative difference: 1.0 2025-12-04T11:07:13.1162160Z 2025-12-04T11:07:13.1162382Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1163291Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1163298Z 2025-12-04T11:07:13.1163581Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1163799Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1163913Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1164819Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1165045Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1165154Z graph_break [] 2025-12-04T11:07:13.1165369Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1166580Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1166770Z if out == self.unknown_value: 2025-12-04T11:07:13.1167502Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1167631Z warnings.warn( 2025-12-04T11:07:13.1168367Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1168468Z warnings.warn( 2025-12-04T11:07:13.1168693Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1168806Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1169035Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1169952Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1170051Z graph_break [] 2025-12-04T11:07:13.1170282Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1171016Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1171118Z warnings.warn( 2025-12-04T11:07:13.1171856Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1171955Z warnings.warn( 2025-12-04T11:07:13.1172171Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1172297Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1172520Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1173460Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1173561Z graph_break [] 2025-12-04T11:07:13.1173773Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1174513Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1174613Z warnings.warn( 2025-12-04T11:07:13.1175350Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1175448Z warnings.warn( 2025-12-04T11:07:13.1176291Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7a570f380a97ffdf.xml - 2025-12-04T11:07:13.1176482Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1177416Z FAILED [0.4151s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1177424Z 2025-12-04T11:07:13.1177540Z Expected 1 but got 2. 2025-12-04T11:07:13.1177647Z Absolute difference: 1 2025-12-04T11:07:13.1177753Z Relative difference: 1.0 2025-12-04T11:07:13.1177758Z 2025-12-04T11:07:13.1177984Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1178968Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1178978Z 2025-12-04T11:07:13.1179326Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1179507Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1179702Z ================== 1 failed, 10 deselected, 2 rerun in 20.02s ================== 2025-12-04T11:07:13.1179846Z Got exit code 1 2025-12-04T11:07:13.1179956Z Retrying single test... 2025-12-04T11:07:13.1180405Z W1204 10:56:37.865000 84734 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1181082Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-b091dc2605282b32.xml 2025-12-04T11:07:13.1181247Z ============================= test session starts ============================== 2025-12-04T11:07:13.1181609Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1181720Z cachedir: .pytest_cache 2025-12-04T11:07:13.1182248Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1182381Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1182490Z configfile: pytest.ini 2025-12-04T11:07:13.1183034Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1183261Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.1184251Z stepcurrent: skipping 3 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1184371Z Running 1 items in this shard 2025-12-04T11:07:13.1184376Z 2025-12-04T11:07:13.1185662Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 [W1204 10:56:43.399263927 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1185714Z 2025-12-04T11:07:13.1186244Z [W1204 10:56:58.083372836 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1186252Z 2025-12-04T11:07:13.1186768Z [W1204 10:56:58.083632197 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1186773Z 2025-12-04T11:07:13.1187294Z [W1204 10:56:58.090890611 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1187299Z 2025-12-04T11:07:13.1188086Z [W1204 10:56:58.091579823 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1188097Z 2025-12-04T11:07:13.1188622Z [W1204 10:56:58.091769133 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1188639Z 2025-12-04T11:07:13.1189152Z [W1204 10:56:58.098527691 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1189159Z 2025-12-04T11:07:13.1189666Z [W1204 10:56:58.099268298 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1189670Z 2025-12-04T11:07:13.1190194Z [W1204 10:56:58.099455134 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1190199Z 2025-12-04T11:07:13.1190708Z [W1204 10:56:59.229965875 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1190713Z 2025-12-04T11:07:13.1191345Z [W1204 10:56:59.231435501 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1191350Z 2025-12-04T11:07:13.1191862Z [W1204 10:56:59.231642215 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1191904Z 2025-12-04T11:07:13.1192426Z [W1204 10:56:59.235473542 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1192431Z 2025-12-04T11:07:13.1192942Z [W1204 10:56:59.236079442 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1192946Z 2025-12-04T11:07:13.1193471Z [W1204 10:56:59.236273870 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1193475Z 2025-12-04T11:07:13.1193991Z [W1204 10:56:59.242233718 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1193998Z 2025-12-04T11:07:13.1194510Z [W1204 10:56:59.242864043 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1194517Z 2025-12-04T11:07:13.1195036Z [W1204 10:56:59.243073053 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1195041Z 2025-12-04T11:07:13.1195172Z ('RERUN', {'yellow': True}) [19.5218s] [100%] 2025-12-04T11:07:13.1196459Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 [W1204 10:56:59.610102078 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1196465Z 2025-12-04T11:07:13.1197022Z [W1204 10:56:59.610815808 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1197027Z 2025-12-04T11:07:13.1197545Z [W1204 10:56:59.611011426 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1197552Z 2025-12-04T11:07:13.1198062Z [W1204 10:56:59.614895579 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1198068Z 2025-12-04T11:07:13.1198587Z [W1204 10:56:59.615519169 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1198591Z 2025-12-04T11:07:13.1199103Z [W1204 10:56:59.615709007 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1199108Z 2025-12-04T11:07:13.1199618Z [W1204 10:56:59.621714453 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1199641Z 2025-12-04T11:07:13.1200154Z [W1204 10:56:59.622324446 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1200161Z 2025-12-04T11:07:13.1200667Z [W1204 10:56:59.622510999 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1200672Z 2025-12-04T11:07:13.1201192Z [W1204 10:56:59.705313068 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1201196Z 2025-12-04T11:07:13.1201707Z [W1204 10:56:59.705946234 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1201712Z 2025-12-04T11:07:13.1202234Z [W1204 10:56:59.706141954 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1202241Z 2025-12-04T11:07:13.1202803Z [W1204 10:56:59.709965572 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1202835Z 2025-12-04T11:07:13.1203359Z [W1204 10:56:59.710601992 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1203364Z 2025-12-04T11:07:13.1203873Z [W1204 10:56:59.710799857 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1203878Z 2025-12-04T11:07:13.1204399Z [W1204 10:56:59.716703532 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1204404Z 2025-12-04T11:07:13.1204912Z [W1204 10:56:59.717474285 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1204919Z 2025-12-04T11:07:13.1205432Z [W1204 10:56:59.717667045 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1205446Z 2025-12-04T11:07:13.1205579Z ('RERUN', {'yellow': True}) [0.4360s] [100%] 2025-12-04T11:07:13.1206852Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 [W1204 10:56:59.020413108 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1206858Z 2025-12-04T11:07:13.1207381Z [W1204 10:56:59.021095136 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1207385Z 2025-12-04T11:07:13.1207901Z [W1204 10:56:59.021288703 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1207936Z 2025-12-04T11:07:13.1208461Z [W1204 10:56:59.025183196 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1208466Z 2025-12-04T11:07:13.1208978Z [W1204 10:56:59.025794745 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1208982Z 2025-12-04T11:07:13.1209497Z [W1204 10:56:59.025986823 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1209502Z 2025-12-04T11:07:13.1210012Z [W1204 10:56:59.031983786 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1210017Z 2025-12-04T11:07:13.1210527Z [W1204 10:56:59.032597166 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1210542Z 2025-12-04T11:07:13.1211057Z [W1204 10:56:59.032784779 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1211061Z 2025-12-04T11:07:13.1211572Z [W1204 10:56:59.115160171 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1211578Z 2025-12-04T11:07:13.1212095Z [W1204 10:56:59.115784049 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1212099Z 2025-12-04T11:07:13.1212609Z [W1204 10:56:59.115978141 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1212614Z 2025-12-04T11:07:13.1213135Z [W1204 10:56:59.119786964 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1213142Z 2025-12-04T11:07:13.1213706Z [W1204 10:56:59.120413003 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1213711Z 2025-12-04T11:07:13.1214232Z [W1204 10:56:59.120610547 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1214266Z 2025-12-04T11:07:13.1214776Z [W1204 10:56:59.126471795 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1214781Z 2025-12-04T11:07:13.1215303Z [W1204 10:56:59.127250119 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1215307Z 2025-12-04T11:07:13.1215816Z [W1204 10:56:59.127443366 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1215823Z 2025-12-04T11:07:13.1215925Z FAILED [0.4077s] [100%] 2025-12-04T11:07:13.1215930Z 2025-12-04T11:07:13.1216087Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1216598Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1216735Z Traceback (most recent call last): 2025-12-04T11:07:13.1217253Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1217484Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1217965Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1218129Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1218781Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1219024Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1219160Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1219166Z 2025-12-04T11:07:13.1219282Z Expected 1 but got 2. 2025-12-04T11:07:13.1219388Z Absolute difference: 1 2025-12-04T11:07:13.1219500Z Relative difference: 1.0 2025-12-04T11:07:13.1219506Z 2025-12-04T11:07:13.1219732Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1220647Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1220653Z 2025-12-04T11:07:13.1220933Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1221155Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1221270Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1222189Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1222418Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1222527Z graph_break [] 2025-12-04T11:07:13.1222744Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1223965Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1224094Z if out == self.unknown_value: 2025-12-04T11:07:13.1224826Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1224938Z warnings.warn( 2025-12-04T11:07:13.1225721Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1225865Z warnings.warn( 2025-12-04T11:07:13.1226382Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1226502Z Traceback (most recent call last): 2025-12-04T11:07:13.1227013Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1227255Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1227718Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1227890Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1228434Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1228641Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1228781Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1228789Z 2025-12-04T11:07:13.1228894Z Expected 1 but got 2. 2025-12-04T11:07:13.1229011Z Absolute difference: 1 2025-12-04T11:07:13.1229119Z Relative difference: 1.0 2025-12-04T11:07:13.1229124Z 2025-12-04T11:07:13.1229336Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1230254Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1230259Z 2025-12-04T11:07:13.1230526Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1230755Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1230906Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1231807Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1232048Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1232149Z graph_break [] 2025-12-04T11:07:13.1232361Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1233588Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1233706Z if out == self.unknown_value: 2025-12-04T11:07:13.1234444Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1234548Z warnings.warn( 2025-12-04T11:07:13.1235270Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1235381Z warnings.warn( 2025-12-04T11:07:13.1235596Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1235717Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1235947Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1236844Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1236955Z graph_break [] 2025-12-04T11:07:13.1237173Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1237965Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1238093Z warnings.warn( 2025-12-04T11:07:13.1238813Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1238921Z warnings.warn( 2025-12-04T11:07:13.1239064Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1239572Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1239704Z Traceback (most recent call last): 2025-12-04T11:07:13.1240214Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1240461Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1240922Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1241085Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1241637Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1241843Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1241975Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1241993Z 2025-12-04T11:07:13.1242096Z Expected 1 but got 2. 2025-12-04T11:07:13.1242203Z Absolute difference: 1 2025-12-04T11:07:13.1242320Z Relative difference: 1.0 2025-12-04T11:07:13.1242325Z 2025-12-04T11:07:13.1242535Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1243475Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1243481Z 2025-12-04T11:07:13.1243758Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1243977Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1244103Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1244994Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1245221Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1245329Z graph_break [] 2025-12-04T11:07:13.1245544Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1246778Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1246897Z if out == self.unknown_value: 2025-12-04T11:07:13.1247630Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1247741Z warnings.warn( 2025-12-04T11:07:13.1248465Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1248576Z warnings.warn( 2025-12-04T11:07:13.1248790Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1248901Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1249142Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1250092Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1250217Z graph_break [] 2025-12-04T11:07:13.1250439Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1251164Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1251269Z warnings.warn( 2025-12-04T11:07:13.1251990Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1252090Z warnings.warn( 2025-12-04T11:07:13.1252312Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1252428Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1252658Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1253560Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1253659Z graph_break [] 2025-12-04T11:07:13.1253884Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1254612Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1254710Z warnings.warn( 2025-12-04T11:07:13.1255443Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1255573Z warnings.warn( 2025-12-04T11:07:13.1256428Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-b091dc2605282b32.xml - 2025-12-04T11:07:13.1256602Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1257542Z FAILED [0.4077s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1257558Z 2025-12-04T11:07:13.1257666Z Expected 1 but got 2. 2025-12-04T11:07:13.1257775Z Absolute difference: 1 2025-12-04T11:07:13.1257895Z Relative difference: 1.0 2025-12-04T11:07:13.1257900Z 2025-12-04T11:07:13.1258116Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1259102Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1259112Z 2025-12-04T11:07:13.1259389Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1259573Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1259782Z ================== 1 failed, 10 deselected, 2 rerun in 20.40s ================== 2025-12-04T11:07:13.1259880Z Got exit code 1 2025-12-04T11:07:13.1260710Z FAILED CONSISTENTLY: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1261136Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:07:13.1261586Z W1204 10:57:10.723000 84908 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1262336Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7a84aa230dcc2c72.xml 2025-12-04T11:07:13.1262500Z ============================= test session starts ============================== 2025-12-04T11:07:13.1262883Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1263003Z cachedir: .pytest_cache 2025-12-04T11:07:13.1263533Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1263657Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1263774Z configfile: pytest.ini 2025-12-04T11:07:13.1264318Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1264540Z collecting ... collected 58 items / 4 deselected / 54 selected 2025-12-04T11:07:13.1264682Z stepcurrent: skipping 4 already run items. 2025-12-04T11:07:13.1264796Z Running 7 items in this shard 2025-12-04T11:07:13.1264802Z 2025-12-04T11:07:13.1265711Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 ('RERUN', {'yellow': True}) [3.8271s] [ 14%] 2025-12-04T11:07:13.1266673Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.4282s] [ 14%] 2025-12-04T11:07:13.1267465Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 FAILED [0.4327s] [ 14%] 2025-12-04T11:07:13.1267471Z 2025-12-04T11:07:13.1267614Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1268174Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1268301Z Traceback (most recent call last): 2025-12-04T11:07:13.1268815Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1269060Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1269521Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1269683Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1270232Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1270441Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1270582Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1270590Z 2025-12-04T11:07:13.1270697Z Expected 1 but got 2. 2025-12-04T11:07:13.1270801Z Absolute difference: 1 2025-12-04T11:07:13.1270923Z Relative difference: 1.0 2025-12-04T11:07:13.1270929Z 2025-12-04T11:07:13.1271145Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1272053Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1272070Z 2025-12-04T11:07:13.1272338Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1272552Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1272675Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1273574Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1273882Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1273983Z graph_break [] 2025-12-04T11:07:13.1274200Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1274978Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1275082Z warnings.warn( 2025-12-04T11:07:13.1275808Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1275920Z warnings.warn( 2025-12-04T11:07:13.1276425Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1276559Z Traceback (most recent call last): 2025-12-04T11:07:13.1277079Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1277311Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1277783Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1277950Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1278491Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1278711Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1278839Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1278845Z 2025-12-04T11:07:13.1278961Z Expected 1 but got 2. 2025-12-04T11:07:13.1279064Z Absolute difference: 1 2025-12-04T11:07:13.1279169Z Relative difference: 1.0 2025-12-04T11:07:13.1279174Z 2025-12-04T11:07:13.1279429Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1280338Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1280345Z 2025-12-04T11:07:13.1280623Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1280838Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1280951Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1281856Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1282083Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1282193Z graph_break [] 2025-12-04T11:07:13.1282409Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1283146Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1283262Z warnings.warn( 2025-12-04T11:07:13.1283980Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1284079Z warnings.warn( 2025-12-04T11:07:13.1284303Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1284415Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1284653Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1285548Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1285703Z graph_break [] 2025-12-04T11:07:13.1285931Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1286662Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1286806Z warnings.warn( 2025-12-04T11:07:13.1287526Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1287622Z warnings.warn( 2025-12-04T11:07:13.1287995Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1288603Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1288726Z Traceback (most recent call last): 2025-12-04T11:07:13.1289258Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1289489Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1289961Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1290126Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1290662Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1290873Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1291002Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1291008Z 2025-12-04T11:07:13.1291127Z Expected 1 but got 2. 2025-12-04T11:07:13.1291228Z Absolute difference: 1 2025-12-04T11:07:13.1291334Z Relative difference: 1.0 2025-12-04T11:07:13.1291405Z 2025-12-04T11:07:13.1291634Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1292543Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1292551Z 2025-12-04T11:07:13.1292826Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1293043Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1293161Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1294069Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1294289Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1294390Z graph_break [] 2025-12-04T11:07:13.1294620Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1295354Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1295465Z warnings.warn( 2025-12-04T11:07:13.1296188Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1296286Z warnings.warn( 2025-12-04T11:07:13.1296513Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1296624Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1296851Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1297751Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1297928Z graph_break [] 2025-12-04T11:07:13.1298156Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1298946Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1299095Z warnings.warn( 2025-12-04T11:07:13.1299822Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1299918Z warnings.warn( 2025-12-04T11:07:13.1300142Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1300251Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1300472Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1301379Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1301475Z graph_break [] 2025-12-04T11:07:13.1301687Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1302424Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1302523Z warnings.warn( 2025-12-04T11:07:13.1303254Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1303349Z warnings.warn( 2025-12-04T11:07:13.1304195Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7a84aa230dcc2c72.xml - 2025-12-04T11:07:13.1304409Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1305343Z FAILED [0.4327s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1305351Z 2025-12-04T11:07:13.1305465Z Expected 1 but got 2. 2025-12-04T11:07:13.1305569Z Absolute difference: 1 2025-12-04T11:07:13.1305677Z Relative difference: 1.0 2025-12-04T11:07:13.1305682Z 2025-12-04T11:07:13.1305906Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1306812Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1306820Z 2025-12-04T11:07:13.1307093Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1307270Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1307460Z =================== 1 failed, 4 deselected, 2 rerun in 4.72s =================== 2025-12-04T11:07:13.1307566Z Got exit code 1 2025-12-04T11:07:13.1307668Z Retrying single test... 2025-12-04T11:07:13.1308123Z W1204 10:57:30.135000 85077 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1308786Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-622e1501e18bdb77.xml 2025-12-04T11:07:13.1308946Z ============================= test session starts ============================== 2025-12-04T11:07:13.1309305Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1309409Z cachedir: .pytest_cache 2025-12-04T11:07:13.1309989Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1310126Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1310230Z configfile: pytest.ini 2025-12-04T11:07:13.1310819Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1311034Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.1312022Z stepcurrent: skipping 4 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1312147Z Running 1 items in this shard 2025-12-04T11:07:13.1312152Z 2025-12-04T11:07:13.1313441Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 [W1204 10:57:35.687150107 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1313449Z 2025-12-04T11:07:13.1313983Z [W1204 10:57:51.564281185 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1313990Z 2025-12-04T11:07:13.1314501Z [W1204 10:57:51.564539134 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1314506Z 2025-12-04T11:07:13.1315023Z [W1204 10:57:51.571902014 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1315028Z 2025-12-04T11:07:13.1315536Z [W1204 10:57:51.572581658 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1315571Z 2025-12-04T11:07:13.1316092Z [W1204 10:57:51.572770208 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1316096Z 2025-12-04T11:07:13.1316605Z [W1204 10:57:51.579611930 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1316612Z 2025-12-04T11:07:13.1317118Z [W1204 10:57:51.580387109 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1317130Z 2025-12-04T11:07:13.1317637Z [W1204 10:57:51.580578635 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1317642Z 2025-12-04T11:07:13.1318149Z [W1204 10:57:51.712535265 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1318154Z 2025-12-04T11:07:13.1318674Z [W1204 10:57:51.714008869 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1318679Z 2025-12-04T11:07:13.1319186Z [W1204 10:57:51.714210535 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1319193Z 2025-12-04T11:07:13.1319707Z [W1204 10:57:51.718090837 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1319712Z 2025-12-04T11:07:13.1320221Z [W1204 10:57:51.718700219 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1320225Z 2025-12-04T11:07:13.1320742Z [W1204 10:57:51.718895050 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1320747Z 2025-12-04T11:07:13.1321254Z [W1204 10:57:51.724938535 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1321339Z 2025-12-04T11:07:13.1321856Z [W1204 10:57:51.725568415 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1321890Z 2025-12-04T11:07:13.1322399Z [W1204 10:57:51.725762176 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1322404Z 2025-12-04T11:07:13.1322537Z ('RERUN', {'yellow': True}) [19.7293s] [100%] 2025-12-04T11:07:13.1323823Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 [W1204 10:57:51.115595243 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1323829Z 2025-12-04T11:07:13.1324338Z [W1204 10:57:51.116298262 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1324349Z 2025-12-04T11:07:13.1324869Z [W1204 10:57:51.116494778 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1324877Z 2025-12-04T11:07:13.1325389Z [W1204 10:57:51.120497651 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1325393Z 2025-12-04T11:07:13.1325918Z [W1204 10:57:51.121113922 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1325923Z 2025-12-04T11:07:13.1326428Z [W1204 10:57:51.121304385 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1326433Z 2025-12-04T11:07:13.1326956Z [W1204 10:57:51.127330427 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1326992Z 2025-12-04T11:07:13.1327506Z [W1204 10:57:51.127942639 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1327512Z 2025-12-04T11:07:13.1328019Z [W1204 10:57:51.128131310 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1328033Z 2025-12-04T11:07:13.1328545Z [W1204 10:57:52.215361443 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1328549Z 2025-12-04T11:07:13.1329056Z [W1204 10:57:52.216099768 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1329061Z 2025-12-04T11:07:13.1329577Z [W1204 10:57:52.216316080 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1329584Z 2025-12-04T11:07:13.1330096Z [W1204 10:57:52.220287594 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1330101Z 2025-12-04T11:07:13.1330618Z [W1204 10:57:52.220931012 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1330623Z 2025-12-04T11:07:13.1331129Z [W1204 10:57:52.221127692 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1331134Z 2025-12-04T11:07:13.1331647Z [W1204 10:57:52.227242807 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1331652Z 2025-12-04T11:07:13.1332159Z [W1204 10:57:52.228040840 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1332166Z 2025-12-04T11:07:13.1332739Z [W1204 10:57:52.228236370 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1332744Z 2025-12-04T11:07:13.1332876Z ('RERUN', {'yellow': True}) [0.4647s] [100%] 2025-12-04T11:07:13.1334179Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 [W1204 10:57:52.548512461 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1334195Z 2025-12-04T11:07:13.1334705Z [W1204 10:57:52.549193336 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1334709Z 2025-12-04T11:07:13.1335217Z [W1204 10:57:52.549387315 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1335224Z 2025-12-04T11:07:13.1335745Z [W1204 10:57:52.553353711 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1335750Z 2025-12-04T11:07:13.1336258Z [W1204 10:57:52.553968323 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1336265Z 2025-12-04T11:07:13.1336785Z [W1204 10:57:52.554157572 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1336790Z 2025-12-04T11:07:13.1337297Z [W1204 10:57:52.560213419 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1337302Z 2025-12-04T11:07:13.1337813Z [W1204 10:57:52.560825465 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1337850Z 2025-12-04T11:07:13.1338435Z [W1204 10:57:52.561013518 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1338440Z 2025-12-04T11:07:13.1338961Z [W1204 10:57:52.644470516 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1338968Z 2025-12-04T11:07:13.1339478Z [W1204 10:57:52.645102691 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1339482Z 2025-12-04T11:07:13.1339990Z [W1204 10:57:52.645298452 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1339994Z 2025-12-04T11:07:13.1340509Z [W1204 10:57:52.649189049 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1340515Z 2025-12-04T11:07:13.1341030Z [W1204 10:57:52.649801412 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1341035Z 2025-12-04T11:07:13.1341551Z [W1204 10:57:52.649995801 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1341558Z 2025-12-04T11:07:13.1342066Z [W1204 10:57:52.656049202 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1342072Z 2025-12-04T11:07:13.1342589Z [W1204 10:57:52.656822838 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1342593Z 2025-12-04T11:07:13.1343101Z [W1204 10:57:52.657018608 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1343105Z 2025-12-04T11:07:13.1343213Z FAILED [0.4253s] [100%] 2025-12-04T11:07:13.1343217Z 2025-12-04T11:07:13.1343419Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1343931Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1344089Z Traceback (most recent call last): 2025-12-04T11:07:13.1344604Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1344834Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1345310Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1345469Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1346018Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1346221Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1346354Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1346359Z 2025-12-04T11:07:13.1346471Z Expected 1 but got 2. 2025-12-04T11:07:13.1346573Z Absolute difference: 1 2025-12-04T11:07:13.1346692Z Relative difference: 1.0 2025-12-04T11:07:13.1346698Z 2025-12-04T11:07:13.1346911Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1347820Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1347825Z 2025-12-04T11:07:13.1348099Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1348321Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1348449Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1349382Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1349611Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1349726Z graph_break [] 2025-12-04T11:07:13.1349943Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1351156Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1351287Z if out == self.unknown_value: 2025-12-04T11:07:13.1352025Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1352142Z warnings.warn( 2025-12-04T11:07:13.1352874Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1352975Z warnings.warn( 2025-12-04T11:07:13.1353500Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1353627Z Traceback (most recent call last): 2025-12-04T11:07:13.1354153Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1354387Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1354851Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1355032Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1355574Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1355857Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1355993Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1356030Z 2025-12-04T11:07:13.1356140Z Expected 1 but got 2. 2025-12-04T11:07:13.1356261Z Absolute difference: 1 2025-12-04T11:07:13.1356372Z Relative difference: 1.0 2025-12-04T11:07:13.1356377Z 2025-12-04T11:07:13.1356596Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1357523Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1357528Z 2025-12-04T11:07:13.1357798Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1358034Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1358156Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1359053Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1359297Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1359400Z graph_break [] 2025-12-04T11:07:13.1359632Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1360856Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1360975Z if out == self.unknown_value: 2025-12-04T11:07:13.1361731Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1361882Z warnings.warn( 2025-12-04T11:07:13.1362626Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1362736Z warnings.warn( 2025-12-04T11:07:13.1362955Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1363085Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1363314Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1364211Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1364324Z graph_break [] 2025-12-04T11:07:13.1364545Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1365300Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1365404Z warnings.warn( 2025-12-04T11:07:13.1366129Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1366243Z warnings.warn( 2025-12-04T11:07:13.1366389Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1366909Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1367030Z Traceback (most recent call last): 2025-12-04T11:07:13.1367545Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1367793Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1368313Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1368480Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1369064Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1369275Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1369418Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1369424Z 2025-12-04T11:07:13.1369528Z Expected 1 but got 2. 2025-12-04T11:07:13.1369638Z Absolute difference: 1 2025-12-04T11:07:13.1369759Z Relative difference: 1.0 2025-12-04T11:07:13.1369763Z 2025-12-04T11:07:13.1369980Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1370909Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1370917Z 2025-12-04T11:07:13.1371185Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1371406Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1371534Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1372432Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1372676Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1372775Z graph_break [] 2025-12-04T11:07:13.1372992Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1374253Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1374371Z if out == self.unknown_value: 2025-12-04T11:07:13.1375102Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1375218Z warnings.warn( 2025-12-04T11:07:13.1375942Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1376056Z warnings.warn( 2025-12-04T11:07:13.1376276Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1376392Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1376635Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1377535Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1377650Z graph_break [] 2025-12-04T11:07:13.1377865Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1378657Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1378774Z warnings.warn( 2025-12-04T11:07:13.1379498Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1379598Z warnings.warn( 2025-12-04T11:07:13.1379826Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1379947Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1380252Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1381147Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1381277Z graph_break [] 2025-12-04T11:07:13.1381504Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1382230Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1382343Z warnings.warn( 2025-12-04T11:07:13.1383064Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1383166Z warnings.warn( 2025-12-04T11:07:13.1384035Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-622e1501e18bdb77.xml - 2025-12-04T11:07:13.1384212Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1385165Z FAILED [0.4253s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1385171Z 2025-12-04T11:07:13.1385277Z Expected 1 but got 2. 2025-12-04T11:07:13.1385387Z Absolute difference: 1 2025-12-04T11:07:13.1385510Z Relative difference: 1.0 2025-12-04T11:07:13.1385515Z 2025-12-04T11:07:13.1385735Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1386662Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1386698Z 2025-12-04T11:07:13.1386971Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1387155Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1387367Z ================== 1 failed, 10 deselected, 2 rerun in 20.65s ================== 2025-12-04T11:07:13.1387467Z Got exit code 1 2025-12-04T11:07:13.1387575Z Retrying single test... 2025-12-04T11:07:13.1388314Z W1204 10:58:03.196000 85251 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1388989Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-af0abf9f8a1870a5.xml 2025-12-04T11:07:13.1389169Z ============================= test session starts ============================== 2025-12-04T11:07:13.1389535Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1389646Z cachedir: .pytest_cache 2025-12-04T11:07:13.1390191Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1390321Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1390433Z configfile: pytest.ini 2025-12-04T11:07:13.1390995Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1391215Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.1392218Z stepcurrent: skipping 4 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1392336Z Running 1 items in this shard 2025-12-04T11:07:13.1392341Z 2025-12-04T11:07:13.1393740Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 [W1204 10:58:08.757255853 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1393783Z 2025-12-04T11:07:13.1394308Z [W1204 10:58:23.982540359 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1394313Z 2025-12-04T11:07:13.1394830Z [W1204 10:58:23.982814320 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1394848Z 2025-12-04T11:07:13.1395362Z [W1204 10:58:23.990124646 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1395367Z 2025-12-04T11:07:13.1395887Z [W1204 10:58:23.990799685 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1395893Z 2025-12-04T11:07:13.1396417Z [W1204 10:58:23.990988830 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1396425Z 2025-12-04T11:07:13.1396935Z [W1204 10:58:23.997759608 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1396940Z 2025-12-04T11:07:13.1397463Z [W1204 10:58:23.998508054 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1397468Z 2025-12-04T11:07:13.1397980Z [W1204 10:58:23.998693981 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1397985Z 2025-12-04T11:07:13.1398515Z [W1204 10:58:23.130436972 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1398560Z 2025-12-04T11:07:13.1399075Z [W1204 10:58:23.132053788 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1399082Z 2025-12-04T11:07:13.1399605Z [W1204 10:58:23.132257066 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1399610Z 2025-12-04T11:07:13.1400120Z [W1204 10:58:23.136114799 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1400125Z 2025-12-04T11:07:13.1400635Z [W1204 10:58:23.136735419 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1400652Z 2025-12-04T11:07:13.1401160Z [W1204 10:58:23.136930791 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1401170Z 2025-12-04T11:07:13.1401681Z [W1204 10:58:24.142998598 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1401689Z 2025-12-04T11:07:13.1402211Z [W1204 10:58:24.143636270 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1402216Z 2025-12-04T11:07:13.1402726Z [W1204 10:58:24.143829916 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1402731Z 2025-12-04T11:07:13.1402878Z ('RERUN', {'yellow': True}) [19.0842s] [100%] 2025-12-04T11:07:13.1404157Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 [W1204 10:58:24.534120867 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1404223Z 2025-12-04T11:07:13.1404754Z [W1204 10:58:24.534834636 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1404804Z 2025-12-04T11:07:13.1405315Z [W1204 10:58:24.535035120 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1405320Z 2025-12-04T11:07:13.1405843Z [W1204 10:58:24.538930000 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1405848Z 2025-12-04T11:07:13.1406356Z [W1204 10:58:24.539562035 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1406361Z 2025-12-04T11:07:13.1406871Z [W1204 10:58:24.539753427 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1406878Z 2025-12-04T11:07:13.1407407Z [W1204 10:58:24.545803368 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1407413Z 2025-12-04T11:07:13.1407919Z [W1204 10:58:24.546418905 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1407924Z 2025-12-04T11:07:13.1408448Z [W1204 10:58:24.546607443 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1408453Z 2025-12-04T11:07:13.1408963Z [W1204 10:58:24.630522789 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1408968Z 2025-12-04T11:07:13.1409493Z [W1204 10:58:24.631181885 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1409533Z 2025-12-04T11:07:13.1410051Z [W1204 10:58:24.631380150 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1410056Z 2025-12-04T11:07:13.1410583Z [W1204 10:58:24.635220043 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1410587Z 2025-12-04T11:07:13.1411096Z [W1204 10:58:24.635835683 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1411100Z 2025-12-04T11:07:13.1411612Z [W1204 10:58:24.636031854 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1411629Z 2025-12-04T11:07:13.1412140Z [W1204 10:58:24.641963304 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1412147Z 2025-12-04T11:07:13.1412664Z [W1204 10:58:24.642742541 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1412669Z 2025-12-04T11:07:13.1413194Z [W1204 10:58:24.642938025 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1413201Z 2025-12-04T11:07:13.1413331Z ('RERUN', {'yellow': True}) [0.4597s] [100%] 2025-12-04T11:07:13.1414619Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 [W1204 10:58:24.960945138 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1414624Z 2025-12-04T11:07:13.1415135Z [W1204 10:58:24.961633052 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1415142Z 2025-12-04T11:07:13.1415723Z [W1204 10:58:24.961831909 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1415728Z 2025-12-04T11:07:13.1416239Z [W1204 10:58:24.965712195 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1416273Z 2025-12-04T11:07:13.1416802Z [W1204 10:58:24.966316136 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1416807Z 2025-12-04T11:07:13.1417318Z [W1204 10:58:24.966507247 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1417323Z 2025-12-04T11:07:13.1417833Z [W1204 10:58:24.972539168 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1417852Z 2025-12-04T11:07:13.1418439Z [W1204 10:58:24.973149630 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1418444Z 2025-12-04T11:07:13.1418957Z [W1204 10:58:24.973339791 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1418964Z 2025-12-04T11:07:13.1419495Z [W1204 10:58:24.056625780 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1419499Z 2025-12-04T11:07:13.1420013Z [W1204 10:58:24.057264668 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1420017Z 2025-12-04T11:07:13.1420542Z [W1204 10:58:24.057462978 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1420582Z 2025-12-04T11:07:13.1421102Z [W1204 10:58:24.061353301 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1421107Z 2025-12-04T11:07:13.1421629Z [W1204 10:58:24.061966690 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1421636Z 2025-12-04T11:07:13.1422146Z [W1204 10:58:24.062161286 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1422150Z 2025-12-04T11:07:13.1422675Z [W1204 10:58:24.068064465 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1422679Z 2025-12-04T11:07:13.1423191Z [W1204 10:58:24.068840133 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1423195Z 2025-12-04T11:07:13.1423713Z [W1204 10:58:24.069034776 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1423718Z 2025-12-04T11:07:13.1423834Z FAILED [0.4239s] [100%] 2025-12-04T11:07:13.1423839Z 2025-12-04T11:07:13.1423982Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1424510Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1424634Z Traceback (most recent call last): 2025-12-04T11:07:13.1425153Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1425398Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1425868Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1426044Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1426646Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1426861Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1427037Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1427043Z 2025-12-04T11:07:13.1427150Z Expected 1 but got 2. 2025-12-04T11:07:13.1427257Z Absolute difference: 1 2025-12-04T11:07:13.1427380Z Relative difference: 1.0 2025-12-04T11:07:13.1427385Z 2025-12-04T11:07:13.1427607Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1428529Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1428535Z 2025-12-04T11:07:13.1428800Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1429023Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1429155Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1430050Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1430289Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1430387Z graph_break [] 2025-12-04T11:07:13.1430604Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1431834Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1431950Z if out == self.unknown_value: 2025-12-04T11:07:13.1432731Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1432833Z warnings.warn( 2025-12-04T11:07:13.1433557Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1433670Z warnings.warn( 2025-12-04T11:07:13.1434181Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1434317Z Traceback (most recent call last): 2025-12-04T11:07:13.1434836Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1435066Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1435538Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1435704Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1436243Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1436460Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1436592Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1436597Z 2025-12-04T11:07:13.1436715Z Expected 1 but got 2. 2025-12-04T11:07:13.1436821Z Absolute difference: 1 2025-12-04T11:07:13.1436931Z Relative difference: 1.0 2025-12-04T11:07:13.1436936Z 2025-12-04T11:07:13.1437157Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1438073Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1438080Z 2025-12-04T11:07:13.1438419Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1438637Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1438753Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1439691Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1439921Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1440018Z graph_break [] 2025-12-04T11:07:13.1440246Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1441463Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1441599Z if out == self.unknown_value: 2025-12-04T11:07:13.1442335Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1442437Z warnings.warn( 2025-12-04T11:07:13.1443174Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1443275Z warnings.warn( 2025-12-04T11:07:13.1443502Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1443614Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1443841Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1444755Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1444881Z graph_break [] 2025-12-04T11:07:13.1445108Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1445834Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1445932Z warnings.warn( 2025-12-04T11:07:13.1446661Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1446765Z warnings.warn( 2025-12-04T11:07:13.1446910Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1447424Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1447550Z Traceback (most recent call last): 2025-12-04T11:07:13.1448073Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1448302Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1448766Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1448941Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1449479Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1449698Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1449831Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1449837Z 2025-12-04T11:07:13.1449941Z Expected 1 but got 2. 2025-12-04T11:07:13.1450059Z Absolute difference: 1 2025-12-04T11:07:13.1450164Z Relative difference: 1.0 2025-12-04T11:07:13.1450171Z 2025-12-04T11:07:13.1450464Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1451386Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1451422Z 2025-12-04T11:07:13.1451690Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1451917Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1452029Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1452919Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1453151Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1453252Z graph_break [] 2025-12-04T11:07:13.1453480Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1454695Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1454815Z if out == self.unknown_value: 2025-12-04T11:07:13.1455552Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1455652Z warnings.warn( 2025-12-04T11:07:13.1456390Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1456487Z warnings.warn( 2025-12-04T11:07:13.1456735Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1456859Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1457086Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1457983Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1458092Z graph_break [] 2025-12-04T11:07:13.1458384Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1459123Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1459222Z warnings.warn( 2025-12-04T11:07:13.1459939Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1460055Z warnings.warn( 2025-12-04T11:07:13.1460274Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1460405Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1460633Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1461536Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1461647Z graph_break [] 2025-12-04T11:07:13.1461863Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1462591Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1462706Z warnings.warn( 2025-12-04T11:07:13.1463489Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1463602Z warnings.warn( 2025-12-04T11:07:13.1464451Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-af0abf9f8a1870a5.xml - 2025-12-04T11:07:13.1464652Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1465600Z FAILED [0.4239s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1465606Z 2025-12-04T11:07:13.1465709Z Expected 1 but got 2. 2025-12-04T11:07:13.1465828Z Absolute difference: 1 2025-12-04T11:07:13.1465933Z Relative difference: 1.0 2025-12-04T11:07:13.1465941Z 2025-12-04T11:07:13.1466156Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1467078Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1467086Z 2025-12-04T11:07:13.1467352Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1467544Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1467740Z ================== 1 failed, 10 deselected, 2 rerun in 20.00s ================== 2025-12-04T11:07:13.1467836Z Got exit code 1 2025-12-04T11:07:13.1468671Z FAILED CONSISTENTLY: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1469082Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:07:13.1469574Z W1204 10:58:35.831000 85425 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1470239Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a29cc44636754e7c.xml 2025-12-04T11:07:13.1470402Z ============================= test session starts ============================== 2025-12-04T11:07:13.1470768Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1470878Z cachedir: .pytest_cache 2025-12-04T11:07:13.1471414Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1471536Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1471644Z configfile: pytest.ini 2025-12-04T11:07:13.1472198Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1472418Z collecting ... collected 58 items / 5 deselected / 53 selected 2025-12-04T11:07:13.1472561Z stepcurrent: skipping 5 already run items. 2025-12-04T11:07:13.1472684Z Running 6 items in this shard 2025-12-04T11:07:13.1472690Z 2025-12-04T11:07:13.1473565Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 ('RERUN', {'yellow': True}) [3.7689s] [ 16%] 2025-12-04T11:07:13.1474443Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.3837s] [ 16%] 2025-12-04T11:07:13.1475226Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 FAILED [0.3749s] [ 16%] 2025-12-04T11:07:13.1475234Z 2025-12-04T11:07:13.1475442Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1475952Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1476103Z Traceback (most recent call last): 2025-12-04T11:07:13.1476628Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1476857Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1477324Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1477498Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1478037Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1478258Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1478389Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1478394Z 2025-12-04T11:07:13.1478499Z Expected 1 but got 2. 2025-12-04T11:07:13.1478614Z Absolute difference: 1 2025-12-04T11:07:13.1478723Z Relative difference: 1.0 2025-12-04T11:07:13.1478728Z 2025-12-04T11:07:13.1478945Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1479868Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1479874Z 2025-12-04T11:07:13.1480137Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1480363Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1480476Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1481045Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1481287Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1481383Z graph_break [] 2025-12-04T11:07:13.1481615Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1482348Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1482446Z warnings.warn( 2025-12-04T11:07:13.1483185Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1483282Z warnings.warn( 2025-12-04T11:07:13.1483804Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1483926Z Traceback (most recent call last): 2025-12-04T11:07:13.1484438Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1484682Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1485142Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1485307Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1485856Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1486061Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1486206Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1486212Z 2025-12-04T11:07:13.1486313Z Expected 1 but got 2. 2025-12-04T11:07:13.1486417Z Absolute difference: 1 2025-12-04T11:07:13.1486539Z Relative difference: 1.0 2025-12-04T11:07:13.1486544Z 2025-12-04T11:07:13.1486824Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1487935Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1488017Z 2025-12-04T11:07:13.1488341Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1488559Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1488682Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1489214Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1489452Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1489550Z graph_break [] 2025-12-04T11:07:13.1489770Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1490520Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1490625Z warnings.warn( 2025-12-04T11:07:13.1491350Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1491460Z warnings.warn( 2025-12-04T11:07:13.1491676Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1491800Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1492023Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1492553Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1492711Z graph_break [] 2025-12-04T11:07:13.1492930Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1493660Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1493773Z warnings.warn( 2025-12-04T11:07:13.1494494Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1494606Z warnings.warn( 2025-12-04T11:07:13.1494749Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1495254Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1495390Z Traceback (most recent call last): 2025-12-04T11:07:13.1495909Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1496141Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1496611Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1496778Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1497328Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1497536Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1497670Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1497675Z 2025-12-04T11:07:13.1497790Z Expected 1 but got 2. 2025-12-04T11:07:13.1497896Z Absolute difference: 1 2025-12-04T11:07:13.1498018Z Relative difference: 1.0 2025-12-04T11:07:13.1498023Z 2025-12-04T11:07:13.1498240Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1499545Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1499584Z 2025-12-04T11:07:13.1499871Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1500092Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1500219Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1500751Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1500982Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1501093Z graph_break [] 2025-12-04T11:07:13.1501309Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1502052Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1502167Z warnings.warn( 2025-12-04T11:07:13.1502890Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1503001Z warnings.warn( 2025-12-04T11:07:13.1503217Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1503332Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1503570Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1504101Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1504200Z graph_break [] 2025-12-04T11:07:13.1504418Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1505180Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1505289Z warnings.warn( 2025-12-04T11:07:13.1506009Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1506110Z warnings.warn( 2025-12-04T11:07:13.1506334Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1506444Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1506679Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1507206Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1507304Z graph_break [] 2025-12-04T11:07:13.1507531Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1508255Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1508358Z warnings.warn( 2025-12-04T11:07:13.1509092Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1509189Z warnings.warn( 2025-12-04T11:07:13.1510041Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a29cc44636754e7c.xml - 2025-12-04T11:07:13.1510210Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1511214Z FAILED [0.3749s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1511237Z 2025-12-04T11:07:13.1511342Z Expected 1 but got 2. 2025-12-04T11:07:13.1511447Z Absolute difference: 1 2025-12-04T11:07:13.1511600Z Relative difference: 1.0 2025-12-04T11:07:13.1511605Z 2025-12-04T11:07:13.1511823Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1512740Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1512746Z 2025-12-04T11:07:13.1513018Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1513195Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1513402Z =================== 1 failed, 5 deselected, 2 rerun in 4.56s =================== 2025-12-04T11:07:13.1513501Z Got exit code 1 2025-12-04T11:07:13.1513612Z Retrying single test... 2025-12-04T11:07:13.1514074Z W1204 10:58:55.288000 85601 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1514744Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ad8948ec33d2b35b.xml 2025-12-04T11:07:13.1514908Z ============================= test session starts ============================== 2025-12-04T11:07:13.1515270Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1515376Z cachedir: .pytest_cache 2025-12-04T11:07:13.1515906Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1516029Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1516135Z configfile: pytest.ini 2025-12-04T11:07:13.1516730Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1516949Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.1517955Z stepcurrent: skipping 5 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1518066Z Running 1 items in this shard 2025-12-04T11:07:13.1518071Z 2025-12-04T11:07:13.1519354Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 [W1204 10:58:58.911930105 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1519371Z 2025-12-04T11:07:13.1519893Z [W1204 10:59:14.425928060 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1519901Z 2025-12-04T11:07:13.1520410Z [W1204 10:59:14.426188229 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1520418Z 2025-12-04T11:07:13.1520942Z [W1204 10:59:14.433528170 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1520948Z 2025-12-04T11:07:13.1521453Z [W1204 10:59:14.434191588 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1521457Z 2025-12-04T11:07:13.1521976Z [W1204 10:59:14.434379188 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1521981Z 2025-12-04T11:07:13.1522489Z [W1204 10:59:14.441274609 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1522557Z 2025-12-04T11:07:13.1523077Z [W1204 10:59:14.441899632 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1523113Z 2025-12-04T11:07:13.1523624Z [W1204 10:59:14.442085581 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1523628Z 2025-12-04T11:07:13.1524147Z [W1204 10:59:16.436837488 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1524152Z 2025-12-04T11:07:13.1524660Z [W1204 10:59:16.438434153 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1524665Z 2025-12-04T11:07:13.1525171Z [W1204 10:59:16.438641628 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1525189Z 2025-12-04T11:07:13.1525704Z [W1204 10:59:16.442570822 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1525708Z 2025-12-04T11:07:13.1526218Z [W1204 10:59:16.443214593 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1526223Z 2025-12-04T11:07:13.1526739Z [W1204 10:59:16.443407714 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1526744Z 2025-12-04T11:07:13.1527255Z [W1204 10:59:16.449479371 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1527259Z 2025-12-04T11:07:13.1527779Z [W1204 10:59:16.450114232 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1527816Z 2025-12-04T11:07:13.1528332Z [W1204 10:59:16.450311002 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1528336Z 2025-12-04T11:07:13.1528480Z ('RERUN', {'yellow': True}) [19.2991s] [100%] 2025-12-04T11:07:13.1529770Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 [W1204 10:59:16.784656904 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1529776Z 2025-12-04T11:07:13.1530287Z [W1204 10:59:16.785351038 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1530302Z 2025-12-04T11:07:13.1530810Z [W1204 10:59:16.785548953 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1530817Z 2025-12-04T11:07:13.1531333Z [W1204 10:59:16.789470754 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1531338Z 2025-12-04T11:07:13.1531859Z [W1204 10:59:16.790254840 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1531864Z 2025-12-04T11:07:13.1532377Z [W1204 10:59:16.790453949 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1532382Z 2025-12-04T11:07:13.1532907Z [W1204 10:59:16.796460077 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1532911Z 2025-12-04T11:07:13.1533421Z [W1204 10:59:16.797073187 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1533428Z 2025-12-04T11:07:13.1534004Z [W1204 10:59:16.797263148 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1534009Z 2025-12-04T11:07:13.1534518Z [W1204 10:59:16.880873345 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1534552Z 2025-12-04T11:07:13.1535072Z [W1204 10:59:16.881562108 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1535076Z 2025-12-04T11:07:13.1535584Z [W1204 10:59:16.881761997 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1535589Z 2025-12-04T11:07:13.1536094Z [W1204 10:59:16.885667640 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1536111Z 2025-12-04T11:07:13.1536621Z [W1204 10:59:16.886287581 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1536625Z 2025-12-04T11:07:13.1537133Z [W1204 10:59:16.886480703 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1537140Z 2025-12-04T11:07:13.1537654Z [W1204 10:59:16.892498600 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1537659Z 2025-12-04T11:07:13.1538169Z [W1204 10:59:16.893285632 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1538174Z 2025-12-04T11:07:13.1538767Z [W1204 10:59:16.893479777 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1538773Z 2025-12-04T11:07:13.1538953Z ('RERUN', {'yellow': True}) [0.4051s] [100%] 2025-12-04T11:07:13.1540259Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 [W1204 10:59:17.166417332 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1540267Z 2025-12-04T11:07:13.1540778Z [W1204 10:59:17.167123826 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1540782Z 2025-12-04T11:07:13.1541307Z [W1204 10:59:17.167329968 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1541312Z 2025-12-04T11:07:13.1541819Z [W1204 10:59:17.171283339 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1541826Z 2025-12-04T11:07:13.1542340Z [W1204 10:59:17.172051913 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1542345Z 2025-12-04T11:07:13.1542861Z [W1204 10:59:17.172242796 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1542868Z 2025-12-04T11:07:13.1543373Z [W1204 10:59:17.178226180 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1543377Z 2025-12-04T11:07:13.1543896Z [W1204 10:59:17.178828891 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1543900Z 2025-12-04T11:07:13.1544412Z [W1204 10:59:17.179016750 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1544417Z 2025-12-04T11:07:13.1544998Z [W1204 10:59:17.262113522 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1545003Z 2025-12-04T11:07:13.1545516Z [W1204 10:59:17.262744307 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1545549Z 2025-12-04T11:07:13.1546068Z [W1204 10:59:17.262940502 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1546073Z 2025-12-04T11:07:13.1546581Z [W1204 10:59:17.266821190 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1546586Z 2025-12-04T11:07:13.1547096Z [W1204 10:59:17.267440610 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1547111Z 2025-12-04T11:07:13.1547617Z [W1204 10:59:17.267634977 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1547629Z 2025-12-04T11:07:13.1548137Z [W1204 10:59:17.273583629 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1548145Z 2025-12-04T11:07:13.1548664Z [W1204 10:59:17.274356234 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1548670Z 2025-12-04T11:07:13.1549176Z [W1204 10:59:17.274550155 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1549181Z 2025-12-04T11:07:13.1549295Z FAILED [0.3781s] [100%] 2025-12-04T11:07:13.1549300Z 2025-12-04T11:07:13.1549440Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1549958Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1550114Z Traceback (most recent call last): 2025-12-04T11:07:13.1550630Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1550874Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1551343Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1551507Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1552057Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1552259Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1552403Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1552408Z 2025-12-04T11:07:13.1552508Z Expected 1 but got 2. 2025-12-04T11:07:13.1552615Z Absolute difference: 1 2025-12-04T11:07:13.1552735Z Relative difference: 1.0 2025-12-04T11:07:13.1552740Z 2025-12-04T11:07:13.1552959Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1553872Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1553892Z 2025-12-04T11:07:13.1554154Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1554379Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1554502Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1555035Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1555264Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1555375Z graph_break [] 2025-12-04T11:07:13.1555650Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1556876Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1557025Z if out == self.unknown_value: 2025-12-04T11:07:13.1557756Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1557865Z warnings.warn( 2025-12-04T11:07:13.1558593Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1558702Z warnings.warn( 2025-12-04T11:07:13.1559215Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1559341Z Traceback (most recent call last): 2025-12-04T11:07:13.1559865Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1560096Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1560554Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1560728Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1561265Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1561479Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1561606Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1561612Z 2025-12-04T11:07:13.1561747Z Expected 1 but got 2. 2025-12-04T11:07:13.1561862Z Absolute difference: 1 2025-12-04T11:07:13.1561976Z Relative difference: 1.0 2025-12-04T11:07:13.1561981Z 2025-12-04T11:07:13.1562205Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1563118Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1563126Z 2025-12-04T11:07:13.1563394Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1563622Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1563734Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1564275Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1564502Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1564608Z graph_break [] 2025-12-04T11:07:13.1564840Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1566055Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1566174Z if out == self.unknown_value: 2025-12-04T11:07:13.1566920Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1567021Z warnings.warn( 2025-12-04T11:07:13.1567756Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1567858Z warnings.warn( 2025-12-04T11:07:13.1568077Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1568263Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1568494Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1569066Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1569166Z graph_break [] 2025-12-04T11:07:13.1569384Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1570128Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1570230Z warnings.warn( 2025-12-04T11:07:13.1570955Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1571070Z warnings.warn( 2025-12-04T11:07:13.1571225Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1571751Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1571877Z Traceback (most recent call last): 2025-12-04T11:07:13.1572392Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1572639Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1573102Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1573276Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1573815Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1574056Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1574205Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1574210Z 2025-12-04T11:07:13.1574316Z Expected 1 but got 2. 2025-12-04T11:07:13.1574422Z Absolute difference: 1 2025-12-04T11:07:13.1574546Z Relative difference: 1.0 2025-12-04T11:07:13.1574551Z 2025-12-04T11:07:13.1574769Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1575698Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1575704Z 2025-12-04T11:07:13.1575970Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1576186Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1576314Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1576853Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1577094Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1577194Z graph_break [] 2025-12-04T11:07:13.1577414Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1578710Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1578830Z if out == self.unknown_value: 2025-12-04T11:07:13.1579573Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1579675Z warnings.warn( 2025-12-04T11:07:13.1580469Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1580585Z warnings.warn( 2025-12-04T11:07:13.1580801Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1580961Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1581199Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1581736Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1581845Z graph_break [] 2025-12-04T11:07:13.1582061Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1582790Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1582906Z warnings.warn( 2025-12-04T11:07:13.1583633Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1583735Z warnings.warn( 2025-12-04T11:07:13.1583966Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1584082Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1584316Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1584846Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1584942Z graph_break [] 2025-12-04T11:07:13.1585164Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1585892Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1586042Z warnings.warn( 2025-12-04T11:07:13.1586760Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1586862Z warnings.warn( 2025-12-04T11:07:13.1587882Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ad8948ec33d2b35b.xml - 2025-12-04T11:07:13.1588152Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1589105Z FAILED [0.3781s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1589124Z 2025-12-04T11:07:13.1589231Z Expected 1 but got 2. 2025-12-04T11:07:13.1589342Z Absolute difference: 1 2025-12-04T11:07:13.1589466Z Relative difference: 1.0 2025-12-04T11:07:13.1589475Z 2025-12-04T11:07:13.1589693Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1590609Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1590629Z 2025-12-04T11:07:13.1590900Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1591080Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1591289Z ================== 1 failed, 10 deselected, 2 rerun in 20.11s ================== 2025-12-04T11:07:13.1591385Z Got exit code 1 2025-12-04T11:07:13.1591493Z Retrying single test... 2025-12-04T11:07:13.1591951Z W1204 10:59:27.842000 85782 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1592718Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-33a982dae5077c46.xml 2025-12-04T11:07:13.1592897Z ============================= test session starts ============================== 2025-12-04T11:07:13.1593292Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1593403Z cachedir: .pytest_cache 2025-12-04T11:07:13.1593941Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1594066Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1594176Z configfile: pytest.ini 2025-12-04T11:07:13.1594736Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1594960Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.1595972Z stepcurrent: skipping 5 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1596089Z Running 1 items in this shard 2025-12-04T11:07:13.1596094Z 2025-12-04T11:07:13.1597382Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 [W1204 10:59:31.467681882 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1597400Z 2025-12-04T11:07:13.1597921Z [W1204 10:59:46.692010040 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1597926Z 2025-12-04T11:07:13.1598440Z [W1204 10:59:46.692262613 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1598498Z 2025-12-04T11:07:13.1599016Z [W1204 10:59:46.699508876 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1599021Z 2025-12-04T11:07:13.1599534Z [W1204 10:59:46.700163979 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1599539Z 2025-12-04T11:07:13.1600067Z [W1204 10:59:46.700356268 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1600071Z 2025-12-04T11:07:13.1600584Z [W1204 10:59:46.707202982 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1600588Z 2025-12-04T11:07:13.1601111Z [W1204 10:59:46.707809508 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1601119Z 2025-12-04T11:07:13.1601633Z [W1204 10:59:46.707993534 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1601637Z 2025-12-04T11:07:13.1602158Z [W1204 10:59:48.705443298 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1602165Z 2025-12-04T11:07:13.1602677Z [W1204 10:59:48.707074957 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1602682Z 2025-12-04T11:07:13.1603192Z [W1204 10:59:48.707281772 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1603208Z 2025-12-04T11:07:13.1603718Z [W1204 10:59:48.711196713 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1603725Z 2025-12-04T11:07:13.1604291Z [W1204 10:59:48.711809761 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1604296Z 2025-12-04T11:07:13.1604818Z [W1204 10:59:48.712001085 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1604851Z 2025-12-04T11:07:13.1605365Z [W1204 10:59:48.717984467 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1605369Z 2025-12-04T11:07:13.1605885Z [W1204 10:59:48.718591052 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1605889Z 2025-12-04T11:07:13.1606398Z [W1204 10:59:48.718788090 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1606405Z 2025-12-04T11:07:13.1606551Z ('RERUN', {'yellow': True}) [19.0208s] [100%] 2025-12-04T11:07:13.1607834Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 [W1204 10:59:48.053109256 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1607842Z 2025-12-04T11:07:13.1608362Z [W1204 10:59:48.053790592 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1608367Z 2025-12-04T11:07:13.1608877Z [W1204 10:59:48.053991105 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1608882Z 2025-12-04T11:07:13.1609392Z [W1204 10:59:48.057878205 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1609440Z 2025-12-04T11:07:13.1609955Z [W1204 10:59:48.058619189 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1609959Z 2025-12-04T11:07:13.1610469Z [W1204 10:59:48.058810118 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1610476Z 2025-12-04T11:07:13.1610999Z [W1204 10:59:48.064821092 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1611004Z 2025-12-04T11:07:13.1611512Z [W1204 10:59:48.065428703 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1611517Z 2025-12-04T11:07:13.1612041Z [W1204 10:59:48.065615927 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1612046Z 2025-12-04T11:07:13.1612565Z [W1204 10:59:49.148030996 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1612569Z 2025-12-04T11:07:13.1613092Z [W1204 10:59:49.148644635 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1613099Z 2025-12-04T11:07:13.1613608Z [W1204 10:59:49.148839353 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1613613Z 2025-12-04T11:07:13.1614135Z [W1204 10:59:49.152722855 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1614139Z 2025-12-04T11:07:13.1614652Z [W1204 10:59:49.153329206 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1614657Z 2025-12-04T11:07:13.1615226Z [W1204 10:59:49.153522003 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1615248Z 2025-12-04T11:07:13.1615762Z [W1204 10:59:49.159480214 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1615797Z 2025-12-04T11:07:13.1616309Z [W1204 10:59:49.160260907 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1616314Z 2025-12-04T11:07:13.1616833Z [W1204 10:59:49.160457277 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1616838Z 2025-12-04T11:07:13.1616967Z ('RERUN', {'yellow': True}) [0.4028s] [100%] 2025-12-04T11:07:13.1618329Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 [W1204 10:59:49.429408892 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1618342Z 2025-12-04T11:07:13.1618861Z [W1204 10:59:49.430109997 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1618868Z 2025-12-04T11:07:13.1619394Z [W1204 10:59:49.430311134 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1619399Z 2025-12-04T11:07:13.1619911Z [W1204 10:59:49.434236936 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1619916Z 2025-12-04T11:07:13.1620429Z [W1204 10:59:49.434991020 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1620445Z 2025-12-04T11:07:13.1620954Z [W1204 10:59:49.435191267 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1621003Z 2025-12-04T11:07:13.1621514Z [W1204 10:59:49.441231221 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1621521Z 2025-12-04T11:07:13.1622043Z [W1204 10:59:49.441842185 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1622048Z 2025-12-04T11:07:13.1622558Z [W1204 10:59:49.442027705 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1622563Z 2025-12-04T11:07:13.1623088Z [W1204 10:59:49.527362710 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1623093Z 2025-12-04T11:07:13.1623600Z [W1204 10:59:49.528149987 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1623607Z 2025-12-04T11:07:13.1624136Z [W1204 10:59:49.528356783 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1624143Z 2025-12-04T11:07:13.1624654Z [W1204 10:59:49.532368645 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1624659Z 2025-12-04T11:07:13.1625181Z [W1204 10:59:49.533014577 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1625186Z 2025-12-04T11:07:13.1625694Z [W1204 10:59:49.533209334 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1625699Z 2025-12-04T11:07:13.1626209Z [W1204 10:59:49.539271348 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1626226Z 2025-12-04T11:07:13.1626816Z [W1204 10:59:49.540104434 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1626821Z 2025-12-04T11:07:13.1627359Z [W1204 10:59:49.540302787 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1627364Z 2025-12-04T11:07:13.1627479Z FAILED [0.3799s] [100%] 2025-12-04T11:07:13.1627484Z 2025-12-04T11:07:13.1627626Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1628151Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1628274Z Traceback (most recent call last): 2025-12-04T11:07:13.1628788Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1629031Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1629502Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1629666Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1630221Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1630428Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1630575Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1630580Z 2025-12-04T11:07:13.1630690Z Expected 1 but got 2. 2025-12-04T11:07:13.1630797Z Absolute difference: 1 2025-12-04T11:07:13.1630916Z Relative difference: 1.0 2025-12-04T11:07:13.1630921Z 2025-12-04T11:07:13.1631137Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1632076Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1632115Z 2025-12-04T11:07:13.1632385Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1632612Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1632737Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1633276Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1633515Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1633614Z graph_break [] 2025-12-04T11:07:13.1633832Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1635067Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1635186Z if out == self.unknown_value: 2025-12-04T11:07:13.1635912Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1636031Z warnings.warn( 2025-12-04T11:07:13.1636759Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1636872Z warnings.warn( 2025-12-04T11:07:13.1637383Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1637508Z Traceback (most recent call last): 2025-12-04T11:07:13.1645360Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1645827Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1646332Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1646540Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1647087Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1647311Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1647449Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1647456Z 2025-12-04T11:07:13.1647576Z Expected 1 but got 2. 2025-12-04T11:07:13.1647683Z Absolute difference: 1 2025-12-04T11:07:13.1647797Z Relative difference: 1.0 2025-12-04T11:07:13.1647802Z 2025-12-04T11:07:13.1648038Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1648962Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1648969Z 2025-12-04T11:07:13.1649255Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1649486Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1649604Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1650149Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1650381Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1650479Z graph_break [] 2025-12-04T11:07:13.1650708Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1651931Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1652098Z if out == self.unknown_value: 2025-12-04T11:07:13.1652833Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1652938Z warnings.warn( 2025-12-04T11:07:13.1653673Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1653772Z warnings.warn( 2025-12-04T11:07:13.1653999Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1654114Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1654345Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1654895Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1654993Z graph_break [] 2025-12-04T11:07:13.1655210Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1655951Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1656053Z warnings.warn( 2025-12-04T11:07:13.1656789Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1656889Z warnings.warn( 2025-12-04T11:07:13.1657037Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1657559Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 _ 2025-12-04T11:07:13.1657743Z Traceback (most recent call last): 2025-12-04T11:07:13.1658330Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1658616Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1659078Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1659249Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1659788Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1659994Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1660138Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1660144Z 2025-12-04T11:07:13.1660248Z Expected 1 but got 2. 2025-12-04T11:07:13.1660368Z Absolute difference: 1 2025-12-04T11:07:13.1660478Z Relative difference: 1.0 2025-12-04T11:07:13.1660488Z 2025-12-04T11:07:13.1660707Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1661637Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1661645Z 2025-12-04T11:07:13.1661915Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1662150Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1662264Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1662805Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1663045Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1663175Z graph_break [] 2025-12-04T11:07:13.1663400Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1664626Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1664745Z if out == self.unknown_value: 2025-12-04T11:07:13.1665486Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1665585Z warnings.warn( 2025-12-04T11:07:13.1666301Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1666413Z warnings.warn( 2025-12-04T11:07:13.1666634Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1666759Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1666990Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1667524Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1667632Z graph_break [] 2025-12-04T11:07:13.1667850Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1668577Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1668687Z warnings.warn( 2025-12-04T11:07:13.1669415Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1669528Z warnings.warn( 2025-12-04T11:07:13.1669807Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1669923Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1670161Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1670727Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1670823Z graph_break [] 2025-12-04T11:07:13.1671050Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1671779Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1671888Z warnings.warn( 2025-12-04T11:07:13.1672612Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1672721Z warnings.warn( 2025-12-04T11:07:13.1673579Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-33a982dae5077c46.xml - 2025-12-04T11:07:13.1673755Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1674716Z FAILED [0.3799s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1674723Z 2025-12-04T11:07:13.1674831Z Expected 1 but got 2. 2025-12-04T11:07:13.1674940Z Absolute difference: 1 2025-12-04T11:07:13.1675060Z Relative difference: 1.0 2025-12-04T11:07:13.1675065Z 2025-12-04T11:07:13.1675280Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1676244Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1676250Z 2025-12-04T11:07:13.1676523Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1676704Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1676917Z ================== 1 failed, 10 deselected, 2 rerun in 19.84s ================== 2025-12-04T11:07:13.1677019Z Got exit code 1 2025-12-04T11:07:13.1677859Z FAILED CONSISTENTLY: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16 2025-12-04T11:07:13.1678273Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:07:13.1678720Z W1204 11:00:00.205000 85963 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1679409Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-bd8b5496f00b6cd3.xml 2025-12-04T11:07:13.1679573Z ============================= test session starts ============================== 2025-12-04T11:07:13.1679939Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1680050Z cachedir: .pytest_cache 2025-12-04T11:07:13.1680575Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1680711Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1680817Z configfile: pytest.ini 2025-12-04T11:07:13.1681361Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1681586Z collecting ... collected 58 items / 6 deselected / 52 selected 2025-12-04T11:07:13.1681795Z stepcurrent: skipping 6 already run items. 2025-12-04T11:07:13.1681921Z Running 5 items in this shard 2025-12-04T11:07:13.1681926Z 2025-12-04T11:07:13.1682813Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 ('RERUN', {'yellow': True}) [3.8582s] [ 20%] 2025-12-04T11:07:13.1683737Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.4712s] [ 20%] 2025-12-04T11:07:13.1684524Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 FAILED [0.4640s] [ 20%] 2025-12-04T11:07:13.1684530Z 2025-12-04T11:07:13.1684671Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1685207Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.1685331Z Traceback (most recent call last): 2025-12-04T11:07:13.1685847Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1686091Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1686553Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1686726Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1687269Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1687473Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1687612Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1687649Z 2025-12-04T11:07:13.1687958Z Expected 1 but got 2. 2025-12-04T11:07:13.1688143Z Absolute difference: 1 2025-12-04T11:07:13.1688311Z Relative difference: 1.0 2025-12-04T11:07:13.1688316Z 2025-12-04T11:07:13.1688534Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1689461Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1689467Z 2025-12-04T11:07:13.1689734Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1689957Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1690087Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1690621Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1690872Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1690971Z graph_break [] 2025-12-04T11:07:13.1691187Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1691937Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1692042Z warnings.warn( 2025-12-04T11:07:13.1692777Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1692877Z warnings.warn( 2025-12-04T11:07:13.1693391Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.1693521Z Traceback (most recent call last): 2025-12-04T11:07:13.1694033Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1694380Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1694856Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1695064Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1695615Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1695819Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1695950Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1695955Z 2025-12-04T11:07:13.1696068Z Expected 1 but got 2. 2025-12-04T11:07:13.1696173Z Absolute difference: 1 2025-12-04T11:07:13.1696279Z Relative difference: 1.0 2025-12-04T11:07:13.1696294Z 2025-12-04T11:07:13.1696512Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1697437Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1697445Z 2025-12-04T11:07:13.1697725Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1697944Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1698058Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1698669Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1698895Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1699006Z graph_break [] 2025-12-04T11:07:13.1699220Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1700014Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1700128Z warnings.warn( 2025-12-04T11:07:13.1700853Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1700969Z warnings.warn( 2025-12-04T11:07:13.1701183Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1701298Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1701537Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1702067Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1702166Z graph_break [] 2025-12-04T11:07:13.1702395Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1703128Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1703243Z warnings.warn( 2025-12-04T11:07:13.1703963Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1704061Z warnings.warn( 2025-12-04T11:07:13.1704219Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1704734Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.1704856Z Traceback (most recent call last): 2025-12-04T11:07:13.1705381Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1705613Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1706146Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1706308Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1706876Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1707091Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1707220Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1707226Z 2025-12-04T11:07:13.1707340Z Expected 1 but got 2. 2025-12-04T11:07:13.1707445Z Absolute difference: 1 2025-12-04T11:07:13.1707550Z Relative difference: 1.0 2025-12-04T11:07:13.1707555Z 2025-12-04T11:07:13.1707781Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1708702Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1708710Z 2025-12-04T11:07:13.1708988Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1709208Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1709322Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1709862Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1710086Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1710183Z graph_break [] 2025-12-04T11:07:13.1710409Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1711139Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1711285Z warnings.warn( 2025-12-04T11:07:13.1712013Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1712115Z warnings.warn( 2025-12-04T11:07:13.1712341Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1712453Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1712677Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1713218Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1713316Z graph_break [] 2025-12-04T11:07:13.1713542Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1714276Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1714376Z warnings.warn( 2025-12-04T11:07:13.1715108Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1715209Z warnings.warn( 2025-12-04T11:07:13.1715435Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1715544Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1715769Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1716307Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1716400Z graph_break [] 2025-12-04T11:07:13.1716613Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1717406Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1717504Z warnings.warn( 2025-12-04T11:07:13.1718236Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1718365Z warnings.warn( 2025-12-04T11:07:13.1719209Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-bd8b5496f00b6cd3.xml - 2025-12-04T11:07:13.1719393Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1720342Z FAILED [0.4640s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1720350Z 2025-12-04T11:07:13.1720471Z Expected 1 but got 2. 2025-12-04T11:07:13.1720573Z Absolute difference: 1 2025-12-04T11:07:13.1720683Z Relative difference: 1.0 2025-12-04T11:07:13.1720688Z 2025-12-04T11:07:13.1720916Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1721829Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1721834Z 2025-12-04T11:07:13.1722115Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1722292Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1722488Z =================== 1 failed, 6 deselected, 2 rerun in 4.83s =================== 2025-12-04T11:07:13.1722601Z Got exit code 1 2025-12-04T11:07:13.1722738Z Retrying single test... 2025-12-04T11:07:13.1723191Z W1204 11:00:19.548000 86139 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1723868Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ef5bae0bde57e96b.xml 2025-12-04T11:07:13.1724029Z ============================= test session starts ============================== 2025-12-04T11:07:13.1724393Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1724499Z cachedir: .pytest_cache 2025-12-04T11:07:13.1725026Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1725161Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1725268Z configfile: pytest.ini 2025-12-04T11:07:13.1725827Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1726052Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.1727054Z stepcurrent: skipping 6 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1727184Z Running 1 items in this shard 2025-12-04T11:07:13.1727189Z 2025-12-04T11:07:13.1728480Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 [W1204 11:00:23.250276614 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1728486Z 2025-12-04T11:07:13.1729021Z [W1204 11:00:38.131402994 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1729028Z 2025-12-04T11:07:13.1729615Z [W1204 11:00:38.131662172 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1729621Z 2025-12-04T11:07:13.1730144Z [W1204 11:00:38.138824216 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1730177Z 2025-12-04T11:07:13.1730691Z [W1204 11:00:38.139502410 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1730697Z 2025-12-04T11:07:13.1731204Z [W1204 11:00:38.139697757 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1731220Z 2025-12-04T11:07:13.1731729Z [W1204 11:00:39.146558965 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1731736Z 2025-12-04T11:07:13.1732248Z [W1204 11:00:39.147201583 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1732253Z 2025-12-04T11:07:13.1732771Z [W1204 11:00:39.147387433 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1732778Z 2025-12-04T11:07:13.1733290Z [W1204 11:00:41.141179578 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1733295Z 2025-12-04T11:07:13.1733815Z [W1204 11:00:41.142815701 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1733820Z 2025-12-04T11:07:13.1734332Z [W1204 11:00:41.143023777 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1734337Z 2025-12-04T11:07:13.1734889Z [W1204 11:00:41.146862141 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1734895Z 2025-12-04T11:07:13.1735406Z [W1204 11:00:41.147500418 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1735413Z 2025-12-04T11:07:13.1735938Z [W1204 11:00:41.147695408 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1735943Z 2025-12-04T11:07:13.1736456Z [W1204 11:00:41.153731377 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1736461Z 2025-12-04T11:07:13.1736971Z [W1204 11:00:41.154346281 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1736988Z 2025-12-04T11:07:13.1737500Z [W1204 11:00:41.154539885 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1737511Z 2025-12-04T11:07:13.1737641Z ('RERUN', {'yellow': True}) [19.7530s] [100%] 2025-12-04T11:07:13.1739015Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 [W1204 11:00:41.565303273 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1739025Z 2025-12-04T11:07:13.1739543Z [W1204 11:00:41.565984921 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1739547Z 2025-12-04T11:07:13.1740069Z [W1204 11:00:41.566185856 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1740074Z 2025-12-04T11:07:13.1741145Z [W1204 11:00:41.570093491 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1741153Z 2025-12-04T11:07:13.1741682Z [W1204 11:00:41.570855384 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1741747Z 2025-12-04T11:07:13.1742258Z [W1204 11:00:41.571059272 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1742263Z 2025-12-04T11:07:13.1742786Z [W1204 11:00:41.577049520 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1742790Z 2025-12-04T11:07:13.1743301Z [W1204 11:00:41.577648430 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1743306Z 2025-12-04T11:07:13.1743813Z [W1204 11:00:41.577838299 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1743824Z 2025-12-04T11:07:13.1744348Z [W1204 11:00:41.659993331 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1744355Z 2025-12-04T11:07:13.1744864Z [W1204 11:00:41.660634086 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1744868Z 2025-12-04T11:07:13.1745392Z [W1204 11:00:41.660830987 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1745397Z 2025-12-04T11:07:13.1745911Z [W1204 11:00:41.664662909 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1745915Z 2025-12-04T11:07:13.1746435Z [W1204 11:00:41.665263067 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1746471Z 2025-12-04T11:07:13.1746983Z [W1204 11:00:41.665458581 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1746988Z 2025-12-04T11:07:13.1747511Z [W1204 11:00:41.671393765 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1747516Z 2025-12-04T11:07:13.1748026Z [W1204 11:00:41.672159283 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1748030Z 2025-12-04T11:07:13.1748540Z [W1204 11:00:41.672354112 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1748553Z 2025-12-04T11:07:13.1748682Z ('RERUN', {'yellow': True}) [0.4782s] [100%] 2025-12-04T11:07:13.1749980Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 [W1204 11:00:41.024611406 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1749988Z 2025-12-04T11:07:13.1750512Z [W1204 11:00:41.025308382 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1750516Z 2025-12-04T11:07:13.1751029Z [W1204 11:00:41.025514751 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1751034Z 2025-12-04T11:07:13.1751555Z [W1204 11:00:41.029403402 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1751560Z 2025-12-04T11:07:13.1752066Z [W1204 11:00:41.030186055 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1752073Z 2025-12-04T11:07:13.1752649Z [W1204 11:00:41.030386134 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1752654Z 2025-12-04T11:07:13.1753160Z [W1204 11:00:41.036394361 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1753196Z 2025-12-04T11:07:13.1753717Z [W1204 11:00:41.036997602 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1753721Z 2025-12-04T11:07:13.1754228Z [W1204 11:00:41.037188083 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1754232Z 2025-12-04T11:07:13.1754743Z [W1204 11:00:41.120521815 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1754761Z 2025-12-04T11:07:13.1755271Z [W1204 11:00:41.121158078 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1755276Z 2025-12-04T11:07:13.1755783Z [W1204 11:00:41.121354311 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1755790Z 2025-12-04T11:07:13.1756316Z [W1204 11:00:41.125218575 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1756320Z 2025-12-04T11:07:13.1756824Z [W1204 11:00:41.125818524 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1756829Z 2025-12-04T11:07:13.1757352Z [W1204 11:00:41.126011437 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1757389Z 2025-12-04T11:07:13.1757904Z [W1204 11:00:41.131962644 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1757908Z 2025-12-04T11:07:13.1758433Z [W1204 11:00:41.132737671 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1758440Z 2025-12-04T11:07:13.1758945Z [W1204 11:00:41.132931566 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1758949Z 2025-12-04T11:07:13.1759051Z FAILED [0.4596s] [100%] 2025-12-04T11:07:13.1759068Z 2025-12-04T11:07:13.1759208Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1759728Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.1759860Z Traceback (most recent call last): 2025-12-04T11:07:13.1760384Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1760617Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1761092Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1761258Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1761808Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1762015Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1762145Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1762150Z 2025-12-04T11:07:13.1762265Z Expected 1 but got 2. 2025-12-04T11:07:13.1762374Z Absolute difference: 1 2025-12-04T11:07:13.1762481Z Relative difference: 1.0 2025-12-04T11:07:13.1762485Z 2025-12-04T11:07:13.1762713Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1763688Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1763723Z 2025-12-04T11:07:13.1764001Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1764222Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1764335Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1764876Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1765104Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1765215Z graph_break [] 2025-12-04T11:07:13.1765430Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1766652Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1766782Z if out == self.unknown_value: 2025-12-04T11:07:13.1767514Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1767624Z warnings.warn( 2025-12-04T11:07:13.1768350Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1768450Z warnings.warn( 2025-12-04T11:07:13.1768974Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.1769124Z Traceback (most recent call last): 2025-12-04T11:07:13.1769639Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1769880Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1770339Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1770511Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1771045Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1771247Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1771388Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1771394Z 2025-12-04T11:07:13.1771497Z Expected 1 but got 2. 2025-12-04T11:07:13.1771612Z Absolute difference: 1 2025-12-04T11:07:13.1771720Z Relative difference: 1.0 2025-12-04T11:07:13.1771726Z 2025-12-04T11:07:13.1771944Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1772871Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1772880Z 2025-12-04T11:07:13.1773145Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1773371Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1773483Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1774017Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1774257Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1774355Z graph_break [] 2025-12-04T11:07:13.1774573Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1775868Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1776014Z if out == self.unknown_value: 2025-12-04T11:07:13.1776756Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1776856Z warnings.warn( 2025-12-04T11:07:13.1777578Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1777688Z warnings.warn( 2025-12-04T11:07:13.1777904Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1778032Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1778334Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1778871Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1778988Z graph_break [] 2025-12-04T11:07:13.1779201Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1779930Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1780045Z warnings.warn( 2025-12-04T11:07:13.1780766Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1780879Z warnings.warn( 2025-12-04T11:07:13.1781023Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1781582Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.1781714Z Traceback (most recent call last): 2025-12-04T11:07:13.1782232Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1782473Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1782933Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1783095Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1783641Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1783846Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1783981Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1783987Z 2025-12-04T11:07:13.1784102Z Expected 1 but got 2. 2025-12-04T11:07:13.1784208Z Absolute difference: 1 2025-12-04T11:07:13.1784330Z Relative difference: 1.0 2025-12-04T11:07:13.1784335Z 2025-12-04T11:07:13.1784547Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1785466Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1785472Z 2025-12-04T11:07:13.1785748Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1785970Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1786097Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1786631Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1786919Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1787029Z graph_break [] 2025-12-04T11:07:13.1787244Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1788791Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1788924Z if out == self.unknown_value: 2025-12-04T11:07:13.1789651Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1789764Z warnings.warn( 2025-12-04T11:07:13.1790484Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1790593Z warnings.warn( 2025-12-04T11:07:13.1790818Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1790933Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1791174Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1791706Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1791803Z graph_break [] 2025-12-04T11:07:13.1792026Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1792754Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1792852Z warnings.warn( 2025-12-04T11:07:13.1793588Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1793777Z warnings.warn( 2025-12-04T11:07:13.1794000Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1794115Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1794340Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1794878Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1794976Z graph_break [] 2025-12-04T11:07:13.1795193Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1795932Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1796033Z warnings.warn( 2025-12-04T11:07:13.1796770Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1796870Z warnings.warn( 2025-12-04T11:07:13.1797721Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ef5bae0bde57e96b.xml - 2025-12-04T11:07:13.1797906Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1798857Z FAILED [0.4596s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1798863Z 2025-12-04T11:07:13.1798976Z Expected 1 but got 2. 2025-12-04T11:07:13.1799081Z Absolute difference: 1 2025-12-04T11:07:13.1799193Z Relative difference: 1.0 2025-12-04T11:07:13.1799201Z 2025-12-04T11:07:13.1799431Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1800426Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1800471Z 2025-12-04T11:07:13.1800756Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1800935Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1801133Z ================== 1 failed, 10 deselected, 2 rerun in 20.72s ================== 2025-12-04T11:07:13.1801240Z Got exit code 1 2025-12-04T11:07:13.1801343Z Retrying single test... 2025-12-04T11:07:13.1801792Z W1204 11:00:53.269000 86320 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1802464Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-c80e75c43f69568f.xml 2025-12-04T11:07:13.1802635Z ============================= test session starts ============================== 2025-12-04T11:07:13.1802998Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1803110Z cachedir: .pytest_cache 2025-12-04T11:07:13.1803632Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1803769Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1803876Z configfile: pytest.ini 2025-12-04T11:07:13.1804420Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1804651Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.1805653Z stepcurrent: skipping 6 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1805809Z Running 1 items in this shard 2025-12-04T11:07:13.1805814Z 2025-12-04T11:07:13.1807107Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 [W1204 11:00:56.992318404 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1807116Z 2025-12-04T11:07:13.1807649Z [W1204 11:01:12.253417807 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1807654Z 2025-12-04T11:07:13.1808166Z [W1204 11:01:12.253674480 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1808171Z 2025-12-04T11:07:13.1808699Z [W1204 11:01:12.260927086 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1808704Z 2025-12-04T11:07:13.1809218Z [W1204 11:01:12.261604030 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1809225Z 2025-12-04T11:07:13.1809734Z [W1204 11:01:12.261793431 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1809751Z 2025-12-04T11:07:13.1810261Z [W1204 11:01:12.268569800 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1810266Z 2025-12-04T11:07:13.1810775Z [W1204 11:01:12.269187907 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1810780Z 2025-12-04T11:07:13.1811358Z [W1204 11:01:12.269372659 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1811365Z 2025-12-04T11:07:13.1811878Z [W1204 11:01:14.264832110 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1811914Z 2025-12-04T11:07:13.1812435Z [W1204 11:01:14.266446089 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1812440Z 2025-12-04T11:07:13.1812949Z [W1204 11:01:14.266653174 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1812954Z 2025-12-04T11:07:13.1813471Z [W1204 11:01:14.270550478 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1813476Z 2025-12-04T11:07:13.1813983Z [W1204 11:01:14.271188631 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1813994Z 2025-12-04T11:07:13.1814514Z [W1204 11:01:14.271387808 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1814521Z 2025-12-04T11:07:13.1815033Z [W1204 11:01:14.277383104 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1815038Z 2025-12-04T11:07:13.1815548Z [W1204 11:01:14.277995076 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1815564Z 2025-12-04T11:07:13.1816073Z [W1204 11:01:14.278187272 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1816078Z 2025-12-04T11:07:13.1816211Z ('RERUN', {'yellow': True}) [19.1453s] [100%] 2025-12-04T11:07:13.1817564Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 [W1204 11:01:14.696764279 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1817572Z 2025-12-04T11:07:13.1818086Z [W1204 11:01:14.697471483 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1818091Z 2025-12-04T11:07:13.1818703Z [W1204 11:01:14.697673392 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1818709Z 2025-12-04T11:07:13.1819219Z [W1204 11:01:14.701621742 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1819224Z 2025-12-04T11:07:13.1819749Z [W1204 11:01:14.702381983 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1819756Z 2025-12-04T11:07:13.1820271Z [W1204 11:01:14.702575171 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1820278Z 2025-12-04T11:07:13.1820790Z [W1204 11:01:14.708508976 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1820809Z 2025-12-04T11:07:13.1821319Z [W1204 11:01:14.709110071 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1821324Z 2025-12-04T11:07:13.1821830Z [W1204 11:01:14.709298830 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1821835Z 2025-12-04T11:07:13.1822359Z [W1204 11:01:14.792222006 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1822367Z 2025-12-04T11:07:13.1822939Z [W1204 11:01:14.792842150 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1822945Z 2025-12-04T11:07:13.1823497Z [W1204 11:01:14.793037563 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1823502Z 2025-12-04T11:07:13.1824012Z [W1204 11:01:14.796878834 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1824016Z 2025-12-04T11:07:13.1824538Z [W1204 11:01:14.797480320 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1824543Z 2025-12-04T11:07:13.1825054Z [W1204 11:01:14.797680987 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1825061Z 2025-12-04T11:07:13.1825585Z [W1204 11:01:14.803622071 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1825590Z 2025-12-04T11:07:13.1826097Z [W1204 11:01:14.804392238 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1826104Z 2025-12-04T11:07:13.1826617Z [W1204 11:01:14.804586737 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1826632Z 2025-12-04T11:07:13.1826764Z ('RERUN', {'yellow': True}) [0.4863s] [100%] 2025-12-04T11:07:13.1828056Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 [W1204 11:01:15.154647663 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1828094Z 2025-12-04T11:07:13.1828622Z [W1204 11:01:15.155357236 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1828627Z 2025-12-04T11:07:13.1829136Z [W1204 11:01:15.155558331 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1829143Z 2025-12-04T11:07:13.1829661Z [W1204 11:01:15.159436770 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1829666Z 2025-12-04T11:07:13.1830178Z [W1204 11:01:15.160211977 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1830183Z 2025-12-04T11:07:13.1830704Z [W1204 11:01:15.160410184 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1830711Z 2025-12-04T11:07:13.1831223Z [W1204 11:01:15.166362632 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1831228Z 2025-12-04T11:07:13.1831753Z [W1204 11:01:15.166964297 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1831760Z 2025-12-04T11:07:13.1832267Z [W1204 11:01:15.167165098 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1832272Z 2025-12-04T11:07:13.1832783Z [W1204 11:01:15.250016735 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1832787Z 2025-12-04T11:07:13.1833306Z [W1204 11:01:15.250648969 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1833313Z 2025-12-04T11:07:13.1833880Z [W1204 11:01:15.250845844 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1833885Z 2025-12-04T11:07:13.1834408Z [W1204 11:01:15.254677837 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1834443Z 2025-12-04T11:07:13.1834950Z [W1204 11:01:15.255294109 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1834955Z 2025-12-04T11:07:13.1835473Z [W1204 11:01:15.255489720 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1835478Z 2025-12-04T11:07:13.1835993Z [W1204 11:01:15.261391222 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1835998Z 2025-12-04T11:07:13.1836531Z [W1204 11:01:15.262160195 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1836536Z 2025-12-04T11:07:13.1837040Z [W1204 11:01:15.262354463 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1837047Z 2025-12-04T11:07:13.1837149Z FAILED [0.4561s] [100%] 2025-12-04T11:07:13.1837154Z 2025-12-04T11:07:13.1837307Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1837823Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.1837957Z Traceback (most recent call last): 2025-12-04T11:07:13.1838476Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1838709Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1839221Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1839385Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1839937Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1840149Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1840285Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1840290Z 2025-12-04T11:07:13.1840406Z Expected 1 but got 2. 2025-12-04T11:07:13.1840514Z Absolute difference: 1 2025-12-04T11:07:13.1840622Z Relative difference: 1.0 2025-12-04T11:07:13.1840626Z 2025-12-04T11:07:13.1840857Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1841768Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1841776Z 2025-12-04T11:07:13.1842059Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1842276Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1842394Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1842942Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1843166Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1843274Z graph_break [] 2025-12-04T11:07:13.1843488Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1844714Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1844904Z if out == self.unknown_value: 2025-12-04T11:07:13.1845640Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1845779Z warnings.warn( 2025-12-04T11:07:13.1846508Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1846607Z warnings.warn( 2025-12-04T11:07:13.1847132Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.1847257Z Traceback (most recent call last): 2025-12-04T11:07:13.1847766Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1848007Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1848475Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1848647Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1849185Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1849387Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1849531Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1849536Z 2025-12-04T11:07:13.1849641Z Expected 1 but got 2. 2025-12-04T11:07:13.1849758Z Absolute difference: 1 2025-12-04T11:07:13.1849867Z Relative difference: 1.0 2025-12-04T11:07:13.1849873Z 2025-12-04T11:07:13.1850087Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1851021Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1851060Z 2025-12-04T11:07:13.1851324Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1851551Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1851669Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1852204Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1852442Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1852541Z graph_break [] 2025-12-04T11:07:13.1852758Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1853981Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1854100Z if out == self.unknown_value: 2025-12-04T11:07:13.1854841Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1854944Z warnings.warn( 2025-12-04T11:07:13.1855668Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1855777Z warnings.warn( 2025-12-04T11:07:13.1855992Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1856118Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1856350Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1856882Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1857053Z graph_break [] 2025-12-04T11:07:13.1857272Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1858001Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1858158Z warnings.warn( 2025-12-04T11:07:13.1858955Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1859069Z warnings.warn( 2025-12-04T11:07:13.1859214Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1859732Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.1859869Z Traceback (most recent call last): 2025-12-04T11:07:13.1860386Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1860619Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1861092Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1861258Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1861810Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1862017Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1862150Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1862156Z 2025-12-04T11:07:13.1862270Z Expected 1 but got 2. 2025-12-04T11:07:13.1862376Z Absolute difference: 1 2025-12-04T11:07:13.1862495Z Relative difference: 1.0 2025-12-04T11:07:13.1862537Z 2025-12-04T11:07:13.1862752Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1863673Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1863681Z 2025-12-04T11:07:13.1863960Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1864181Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1864308Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1864842Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1865068Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1865175Z graph_break [] 2025-12-04T11:07:13.1865392Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1866611Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.1866740Z if out == self.unknown_value: 2025-12-04T11:07:13.1867470Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1867581Z warnings.warn( 2025-12-04T11:07:13.1868306Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1868406Z warnings.warn( 2025-12-04T11:07:13.1868630Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1868748Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1869065Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1869600Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1869730Z graph_break [] 2025-12-04T11:07:13.1869955Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1870687Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1870784Z warnings.warn( 2025-12-04T11:07:13.1871515Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1871615Z warnings.warn( 2025-12-04T11:07:13.1871844Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1871957Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1872188Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1872730Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.1872831Z graph_break [] 2025-12-04T11:07:13.1873046Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1873778Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1873879Z warnings.warn( 2025-12-04T11:07:13.1874609Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1874739Z warnings.warn( 2025-12-04T11:07:13.1875586Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-c80e75c43f69568f.xml - 2025-12-04T11:07:13.1875766Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1876717Z FAILED [0.4561s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1876724Z 2025-12-04T11:07:13.1876837Z Expected 1 but got 2. 2025-12-04T11:07:13.1876943Z Absolute difference: 1 2025-12-04T11:07:13.1877051Z Relative difference: 1.0 2025-12-04T11:07:13.1877055Z 2025-12-04T11:07:13.1877281Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1878196Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1878208Z 2025-12-04T11:07:13.1878485Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1878664Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1878866Z ================== 1 failed, 10 deselected, 2 rerun in 20.12s ================== 2025-12-04T11:07:13.1878977Z Got exit code 1 2025-12-04T11:07:13.1879804Z FAILED CONSISTENTLY: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.1880226Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:07:13.1880673Z W1204 11:01:26.077000 86501 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1881394Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-528c2efc29bf6df4.xml 2025-12-04T11:07:13.1881571Z ============================= test session starts ============================== 2025-12-04T11:07:13.1881923Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1882077Z cachedir: .pytest_cache 2025-12-04T11:07:13.1882599Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1882723Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1882844Z configfile: pytest.ini 2025-12-04T11:07:13.1883390Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1883605Z collecting ... collected 58 items / 7 deselected / 51 selected 2025-12-04T11:07:13.1883756Z stepcurrent: skipping 7 already run items. 2025-12-04T11:07:13.1883869Z Running 4 items in this shard 2025-12-04T11:07:13.1883878Z 2025-12-04T11:07:13.1884756Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 ('RERUN', {'yellow': True}) [3.8299s] [ 25%] 2025-12-04T11:07:13.1885620Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.4302s] [ 25%] 2025-12-04T11:07:13.1886404Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 FAILED [0.4246s] [ 25%] 2025-12-04T11:07:13.1886422Z 2025-12-04T11:07:13.1886558Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.1887062Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1887230Z Traceback (most recent call last): 2025-12-04T11:07:13.1887961Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1888267Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1888744Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1888911Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1889462Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1889666Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1889801Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1889806Z 2025-12-04T11:07:13.1889921Z Expected 1 but got 2. 2025-12-04T11:07:13.1890033Z Absolute difference: 1 2025-12-04T11:07:13.1890142Z Relative difference: 1.0 2025-12-04T11:07:13.1890165Z 2025-12-04T11:07:13.1890378Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1891285Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1891293Z 2025-12-04T11:07:13.1891570Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1891791Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1891908Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1892817Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1893047Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1893261Z graph_break [] 2025-12-04T11:07:13.1893479Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1894219Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1894375Z warnings.warn( 2025-12-04T11:07:13.1895103Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1895214Z warnings.warn( 2025-12-04T11:07:13.1895720Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1895844Z Traceback (most recent call last): 2025-12-04T11:07:13.1896370Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1896607Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1897077Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1897238Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1897780Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1898783Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1899268Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1899520Z 2025-12-04T11:07:13.1899623Z Expected 1 but got 2. 2025-12-04T11:07:13.1899910Z Absolute difference: 1 2025-12-04T11:07:13.1900200Z Relative difference: 1.0 2025-12-04T11:07:13.1900389Z 2025-12-04T11:07:13.1900612Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1901943Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1902997Z 2025-12-04T11:07:13.1903265Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1903910Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1904387Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1905489Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1906818Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1907282Z graph_break [] 2025-12-04T11:07:13.1907640Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1908738Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1909701Z warnings.warn( 2025-12-04T11:07:13.1910572Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1911523Z warnings.warn( 2025-12-04T11:07:13.1911893Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1912362Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1912797Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1917629Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1919022Z graph_break [] 2025-12-04T11:07:13.1919498Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1920597Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1921602Z warnings.warn( 2025-12-04T11:07:13.1922489Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1923456Z warnings.warn( 2025-12-04T11:07:13.1923753Z =================================== FAILURES =================================== 2025-12-04T11:07:13.1924552Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.1925316Z Traceback (most recent call last): 2025-12-04T11:07:13.1926091Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.1926972Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.1927783Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.1928546Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.1929376Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.1930254Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.1930717Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1930977Z 2025-12-04T11:07:13.1931082Z Expected 1 but got 2. 2025-12-04T11:07:13.1931368Z Absolute difference: 1 2025-12-04T11:07:13.1931644Z Relative difference: 1.0 2025-12-04T11:07:13.1931840Z 2025-12-04T11:07:13.1932056Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1933358Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1934400Z 2025-12-04T11:07:13.1934677Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1935292Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1935772Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1936884Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1938143Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1938682Z graph_break [] 2025-12-04T11:07:13.1939059Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1940167Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1941127Z warnings.warn( 2025-12-04T11:07:13.1942019Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1942982Z warnings.warn( 2025-12-04T11:07:13.1943355Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1943818Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1944255Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1945591Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1946727Z graph_break [] 2025-12-04T11:07:13.1947120Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1948220Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1949218Z warnings.warn( 2025-12-04T11:07:13.1950094Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1951060Z warnings.warn( 2025-12-04T11:07:13.1951438Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.1951916Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.1952343Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.1953614Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.1954740Z graph_break [] 2025-12-04T11:07:13.1955099Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.1956182Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1957149Z warnings.warn( 2025-12-04T11:07:13.1958025Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.1958969Z warnings.warn( 2025-12-04T11:07:13.1959968Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-528c2efc29bf6df4.xml - 2025-12-04T11:07:13.1961162Z =========================== short test summary info ============================ 2025-12-04T11:07:13.1962419Z FAILED [0.4246s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.1963485Z 2025-12-04T11:07:13.1963604Z Expected 1 but got 2. 2025-12-04T11:07:13.1963880Z Absolute difference: 1 2025-12-04T11:07:13.1964170Z Relative difference: 1.0 2025-12-04T11:07:13.1964358Z 2025-12-04T11:07:13.1964580Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.1965829Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1966881Z 2025-12-04T11:07:13.1967150Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.1967742Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.1968259Z =================== 1 failed, 7 deselected, 2 rerun in 4.72s =================== 2025-12-04T11:07:13.1968815Z Got exit code 1 2025-12-04T11:07:13.1969076Z Retrying single test... 2025-12-04T11:07:13.1969706Z W1204 11:01:45.470000 86670 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.1970942Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ed81db7834398b8b.xml 2025-12-04T11:07:13.1971911Z ============================= test session starts ============================== 2025-12-04T11:07:13.1972572Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.1973166Z cachedir: .pytest_cache 2025-12-04T11:07:13.1973926Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.1974753Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.1975104Z configfile: pytest.ini 2025-12-04T11:07:13.1975828Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.1976770Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.1978126Z stepcurrent: skipping 7 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.1979441Z Running 1 items in this shard 2025-12-04T11:07:13.1979657Z 2025-12-04T11:07:13.1980966Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:01:50.024350801 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1982385Z 2025-12-04T11:07:13.1982911Z [W1204 11:02:06.334812744 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1983591Z 2025-12-04T11:07:13.1984110Z [W1204 11:02:06.335098635 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1984779Z 2025-12-04T11:07:13.1985292Z [W1204 11:02:06.342300735 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1985941Z 2025-12-04T11:07:13.1986469Z [W1204 11:02:06.342966564 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1987121Z 2025-12-04T11:07:13.1987651Z [W1204 11:02:06.343169345 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1988657Z 2025-12-04T11:07:13.1989174Z [W1204 11:02:06.349939295 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1989849Z 2025-12-04T11:07:13.1990361Z [W1204 11:02:06.350689693 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1991029Z 2025-12-04T11:07:13.1991545Z [W1204 11:02:06.350881560 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1992215Z 2025-12-04T11:07:13.1992728Z [W1204 11:02:06.481607474 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1993381Z 2025-12-04T11:07:13.1993909Z [W1204 11:02:06.483111091 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1994559Z 2025-12-04T11:07:13.1995076Z [W1204 11:02:06.483315948 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1995738Z 2025-12-04T11:07:13.1996250Z [W1204 11:02:06.487142128 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1996920Z 2025-12-04T11:07:13.1997432Z [W1204 11:02:06.487761967 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1998102Z 2025-12-04T11:07:13.1998613Z [W1204 11:02:06.487955933 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.1999270Z 2025-12-04T11:07:13.1999884Z [W1204 11:02:06.493917305 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2000542Z 2025-12-04T11:07:13.2001111Z [W1204 11:02:06.494535335 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2001765Z 2025-12-04T11:07:13.2002278Z [W1204 11:02:06.494728809 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2002987Z 2025-12-04T11:07:13.2003121Z ('RERUN', {'yellow': True}) [19.1575s] [100%] 2025-12-04T11:07:13.2004680Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:02:06.868455766 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2006075Z 2025-12-04T11:07:13.2006605Z [W1204 11:02:06.869177575 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2007262Z 2025-12-04T11:07:13.2007794Z [W1204 11:02:06.869375559 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2008444Z 2025-12-04T11:07:13.2008963Z [W1204 11:02:06.873314381 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2009628Z 2025-12-04T11:07:13.2010141Z [W1204 11:02:06.873929101 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2010804Z 2025-12-04T11:07:13.2011321Z [W1204 11:02:06.874118397 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2011971Z 2025-12-04T11:07:13.2012501Z [W1204 11:02:06.880168579 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2013200Z 2025-12-04T11:07:13.2013730Z [W1204 11:02:06.880778161 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2014376Z 2025-12-04T11:07:13.2014892Z [W1204 11:02:06.880967799 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2015561Z 2025-12-04T11:07:13.2016074Z [W1204 11:02:06.966690222 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2016738Z 2025-12-04T11:07:13.2017250Z [W1204 11:02:06.967381477 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2017898Z 2025-12-04T11:07:13.2018525Z [W1204 11:02:06.967586031 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2019186Z 2025-12-04T11:07:13.2019714Z [W1204 11:02:06.971456051 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2020366Z 2025-12-04T11:07:13.2020880Z [W1204 11:02:06.972071580 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2021546Z 2025-12-04T11:07:13.2022060Z [W1204 11:02:06.972266919 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2022724Z 2025-12-04T11:07:13.2023234Z [W1204 11:02:06.978181859 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2023885Z 2025-12-04T11:07:13.2024412Z [W1204 11:02:06.978948212 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2025060Z 2025-12-04T11:07:13.2025676Z [W1204 11:02:06.979154676 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2026336Z 2025-12-04T11:07:13.2026468Z ('RERUN', {'yellow': True}) [0.4457s] [100%] 2025-12-04T11:07:13.2028021Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:02:07.290413930 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2029477Z 2025-12-04T11:07:13.2029999Z [W1204 11:02:07.291109257 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2030667Z 2025-12-04T11:07:13.2031185Z [W1204 11:02:07.291309638 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2031842Z 2025-12-04T11:07:13.2032374Z [W1204 11:02:07.295193501 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2033025Z 2025-12-04T11:07:13.2033558Z [W1204 11:02:07.295795626 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2034217Z 2025-12-04T11:07:13.2034733Z [W1204 11:02:07.295985300 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2035396Z 2025-12-04T11:07:13.2035911Z [W1204 11:02:07.302000587 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2036582Z 2025-12-04T11:07:13.2037095Z [W1204 11:02:07.302610326 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2037747Z 2025-12-04T11:07:13.2038311Z [W1204 11:02:07.302800928 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2038962Z 2025-12-04T11:07:13.2039484Z [W1204 11:02:07.386582317 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2040141Z 2025-12-04T11:07:13.2040657Z [W1204 11:02:07.387229770 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2041320Z 2025-12-04T11:07:13.2041835Z [W1204 11:02:07.387427019 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2042497Z 2025-12-04T11:07:13.2043012Z [W1204 11:02:07.391287162 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2043670Z 2025-12-04T11:07:13.2044200Z [W1204 11:02:07.391896237 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2044860Z 2025-12-04T11:07:13.2045384Z [W1204 11:02:07.392092765 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2046038Z 2025-12-04T11:07:13.2046555Z [W1204 11:02:07.397942810 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2047219Z 2025-12-04T11:07:13.2047735Z [W1204 11:02:07.398705620 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2048396Z 2025-12-04T11:07:13.2048907Z [W1204 11:02:07.398901091 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2049560Z 2025-12-04T11:07:13.2049675Z FAILED [0.4176s] [100%] 2025-12-04T11:07:13.2049856Z 2025-12-04T11:07:13.2050057Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.2050879Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2051658Z Traceback (most recent call last): 2025-12-04T11:07:13.2052449Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2053331Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2054173Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2054944Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2055789Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2056670Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2057149Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2057401Z 2025-12-04T11:07:13.2057522Z Expected 1 but got 2. 2025-12-04T11:07:13.2057804Z Absolute difference: 1 2025-12-04T11:07:13.2058101Z Relative difference: 1.0 2025-12-04T11:07:13.2058378Z 2025-12-04T11:07:13.2058601Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2059876Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2060920Z 2025-12-04T11:07:13.2061188Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2061822Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2062310Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2063436Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2064732Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2065208Z graph_break [] 2025-12-04T11:07:13.2065594Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2067180Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2068636Z if out == self.unknown_value: 2025-12-04T11:07:13.2069593Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2070570Z warnings.warn( 2025-12-04T11:07:13.2071469Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2072427Z warnings.warn( 2025-12-04T11:07:13.2073105Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2073886Z Traceback (most recent call last): 2025-12-04T11:07:13.2074632Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2075518Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2076351Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2077116Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2077992Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2078918Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2079384Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2079650Z 2025-12-04T11:07:13.2079759Z Expected 1 but got 2. 2025-12-04T11:07:13.2080084Z Absolute difference: 1 2025-12-04T11:07:13.2080373Z Relative difference: 1.0 2025-12-04T11:07:13.2080576Z 2025-12-04T11:07:13.2080793Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2082069Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2083114Z 2025-12-04T11:07:13.2083393Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2084016Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2084510Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2085636Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2086903Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2087363Z graph_break [] 2025-12-04T11:07:13.2087930Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2089630Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2091102Z if out == self.unknown_value: 2025-12-04T11:07:13.2092048Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2093102Z warnings.warn( 2025-12-04T11:07:13.2094003Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2094976Z warnings.warn( 2025-12-04T11:07:13.2095344Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2095820Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2096265Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2097522Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2098731Z graph_break [] 2025-12-04T11:07:13.2099103Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2100208Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2101166Z warnings.warn( 2025-12-04T11:07:13.2102052Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2103024Z warnings.warn( 2025-12-04T11:07:13.2103323Z =================================== FAILURES =================================== 2025-12-04T11:07:13.2104124Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2104899Z Traceback (most recent call last): 2025-12-04T11:07:13.2105650Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2106592Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2107472Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2108240Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2109127Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2110004Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2110475Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2110726Z 2025-12-04T11:07:13.2110840Z Expected 1 but got 2. 2025-12-04T11:07:13.2111113Z Absolute difference: 1 2025-12-04T11:07:13.2111410Z Relative difference: 1.0 2025-12-04T11:07:13.2111600Z 2025-12-04T11:07:13.2111829Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2113101Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2114147Z 2025-12-04T11:07:13.2114414Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2115046Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2115527Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2116646Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2117894Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2118363Z graph_break [] 2025-12-04T11:07:13.2118737Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2120324Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2121834Z if out == self.unknown_value: 2025-12-04T11:07:13.2122787Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2123762Z warnings.warn( 2025-12-04T11:07:13.2124634Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2125606Z warnings.warn( 2025-12-04T11:07:13.2125989Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2126465Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2126902Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2128169Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2129297Z graph_break [] 2025-12-04T11:07:13.2129673Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2130759Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2131725Z warnings.warn( 2025-12-04T11:07:13.2132610Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2133561Z warnings.warn( 2025-12-04T11:07:13.2133937Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2134456Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2134934Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2136186Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2137345Z graph_break [] 2025-12-04T11:07:13.2137717Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2138883Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2139847Z warnings.warn( 2025-12-04T11:07:13.2140735Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2141692Z warnings.warn( 2025-12-04T11:07:13.2142691Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ed81db7834398b8b.xml - 2025-12-04T11:07:13.2143839Z =========================== short test summary info ============================ 2025-12-04T11:07:13.2145101Z FAILED [0.4176s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2146168Z 2025-12-04T11:07:13.2146287Z Expected 1 but got 2. 2025-12-04T11:07:13.2146565Z Absolute difference: 1 2025-12-04T11:07:13.2146859Z Relative difference: 1.0 2025-12-04T11:07:13.2147061Z 2025-12-04T11:07:13.2147275Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2148533Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2149611Z 2025-12-04T11:07:13.2149882Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2150475Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.2151001Z ================== 1 failed, 10 deselected, 2 rerun in 20.05s ================== 2025-12-04T11:07:13.2151442Z Got exit code 1 2025-12-04T11:07:13.2151698Z Retrying single test... 2025-12-04T11:07:13.2152333Z W1204 11:02:18.307000 86844 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.2153580Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-261b1bf878c6e8ce.xml 2025-12-04T11:07:13.2154550Z ============================= test session starts ============================== 2025-12-04T11:07:13.2155205Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.2155819Z cachedir: .pytest_cache 2025-12-04T11:07:13.2156536Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.2157328Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.2157688Z configfile: pytest.ini 2025-12-04T11:07:13.2158417Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.2159319Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.2160659Z stepcurrent: skipping 7 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2161909Z Running 1 items in this shard 2025-12-04T11:07:13.2162117Z 2025-12-04T11:07:13.2163485Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:02:23.849971965 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2164925Z 2025-12-04T11:07:13.2165461Z [W1204 11:02:39.553096213 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2166123Z 2025-12-04T11:07:13.2166648Z [W1204 11:02:39.553361118 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2167304Z 2025-12-04T11:07:13.2167817Z [W1204 11:02:39.560639831 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2168478Z 2025-12-04T11:07:13.2168993Z [W1204 11:02:39.561323711 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2169661Z 2025-12-04T11:07:13.2170180Z [W1204 11:02:39.561516463 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2170838Z 2025-12-04T11:07:13.2171354Z [W1204 11:02:39.568306585 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2172004Z 2025-12-04T11:07:13.2172522Z [W1204 11:02:39.569042522 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2173173Z 2025-12-04T11:07:13.2173688Z [W1204 11:02:39.569226004 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2174347Z 2025-12-04T11:07:13.2174859Z [W1204 11:02:39.700580703 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2175558Z 2025-12-04T11:07:13.2176073Z [W1204 11:02:39.702188464 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2176725Z 2025-12-04T11:07:13.2177250Z [W1204 11:02:39.702391306 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2177903Z 2025-12-04T11:07:13.2178501Z [W1204 11:02:39.706273338 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2179155Z 2025-12-04T11:07:13.2179667Z [W1204 11:02:39.706888553 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2180332Z 2025-12-04T11:07:13.2180847Z [W1204 11:02:39.707102123 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2181512Z 2025-12-04T11:07:13.2182022Z [W1204 11:02:39.713082353 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2182676Z 2025-12-04T11:07:13.2183198Z [W1204 11:02:39.713703859 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2183851Z 2025-12-04T11:07:13.2184375Z [W1204 11:02:39.713895103 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2185027Z 2025-12-04T11:07:13.2185163Z ('RERUN', {'yellow': True}) [19.5462s] [100%] 2025-12-04T11:07:13.2186744Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:02:39.089246135 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2188460Z 2025-12-04T11:07:13.2189047Z [W1204 11:02:39.089960469 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2189718Z 2025-12-04T11:07:13.2190228Z [W1204 11:02:39.090184642 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2190920Z 2025-12-04T11:07:13.2191445Z [W1204 11:02:39.094169648 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2192092Z 2025-12-04T11:07:13.2192616Z [W1204 11:02:39.094773694 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2193263Z 2025-12-04T11:07:13.2193779Z [W1204 11:02:39.094961946 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2194444Z 2025-12-04T11:07:13.2194961Z [W1204 11:02:39.100988082 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2195618Z 2025-12-04T11:07:13.2196127Z [W1204 11:02:39.101589136 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2196777Z 2025-12-04T11:07:13.2197298Z [W1204 11:02:39.101774768 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2197947Z 2025-12-04T11:07:13.2198470Z [W1204 11:02:40.186266701 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2199119Z 2025-12-04T11:07:13.2199631Z [W1204 11:02:40.186936257 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2200332Z 2025-12-04T11:07:13.2200850Z [W1204 11:02:40.187146507 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2201512Z 2025-12-04T11:07:13.2202023Z [W1204 11:02:40.191017109 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2202676Z 2025-12-04T11:07:13.2203201Z [W1204 11:02:40.191645733 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2203852Z 2025-12-04T11:07:13.2204375Z [W1204 11:02:40.191838582 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2205024Z 2025-12-04T11:07:13.2205540Z [W1204 11:02:40.197733439 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2206206Z 2025-12-04T11:07:13.2206726Z [W1204 11:02:40.198503011 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2207392Z 2025-12-04T11:07:13.2207901Z [W1204 11:02:40.198694713 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2208549Z 2025-12-04T11:07:13.2208693Z ('RERUN', {'yellow': True}) [0.4460s] [100%] 2025-12-04T11:07:13.2210234Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:02:40.514436730 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2211642Z 2025-12-04T11:07:13.2212157Z [W1204 11:02:40.515128300 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2212818Z 2025-12-04T11:07:13.2213407Z [W1204 11:02:40.515325151 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2214070Z 2025-12-04T11:07:13.2214583Z [W1204 11:02:40.519216087 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2215280Z 2025-12-04T11:07:13.2215805Z [W1204 11:02:40.519830079 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2216453Z 2025-12-04T11:07:13.2216971Z [W1204 11:02:40.520070697 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2217617Z 2025-12-04T11:07:13.2218127Z [W1204 11:02:40.526035432 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2218863Z 2025-12-04T11:07:13.2219381Z [W1204 11:02:40.526632935 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2220045Z 2025-12-04T11:07:13.2220556Z [W1204 11:02:40.526819941 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2221204Z 2025-12-04T11:07:13.2221722Z [W1204 11:02:40.611613234 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2222371Z 2025-12-04T11:07:13.2222893Z [W1204 11:02:40.612305408 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2223544Z 2025-12-04T11:07:13.2224053Z [W1204 11:02:40.612504053 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2224714Z 2025-12-04T11:07:13.2225230Z [W1204 11:02:40.616363504 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2225933Z 2025-12-04T11:07:13.2226445Z [W1204 11:02:40.616979137 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2227108Z 2025-12-04T11:07:13.2227621Z [W1204 11:02:40.617171319 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2228268Z 2025-12-04T11:07:13.2228789Z [W1204 11:02:40.623131918 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2229437Z 2025-12-04T11:07:13.2229958Z [W1204 11:02:40.623902251 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2230607Z 2025-12-04T11:07:13.2231122Z [W1204 11:02:40.624097258 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2231791Z 2025-12-04T11:07:13.2231896Z FAILED [0.4235s] [100%] 2025-12-04T11:07:13.2232069Z 2025-12-04T11:07:13.2232224Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.2233030Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2233788Z Traceback (most recent call last): 2025-12-04T11:07:13.2234543Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2235423Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2236249Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2237016Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2237890Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2238810Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2239275Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2239564Z 2025-12-04T11:07:13.2239672Z Expected 1 but got 2. 2025-12-04T11:07:13.2239958Z Absolute difference: 1 2025-12-04T11:07:13.2240243Z Relative difference: 1.0 2025-12-04T11:07:13.2240440Z 2025-12-04T11:07:13.2240655Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2241921Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2242962Z 2025-12-04T11:07:13.2243236Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2243859Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2244344Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2245462Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2246727Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2247187Z graph_break [] 2025-12-04T11:07:13.2247562Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2249150Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2250620Z if out == self.unknown_value: 2025-12-04T11:07:13.2251569Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2252584Z warnings.warn( 2025-12-04T11:07:13.2253475Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2254452Z warnings.warn( 2025-12-04T11:07:13.2255116Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2255902Z Traceback (most recent call last): 2025-12-04T11:07:13.2256657Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2257531Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2258437Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2259217Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2260064Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2260941Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2261425Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2261678Z 2025-12-04T11:07:13.2261801Z Expected 1 but got 2. 2025-12-04T11:07:13.2262090Z Absolute difference: 1 2025-12-04T11:07:13.2262387Z Relative difference: 1.0 2025-12-04T11:07:13.2262593Z 2025-12-04T11:07:13.2262807Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2264075Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2265113Z 2025-12-04T11:07:13.2265427Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2266097Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2266588Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2267702Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2268978Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2269452Z graph_break [] 2025-12-04T11:07:13.2269823Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2271400Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2272861Z if out == self.unknown_value: 2025-12-04T11:07:13.2273826Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2274793Z warnings.warn( 2025-12-04T11:07:13.2275686Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2276650Z warnings.warn( 2025-12-04T11:07:13.2277030Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2277508Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2277935Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2279204Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2280370Z graph_break [] 2025-12-04T11:07:13.2280746Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2281836Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2282806Z warnings.warn( 2025-12-04T11:07:13.2283691Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2284656Z warnings.warn( 2025-12-04T11:07:13.2284956Z =================================== FAILURES =================================== 2025-12-04T11:07:13.2285763Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2286535Z Traceback (most recent call last): 2025-12-04T11:07:13.2287288Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2288488Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2289331Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2290104Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2290936Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2291830Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2292305Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2292554Z 2025-12-04T11:07:13.2292674Z Expected 1 but got 2. 2025-12-04T11:07:13.2292950Z Absolute difference: 1 2025-12-04T11:07:13.2293245Z Relative difference: 1.0 2025-12-04T11:07:13.2293432Z 2025-12-04T11:07:13.2293746Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2295121Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2296224Z 2025-12-04T11:07:13.2296490Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2297121Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2297601Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2298764Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2300033Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2300511Z graph_break [] 2025-12-04T11:07:13.2300894Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2302468Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2303930Z if out == self.unknown_value: 2025-12-04T11:07:13.2304882Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2305854Z warnings.warn( 2025-12-04T11:07:13.2306735Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2307709Z warnings.warn( 2025-12-04T11:07:13.2308096Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2308618Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2309063Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2310322Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2311461Z graph_break [] 2025-12-04T11:07:13.2311819Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2312912Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2313880Z warnings.warn( 2025-12-04T11:07:13.2314770Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2315734Z warnings.warn( 2025-12-04T11:07:13.2316123Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2316604Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2317035Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2318302Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2319440Z graph_break [] 2025-12-04T11:07:13.2319810Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2320896Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2321861Z warnings.warn( 2025-12-04T11:07:13.2322830Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2323801Z warnings.warn( 2025-12-04T11:07:13.2324798Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-261b1bf878c6e8ce.xml - 2025-12-04T11:07:13.2335097Z =========================== short test summary info ============================ 2025-12-04T11:07:13.2336484Z FAILED [0.4235s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2337552Z 2025-12-04T11:07:13.2337665Z Expected 1 but got 2. 2025-12-04T11:07:13.2337963Z Absolute difference: 1 2025-12-04T11:07:13.2338364Z Relative difference: 1.0 2025-12-04T11:07:13.2338560Z 2025-12-04T11:07:13.2338791Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2340075Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2341133Z 2025-12-04T11:07:13.2341404Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2341995Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.2342522Z ================== 1 failed, 10 deselected, 2 rerun in 20.45s ================== 2025-12-04T11:07:13.2342958Z Got exit code 1 2025-12-04T11:07:13.2343945Z FAILED CONSISTENTLY: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2345321Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:07:13.2346438Z W1204 11:02:51.414000 87018 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.2347690Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-753aefcc31d0e7df.xml 2025-12-04T11:07:13.2348797Z ============================= test session starts ============================== 2025-12-04T11:07:13.2349467Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.2350060Z cachedir: .pytest_cache 2025-12-04T11:07:13.2350779Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.2351573Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.2351927Z configfile: pytest.ini 2025-12-04T11:07:13.2352648Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.2353554Z collecting ... collected 58 items / 8 deselected / 50 selected 2025-12-04T11:07:13.2354049Z stepcurrent: skipping 8 already run items. 2025-12-04T11:07:13.2354419Z Running 3 items in this shard 2025-12-04T11:07:13.2354643Z 2025-12-04T11:07:13.2355520Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 ('RERUN', {'yellow': True}) [3.8539s] [ 33%] 2025-12-04T11:07:13.2357392Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.4331s] [ 33%] 2025-12-04T11:07:13.2359169Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 FAILED [0.4317s] [ 33%] 2025-12-04T11:07:13.2360079Z 2025-12-04T11:07:13.2360301Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.2361132Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2361908Z Traceback (most recent call last): 2025-12-04T11:07:13.2362691Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2363561Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2364390Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2365156Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2365997Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2366204Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2366339Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2366345Z 2025-12-04T11:07:13.2366464Z Expected 1 but got 2. 2025-12-04T11:07:13.2366571Z Absolute difference: 1 2025-12-04T11:07:13.2366691Z Relative difference: 1.0 2025-12-04T11:07:13.2366696Z 2025-12-04T11:07:13.2366918Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2367830Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2367836Z 2025-12-04T11:07:13.2368111Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2368335Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2368463Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2369374Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2369639Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2369752Z graph_break [] 2025-12-04T11:07:13.2369972Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2370714Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2370832Z warnings.warn( 2025-12-04T11:07:13.2371557Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2371670Z warnings.warn( 2025-12-04T11:07:13.2372183Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2372307Z Traceback (most recent call last): 2025-12-04T11:07:13.2372833Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2373067Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2373542Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2373706Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2374246Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2374460Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2374593Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2374599Z 2025-12-04T11:07:13.2374705Z Expected 1 but got 2. 2025-12-04T11:07:13.2374823Z Absolute difference: 1 2025-12-04T11:07:13.2375000Z Relative difference: 1.0 2025-12-04T11:07:13.2375006Z 2025-12-04T11:07:13.2375266Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2376180Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2376219Z 2025-12-04T11:07:13.2376487Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2376719Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2376838Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2377751Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2377985Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2378088Z graph_break [] 2025-12-04T11:07:13.2378400Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2379146Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2379260Z warnings.warn( 2025-12-04T11:07:13.2379995Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2380099Z warnings.warn( 2025-12-04T11:07:13.2380330Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2380447Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2380675Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2381587Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2381728Z graph_break [] 2025-12-04T11:07:13.2381951Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2382683Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2382783Z warnings.warn( 2025-12-04T11:07:13.2383515Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2383615Z warnings.warn( 2025-12-04T11:07:13.2383771Z =================================== FAILURES =================================== 2025-12-04T11:07:13.2384281Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2384410Z Traceback (most recent call last): 2025-12-04T11:07:13.2384933Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2385170Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2385631Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2385804Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2386342Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2386556Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2386686Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2386692Z 2025-12-04T11:07:13.2386795Z Expected 1 but got 2. 2025-12-04T11:07:13.2386946Z Absolute difference: 1 2025-12-04T11:07:13.2387056Z Relative difference: 1.0 2025-12-04T11:07:13.2387061Z 2025-12-04T11:07:13.2387306Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2388518Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2388598Z 2025-12-04T11:07:13.2388871Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2389104Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2389220Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2390115Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2390358Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2390460Z graph_break [] 2025-12-04T11:07:13.2390688Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2391426Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2391529Z warnings.warn( 2025-12-04T11:07:13.2392267Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2392366Z warnings.warn( 2025-12-04T11:07:13.2392593Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2392705Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2392932Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2393888Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2393989Z graph_break [] 2025-12-04T11:07:13.2394206Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2394947Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2395047Z warnings.warn( 2025-12-04T11:07:13.2395780Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2395877Z warnings.warn( 2025-12-04T11:07:13.2396089Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2396217Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2396447Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2397357Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2397460Z graph_break [] 2025-12-04T11:07:13.2397674Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2398408Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2398506Z warnings.warn( 2025-12-04T11:07:13.2399231Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2399341Z warnings.warn( 2025-12-04T11:07:13.2400302Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-753aefcc31d0e7df.xml - 2025-12-04T11:07:13.2400488Z =========================== short test summary info ============================ 2025-12-04T11:07:13.2401467Z FAILED [0.4317s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2401473Z 2025-12-04T11:07:13.2401596Z Expected 1 but got 2. 2025-12-04T11:07:13.2401702Z Absolute difference: 1 2025-12-04T11:07:13.2401808Z Relative difference: 1.0 2025-12-04T11:07:13.2401813Z 2025-12-04T11:07:13.2402042Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2402958Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2402966Z 2025-12-04T11:07:13.2403246Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2403425Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.2403624Z =================== 1 failed, 8 deselected, 2 rerun in 4.75s =================== 2025-12-04T11:07:13.2403735Z Got exit code 1 2025-12-04T11:07:13.2403841Z Retrying single test... 2025-12-04T11:07:13.2404288Z W1204 11:03:10.914000 87194 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.2404963Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-adc6d5470587661c.xml 2025-12-04T11:07:13.2405127Z ============================= test session starts ============================== 2025-12-04T11:07:13.2405490Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.2405627Z cachedir: .pytest_cache 2025-12-04T11:07:13.2406153Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.2406288Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.2406393Z configfile: pytest.ini 2025-12-04T11:07:13.2406936Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.2407165Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.2408152Z stepcurrent: skipping 8 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2408275Z Running 1 items in this shard 2025-12-04T11:07:13.2408281Z 2025-12-04T11:07:13.2409581Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 [W1204 11:03:16.496870964 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2409590Z 2025-12-04T11:07:13.2410121Z [W1204 11:03:32.393313616 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2410127Z 2025-12-04T11:07:13.2410640Z [W1204 11:03:32.393640425 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2410645Z 2025-12-04T11:07:13.2411154Z [W1204 11:03:32.400982851 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2411171Z 2025-12-04T11:07:13.2411712Z [W1204 11:03:32.401669797 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2411720Z 2025-12-04T11:07:13.2412260Z [W1204 11:03:32.401862390 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2412265Z 2025-12-04T11:07:13.2412813Z [W1204 11:03:32.408738017 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2412818Z 2025-12-04T11:07:13.2413329Z [W1204 11:03:32.409477749 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2413333Z 2025-12-04T11:07:13.2413854Z [W1204 11:03:32.409661757 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2413860Z 2025-12-04T11:07:13.2414374Z [W1204 11:03:32.542446504 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2414381Z 2025-12-04T11:07:13.2414902Z [W1204 11:03:32.544128600 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2414907Z 2025-12-04T11:07:13.2415417Z [W1204 11:03:32.544352513 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2415425Z 2025-12-04T11:07:13.2415945Z [W1204 11:03:32.548288543 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2415949Z 2025-12-04T11:07:13.2416459Z [W1204 11:03:32.548916948 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2416464Z 2025-12-04T11:07:13.2416971Z [W1204 11:03:32.549111025 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2417020Z 2025-12-04T11:07:13.2417531Z [W1204 11:03:32.555201690 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2417536Z 2025-12-04T11:07:13.2418044Z [W1204 11:03:32.555835711 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2418051Z 2025-12-04T11:07:13.2418663Z [W1204 11:03:32.556029654 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2418670Z 2025-12-04T11:07:13.2418803Z ('RERUN', {'yellow': True}) [19.7618s] [100%] 2025-12-04T11:07:13.2420099Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 [W1204 11:03:32.936444169 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2420108Z 2025-12-04T11:07:13.2420622Z [W1204 11:03:32.937136046 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2420627Z 2025-12-04T11:07:13.2421152Z [W1204 11:03:32.937331784 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2421160Z 2025-12-04T11:07:13.2421668Z [W1204 11:03:32.941358791 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2421673Z 2025-12-04T11:07:13.2422191Z [W1204 11:03:32.941980537 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2422195Z 2025-12-04T11:07:13.2422705Z [W1204 11:03:32.942170688 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2422751Z 2025-12-04T11:07:13.2423290Z [W1204 11:03:32.948285247 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2423295Z 2025-12-04T11:07:13.2423816Z [W1204 11:03:32.948897469 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2423853Z 2025-12-04T11:07:13.2424362Z [W1204 11:03:32.949084636 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2424366Z 2025-12-04T11:07:13.2424889Z [W1204 11:03:32.034047944 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2424894Z 2025-12-04T11:07:13.2425402Z [W1204 11:03:32.034746490 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2425406Z 2025-12-04T11:07:13.2425933Z [W1204 11:03:32.034946986 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2425937Z 2025-12-04T11:07:13.2426447Z [W1204 11:03:32.038888107 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2426454Z 2025-12-04T11:07:13.2426974Z [W1204 11:03:32.039519444 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2426978Z 2025-12-04T11:07:13.2427487Z [W1204 11:03:32.039714748 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2427491Z 2025-12-04T11:07:13.2428000Z [W1204 11:03:32.045739022 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2428015Z 2025-12-04T11:07:13.2428562Z [W1204 11:03:32.046525751 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2428567Z 2025-12-04T11:07:13.2429077Z [W1204 11:03:32.046721292 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2429084Z 2025-12-04T11:07:13.2429227Z ('RERUN', {'yellow': True}) [0.4517s] [100%] 2025-12-04T11:07:13.2430510Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 [W1204 11:03:33.361719031 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2430516Z 2025-12-04T11:07:13.2431043Z [W1204 11:03:33.362407526 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2431047Z 2025-12-04T11:07:13.2431565Z [W1204 11:03:33.362604027 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2431570Z 2025-12-04T11:07:13.2432101Z [W1204 11:03:33.366557986 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2432108Z 2025-12-04T11:07:13.2432620Z [W1204 11:03:33.367183815 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2432625Z 2025-12-04T11:07:13.2433153Z [W1204 11:03:33.367372945 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2433158Z 2025-12-04T11:07:13.2433669Z [W1204 11:03:33.373518149 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2433674Z 2025-12-04T11:07:13.2434214Z [W1204 11:03:33.374133187 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2434261Z 2025-12-04T11:07:13.2434775Z [W1204 11:03:33.374320938 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2434811Z 2025-12-04T11:07:13.2435320Z [W1204 11:03:33.457940582 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2435325Z 2025-12-04T11:07:13.2435850Z [W1204 11:03:33.458593658 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2435854Z 2025-12-04T11:07:13.2436367Z [W1204 11:03:33.458791094 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2436371Z 2025-12-04T11:07:13.2436896Z [W1204 11:03:33.462700344 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2436903Z 2025-12-04T11:07:13.2437412Z [W1204 11:03:33.463329461 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2437419Z 2025-12-04T11:07:13.2437938Z [W1204 11:03:33.463523129 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2437943Z 2025-12-04T11:07:13.2438452Z [W1204 11:03:33.469493772 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2438457Z 2025-12-04T11:07:13.2438977Z [W1204 11:03:33.470296812 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2438982Z 2025-12-04T11:07:13.2439494Z [W1204 11:03:33.470495603 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2439542Z 2025-12-04T11:07:13.2439648Z FAILED [0.4211s] [100%] 2025-12-04T11:07:13.2439653Z 2025-12-04T11:07:13.2439810Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.2440318Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2440451Z Traceback (most recent call last): 2025-12-04T11:07:13.2440968Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2441199Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2441677Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2441840Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2442384Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2442602Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2442736Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2442741Z 2025-12-04T11:07:13.2442858Z Expected 1 but got 2. 2025-12-04T11:07:13.2442965Z Absolute difference: 1 2025-12-04T11:07:13.2443073Z Relative difference: 1.0 2025-12-04T11:07:13.2443078Z 2025-12-04T11:07:13.2443307Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2444221Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2444227Z 2025-12-04T11:07:13.2444504Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2444756Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2444876Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2445820Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2446076Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2446184Z graph_break [] 2025-12-04T11:07:13.2446401Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2447620Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2447749Z if out == self.unknown_value: 2025-12-04T11:07:13.2448481Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2448600Z warnings.warn( 2025-12-04T11:07:13.2449326Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2449429Z warnings.warn( 2025-12-04T11:07:13.2449950Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2450073Z Traceback (most recent call last): 2025-12-04T11:07:13.2450586Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2450826Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2451287Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2451495Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2452036Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2452241Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2452383Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2452388Z 2025-12-04T11:07:13.2452491Z Expected 1 but got 2. 2025-12-04T11:07:13.2452597Z Absolute difference: 1 2025-12-04T11:07:13.2452716Z Relative difference: 1.0 2025-12-04T11:07:13.2452721Z 2025-12-04T11:07:13.2452935Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2453858Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2453863Z 2025-12-04T11:07:13.2454129Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2454353Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2454481Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2455379Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2455619Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2455719Z graph_break [] 2025-12-04T11:07:13.2455937Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2457163Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2457313Z if out == self.unknown_value: 2025-12-04T11:07:13.2458088Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2458188Z warnings.warn( 2025-12-04T11:07:13.2459028Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2459147Z warnings.warn( 2025-12-04T11:07:13.2459365Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2459495Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2459721Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2460621Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2460735Z graph_break [] 2025-12-04T11:07:13.2460952Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2461683Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2461796Z warnings.warn( 2025-12-04T11:07:13.2462521Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2462633Z warnings.warn( 2025-12-04T11:07:13.2462775Z =================================== FAILURES =================================== 2025-12-04T11:07:13.2463284Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2463419Z Traceback (most recent call last): 2025-12-04T11:07:13.2463973Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2464219Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2464679Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2464845Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2465401Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2465610Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2465742Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2465748Z 2025-12-04T11:07:13.2465861Z Expected 1 but got 2. 2025-12-04T11:07:13.2465967Z Absolute difference: 1 2025-12-04T11:07:13.2466089Z Relative difference: 1.0 2025-12-04T11:07:13.2466093Z 2025-12-04T11:07:13.2466313Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2467229Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2467237Z 2025-12-04T11:07:13.2467515Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2467735Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2467863Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2468764Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2468993Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2469106Z graph_break [] 2025-12-04T11:07:13.2469351Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2470609Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2470756Z if out == self.unknown_value: 2025-12-04T11:07:13.2471482Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2471592Z warnings.warn( 2025-12-04T11:07:13.2472317Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2472416Z warnings.warn( 2025-12-04T11:07:13.2472642Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2472762Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2473003Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2473907Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2474007Z graph_break [] 2025-12-04T11:07:13.2474233Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2474959Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2475072Z warnings.warn( 2025-12-04T11:07:13.2475792Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2475927Z warnings.warn( 2025-12-04T11:07:13.2476156Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2476270Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2476495Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2477407Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2477505Z graph_break [] 2025-12-04T11:07:13.2477731Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2478457Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2478559Z warnings.warn( 2025-12-04T11:07:13.2479297Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2479398Z warnings.warn( 2025-12-04T11:07:13.2480247Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-adc6d5470587661c.xml - 2025-12-04T11:07:13.2480422Z =========================== short test summary info ============================ 2025-12-04T11:07:13.2481365Z FAILED [0.4211s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2481371Z 2025-12-04T11:07:13.2481485Z Expected 1 but got 2. 2025-12-04T11:07:13.2481590Z Absolute difference: 1 2025-12-04T11:07:13.2481713Z Relative difference: 1.0 2025-12-04T11:07:13.2481718Z 2025-12-04T11:07:13.2481964Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2482911Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2482916Z 2025-12-04T11:07:13.2483244Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2483427Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.2483635Z ================== 1 failed, 10 deselected, 2 rerun in 20.67s ================== 2025-12-04T11:07:13.2483735Z Got exit code 1 2025-12-04T11:07:13.2483842Z Retrying single test... 2025-12-04T11:07:13.2484301Z W1204 11:03:44.496000 87375 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.2484969Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-bbb1474dafa92442.xml 2025-12-04T11:07:13.2485138Z ============================= test session starts ============================== 2025-12-04T11:07:13.2485503Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.2485613Z cachedir: .pytest_cache 2025-12-04T11:07:13.2486151Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.2486277Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.2486387Z configfile: pytest.ini 2025-12-04T11:07:13.2486946Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.2487165Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.2488481Z stepcurrent: skipping 8 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2488671Z Running 1 items in this shard 2025-12-04T11:07:13.2488677Z 2025-12-04T11:07:13.2489968Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 [W1204 11:03:49.043822352 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2489977Z 2025-12-04T11:07:13.2490518Z [W1204 11:04:05.539091996 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2490523Z 2025-12-04T11:07:13.2491035Z [W1204 11:04:05.539345559 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2491040Z 2025-12-04T11:07:13.2491571Z [W1204 11:04:05.546565755 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2491578Z 2025-12-04T11:07:13.2492092Z [W1204 11:04:05.547264922 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2492097Z 2025-12-04T11:07:13.2492619Z [W1204 11:04:05.547452592 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2492624Z 2025-12-04T11:07:13.2493135Z [W1204 11:04:05.554280781 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2493140Z 2025-12-04T11:07:13.2493659Z [W1204 11:04:05.555027251 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2493664Z 2025-12-04T11:07:13.2494216Z [W1204 11:04:05.555224694 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2494223Z 2025-12-04T11:07:13.2494776Z [W1204 11:04:05.685020087 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2494790Z 2025-12-04T11:07:13.2495299Z [W1204 11:04:05.686594674 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2495344Z 2025-12-04T11:07:13.2495856Z [W1204 11:04:05.686801073 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2495860Z 2025-12-04T11:07:13.2496383Z [W1204 11:04:05.690700816 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2496388Z 2025-12-04T11:07:13.2496897Z [W1204 11:04:05.691326020 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2496907Z 2025-12-04T11:07:13.2497432Z [W1204 11:04:05.691517541 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2497437Z 2025-12-04T11:07:13.2497948Z [W1204 11:04:05.697480159 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2497955Z 2025-12-04T11:07:13.2498542Z [W1204 11:04:05.698099451 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2498548Z 2025-12-04T11:07:13.2499059Z [W1204 11:04:05.698290858 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2499064Z 2025-12-04T11:07:13.2499210Z ('RERUN', {'yellow': True}) [19.3482s] [100%] 2025-12-04T11:07:13.2500499Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 [W1204 11:04:05.083468954 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2500542Z 2025-12-04T11:07:13.2501056Z [W1204 11:04:05.084164164 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2501078Z 2025-12-04T11:07:13.2501589Z [W1204 11:04:05.084366654 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2501594Z 2025-12-04T11:07:13.2502105Z [W1204 11:04:05.088296098 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2502111Z 2025-12-04T11:07:13.2502631Z [W1204 11:04:05.088911357 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2502638Z 2025-12-04T11:07:13.2503154Z [W1204 11:04:05.089102545 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2503170Z 2025-12-04T11:07:13.2503681Z [W1204 11:04:05.095286211 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2503689Z 2025-12-04T11:07:13.2504200Z [W1204 11:04:05.095892787 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2504204Z 2025-12-04T11:07:13.2504728Z [W1204 11:04:05.096079648 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2504734Z 2025-12-04T11:07:13.2505241Z [W1204 11:04:06.180198244 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2505245Z 2025-12-04T11:07:13.2505822Z [W1204 11:04:06.180855694 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2505827Z 2025-12-04T11:07:13.2506339Z [W1204 11:04:06.181055536 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2506375Z 2025-12-04T11:07:13.2506896Z [W1204 11:04:06.184884002 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2506900Z 2025-12-04T11:07:13.2507410Z [W1204 11:04:06.185491925 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2507415Z 2025-12-04T11:07:13.2507923Z [W1204 11:04:06.185686451 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2507939Z 2025-12-04T11:07:13.2508449Z [W1204 11:04:06.191661636 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2508456Z 2025-12-04T11:07:13.2508968Z [W1204 11:04:06.192432102 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2508975Z 2025-12-04T11:07:13.2509494Z [W1204 11:04:06.192626331 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2509499Z 2025-12-04T11:07:13.2509629Z ('RERUN', {'yellow': True}) [0.4556s] [100%] 2025-12-04T11:07:13.2510921Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 [W1204 11:04:06.505654212 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2510927Z 2025-12-04T11:07:13.2511443Z [W1204 11:04:06.506340559 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2511486Z 2025-12-04T11:07:13.2512006Z [W1204 11:04:06.506537934 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2512013Z 2025-12-04T11:07:13.2512526Z [W1204 11:04:06.510490727 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2512530Z 2025-12-04T11:07:13.2513052Z [W1204 11:04:06.511115827 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2513056Z 2025-12-04T11:07:13.2513569Z [W1204 11:04:06.511304302 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2513573Z 2025-12-04T11:07:13.2514087Z [W1204 11:04:06.517285638 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2514105Z 2025-12-04T11:07:13.2514614Z [W1204 11:04:06.517888581 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2514622Z 2025-12-04T11:07:13.2515131Z [W1204 11:04:06.518079061 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2515135Z 2025-12-04T11:07:13.2515662Z [W1204 11:04:06.601025626 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2515667Z 2025-12-04T11:07:13.2516177Z [W1204 11:04:06.601681138 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2516182Z 2025-12-04T11:07:13.2516737Z [W1204 11:04:06.601887225 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2516745Z 2025-12-04T11:07:13.2517287Z [W1204 11:04:06.605711107 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2517321Z 2025-12-04T11:07:13.2517844Z [W1204 11:04:06.606319181 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2517849Z 2025-12-04T11:07:13.2518360Z [W1204 11:04:06.606512507 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2518365Z 2025-12-04T11:07:13.2518889Z [W1204 11:04:06.612470486 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2518893Z 2025-12-04T11:07:13.2519408Z [W1204 11:04:06.613241786 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2519415Z 2025-12-04T11:07:13.2519923Z [W1204 11:04:06.613435365 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2519927Z 2025-12-04T11:07:13.2520044Z FAILED [0.4188s] [100%] 2025-12-04T11:07:13.2520049Z 2025-12-04T11:07:13.2520193Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.2520710Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2520833Z Traceback (most recent call last): 2025-12-04T11:07:13.2521349Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2521592Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2522061Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2522273Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2522816Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2523024Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2523169Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2523174Z 2025-12-04T11:07:13.2523277Z Expected 1 but got 2. 2025-12-04T11:07:13.2523385Z Absolute difference: 1 2025-12-04T11:07:13.2523505Z Relative difference: 1.0 2025-12-04T11:07:13.2523510Z 2025-12-04T11:07:13.2523726Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2524648Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2524656Z 2025-12-04T11:07:13.2524926Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2525148Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2525273Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2526172Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2526412Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2526512Z graph_break [] 2025-12-04T11:07:13.2526728Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2528066Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2528187Z if out == self.unknown_value: 2025-12-04T11:07:13.2528975Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2529110Z warnings.warn( 2025-12-04T11:07:13.2529836Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2529950Z warnings.warn( 2025-12-04T11:07:13.2530460Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2530595Z Traceback (most recent call last): 2025-12-04T11:07:13.2531104Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2531338Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2531816Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2531983Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2532521Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2532737Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2532868Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2532873Z 2025-12-04T11:07:13.2532987Z Expected 1 but got 2. 2025-12-04T11:07:13.2533094Z Absolute difference: 1 2025-12-04T11:07:13.2533203Z Relative difference: 1.0 2025-12-04T11:07:13.2533208Z 2025-12-04T11:07:13.2533433Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2534347Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2534384Z 2025-12-04T11:07:13.2534661Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2534879Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2534997Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2535905Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2536133Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2536229Z graph_break [] 2025-12-04T11:07:13.2536458Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2537680Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2537815Z if out == self.unknown_value: 2025-12-04T11:07:13.2538616Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2538723Z warnings.warn( 2025-12-04T11:07:13.2539460Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2539559Z warnings.warn( 2025-12-04T11:07:13.2539788Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2539906Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2540135Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2541111Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2541211Z graph_break [] 2025-12-04T11:07:13.2541435Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2542201Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2542301Z warnings.warn( 2025-12-04T11:07:13.2543035Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2543135Z warnings.warn( 2025-12-04T11:07:13.2543279Z =================================== FAILURES =================================== 2025-12-04T11:07:13.2543802Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2543931Z Traceback (most recent call last): 2025-12-04T11:07:13.2544457Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2544694Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2545153Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2545330Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2545869Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2546087Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2546217Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2546222Z 2025-12-04T11:07:13.2546328Z Expected 1 but got 2. 2025-12-04T11:07:13.2546481Z Absolute difference: 1 2025-12-04T11:07:13.2546591Z Relative difference: 1.0 2025-12-04T11:07:13.2546598Z 2025-12-04T11:07:13.2546813Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2547738Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2547746Z 2025-12-04T11:07:13.2548017Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2548248Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2548363Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2549253Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2549496Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2549597Z graph_break [] 2025-12-04T11:07:13.2549822Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2551039Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2551157Z if out == self.unknown_value: 2025-12-04T11:07:13.2551897Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2551999Z warnings.warn( 2025-12-04T11:07:13.2552769Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2552874Z warnings.warn( 2025-12-04T11:07:13.2553119Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2553244Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2553473Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2554399Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2554510Z graph_break [] 2025-12-04T11:07:13.2554722Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2555463Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2555567Z warnings.warn( 2025-12-04T11:07:13.2556296Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2556409Z warnings.warn( 2025-12-04T11:07:13.2556628Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2556755Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2556984Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2557881Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2557992Z graph_break [] 2025-12-04T11:07:13.2558207Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2558938Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2559084Z warnings.warn( 2025-12-04T11:07:13.2559804Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2559921Z warnings.warn( 2025-12-04T11:07:13.2560777Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-bbb1474dafa92442.xml - 2025-12-04T11:07:13.2560945Z =========================== short test summary info ============================ 2025-12-04T11:07:13.2561904Z FAILED [0.4188s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2561910Z 2025-12-04T11:07:13.2562015Z Expected 1 but got 2. 2025-12-04T11:07:13.2562139Z Absolute difference: 1 2025-12-04T11:07:13.2562249Z Relative difference: 1.0 2025-12-04T11:07:13.2562255Z 2025-12-04T11:07:13.2562473Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2563391Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2563399Z 2025-12-04T11:07:13.2563664Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2563854Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.2564056Z ================== 1 failed, 10 deselected, 2 rerun in 20.25s ================== 2025-12-04T11:07:13.2564154Z Got exit code 1 2025-12-04T11:07:13.2565032Z FAILED CONSISTENTLY: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2565481Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:07:13.2565939Z W1204 11:04:17.482000 87556 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.2566642Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-e5cba0d160bf2991.xml 2025-12-04T11:07:13.2566808Z ============================= test session starts ============================== 2025-12-04T11:07:13.2567169Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.2567280Z cachedir: .pytest_cache 2025-12-04T11:07:13.2567810Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.2567934Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.2568051Z configfile: pytest.ini 2025-12-04T11:07:13.2568611Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.2568829Z collecting ... collected 58 items / 9 deselected / 49 selected 2025-12-04T11:07:13.2568974Z stepcurrent: skipping 9 already run items. 2025-12-04T11:07:13.2569100Z Running 2 items in this shard 2025-12-04T11:07:13.2569105Z 2025-12-04T11:07:13.2569990Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 ('RERUN', {'yellow': True}) [3.8698s] [ 50%] 2025-12-04T11:07:13.2570869Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.4660s] [ 50%] 2025-12-04T11:07:13.2571662Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 FAILED [0.4618s] [ 50%] 2025-12-04T11:07:13.2571701Z 2025-12-04T11:07:13.2571857Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.2572375Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.2572503Z Traceback (most recent call last): 2025-12-04T11:07:13.2573026Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2573258Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2573723Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2573898Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2574442Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2574664Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2574800Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2574806Z 2025-12-04T11:07:13.2574916Z Expected 1 but got 2. 2025-12-04T11:07:13.2575030Z Absolute difference: 1 2025-12-04T11:07:13.2575142Z Relative difference: 1.0 2025-12-04T11:07:13.2575148Z 2025-12-04T11:07:13.2575365Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2576295Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2576300Z 2025-12-04T11:07:13.2576568Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2576853Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2576972Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2577534Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2577775Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2577906Z graph_break [] 2025-12-04T11:07:13.2578134Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2578949Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2579068Z warnings.warn( 2025-12-04T11:07:13.2579810Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2579913Z warnings.warn( 2025-12-04T11:07:13.2580446Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.2580568Z Traceback (most recent call last): 2025-12-04T11:07:13.2581081Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2581326Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2581789Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2581954Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2582507Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2582716Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2582855Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2582901Z 2025-12-04T11:07:13.2583008Z Expected 1 but got 2. 2025-12-04T11:07:13.2583115Z Absolute difference: 1 2025-12-04T11:07:13.2583239Z Relative difference: 1.0 2025-12-04T11:07:13.2583244Z 2025-12-04T11:07:13.2583458Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2584392Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2584397Z 2025-12-04T11:07:13.2584663Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2584879Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2585010Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2585544Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2585786Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2585885Z graph_break [] 2025-12-04T11:07:13.2586102Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2586847Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2586949Z warnings.warn( 2025-12-04T11:07:13.2587677Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2588064Z warnings.warn( 2025-12-04T11:07:13.2588320Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2588449Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2588681Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2589329Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2589445Z graph_break [] 2025-12-04T11:07:13.2589662Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2590434Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2590548Z warnings.warn( 2025-12-04T11:07:13.2591274Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2591388Z warnings.warn( 2025-12-04T11:07:13.2591535Z =================================== FAILURES =================================== 2025-12-04T11:07:13.2592054Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.2592193Z Traceback (most recent call last): 2025-12-04T11:07:13.2592710Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2592952Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2593417Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2593580Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2594133Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2594338Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2594470Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2594476Z 2025-12-04T11:07:13.2594590Z Expected 1 but got 2. 2025-12-04T11:07:13.2594695Z Absolute difference: 1 2025-12-04T11:07:13.2594858Z Relative difference: 1.0 2025-12-04T11:07:13.2594863Z 2025-12-04T11:07:13.2595082Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2596001Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2596008Z 2025-12-04T11:07:13.2596281Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2596495Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2596619Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2597149Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2597373Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2597487Z graph_break [] 2025-12-04T11:07:13.2597702Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2598443Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2598560Z warnings.warn( 2025-12-04T11:07:13.2599287Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2599401Z warnings.warn( 2025-12-04T11:07:13.2599618Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2599730Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2599967Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2600532Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2600635Z graph_break [] 2025-12-04T11:07:13.2600892Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2601625Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2601765Z warnings.warn( 2025-12-04T11:07:13.2602491Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2602592Z warnings.warn( 2025-12-04T11:07:13.2602817Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2602932Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2603171Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2603705Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2603807Z graph_break [] 2025-12-04T11:07:13.2604032Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2604754Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2604855Z warnings.warn( 2025-12-04T11:07:13.2605587Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2605684Z warnings.warn( 2025-12-04T11:07:13.2606542Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-e5cba0d160bf2991.xml - 2025-12-04T11:07:13.2606710Z =========================== short test summary info ============================ 2025-12-04T11:07:13.2607693Z FAILED [0.4618s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2607711Z 2025-12-04T11:07:13.2607818Z Expected 1 but got 2. 2025-12-04T11:07:13.2607924Z Absolute difference: 1 2025-12-04T11:07:13.2608047Z Relative difference: 1.0 2025-12-04T11:07:13.2608052Z 2025-12-04T11:07:13.2608271Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2609187Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2609193Z 2025-12-04T11:07:13.2609467Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2609647Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.2609857Z =================== 1 failed, 9 deselected, 2 rerun in 4.83s =================== 2025-12-04T11:07:13.2609953Z Got exit code 1 2025-12-04T11:07:13.2610061Z Retrying single test... 2025-12-04T11:07:13.2610521Z W1204 11:04:37.152000 87732 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.2611194Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-df18bdebce6af33a.xml 2025-12-04T11:07:13.2611371Z ============================= test session starts ============================== 2025-12-04T11:07:13.2611723Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.2611835Z cachedir: .pytest_cache 2025-12-04T11:07:13.2612373Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.2612529Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.2612676Z configfile: pytest.ini 2025-12-04T11:07:13.2613233Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.2613480Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.2614494Z stepcurrent: skipping 9 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2614605Z Running 1 items in this shard 2025-12-04T11:07:13.2614611Z 2025-12-04T11:07:13.2615913Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 [W1204 11:04:40.868316461 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2615933Z 2025-12-04T11:07:13.2616454Z [W1204 11:04:56.384680461 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2616461Z 2025-12-04T11:07:13.2616975Z [W1204 11:04:56.384946963 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2616982Z 2025-12-04T11:07:13.2617505Z [W1204 11:04:56.392238379 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2617510Z 2025-12-04T11:07:13.2618020Z [W1204 11:04:56.392907847 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2618026Z 2025-12-04T11:07:13.2618621Z [W1204 11:04:56.393097200 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2618688Z 2025-12-04T11:07:13.2619205Z [W1204 11:04:56.399894130 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2619209Z 2025-12-04T11:07:13.2619728Z [W1204 11:04:56.400536121 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2619735Z 2025-12-04T11:07:13.2620244Z [W1204 11:04:56.400726573 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2620249Z 2025-12-04T11:07:13.2620771Z [W1204 11:04:58.396523645 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2620777Z 2025-12-04T11:07:13.2621285Z [W1204 11:04:58.398152137 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2621293Z 2025-12-04T11:07:13.2621808Z [W1204 11:04:58.398360816 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2621826Z 2025-12-04T11:07:13.2622334Z [W1204 11:04:58.402305952 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2622341Z 2025-12-04T11:07:13.2622849Z [W1204 11:04:58.402941956 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2622855Z 2025-12-04T11:07:13.2623373Z [W1204 11:04:58.403151320 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2623378Z 2025-12-04T11:07:13.2623885Z [W1204 11:04:58.409169130 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2623889Z 2025-12-04T11:07:13.2624469Z [W1204 11:04:58.409785647 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2624474Z 2025-12-04T11:07:13.2624986Z [W1204 11:04:58.409980050 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2625023Z 2025-12-04T11:07:13.2625159Z ('RERUN', {'yellow': True}) [19.3876s] [100%] 2025-12-04T11:07:13.2626447Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 [W1204 11:04:58.827921970 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2626453Z 2025-12-04T11:07:13.2626974Z [W1204 11:04:58.828631486 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2626979Z 2025-12-04T11:07:13.2627496Z [W1204 11:04:58.828835460 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2627501Z 2025-12-04T11:07:13.2628009Z [W1204 11:04:58.832778366 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2628024Z 2025-12-04T11:07:13.2628530Z [W1204 11:04:58.833544292 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2628535Z 2025-12-04T11:07:13.2629045Z [W1204 11:04:58.833735326 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2629050Z 2025-12-04T11:07:13.2629565Z [W1204 11:04:58.839718024 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2629569Z 2025-12-04T11:07:13.2630116Z [W1204 11:04:58.840344905 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2630120Z 2025-12-04T11:07:13.2630638Z [W1204 11:04:58.840537600 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2630645Z 2025-12-04T11:07:13.2631151Z [W1204 11:04:58.924659408 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2631156Z 2025-12-04T11:07:13.2631679Z [W1204 11:04:58.925332438 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2631683Z 2025-12-04T11:07:13.2632192Z [W1204 11:04:58.925530491 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2632197Z 2025-12-04T11:07:13.2632703Z [W1204 11:04:58.929392619 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2632724Z 2025-12-04T11:07:13.2633234Z [W1204 11:04:58.930018008 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2633241Z 2025-12-04T11:07:13.2633747Z [W1204 11:04:58.930220391 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2633752Z 2025-12-04T11:07:13.2634271Z [W1204 11:04:58.936158584 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2634277Z 2025-12-04T11:07:13.2634784Z [W1204 11:04:58.936925809 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2634789Z 2025-12-04T11:07:13.2635339Z [W1204 11:04:58.937119131 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2635346Z 2025-12-04T11:07:13.2635508Z ('RERUN', {'yellow': True}) [0.4880s] [100%] 2025-12-04T11:07:13.2636808Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 [W1204 11:04:59.288712135 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2636845Z 2025-12-04T11:07:13.2637355Z [W1204 11:04:59.289411681 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2637360Z 2025-12-04T11:07:13.2637878Z [W1204 11:04:59.289611341 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2637883Z 2025-12-04T11:07:13.2638394Z [W1204 11:04:59.293540121 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2638401Z 2025-12-04T11:07:13.2638911Z [W1204 11:04:59.294307222 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2638928Z 2025-12-04T11:07:13.2639431Z [W1204 11:04:59.294502305 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2639436Z 2025-12-04T11:07:13.2639948Z [W1204 11:04:59.300525189 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2639953Z 2025-12-04T11:07:13.2640475Z [W1204 11:04:59.301136495 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2640480Z 2025-12-04T11:07:13.2640987Z [W1204 11:04:59.301324901 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2641021Z 2025-12-04T11:07:13.2641547Z [W1204 11:04:59.385050381 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2641552Z 2025-12-04T11:07:13.2642063Z [W1204 11:04:59.385711283 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2642068Z 2025-12-04T11:07:13.2642585Z [W1204 11:04:59.385909346 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2642590Z 2025-12-04T11:07:13.2643098Z [W1204 11:04:59.389812684 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2643104Z 2025-12-04T11:07:13.2643625Z [W1204 11:04:59.390447621 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2643632Z 2025-12-04T11:07:13.2644142Z [W1204 11:04:59.390650788 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2644147Z 2025-12-04T11:07:13.2644655Z [W1204 11:04:59.396619599 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2644670Z 2025-12-04T11:07:13.2645179Z [W1204 11:04:59.397389906 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2645184Z 2025-12-04T11:07:13.2645692Z [W1204 11:04:59.397584268 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2645696Z 2025-12-04T11:07:13.2645805Z FAILED [0.4586s] [100%] 2025-12-04T11:07:13.2645809Z 2025-12-04T11:07:13.2645982Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.2646547Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.2646668Z Traceback (most recent call last): 2025-12-04T11:07:13.2647183Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2647451Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2647914Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2648076Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2648624Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2648826Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2648969Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2648979Z 2025-12-04T11:07:13.2649080Z Expected 1 but got 2. 2025-12-04T11:07:13.2649185Z Absolute difference: 1 2025-12-04T11:07:13.2649299Z Relative difference: 1.0 2025-12-04T11:07:13.2649304Z 2025-12-04T11:07:13.2649513Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2650450Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2650456Z 2025-12-04T11:07:13.2650720Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2650938Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2651066Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2651597Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2651867Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2651970Z graph_break [] 2025-12-04T11:07:13.2652181Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2653408Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2653525Z if out == self.unknown_value: 2025-12-04T11:07:13.2654253Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2654362Z warnings.warn( 2025-12-04T11:07:13.2655082Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2655193Z warnings.warn( 2025-12-04T11:07:13.2655709Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.2655827Z Traceback (most recent call last): 2025-12-04T11:07:13.2656349Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2656578Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2657048Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2657212Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2657750Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2657970Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2658134Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2658140Z 2025-12-04T11:07:13.2658340Z Expected 1 but got 2. 2025-12-04T11:07:13.2658464Z Absolute difference: 1 2025-12-04T11:07:13.2658576Z Relative difference: 1.0 2025-12-04T11:07:13.2658581Z 2025-12-04T11:07:13.2658850Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2659774Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2659779Z 2025-12-04T11:07:13.2660047Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2660273Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2660388Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2660932Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2661164Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2661261Z graph_break [] 2025-12-04T11:07:13.2661483Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2662697Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2662821Z if out == self.unknown_value: 2025-12-04T11:07:13.2663551Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2663652Z warnings.warn( 2025-12-04T11:07:13.2664396Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2664528Z warnings.warn( 2025-12-04T11:07:13.2664748Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2664871Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2665100Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2665643Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2665740Z graph_break [] 2025-12-04T11:07:13.2665954Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2666693Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2666794Z warnings.warn( 2025-12-04T11:07:13.2667523Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2667633Z warnings.warn( 2025-12-04T11:07:13.2667775Z =================================== FAILURES =================================== 2025-12-04T11:07:13.2668298Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.2668418Z Traceback (most recent call last): 2025-12-04T11:07:13.2668928Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2669166Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2669626Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2669797Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2670396Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2670604Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2670748Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2670781Z 2025-12-04T11:07:13.2670883Z Expected 1 but got 2. 2025-12-04T11:07:13.2670985Z Absolute difference: 1 2025-12-04T11:07:13.2671102Z Relative difference: 1.0 2025-12-04T11:07:13.2671107Z 2025-12-04T11:07:13.2671323Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2672256Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2672261Z 2025-12-04T11:07:13.2672525Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2672740Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2672872Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2673403Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2673639Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2673733Z graph_break [] 2025-12-04T11:07:13.2673946Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2675167Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2675280Z if out == self.unknown_value: 2025-12-04T11:07:13.2676018Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2676152Z warnings.warn( 2025-12-04T11:07:13.2676872Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2676982Z warnings.warn( 2025-12-04T11:07:13.2677197Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2677309Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2677548Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2678080Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2678185Z graph_break [] 2025-12-04T11:07:13.2678401Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2679133Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2679244Z warnings.warn( 2025-12-04T11:07:13.2679966Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2680077Z warnings.warn( 2025-12-04T11:07:13.2680292Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2680406Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2680637Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2681168Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2681263Z graph_break [] 2025-12-04T11:07:13.2681482Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2682271Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2682379Z warnings.warn( 2025-12-04T11:07:13.2683101Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2683230Z warnings.warn( 2025-12-04T11:07:13.2684085Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-df18bdebce6af33a.xml - 2025-12-04T11:07:13.2684257Z =========================== short test summary info ============================ 2025-12-04T11:07:13.2685214Z FAILED [0.4586s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2685222Z 2025-12-04T11:07:13.2685330Z Expected 1 but got 2. 2025-12-04T11:07:13.2685438Z Absolute difference: 1 2025-12-04T11:07:13.2685561Z Relative difference: 1.0 2025-12-04T11:07:13.2685566Z 2025-12-04T11:07:13.2685785Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2686706Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2686725Z 2025-12-04T11:07:13.2686993Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2687173Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.2687384Z ================== 1 failed, 10 deselected, 2 rerun in 20.37s ================== 2025-12-04T11:07:13.2687487Z Got exit code 1 2025-12-04T11:07:13.2687630Z Retrying single test... 2025-12-04T11:07:13.2688388Z W1204 11:05:10.228000 87913 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.2689061Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-d98beca05e867fac.xml 2025-12-04T11:07:13.2689241Z ============================= test session starts ============================== 2025-12-04T11:07:13.2689598Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.2689709Z cachedir: .pytest_cache 2025-12-04T11:07:13.2690250Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.2690377Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.2690488Z configfile: pytest.ini 2025-12-04T11:07:13.2691055Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.2691282Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.2692301Z stepcurrent: skipping 9 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2692422Z Running 1 items in this shard 2025-12-04T11:07:13.2692427Z 2025-12-04T11:07:13.2693737Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 [W1204 11:05:13.963149657 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2693743Z 2025-12-04T11:07:13.2694268Z [W1204 11:05:29.196964521 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2694342Z 2025-12-04T11:07:13.2694900Z [W1204 11:05:29.197227936 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2694920Z 2025-12-04T11:07:13.2695433Z [W1204 11:05:29.204578810 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2695477Z 2025-12-04T11:07:13.2695990Z [W1204 11:05:29.205285292 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2695995Z 2025-12-04T11:07:13.2696521Z [W1204 11:05:29.205486623 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2696525Z 2025-12-04T11:07:13.2697040Z [W1204 11:05:29.212360053 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2697047Z 2025-12-04T11:07:13.2697578Z [W1204 11:05:29.212991722 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2697583Z 2025-12-04T11:07:13.2698095Z [W1204 11:05:29.213183005 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2698101Z 2025-12-04T11:07:13.2698697Z [W1204 11:05:31.216872121 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2698703Z 2025-12-04T11:07:13.2699218Z [W1204 11:05:31.218519916 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2699222Z 2025-12-04T11:07:13.2699746Z [W1204 11:05:31.218725278 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2699751Z 2025-12-04T11:07:13.2700325Z [W1204 11:05:31.222623238 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2700329Z 2025-12-04T11:07:13.2700840Z [W1204 11:05:31.223288462 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2700861Z 2025-12-04T11:07:13.2701373Z [W1204 11:05:31.223486100 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2701378Z 2025-12-04T11:07:13.2701887Z [W1204 11:05:31.229481213 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2701892Z 2025-12-04T11:07:13.2702414Z [W1204 11:05:31.230121010 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2702419Z 2025-12-04T11:07:13.2702933Z [W1204 11:05:31.230322803 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2702943Z 2025-12-04T11:07:13.2703090Z ('RERUN', {'yellow': True}) [19.1318s] [100%] 2025-12-04T11:07:13.2704382Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 [W1204 11:05:31.648400921 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2704391Z 2025-12-04T11:07:13.2704917Z [W1204 11:05:31.649111908 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2704922Z 2025-12-04T11:07:13.2705437Z [W1204 11:05:31.649311435 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2705441Z 2025-12-04T11:07:13.2706049Z [W1204 11:05:31.653239457 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2706056Z 2025-12-04T11:07:13.2706570Z [W1204 11:05:31.654018242 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2706603Z 2025-12-04T11:07:13.2707115Z [W1204 11:05:31.654213340 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2707120Z 2025-12-04T11:07:13.2707641Z [W1204 11:05:31.660208224 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2707646Z 2025-12-04T11:07:13.2708159Z [W1204 11:05:31.660829484 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2708164Z 2025-12-04T11:07:13.2708690Z [W1204 11:05:31.661019687 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2708699Z 2025-12-04T11:07:13.2709210Z [W1204 11:05:31.744931405 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2709217Z 2025-12-04T11:07:13.2709742Z [W1204 11:05:31.745626770 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2709747Z 2025-12-04T11:07:13.2710256Z [W1204 11:05:31.745832653 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2710261Z 2025-12-04T11:07:13.2710785Z [W1204 11:05:31.749702717 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2710790Z 2025-12-04T11:07:13.2711302Z [W1204 11:05:31.750360979 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2711405Z 2025-12-04T11:07:13.2711918Z [W1204 11:05:31.750564805 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2711935Z 2025-12-04T11:07:13.2712452Z [W1204 11:05:31.756575806 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2712457Z 2025-12-04T11:07:13.2712965Z [W1204 11:05:31.757367612 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2712970Z 2025-12-04T11:07:13.2713491Z [W1204 11:05:31.757565227 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2713495Z 2025-12-04T11:07:13.2713625Z ('RERUN', {'yellow': True}) [0.4878s] [100%] 2025-12-04T11:07:13.2714930Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 [W1204 11:05:31.116208808 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2714938Z 2025-12-04T11:07:13.2715453Z [W1204 11:05:31.116921833 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2715458Z 2025-12-04T11:07:13.2715981Z [W1204 11:05:31.117125877 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2715986Z 2025-12-04T11:07:13.2716496Z [W1204 11:05:31.121066333 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2716501Z 2025-12-04T11:07:13.2717056Z [W1204 11:05:31.121845373 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2717063Z 2025-12-04T11:07:13.2717605Z [W1204 11:05:31.122043053 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2717611Z 2025-12-04T11:07:13.2718123Z [W1204 11:05:31.128030085 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2718170Z 2025-12-04T11:07:13.2718682Z [W1204 11:05:31.128646000 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2718687Z 2025-12-04T11:07:13.2719196Z [W1204 11:05:31.128836314 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2719200Z 2025-12-04T11:07:13.2719723Z [W1204 11:05:32.212779502 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2719732Z 2025-12-04T11:07:13.2720244Z [W1204 11:05:32.213452845 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2720248Z 2025-12-04T11:07:13.2720771Z [W1204 11:05:32.213654667 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2720778Z 2025-12-04T11:07:13.2721289Z [W1204 11:05:32.217513933 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2721294Z 2025-12-04T11:07:13.2721819Z [W1204 11:05:32.218131063 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2721823Z 2025-12-04T11:07:13.2722334Z [W1204 11:05:32.218327372 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2722371Z 2025-12-04T11:07:13.2722887Z [W1204 11:05:32.224312688 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2722905Z 2025-12-04T11:07:13.2723412Z [W1204 11:05:32.225089753 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2723419Z 2025-12-04T11:07:13.2723930Z [W1204 11:05:32.225286888 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2723934Z 2025-12-04T11:07:13.2724050Z FAILED [0.4655s] [100%] 2025-12-04T11:07:13.2724054Z 2025-12-04T11:07:13.2724195Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.2724726Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.2724849Z Traceback (most recent call last): 2025-12-04T11:07:13.2725373Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2725618Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2726086Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2726268Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2726809Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2727017Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2727162Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2727168Z 2025-12-04T11:07:13.2727273Z Expected 1 but got 2. 2025-12-04T11:07:13.2727383Z Absolute difference: 1 2025-12-04T11:07:13.2727507Z Relative difference: 1.0 2025-12-04T11:07:13.2727512Z 2025-12-04T11:07:13.2727765Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2728734Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2728767Z 2025-12-04T11:07:13.2729039Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2729262Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2729393Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2729927Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2730172Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2730271Z graph_break [] 2025-12-04T11:07:13.2730493Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2731730Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2731850Z if out == self.unknown_value: 2025-12-04T11:07:13.2732596Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2732697Z warnings.warn( 2025-12-04T11:07:13.2733423Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2733540Z warnings.warn( 2025-12-04T11:07:13.2734057Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.2734216Z Traceback (most recent call last): 2025-12-04T11:07:13.2734743Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2734977Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2735454Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2735619Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2736157Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2736375Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2736511Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2736517Z 2025-12-04T11:07:13.2736636Z Expected 1 but got 2. 2025-12-04T11:07:13.2736741Z Absolute difference: 1 2025-12-04T11:07:13.2736852Z Relative difference: 1.0 2025-12-04T11:07:13.2736860Z 2025-12-04T11:07:13.2737093Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2738012Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2738019Z 2025-12-04T11:07:13.2738371Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2738595Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2738713Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2739264Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2739494Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2739595Z graph_break [] 2025-12-04T11:07:13.2739869Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2741117Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2741277Z if out == self.unknown_value: 2025-12-04T11:07:13.2742007Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2742109Z warnings.warn( 2025-12-04T11:07:13.2742852Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2742953Z warnings.warn( 2025-12-04T11:07:13.2743184Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2743307Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2743537Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2744083Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2744183Z graph_break [] 2025-12-04T11:07:13.2744400Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2745141Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2745241Z warnings.warn( 2025-12-04T11:07:13.2745975Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2746073Z warnings.warn( 2025-12-04T11:07:13.2746220Z =================================== FAILURES =================================== 2025-12-04T11:07:13.2746787Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 _ 2025-12-04T11:07:13.2746909Z Traceback (most recent call last): 2025-12-04T11:07:13.2747423Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2747667Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2748129Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2748306Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2748845Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2749048Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2749197Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2749203Z 2025-12-04T11:07:13.2749309Z Expected 1 but got 2. 2025-12-04T11:07:13.2749430Z Absolute difference: 1 2025-12-04T11:07:13.2749541Z Relative difference: 1.0 2025-12-04T11:07:13.2749546Z 2025-12-04T11:07:13.2749764Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2750696Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2750702Z 2025-12-04T11:07:13.2750971Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2751203Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2751321Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2751918Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2752196Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2752297Z graph_break [] 2025-12-04T11:07:13.2752517Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2753783Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2753902Z if out == self.unknown_value: 2025-12-04T11:07:13.2754642Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2754743Z warnings.warn( 2025-12-04T11:07:13.2755471Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2755591Z warnings.warn( 2025-12-04T11:07:13.2755808Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2755940Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2756169Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2756701Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2756813Z graph_break [] 2025-12-04T11:07:13.2757029Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2757755Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2757869Z warnings.warn( 2025-12-04T11:07:13.2758597Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2758743Z warnings.warn( 2025-12-04T11:07:13.2758958Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2759074Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2759316Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2759849Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2)] 2025-12-04T11:07:13.2759947Z graph_break [] 2025-12-04T11:07:13.2760174Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2760903Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2761015Z warnings.warn( 2025-12-04T11:07:13.2761742Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2761838Z warnings.warn( 2025-12-04T11:07:13.2762702Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-d98beca05e867fac.xml - 2025-12-04T11:07:13.2762873Z =========================== short test summary info ============================ 2025-12-04T11:07:13.2763831Z FAILED [0.4655s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2763837Z 2025-12-04T11:07:13.2763945Z Expected 1 but got 2. 2025-12-04T11:07:13.2764054Z Absolute difference: 1 2025-12-04T11:07:13.2764176Z Relative difference: 1.0 2025-12-04T11:07:13.2764183Z 2025-12-04T11:07:13.2764441Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2765395Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2765442Z 2025-12-04T11:07:13.2765711Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2765892Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.2766102Z ================== 1 failed, 10 deselected, 2 rerun in 20.12s ================== 2025-12-04T11:07:13.2766204Z Got exit code 1 2025-12-04T11:07:13.2767049Z FAILED CONSISTENTLY: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16 2025-12-04T11:07:13.2767465Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:07:13.2767916Z W1204 11:05:43.106000 88094 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.2768592Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-3111b2456c98ee88.xml 2025-12-04T11:07:13.2768761Z ============================= test session starts ============================== 2025-12-04T11:07:13.2769132Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.2769244Z cachedir: .pytest_cache 2025-12-04T11:07:13.2769769Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.2769907Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.2770019Z configfile: pytest.ini 2025-12-04T11:07:13.2770566Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.2770835Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.2770977Z stepcurrent: skipping 10 already run items. 2025-12-04T11:07:13.2771106Z Running 1 items in this shard 2025-12-04T11:07:13.2771111Z 2025-12-04T11:07:13.2771979Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 ('RERUN', {'yellow': True}) [4.1213s] [100%] 2025-12-04T11:07:13.2772845Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.4376s] [100%] 2025-12-04T11:07:13.2773637Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 FAILED [0.4335s] [100%] 2025-12-04T11:07:13.2773645Z 2025-12-04T11:07:13.2773789Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.2774305Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2774433Z Traceback (most recent call last): 2025-12-04T11:07:13.2774950Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2775190Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2775650Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2775829Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2776363Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2776599Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2776777Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2776783Z 2025-12-04T11:07:13.2776886Z Expected 1 but got 2. 2025-12-04T11:07:13.2776989Z Absolute difference: 1 2025-12-04T11:07:13.2777142Z Relative difference: 1.0 2025-12-04T11:07:13.2777147Z 2025-12-04T11:07:13.2777361Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2778356Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2778363Z 2025-12-04T11:07:13.2778629Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2778847Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2779025Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2779970Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2780210Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2780312Z graph_break [] 2025-12-04T11:07:13.2780525Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2781279Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2781383Z warnings.warn( 2025-12-04T11:07:13.2782116Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2782217Z warnings.warn( 2025-12-04T11:07:13.2782726Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2782904Z Traceback (most recent call last): 2025-12-04T11:07:13.2783413Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2783645Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2784118Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2784283Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2784835Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2785042Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2785174Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2785179Z 2025-12-04T11:07:13.2785294Z Expected 1 but got 2. 2025-12-04T11:07:13.2785404Z Absolute difference: 1 2025-12-04T11:07:13.2785522Z Relative difference: 1.0 2025-12-04T11:07:13.2785529Z 2025-12-04T11:07:13.2785744Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2786653Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2786660Z 2025-12-04T11:07:13.2786937Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2787154Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2787277Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2788503Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2788811Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2788959Z graph_break [] 2025-12-04T11:07:13.2789175Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2789910Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2790061Z warnings.warn( 2025-12-04T11:07:13.2790785Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2790894Z warnings.warn( 2025-12-04T11:07:13.2791105Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2791220Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2791455Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2792357Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2792468Z graph_break [] 2025-12-04T11:07:13.2792683Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2793415Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2793530Z warnings.warn( 2025-12-04T11:07:13.2794255Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2794355Z warnings.warn( 2025-12-04T11:07:13.2794511Z =================================== FAILURES =================================== 2025-12-04T11:07:13.2795084Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2795220Z Traceback (most recent call last): 2025-12-04T11:07:13.2795730Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2795961Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2796432Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2796597Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2797150Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2797355Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2797487Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2797495Z 2025-12-04T11:07:13.2797617Z Expected 1 but got 2. 2025-12-04T11:07:13.2797726Z Absolute difference: 1 2025-12-04T11:07:13.2797838Z Relative difference: 1.0 2025-12-04T11:07:13.2797843Z 2025-12-04T11:07:13.2798074Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2798985Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2798990Z 2025-12-04T11:07:13.2799269Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2799485Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2799598Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2800534Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2800796Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2800907Z graph_break [] 2025-12-04T11:07:13.2801124Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2801893Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2802006Z warnings.warn( 2025-12-04T11:07:13.2802734Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2802842Z warnings.warn( 2025-12-04T11:07:13.2803057Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2803171Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2803407Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2804307Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2804405Z graph_break [] 2025-12-04T11:07:13.2804631Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2805354Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2805467Z warnings.warn( 2025-12-04T11:07:13.2806194Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2806295Z warnings.warn( 2025-12-04T11:07:13.2806522Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2806668Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2806899Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2807799Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2807900Z graph_break [] 2025-12-04T11:07:13.2808121Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2808852Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2808951Z warnings.warn( 2025-12-04T11:07:13.2809678Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2809781Z warnings.warn( 2025-12-04T11:07:13.2810638Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-3111b2456c98ee88.xml - 2025-12-04T11:07:13.2810810Z =========================== short test summary info ============================ 2025-12-04T11:07:13.2811753Z FAILED [0.4335s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2811769Z 2025-12-04T11:07:13.2811872Z Expected 1 but got 2. 2025-12-04T11:07:13.2811979Z Absolute difference: 1 2025-12-04T11:07:13.2812095Z Relative difference: 1.0 2025-12-04T11:07:13.2812100Z 2025-12-04T11:07:13.2812319Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2813284Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2813292Z 2025-12-04T11:07:13.2813568Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2813775Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.2813984Z ================== 1 failed, 10 deselected, 2 rerun in 5.02s =================== 2025-12-04T11:07:13.2814081Z Got exit code 1 2025-12-04T11:07:13.2814185Z Retrying single test... 2025-12-04T11:07:13.2814643Z W1204 11:06:02.663000 88290 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.2815311Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-d129ff52aa3d5acd.xml 2025-12-04T11:07:13.2815475Z ============================= test session starts ============================== 2025-12-04T11:07:13.2815847Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.2815957Z cachedir: .pytest_cache 2025-12-04T11:07:13.2816494Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.2816623Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.2816731Z configfile: pytest.ini 2025-12-04T11:07:13.2817290Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.2817508Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.2818600Z stepcurrent: skipping 10 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2818714Z Running 1 items in this shard 2025-12-04T11:07:13.2818775Z 2025-12-04T11:07:13.2820068Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:06:08.505260674 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2820089Z 2025-12-04T11:07:13.2820610Z [W1204 11:06:24.395165818 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2820616Z 2025-12-04T11:07:13.2821129Z [W1204 11:06:24.395434076 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2821134Z 2025-12-04T11:07:13.2821656Z [W1204 11:06:24.402742751 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2821662Z 2025-12-04T11:07:13.2822180Z [W1204 11:06:24.403443488 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2822186Z 2025-12-04T11:07:13.2822705Z [W1204 11:06:24.403636400 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2822712Z 2025-12-04T11:07:13.2823224Z [W1204 11:06:24.410583801 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2823229Z 2025-12-04T11:07:13.2823749Z [W1204 11:06:24.411224264 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2823754Z 2025-12-04T11:07:13.2824262Z [W1204 11:06:24.411410284 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2824267Z 2025-12-04T11:07:13.2824821Z [W1204 11:06:24.543906377 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2824856Z 2025-12-04T11:07:13.2825367Z [W1204 11:06:24.545547507 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2825401Z 2025-12-04T11:07:13.2825912Z [W1204 11:06:24.545754983 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2825917Z 2025-12-04T11:07:13.2826440Z [W1204 11:06:24.549650699 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2826444Z 2025-12-04T11:07:13.2826958Z [W1204 11:06:24.550312577 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2826962Z 2025-12-04T11:07:13.2827493Z [W1204 11:06:24.550523515 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2827500Z 2025-12-04T11:07:13.2828010Z [W1204 11:06:24.556530595 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2828017Z 2025-12-04T11:07:13.2828539Z [W1204 11:06:24.557155132 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2828544Z 2025-12-04T11:07:13.2829055Z [W1204 11:06:24.557349903 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2829060Z 2025-12-04T11:07:13.2829203Z ('RERUN', {'yellow': True}) [20.0280s] [100%] 2025-12-04T11:07:13.2830489Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:06:24.937483327 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2830532Z 2025-12-04T11:07:13.2831046Z [W1204 11:06:24.938197539 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2831064Z 2025-12-04T11:07:13.2831577Z [W1204 11:06:24.938394730 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2831582Z 2025-12-04T11:07:13.2832092Z [W1204 11:06:24.942347882 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2832096Z 2025-12-04T11:07:13.2832615Z [W1204 11:06:24.942955971 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2832619Z 2025-12-04T11:07:13.2833136Z [W1204 11:06:24.943158598 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2833143Z 2025-12-04T11:07:13.2833671Z [W1204 11:06:24.949154214 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2833676Z 2025-12-04T11:07:13.2834191Z [W1204 11:06:24.949756994 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2834196Z 2025-12-04T11:07:13.2834717Z [W1204 11:06:24.949945479 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2834722Z 2025-12-04T11:07:13.2835232Z [W1204 11:06:24.034693115 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2835236Z 2025-12-04T11:07:13.2835791Z [W1204 11:06:24.035376183 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2835799Z 2025-12-04T11:07:13.2836337Z [W1204 11:06:24.035576724 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2836342Z 2025-12-04T11:07:13.2836851Z [W1204 11:06:24.039414973 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2836907Z 2025-12-04T11:07:13.2837417Z [W1204 11:06:24.040060152 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2837422Z 2025-12-04T11:07:13.2837931Z [W1204 11:06:24.040263513 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2837936Z 2025-12-04T11:07:13.2838460Z [W1204 11:06:24.046210531 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2838469Z 2025-12-04T11:07:13.2838983Z [W1204 11:06:24.046978015 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2838988Z 2025-12-04T11:07:13.2839510Z [W1204 11:06:24.047189528 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2839517Z 2025-12-04T11:07:13.2839646Z ('RERUN', {'yellow': True}) [0.4505s] [100%] 2025-12-04T11:07:13.2840938Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:06:25.363791249 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2840944Z 2025-12-04T11:07:13.2841459Z [W1204 11:06:25.364492116 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2841499Z 2025-12-04T11:07:13.2842022Z [W1204 11:06:25.364689719 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2842027Z 2025-12-04T11:07:13.2842536Z [W1204 11:06:25.368628953 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2842544Z 2025-12-04T11:07:13.2843054Z [W1204 11:06:25.369233329 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2843059Z 2025-12-04T11:07:13.2843583Z [W1204 11:06:25.369423473 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2843588Z 2025-12-04T11:07:13.2844098Z [W1204 11:06:25.375521765 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2844105Z 2025-12-04T11:07:13.2844628Z [W1204 11:06:25.376132434 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2844633Z 2025-12-04T11:07:13.2845143Z [W1204 11:06:25.376319754 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2845150Z 2025-12-04T11:07:13.2845668Z [W1204 11:06:25.461587034 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2845672Z 2025-12-04T11:07:13.2846184Z [W1204 11:06:25.462264497 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2846189Z 2025-12-04T11:07:13.2846707Z [W1204 11:06:25.462465931 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2846712Z 2025-12-04T11:07:13.2847277Z [W1204 11:06:25.466324083 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2847282Z 2025-12-04T11:07:13.2847789Z [W1204 11:06:25.466930833 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2847838Z 2025-12-04T11:07:13.2848345Z [W1204 11:06:25.467141796 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2848350Z 2025-12-04T11:07:13.2848856Z [W1204 11:06:25.473134240 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2848860Z 2025-12-04T11:07:13.2849381Z [W1204 11:06:25.473907179 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2849385Z 2025-12-04T11:07:13.2849898Z [W1204 11:06:25.474101501 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2849904Z 2025-12-04T11:07:13.2850020Z FAILED [0.4251s] [100%] 2025-12-04T11:07:13.2850025Z 2025-12-04T11:07:13.2850164Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.2850675Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2850810Z Traceback (most recent call last): 2025-12-04T11:07:13.2851326Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2851573Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2852038Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2852202Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2852792Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2852997Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2853142Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2853149Z 2025-12-04T11:07:13.2853253Z Expected 1 but got 2. 2025-12-04T11:07:13.2853358Z Absolute difference: 1 2025-12-04T11:07:13.2853476Z Relative difference: 1.0 2025-12-04T11:07:13.2853481Z 2025-12-04T11:07:13.2853694Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2854614Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2854631Z 2025-12-04T11:07:13.2854895Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2855118Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2855250Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2856146Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2856373Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2856483Z graph_break [] 2025-12-04T11:07:13.2856698Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2857922Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2858037Z if out == self.unknown_value: 2025-12-04T11:07:13.2858903Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2859020Z warnings.warn( 2025-12-04T11:07:13.2859751Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2859894Z warnings.warn( 2025-12-04T11:07:13.2860401Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2860523Z Traceback (most recent call last): 2025-12-04T11:07:13.2861047Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2861278Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2861748Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2861914Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2862453Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2862670Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2862801Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2862806Z 2025-12-04T11:07:13.2862911Z Expected 1 but got 2. 2025-12-04T11:07:13.2863023Z Absolute difference: 1 2025-12-04T11:07:13.2863132Z Relative difference: 1.0 2025-12-04T11:07:13.2863137Z 2025-12-04T11:07:13.2863360Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2864276Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2864312Z 2025-12-04T11:07:13.2864586Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2864815Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2864929Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2865837Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2866066Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2866165Z graph_break [] 2025-12-04T11:07:13.2866390Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2867612Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2867738Z if out == self.unknown_value: 2025-12-04T11:07:13.2868474Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2868578Z warnings.warn( 2025-12-04T11:07:13.2869314Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2869415Z warnings.warn( 2025-12-04T11:07:13.2869631Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2869758Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2869987Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2870923Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2871056Z graph_break [] 2025-12-04T11:07:13.2871272Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2872012Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2872145Z warnings.warn( 2025-12-04T11:07:13.2872883Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2872983Z warnings.warn( 2025-12-04T11:07:13.2873130Z =================================== FAILURES =================================== 2025-12-04T11:07:13.2873645Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2873772Z Traceback (most recent call last): 2025-12-04T11:07:13.2874282Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2874522Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2874980Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2875154Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2875691Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2875894Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2876038Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2876044Z 2025-12-04T11:07:13.2876148Z Expected 1 but got 2. 2025-12-04T11:07:13.2876264Z Absolute difference: 1 2025-12-04T11:07:13.2876373Z Relative difference: 1.0 2025-12-04T11:07:13.2876412Z 2025-12-04T11:07:13.2876631Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2877550Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2877557Z 2025-12-04T11:07:13.2877820Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2878049Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2878162Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2879056Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2879294Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2879396Z graph_break [] 2025-12-04T11:07:13.2879618Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2880839Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2880955Z if out == self.unknown_value: 2025-12-04T11:07:13.2881698Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2881794Z warnings.warn( 2025-12-04T11:07:13.2882516Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2882630Z warnings.warn( 2025-12-04T11:07:13.2882878Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2883051Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2883278Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2884172Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2884312Z graph_break [] 2025-12-04T11:07:13.2884525Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2885253Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2885365Z warnings.warn( 2025-12-04T11:07:13.2886087Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2886202Z warnings.warn( 2025-12-04T11:07:13.2886417Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2886530Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2886768Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2887667Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2887990Z graph_break [] 2025-12-04T11:07:13.2888284Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2889010Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2889121Z warnings.warn( 2025-12-04T11:07:13.2889922Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2890021Z warnings.warn( 2025-12-04T11:07:13.2890878Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-d129ff52aa3d5acd.xml - 2025-12-04T11:07:13.2891053Z =========================== short test summary info ============================ 2025-12-04T11:07:13.2892013Z FAILED [0.4251s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2892020Z 2025-12-04T11:07:13.2892127Z Expected 1 but got 2. 2025-12-04T11:07:13.2892233Z Absolute difference: 1 2025-12-04T11:07:13.2892356Z Relative difference: 1.0 2025-12-04T11:07:13.2892363Z 2025-12-04T11:07:13.2892585Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2893504Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2893512Z 2025-12-04T11:07:13.2893781Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2893963Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.2894171Z ================== 1 failed, 10 deselected, 2 rerun in 20.94s ================== 2025-12-04T11:07:13.2894270Z Got exit code 1 2025-12-04T11:07:13.2894382Z Retrying single test... 2025-12-04T11:07:13.2894831Z W1204 11:06:36.358000 88491 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.2895536Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-89ebab0d3f4369e8.xml 2025-12-04T11:07:13.2895755Z ============================= test session starts ============================== 2025-12-04T11:07:13.2896112Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.2896353Z cachedir: .pytest_cache 2025-12-04T11:07:13.2896876Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.2897001Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.2897122Z configfile: pytest.ini 2025-12-04T11:07:13.2897665Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.2897884Z collecting ... collected 58 items / 10 deselected / 48 selected 2025-12-04T11:07:13.2898957Z stepcurrent: skipping 10 already run items. Running only test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2899075Z Running 1 items in this shard 2025-12-04T11:07:13.2899080Z 2025-12-04T11:07:13.2900374Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:06:42.210617833 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2900382Z 2025-12-04T11:07:13.2900911Z [W1204 11:06:57.087935248 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2900917Z 2025-12-04T11:07:13.2901444Z [W1204 11:06:57.088191514 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2901449Z 2025-12-04T11:07:13.2902006Z [W1204 11:06:57.095524221 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2902011Z 2025-12-04T11:07:13.2902535Z [W1204 11:06:57.096209451 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2902542Z 2025-12-04T11:07:13.2903053Z [W1204 11:06:57.096399115 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2903058Z 2025-12-04T11:07:13.2903568Z [W1204 11:06:57.103373881 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2903586Z 2025-12-04T11:07:13.2904095Z [W1204 11:06:57.103994654 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2904100Z 2025-12-04T11:07:13.2904612Z [W1204 11:06:57.104181266 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2904622Z 2025-12-04T11:07:13.2905144Z [W1204 11:06:58.236597198 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2905152Z 2025-12-04T11:07:13.2905665Z [W1204 11:06:58.238177024 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2905670Z 2025-12-04T11:07:13.2906190Z [W1204 11:06:58.238383695 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2906194Z 2025-12-04T11:07:13.2906705Z [W1204 11:06:58.242308651 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2906710Z 2025-12-04T11:07:13.2907263Z [W1204 11:06:58.242943885 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2907270Z 2025-12-04T11:07:13.2907810Z [W1204 11:06:58.243149910 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2907843Z 2025-12-04T11:07:13.2908367Z [W1204 11:06:58.249160154 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2908372Z 2025-12-04T11:07:13.2908885Z [W1204 11:06:58.249794064 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2908890Z 2025-12-04T11:07:13.2909404Z [W1204 11:06:58.249985718 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2909408Z 2025-12-04T11:07:13.2909560Z ('RERUN', {'yellow': True}) [20.0065s] [100%] 2025-12-04T11:07:13.2910845Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:06:58.630585535 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2910855Z 2025-12-04T11:07:13.2911382Z [W1204 11:06:58.631323231 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2911388Z 2025-12-04T11:07:13.2911899Z [W1204 11:06:58.631519284 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2911904Z 2025-12-04T11:07:13.2912428Z [W1204 11:06:58.635437898 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2912433Z 2025-12-04T11:07:13.2912945Z [W1204 11:06:58.636048986 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2912986Z 2025-12-04T11:07:13.2913513Z [W1204 11:06:58.636238697 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2913517Z 2025-12-04T11:07:13.2914032Z [W1204 11:06:58.642306106 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2914037Z 2025-12-04T11:07:13.2914549Z [W1204 11:06:58.642916536 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2914568Z 2025-12-04T11:07:13.2915087Z [W1204 11:06:58.643119421 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2915091Z 2025-12-04T11:07:13.2915606Z [W1204 11:06:58.728536115 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2915612Z 2025-12-04T11:07:13.2916140Z [W1204 11:06:58.729215151 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2916145Z 2025-12-04T11:07:13.2916656Z [W1204 11:06:58.729416640 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2916663Z 2025-12-04T11:07:13.2917186Z [W1204 11:06:58.733329737 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2917190Z 2025-12-04T11:07:13.2917700Z [W1204 11:06:58.733951801 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2917705Z 2025-12-04T11:07:13.2918225Z [W1204 11:06:58.734147142 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2918265Z 2025-12-04T11:07:13.2918807Z [W1204 11:06:58.740157539 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2918811Z 2025-12-04T11:07:13.2919330Z [W1204 11:06:58.740935904 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2919860Z 2025-12-04T11:07:13.2920374Z [W1204 11:06:58.741130122 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2920379Z 2025-12-04T11:07:13.2920510Z ('RERUN', {'yellow': True}) [0.4521s] [100%] 2025-12-04T11:07:13.2921801Z inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 [W1204 11:06:58.056846950 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2921812Z 2025-12-04T11:07:13.2922327Z [W1204 11:06:58.057537136 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2922332Z 2025-12-04T11:07:13.2922864Z [W1204 11:06:58.057735335 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2922871Z 2025-12-04T11:07:13.2923380Z [W1204 11:06:58.061672414 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2923385Z 2025-12-04T11:07:13.2923909Z [W1204 11:06:58.062288557 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2923914Z 2025-12-04T11:07:13.2924423Z [W1204 11:06:58.062477614 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2924460Z 2025-12-04T11:07:13.2924988Z [W1204 11:06:58.068510791 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2924992Z 2025-12-04T11:07:13.2925501Z [W1204 11:06:58.069118452 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2925508Z 2025-12-04T11:07:13.2926017Z [W1204 11:06:58.069305199 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2926034Z 2025-12-04T11:07:13.2926544Z [W1204 11:06:59.154463563 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2926549Z 2025-12-04T11:07:13.2927058Z [W1204 11:06:59.155139518 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2927063Z 2025-12-04T11:07:13.2927587Z [W1204 11:06:59.155337630 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2927592Z 2025-12-04T11:07:13.2928103Z [W1204 11:06:59.159216126 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2928110Z 2025-12-04T11:07:13.2928630Z [W1204 11:06:59.159828116 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2928635Z 2025-12-04T11:07:13.2929142Z [W1204 11:06:59.160046548 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2929147Z 2025-12-04T11:07:13.2929666Z [W1204 11:06:59.165989129 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2929671Z 2025-12-04T11:07:13.2930253Z [W1204 11:06:59.166761683 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2930260Z 2025-12-04T11:07:13.2930788Z [W1204 11:06:59.166953922 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:07:13.2930823Z 2025-12-04T11:07:13.2930927Z FAILED [0.4237s] [100%] 2025-12-04T11:07:13.2930932Z 2025-12-04T11:07:13.2931074Z ==================================== RERUNS ==================================== 2025-12-04T11:07:13.2931600Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2931725Z Traceback (most recent call last): 2025-12-04T11:07:13.2932252Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2932487Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2932960Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2933136Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2933678Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2933888Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2934032Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2934037Z 2025-12-04T11:07:13.2934146Z Expected 1 but got 2. 2025-12-04T11:07:13.2934264Z Absolute difference: 1 2025-12-04T11:07:13.2934374Z Relative difference: 1.0 2025-12-04T11:07:13.2934379Z 2025-12-04T11:07:13.2934594Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2935524Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2935560Z 2025-12-04T11:07:13.2935834Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2936068Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2936185Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2937089Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2937334Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2937435Z graph_break [] 2025-12-04T11:07:13.2937664Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2938972Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2939092Z if out == self.unknown_value: 2025-12-04T11:07:13.2939837Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2939942Z warnings.warn( 2025-12-04T11:07:13.2940680Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2940780Z warnings.warn( 2025-12-04T11:07:13.2941294Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2941431Z Traceback (most recent call last): 2025-12-04T11:07:13.2941943Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2942241Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2942754Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2942922Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2943506Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2943714Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2943848Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2943854Z 2025-12-04T11:07:13.2943976Z Expected 1 but got 2. 2025-12-04T11:07:13.2944084Z Absolute difference: 1 2025-12-04T11:07:13.2944195Z Relative difference: 1.0 2025-12-04T11:07:13.2944201Z 2025-12-04T11:07:13.2944432Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2945348Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2945356Z 2025-12-04T11:07:13.2945637Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2945861Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2945976Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2946886Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2947116Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2947226Z graph_break [] 2025-12-04T11:07:13.2947445Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2948694Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2948823Z if out == self.unknown_value: 2025-12-04T11:07:13.2949559Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2949677Z warnings.warn( 2025-12-04T11:07:13.2950402Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2950500Z warnings.warn( 2025-12-04T11:07:13.2950729Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2950844Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2951075Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2951992Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2952093Z graph_break [] 2025-12-04T11:07:13.2952320Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2953049Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2953150Z warnings.warn( 2025-12-04T11:07:13.2953885Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2953987Z warnings.warn( 2025-12-04T11:07:13.2954174Z =================================== FAILURES =================================== 2025-12-04T11:07:13.2954718Z _ TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 _ 2025-12-04T11:07:13.2954843Z Traceback (most recent call last): 2025-12-04T11:07:13.2955369Z File "/var/lib/jenkins/workspace/test/inductor/test_cuda_select_algorithm.py", line 130, in test_int8_woq_mm_cuda 2025-12-04T11:07:13.2955633Z self.assertEqual(counters["inductor"]["woq_matcher_count"], 1) 2025-12-04T11:07:13.2956108Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T11:07:13.2956272Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T11:07:13.2956813Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T11:07:13.2957031Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T11:07:13.2957163Z AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2957172Z 2025-12-04T11:07:13.2957274Z Expected 1 but got 2. 2025-12-04T11:07:13.2957395Z Absolute difference: 1 2025-12-04T11:07:13.2957503Z Relative difference: 1.0 2025-12-04T11:07:13.2957508Z 2025-12-04T11:07:13.2957737Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2958655Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2958661Z 2025-12-04T11:07:13.2958929Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2959160Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2959276Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2960186Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2960449Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2960550Z graph_break [] 2025-12-04T11:07:13.2960776Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2962003Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/constant_folding.py:256: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T11:07:13.2962131Z if out == self.unknown_value: 2025-12-04T11:07:13.2962862Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2962965Z warnings.warn( 2025-12-04T11:07:13.2963708Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2963813Z warnings.warn( 2025-12-04T11:07:13.2964028Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2964155Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2964386Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2965293Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2965390Z graph_break [] 2025-12-04T11:07:13.2965605Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2966350Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2966486Z warnings.warn( 2025-12-04T11:07:13.2967253Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2967354Z warnings.warn( 2025-12-04T11:07:13.2967601Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:07:13.2967726Z stats [('calls_captured', 6)] 2025-12-04T11:07:13.2967951Z aot_autograd [('total', 2), ('autograd_cache_bypass', 2), ('not_ok', 2)] 2025-12-04T11:07:13.2968850Z inductor [('pattern_matcher_nodes', 16), ('woq_matcher_nodes', 12), ('pattern_matcher_count', 6), ('benchmarking.InductorBenchmarker.benchmark_gpu', 2), ('fxgraph_cache_miss', 2), ('woq_matcher_count', 2), ('pad_mm_bench', 1)] 2025-12-04T11:07:13.2968962Z graph_break [] 2025-12-04T11:07:13.2969180Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:07:13.2969931Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2970036Z warnings.warn( 2025-12-04T11:07:13.2970761Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T11:07:13.2970876Z warnings.warn( 2025-12-04T11:07:13.2971729Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-89ebab0d3f4369e8.xml - 2025-12-04T11:07:13.2971913Z =========================== short test summary info ============================ 2025-12-04T11:07:13.2972854Z FAILED [0.4237s] inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 - AssertionError: Scalars are not equal! 2025-12-04T11:07:13.2972907Z 2025-12-04T11:07:13.2973019Z Expected 1 but got 2. 2025-12-04T11:07:13.2973140Z Absolute difference: 1 2025-12-04T11:07:13.2973252Z Relative difference: 1.0 2025-12-04T11:07:13.2973257Z 2025-12-04T11:07:13.2973487Z To execute this test, run the following from the base repo dir: 2025-12-04T11:07:13.2974397Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cuda_select_algorithm.py TestSelectAlgorithmCudaCUDA.test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2974402Z 2025-12-04T11:07:13.2974671Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:07:13.2974862Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:07:13.2975063Z ================== 1 failed, 10 deselected, 2 rerun in 20.91s ================== 2025-12-04T11:07:13.2975177Z Got exit code 1 2025-12-04T11:07:13.2976007Z FAILED CONSISTENTLY: test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16 2025-12-04T11:07:13.2976428Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:07:13.2976889Z W1204 11:07:10.044000 88692 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:07:13.2986069Z Test results will be stored in test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-0dc61241fef4b8bc.xml 2025-12-04T11:07:13.2986322Z ============================= test session starts ============================== 2025-12-04T11:07:13.2986706Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:07:13.2986819Z cachedir: .pytest_cache 2025-12-04T11:07:13.2987352Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:07:13.2987588Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:07:13.2987996Z configfile: pytest.ini 2025-12-04T11:07:13.2988622Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:07:13.2988913Z collecting ... collected 58 items / 11 deselected / 47 selected 2025-12-04T11:07:13.2989058Z stepcurrent: skipping 11 already run items. 2025-12-04T11:07:13.2989188Z Running 0 items in this shard 2025-12-04T11:07:13.2989195Z 2025-12-04T11:07:13.2990058Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-0dc61241fef4b8bc.xml - 2025-12-04T11:07:13.2990230Z ============================ 11 deselected in 0.02s ============================ 2025-12-04T11:07:13.2999028Z The following tests failed consistently: ['test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_1_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16', 'test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_concat_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_64_cuda_bfloat16', 'test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_128_out_features_65_cuda_bfloat16', 'test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_64_cuda_bfloat16', 'test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_1_in_features_144_out_features_65_cuda_bfloat16', 'test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_1024_out_features_64_cuda_bfloat16', 'test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_1024_cuda_bfloat16', 'test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_17_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16', 'test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_1_mid_dim_1_in_features_1024_out_features_65_cuda_bfloat16', 'test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_1_in_features_128_out_features_1024_cuda_bfloat16', 'test/inductor/test_cuda_select_algorithm.py::TestSelectAlgorithmCudaCUDA::test_int8_woq_mm_cuda_batch_size_32_mid_dim_8_in_features_144_out_features_65_cuda_bfloat16'] 2025-12-04T11:07:13.2999105Z 2025-12-04T11:07:13.2999756Z FINISHED PRINTING LOG FILE of inductor/test_cuda_select_algorithm 1/5 (test/test-reports/inductor.test_cuda_select_algorithm_1.5_fb546dd2d7cc6229_.log) 2025-12-04T11:07:13.2999762Z 2025-12-04T11:07:13.3000182Z Finished inductor/test_cuda_select_algorithm 1/5 ... [2025-12-04 11:07:12.964145][6837.085357978], took 15.91min 2025-12-04T11:07:13.3001099Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-b8b19f7b3ae0b98c.xml 2025-12-04T11:07:13.3002052Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-cdcb3ea635f515ef.xml 2025-12-04T11:07:13.3002951Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-2c53b8c4a22bc4ac.xml 2025-12-04T11:07:13.3003848Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-c3b4fd201ecf8b1e.xml 2025-12-04T11:07:13.3004821Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7b498694d0586007.xml 2025-12-04T11:07:13.3005718Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-f04dfce55afddfa2.xml 2025-12-04T11:07:13.3006653Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a20deb48b5165834.xml 2025-12-04T11:07:13.3007542Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-688c2e6fbd08d334.xml 2025-12-04T11:07:13.3094818Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a49a4c623eea732f.xml 2025-12-04T11:07:13.3534312Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-9d29992f2be81c73.xml 2025-12-04T11:07:13.3882768Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7a570f380a97ffdf.xml 2025-12-04T11:07:13.4256066Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-b091dc2605282b32.xml 2025-12-04T11:07:13.4585983Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7a84aa230dcc2c72.xml 2025-12-04T11:07:13.4933866Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-622e1501e18bdb77.xml 2025-12-04T11:07:13.5257745Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-af0abf9f8a1870a5.xml 2025-12-04T11:07:13.5582053Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a29cc44636754e7c.xml 2025-12-04T11:07:13.6038416Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ad8948ec33d2b35b.xml 2025-12-04T11:07:13.6343225Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-33a982dae5077c46.xml 2025-12-04T11:07:13.6639426Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-bd8b5496f00b6cd3.xml 2025-12-04T11:07:13.6931920Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ef5bae0bde57e96b.xml 2025-12-04T11:07:13.7282643Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-c80e75c43f69568f.xml 2025-12-04T11:07:13.7549435Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-528c2efc29bf6df4.xml 2025-12-04T11:07:13.7840617Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ed81db7834398b8b.xml 2025-12-04T11:07:13.8152450Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-261b1bf878c6e8ce.xml 2025-12-04T11:07:13.8457268Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-753aefcc31d0e7df.xml 2025-12-04T11:07:13.8771661Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-adc6d5470587661c.xml 2025-12-04T11:07:13.9140222Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-bbb1474dafa92442.xml 2025-12-04T11:07:13.9466654Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-e5cba0d160bf2991.xml 2025-12-04T11:07:13.9818844Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-df18bdebce6af33a.xml 2025-12-04T11:07:14.0140829Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-d98beca05e867fac.xml 2025-12-04T11:07:14.0473425Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-3111b2456c98ee88.xml 2025-12-04T11:07:14.0815996Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-d129ff52aa3d5acd.xml 2025-12-04T11:07:14.1141728Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-89ebab0d3f4369e8.xml 2025-12-04T11:07:14.1463166Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-0dc61241fef4b8bc.xml 2025-12-04T11:07:14.5380695Z Uploading logs for 57119749259 to S3 2025-12-04T11:07:14.6099617Z Uploading artifacts took 0.38 seconds 2025-12-04T11:07:14.6100077Z inductor/test_cuda_select_algorithm 1/5 failed! 2025-12-04T11:07:14.6105161Z Running inductor/test_compile_subprocess 1/3 ... [2025-12-04 11:07:14.610331][6838.731550285] 2025-12-04T11:07:14.6105852Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T11:07:14.6110303Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_compile_subprocess.py', '--shard-id=1', '--num-shards=3', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:07:14.610772] 2025-12-04T11:14:00.0448144Z 2025-12-04T11:14:00.0449601Z PRINTING LOG FILE of inductor/test_compile_subprocess 1/3 (test/test-reports/inductor.test_compile_subprocess_1.3_2d3d7d60d67c3c74_.log) 2025-12-04T11:14:00.0451994Z W1204 11:07:23.471000 88809 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:14:00.0454141Z Test results will be stored in test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-5bc7c9553ad37db8.xml 2025-12-04T11:14:00.0455659Z ============================= test session starts ============================== 2025-12-04T11:14:00.0456849Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:14:00.0457837Z cachedir: .pytest_cache 2025-12-04T11:14:00.0459107Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:14:00.0460422Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:14:00.0461386Z configfile: pytest.ini 2025-12-04T11:14:00.0462715Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:14:00.0463996Z collecting ... collected 879 items 2025-12-04T11:14:00.0464566Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T11:14:00.0589524Z Running 279 items in this shard: test/inductor/test_compile_subprocess.py::TestSubprocess::test_async, test/inductor/test_compile_subprocess.py::TestSubprocess::test_progressive, test/inductor/test_compile_subprocess.py::GPUTests::test__dyn_quant_matmul_4bit_fp32_input_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_adaptive_avg_pool1d_argmax_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_adaptive_avg_pool_with_output_size_0_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_adaptive_max_pool2d1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_adaptive_max_pool2d2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_add_complex3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_add_complex6_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_add_complex_strided_fallback_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_add_const_float_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_addmm_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_angle_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_aoti_eager_cache_hit_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_arange4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_argmax_argmin2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_assert_alignment_op_name_fail_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_assert_alignment_op_name_pass_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_assert_size_stride_op_name_fail_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_async, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d7_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d_backward3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool3d_backward4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bmm2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_add_autotune_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int16_int64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int32_int16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int64_int64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int64_int8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int8_int16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int8_int8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int8_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_uint8_int64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_nd_tiling_False_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_buffer_copied_in_graph_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_int_ndigits_pos_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cat_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cat_empty_index_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cat_extern_kernel_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cat_inplace_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cat_unbacked_2d_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cat_unbacked_empty_1d_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_check_stack_no_cycles_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_clamp_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_clamp_type_promotion_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_clamp_type_promotion_non_tensor_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_complex_fallback_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_complex_memory_overlap_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_config_option_dont_assume_alignment_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_config_option_dont_assume_alignment_cudagraphs_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_config_option_dont_assume_alignment_recompiles_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_consecutive_split_cumsum_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_constant_pad_3d_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_conv3d_channels_last_use_block_ptr_True_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_conv_backward_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_conv_bn_fuse_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_conv_shape_check_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_convolution2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_convolution3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_convolution5_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_False_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cos_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cpu_scalar_with_cpu_tensor_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cudnn_rnn_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cumsum_zero_dim_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_custom_op_1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_custom_op_3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_custom_op_default_layout_constraint_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_custom_scan_op_compiled_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_deterministic_codegen_with_suffix_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_device_assert_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_diagonal_copy_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dist_bf16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dist_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_div3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_div5_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_div_presicion_accuracy_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_div_prim_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_div_softmax_symfloat_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float16_float32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float32_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float64_float64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int16_int64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int32_float64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int32_int32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int32_int64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int32_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int64_float32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int64_int16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_int64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_int8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_embedding_bag_byte_unpack_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_embedding_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_empty1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_empty2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_empty_strided_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_emulate_precision_triton_fp_fusion_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fallback_mutable_op_list_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fft_real_input_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_float16_to_int16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_float32_to_int32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fmin_fmax_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fractional_max_pool2d1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fractional_max_pool2d4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_full_like_transposed_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fuse_tiled_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_gelu_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_generated_code_has_size_stride_assert_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_getitem_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_gpu_scalar_with_cpu_tensor_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_gpu_scalar_with_gpu_tensor_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_arange1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_arange2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_argmax_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_constant_tensor1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_unbacked_symint_as_output_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_grid_sampler_expand_preserves_view_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_hardsigmoid_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_hardswish_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_hardtanh_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_horizonal_fusion1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_horizonal_fusion2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_device_assert_masked_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_flip_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_put3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_put_fallback2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_put_reinplace_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_inductor_assert_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_inplace_activations_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_inplace_flip_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_inplace_mixed_dtype_ops_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_inplace_resize_as_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_input_mutation1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_invalid_operand_issue1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_isin_tensor_scalar_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_large_grid_use_block_ptr_True_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_large_strided_reduction_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_large_tensor_reduction_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_lerp_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_like_channels_last_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_like_rands_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_like_rands_sliced_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_linear2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_linear_dynamic_maxautotune_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_linear_float64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_linspace3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_lite_dynamic_shape_assertion_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_lite_mode_fallback_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_lite_mode_not_decompose_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_log1p_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_low_memory_max_pool_dilation_1_dim_3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_low_memory_max_pool_dilation_2_dim_2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_mark_unbacked_with_hint_override_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_masked_fill_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_masked_scatter_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d6_dilation_1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d6_dilation_2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d_with_indices_backward6_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_mixed_mm_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_mm_views_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_multi_device_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_multi_gpu_recompile_on_index_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_multilayer_prime_size_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_multilayer_var_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_mutable_custom_op_fixed_layout_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_nan_assert_inside_triton_kernel_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_nan_sort_stable_True_descending_False_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_nan_to_num_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_narrow_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_no_specization_over_symbolic_value_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_output_strides_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pad_cast_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pad_view_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pattern_matcher_multi_user_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_permute2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_philox_rand_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pixel_shuffle_channels_last_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_airy_ai_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_bessel_j1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_chebyshev_polynomial_u_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_erf_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_erfinv_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_exp2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_expit_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_i1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_i1e_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_legendre_polynomial_p_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_modified_bessel_k0_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_polygamma_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_round_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_t_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_v_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_spherical_bessel_j0_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_xlogy_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_polar_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pow2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pow_by_natural_log2_dynamic_shapes_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pow_int_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_profiler_mark_wrapper_call_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_randint_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_randn_with_dtype_and_device_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_reduction1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_reduction3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_reduction4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_reinterpret_dtypeview_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_clone_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_copy_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_view_default_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_view_dtype_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_repeat_interleave_2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_repeat_interleave_Tensor_decomp_int32_nd_2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_repeat_interleave_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_replication_pad_errors_with_bool_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_round_correctness_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_scalar_cpu_tensor_arg_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_scalar_input_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_scalar_output_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_scatter1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_scatter_bf16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_True_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_searchsorted_broadcast_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_searchsorted_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_setitem_with_int_parameter_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_shape_prop_torch_ones_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_should_pad_bench_for_bmm_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sigmoid_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_simplify_loops_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_single_elem_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_size_asserts_for_multi_output_fallback_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_slice3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_slice4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_slice_mutation3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_slice_scatter2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_slice_scatter4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_slice_scatter_reinplace_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_slice_view_with_graph_break_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_softmax_one_kernel_persist_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sort_bool_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_split_cumsum_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_split_cumsum_index_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_split_failed_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_split_reduction_dynamic_shape_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_split_with_sizes_with_unbacked_symints_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sum_int_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_tensor1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_tensor3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_tmp_not_defined_issue2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_topk_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_transpose_add_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_transpose_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_transposed_propagates_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_unbacked_float_item_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_unsigned_constant_tensors_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_unspec_inputs_float32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_unspec_inputs_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_unsqueeze_inplace_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_upsample_bilinear2d_a_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_upsample_nearest2d_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_vectorized_ops_masked_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_vectorized_ops_masked_var_novec_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_view_as_real_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_view_detach_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_view_uint8_through_differing_bitwidths_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_views1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_views3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_views7_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_zeros_cuda 2025-12-04T11:14:00.0701439Z 2025-12-04T11:14:00.0701823Z inductor/test_compile_subprocess.py::TestSubprocess::test_async PASSED [32.9030s] [ 0%] 2025-12-04T11:14:00.0703038Z inductor/test_compile_subprocess.py::TestSubprocess::test_progressive SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 0%] 2025-12-04T11:14:00.0704757Z inductor/test_compile_subprocess.py::GPUTests::test__dyn_quant_matmul_4bit_fp32_input_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (No _dyn_quant_matmul_4bit implementation on CUDA) [ 1%] 2025-12-04T11:14:00.0706381Z inductor/test_compile_subprocess.py::GPUTests::test_adaptive_avg_pool1d_argmax_cuda <- test/inductor/test_torchinductor.py PASSED [2.4114s] [ 1%] 2025-12-04T11:14:00.0707804Z inductor/test_compile_subprocess.py::GPUTests::test_adaptive_avg_pool_with_output_size_0_cuda <- test/inductor/test_torchinductor.py PASSED [0.5873s] [ 1%] 2025-12-04T11:14:00.0709766Z inductor/test_compile_subprocess.py::GPUTests::test_adaptive_max_pool2d1_cuda <- test/inductor/test_torchinductor.py W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.0711514Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.0713022Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.0714465Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.0715835Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.0717394Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.0718908Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.0720242Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.0721615Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.0723145Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.0724663Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.0726133Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.0727511Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.0728984Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.0730449Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.0731932Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.0733406Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.0734915Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.0736407Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.0737910Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.0739539Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.0741221Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.0742969Z W1204 11:08:04.917000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.0744623Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.0745716Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.0747214Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.0748630Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.0750004Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.0751559Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.0753076Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.0754409Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.0755805Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.0757381Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.0758891Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.0760304Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.0761691Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.0763158Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.0764613Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.0766072Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.0767538Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.0769028Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.0770520Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.0772075Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.0773603Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.0775223Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.0776960Z W1204 11:08:05.286000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.0778244Z PASSED [4.2371s] [ 2%] 2025-12-04T11:14:00.0779032Z inductor/test_compile_subprocess.py::GPUTests::test_adaptive_max_pool2d2_cuda <- test/inductor/test_torchinductor.py PASSED [0.1418s] [ 2%] 2025-12-04T11:14:00.0780334Z inductor/test_compile_subprocess.py::GPUTests::test_add_complex3_cuda <- test/inductor/test_torchinductor.py PASSED [0.5418s] [ 2%] 2025-12-04T11:14:00.0781584Z inductor/test_compile_subprocess.py::GPUTests::test_add_complex6_cuda <- test/inductor/test_torchinductor.py PASSED [0.6994s] [ 3%] 2025-12-04T11:14:00.0782914Z inductor/test_compile_subprocess.py::GPUTests::test_add_complex_strided_fallback_cuda <- test/inductor/test_torchinductor.py PASSED [0.1331s] [ 3%] 2025-12-04T11:14:00.0784239Z inductor/test_compile_subprocess.py::GPUTests::test_add_const_float_cuda <- test/inductor/test_torchinductor.py PASSED [0.8414s] [ 3%] 2025-12-04T11:14:00.0785914Z inductor/test_compile_subprocess.py::GPUTests::test_addmm_cuda <- test/inductor/test_torchinductor.py W1204 11:08:08.865000 89018 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:14:00.0787128Z PASSED [2.2221s] [ 4%] 2025-12-04T11:14:00.0788099Z inductor/test_compile_subprocess.py::GPUTests::test_angle_cuda <- test/inductor/test_torchinductor.py PASSED [1.1823s] [ 4%] 2025-12-04T11:14:00.0789419Z inductor/test_compile_subprocess.py::GPUTests::test_aoti_eager_cache_hit_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (Requires sm80) [ 5%] 2025-12-04T11:14:00.0791259Z inductor/test_compile_subprocess.py::GPUTests::test_arange4_cuda <- test/inductor/test_torchinductor.py W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.0792888Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.0794384Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.0795797Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.0797150Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.0798710Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.0800226Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.0801564Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.0803004Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.0804594Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.0806125Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.0807584Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.0808980Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.0810439Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.0811908Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.0813372Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.0814836Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.0816332Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.0817812Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.0819453Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.0820982Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.0822567Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.0824155Z W1204 11:08:11.400000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.0825641Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.0826755Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.0828250Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.0829656Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.0831005Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.0832560Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.0834153Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.0835484Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.0836889Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.0838422Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.0839942Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.0841354Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.0842753Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.0844203Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.0845679Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.0847145Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.0848609Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.0850140Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.0851614Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.0853122Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.0854647Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.0856222Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.0857820Z W1204 11:08:11.672000 88809 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.0858954Z PASSED [0.5547s] [ 5%] 2025-12-04T11:14:00.0859719Z inductor/test_compile_subprocess.py::GPUTests::test_argmax_argmin2_cuda <- test/inductor/test_torchinductor.py PASSED [1.1638s] [ 5%] 2025-12-04T11:14:00.0861107Z inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.1485s] [ 6%] 2025-12-04T11:14:00.0862595Z inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.1098s] [ 6%] 2025-12-04T11:14:00.0863979Z inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda <- test/inductor/test_torchinductor.py FAILED [0.1088s] [ 6%] 2025-12-04T11:14:00.0864764Z 2025-12-04T11:14:00.0864944Z ==================================== RERUNS ==================================== 2025-12-04T11:14:00.0865482Z ____________________ GPUTests.test_as_strided_on_views_cuda ____________________ 2025-12-04T11:14:00.0865994Z Traceback (most recent call last): 2025-12-04T11:14:00.0866673Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T11:14:00.0867332Z return value(self) 2025-12-04T11:14:00.0868007Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5667, in test_as_strided_on_views 2025-12-04T11:14:00.0868792Z self.common(fn, (a,), reference_in_float=False) 2025-12-04T11:14:00.0869377Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T11:14:00.0869920Z return func(*args, **kwds) 2025-12-04T11:14:00.0870560Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T11:14:00.0871250Z check_model( 2025-12-04T11:14:00.0871827Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 514, in check_model 2025-12-04T11:14:00.0872518Z actual = run(*example_inputs, **kwargs) 2025-12-04T11:14:00.0873269Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T11:14:00.0874161Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T11:14:00.0875074Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1019, in _compile_fx_inner 2025-12-04T11:14:00.0875932Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T11:14:00.0876768Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1003, in _compile_fx_inner 2025-12-04T11:14:00.0877579Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T11:14:00.0878403Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T11:14:00.0879438Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T11:14:00.0880448Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 422, in codegen_and_compile 2025-12-04T11:14:00.0881343Z output = self._send_to_child(inputs).deserialize(constants) 2025-12-04T11:14:00.0882200Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 596, in _send_to_child 2025-12-04T11:14:00.0882943Z return f.result() 2025-12-04T11:14:00.0883487Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 458, in result 2025-12-04T11:14:00.0884106Z return self.__get_result() 2025-12-04T11:14:00.0884716Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 403, in __get_result 2025-12-04T11:14:00.0885353Z raise self._exception 2025-12-04T11:14:00.0885946Z torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.0886466Z 2025-12-04T11:14:00.0886582Z Name= 2025-12-04T11:14:00.0886850Z Traceback (most recent call last): 2025-12-04T11:14:00.0887640Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.0888653Z result = job() 2025-12-04T11:14:00.0889410Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.0890298Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.0891119Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.0891978Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.0892920Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.0893771Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.0894577Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.0895444Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.0896170Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.0896943Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.0897384Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.0897685Z 2025-12-04T11:14:00.0897690Z 2025-12-04T11:14:00.0898487Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.0899344Z 2025-12-04T11:14:00.0899352Z 2025-12-04T11:14:00.0899574Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.0900448Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.0901107Z 2025-12-04T11:14:00.0901382Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.0902029Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.0902508Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.0902951Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.0903655Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.0904369Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.0905468Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.0906693Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.0907354Z ____________________ GPUTests.test_as_strided_on_views_cuda ____________________ 2025-12-04T11:14:00.0907849Z Traceback (most recent call last): 2025-12-04T11:14:00.0908497Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T11:14:00.0909157Z return value(self) 2025-12-04T11:14:00.0909827Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5667, in test_as_strided_on_views 2025-12-04T11:14:00.0910603Z self.common(fn, (a,), reference_in_float=False) 2025-12-04T11:14:00.0911185Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T11:14:00.0911720Z return func(*args, **kwds) 2025-12-04T11:14:00.0912359Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T11:14:00.0913041Z check_model( 2025-12-04T11:14:00.0913608Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 514, in check_model 2025-12-04T11:14:00.0914295Z actual = run(*example_inputs, **kwargs) 2025-12-04T11:14:00.0915033Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T11:14:00.0915914Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T11:14:00.0916823Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1019, in _compile_fx_inner 2025-12-04T11:14:00.0917660Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T11:14:00.0918498Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1003, in _compile_fx_inner 2025-12-04T11:14:00.0919301Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T11:14:00.0920195Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T11:14:00.0921193Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T11:14:00.0922198Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 422, in codegen_and_compile 2025-12-04T11:14:00.0923126Z output = self._send_to_child(inputs).deserialize(constants) 2025-12-04T11:14:00.0923982Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 596, in _send_to_child 2025-12-04T11:14:00.0924731Z return f.result() 2025-12-04T11:14:00.0925281Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 458, in result 2025-12-04T11:14:00.0925907Z return self.__get_result() 2025-12-04T11:14:00.0926502Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 403, in __get_result 2025-12-04T11:14:00.0927155Z raise self._exception 2025-12-04T11:14:00.0927733Z torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.0928252Z 2025-12-04T11:14:00.0928364Z Name= 2025-12-04T11:14:00.0928637Z Traceback (most recent call last): 2025-12-04T11:14:00.0929417Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.0930202Z result = job() 2025-12-04T11:14:00.0930941Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.0931840Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.0932655Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.0933506Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.0934422Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.0935220Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.0936026Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.0936859Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.0937580Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.0938436Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.0938882Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.0939184Z 2025-12-04T11:14:00.0939189Z 2025-12-04T11:14:00.0939921Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.0940778Z 2025-12-04T11:14:00.0940783Z 2025-12-04T11:14:00.0941004Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.0941875Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.0942533Z 2025-12-04T11:14:00.0942805Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.0943444Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.0943920Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.0944368Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.0945078Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.0945774Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.0946945Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.0948150Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.0948798Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.0949301Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.0949746Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.0950449Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.0951153Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.0952239Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.0953401Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.0953977Z =================================== FAILURES =================================== 2025-12-04T11:14:00.0954497Z ____________________ GPUTests.test_as_strided_on_views_cuda ____________________ 2025-12-04T11:14:00.0954999Z Traceback (most recent call last): 2025-12-04T11:14:00.0955646Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T11:14:00.0956308Z return value(self) 2025-12-04T11:14:00.0956972Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5667, in test_as_strided_on_views 2025-12-04T11:14:00.0957764Z self.common(fn, (a,), reference_in_float=False) 2025-12-04T11:14:00.0958347Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T11:14:00.0958872Z return func(*args, **kwds) 2025-12-04T11:14:00.0959523Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T11:14:00.0960202Z check_model( 2025-12-04T11:14:00.0960837Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 514, in check_model 2025-12-04T11:14:00.0961504Z actual = run(*example_inputs, **kwargs) 2025-12-04T11:14:00.0962257Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T11:14:00.0963142Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T11:14:00.0964039Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1019, in _compile_fx_inner 2025-12-04T11:14:00.0964892Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T11:14:00.0965729Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1003, in _compile_fx_inner 2025-12-04T11:14:00.0966532Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T11:14:00.0967343Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T11:14:00.0968358Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T11:14:00.0969363Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 422, in codegen_and_compile 2025-12-04T11:14:00.0970254Z output = self._send_to_child(inputs).deserialize(constants) 2025-12-04T11:14:00.0971102Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 596, in _send_to_child 2025-12-04T11:14:00.0971867Z return f.result() 2025-12-04T11:14:00.0972409Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 458, in result 2025-12-04T11:14:00.0973017Z return self.__get_result() 2025-12-04T11:14:00.0973627Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 403, in __get_result 2025-12-04T11:14:00.0974276Z raise self._exception 2025-12-04T11:14:00.0974906Z torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.0975467Z 2025-12-04T11:14:00.0975566Z Name= 2025-12-04T11:14:00.0975848Z Traceback (most recent call last): 2025-12-04T11:14:00.0976634Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.0977443Z result = job() 2025-12-04T11:14:00.0978259Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.0979166Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.0979986Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.0980828Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.0981703Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.0982514Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.0983317Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.0984126Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.0984856Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.0985625Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.0986052Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.0986362Z 2025-12-04T11:14:00.0986367Z 2025-12-04T11:14:00.0987080Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.0988129Z 2025-12-04T11:14:00.0988134Z 2025-12-04T11:14:00.0988524Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.0989395Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.0990038Z 2025-12-04T11:14:00.0990317Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.0990940Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.0991423Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.0991868Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.0992560Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.0993268Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.0994382Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.0995556Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.0996187Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.0996668Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.0997115Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.0997803Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.0998505Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.0999606Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1000764Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1001452Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1001939Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1002431Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1003134Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1003867Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1004967Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1006126Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1007381Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-5bc7c9553ad37db8.xml - 2025-12-04T11:14:00.1008518Z =========================== short test summary info ============================ 2025-12-04T11:14:00.1009680Z FAILED [0.1088s] inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda - torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.1010645Z 2025-12-04T11:14:00.1010749Z Name= 2025-12-04T11:14:00.1011044Z Traceback (most recent call last): 2025-12-04T11:14:00.1011827Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.1012621Z result = job() 2025-12-04T11:14:00.1013381Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.1014279Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.1015109Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.1015969Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.1016888Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.1017674Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.1018554Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.1019380Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.1020122Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.1020887Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.1021329Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.1021632Z 2025-12-04T11:14:00.1021637Z 2025-12-04T11:14:00.1022371Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.1023219Z 2025-12-04T11:14:00.1023223Z 2025-12-04T11:14:00.1023459Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.1024308Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.1024960Z 2025-12-04T11:14:00.1025229Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.1025821Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:14:00.1026357Z ============== 1 failed, 13 passed, 3 skipped, 2 rerun in 48.11s =============== 2025-12-04T11:14:00.1026800Z Got exit code 1 2025-12-04T11:14:00.1027067Z Retrying single test... 2025-12-04T11:14:00.1027703Z W1204 11:08:26.648000 89998 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:14:00.1028969Z Test results will be stored in test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-c06a9a1a186bb263.xml 2025-12-04T11:14:00.1029955Z ============================= test session starts ============================== 2025-12-04T11:14:00.1030622Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:14:00.1031267Z cachedir: .pytest_cache 2025-12-04T11:14:00.1031974Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:14:00.1032766Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:14:00.1033117Z configfile: pytest.ini 2025-12-04T11:14:00.1033844Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:14:00.1034758Z collecting ... collected 879 items / 278 deselected / 601 selected 2025-12-04T11:14:00.1035720Z stepcurrent: skipping 16 already run items. Running only test/inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda 2025-12-04T11:14:00.1036573Z Running 1 items in this shard 2025-12-04T11:14:00.1036786Z 2025-12-04T11:14:00.1037470Z inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [14.8985s] [100%] 2025-12-04T11:14:00.1038960Z inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.1126s] [100%] 2025-12-04T11:14:00.1040352Z inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda <- test/inductor/test_torchinductor.py FAILED [0.1117s] [100%] 2025-12-04T11:14:00.1041062Z 2025-12-04T11:14:00.1041217Z ==================================== RERUNS ==================================== 2025-12-04T11:14:00.1041735Z ____________________ GPUTests.test_as_strided_on_views_cuda ____________________ 2025-12-04T11:14:00.1042240Z Traceback (most recent call last): 2025-12-04T11:14:00.1042934Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T11:14:00.1043593Z return value(self) 2025-12-04T11:14:00.1044255Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5667, in test_as_strided_on_views 2025-12-04T11:14:00.1045047Z self.common(fn, (a,), reference_in_float=False) 2025-12-04T11:14:00.1045628Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T11:14:00.1046156Z return func(*args, **kwds) 2025-12-04T11:14:00.1046804Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T11:14:00.1047480Z check_model( 2025-12-04T11:14:00.1048048Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 514, in check_model 2025-12-04T11:14:00.1048716Z actual = run(*example_inputs, **kwargs) 2025-12-04T11:14:00.1049473Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T11:14:00.1050367Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T11:14:00.1051260Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1019, in _compile_fx_inner 2025-12-04T11:14:00.1052114Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T11:14:00.1052958Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1003, in _compile_fx_inner 2025-12-04T11:14:00.1053762Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T11:14:00.1054566Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T11:14:00.1055570Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T11:14:00.1056616Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 422, in codegen_and_compile 2025-12-04T11:14:00.1057547Z output = self._send_to_child(inputs).deserialize(constants) 2025-12-04T11:14:00.1058464Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 596, in _send_to_child 2025-12-04T11:14:00.1059320Z return f.result() 2025-12-04T11:14:00.1059866Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 458, in result 2025-12-04T11:14:00.1060469Z return self.__get_result() 2025-12-04T11:14:00.1061079Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 403, in __get_result 2025-12-04T11:14:00.1061730Z raise self._exception 2025-12-04T11:14:00.1062307Z torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.1062823Z 2025-12-04T11:14:00.1062921Z Name= 2025-12-04T11:14:00.1063200Z Traceback (most recent call last): 2025-12-04T11:14:00.1063998Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.1064772Z result = job() 2025-12-04T11:14:00.1065518Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.1066419Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.1067239Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.1068079Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.1068954Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.1069757Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.1070567Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.1071413Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.1072154Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.1072927Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.1073357Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.1073672Z 2025-12-04T11:14:00.1073677Z 2025-12-04T11:14:00.1074393Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.1075256Z 2025-12-04T11:14:00.1075260Z 2025-12-04T11:14:00.1075479Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.1076340Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.1076987Z 2025-12-04T11:14:00.1077270Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.1077897Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1078378Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1078953Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1079653Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1080243Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1081354Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1082523Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1083176Z ____________________ GPUTests.test_as_strided_on_views_cuda ____________________ 2025-12-04T11:14:00.1083727Z Traceback (most recent call last): 2025-12-04T11:14:00.1084412Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T11:14:00.1085078Z return value(self) 2025-12-04T11:14:00.1085743Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5667, in test_as_strided_on_views 2025-12-04T11:14:00.1086582Z self.common(fn, (a,), reference_in_float=False) 2025-12-04T11:14:00.1087168Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T11:14:00.1087694Z return func(*args, **kwds) 2025-12-04T11:14:00.1088550Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T11:14:00.1089238Z check_model( 2025-12-04T11:14:00.1089796Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 514, in check_model 2025-12-04T11:14:00.1090489Z actual = run(*example_inputs, **kwargs) 2025-12-04T11:14:00.1091251Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T11:14:00.1092130Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T11:14:00.1093031Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1019, in _compile_fx_inner 2025-12-04T11:14:00.1093893Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T11:14:00.1094734Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1003, in _compile_fx_inner 2025-12-04T11:14:00.1095537Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T11:14:00.1096347Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T11:14:00.1097346Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T11:14:00.1098421Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 422, in codegen_and_compile 2025-12-04T11:14:00.1099398Z output = self._send_to_child(inputs).deserialize(constants) 2025-12-04T11:14:00.1100245Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 596, in _send_to_child 2025-12-04T11:14:00.1101007Z return f.result() 2025-12-04T11:14:00.1101555Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 458, in result 2025-12-04T11:14:00.1102167Z return self.__get_result() 2025-12-04T11:14:00.1102781Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 403, in __get_result 2025-12-04T11:14:00.1103428Z raise self._exception 2025-12-04T11:14:00.1104006Z torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.1104529Z 2025-12-04T11:14:00.1104628Z Name= 2025-12-04T11:14:00.1104912Z Traceback (most recent call last): 2025-12-04T11:14:00.1105708Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.1106481Z result = job() 2025-12-04T11:14:00.1107228Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.1108129Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.1108945Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.1109784Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.1110652Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.1111458Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.1112300Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.1113168Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.1113911Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.1114739Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.1115173Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.1115488Z 2025-12-04T11:14:00.1115493Z 2025-12-04T11:14:00.1116205Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.1117065Z 2025-12-04T11:14:00.1117070Z 2025-12-04T11:14:00.1117289Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.1118158Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.1118804Z 2025-12-04T11:14:00.1119088Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.1119706Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1120191Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1120754Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1121450Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1122033Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1123140Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1124300Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1124931Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1125452Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1125895Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1126591Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1127297Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1128400Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1129559Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1130117Z =================================== FAILURES =================================== 2025-12-04T11:14:00.1130644Z ____________________ GPUTests.test_as_strided_on_views_cuda ____________________ 2025-12-04T11:14:00.1131146Z Traceback (most recent call last): 2025-12-04T11:14:00.1131795Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T11:14:00.1132432Z return value(self) 2025-12-04T11:14:00.1133105Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5667, in test_as_strided_on_views 2025-12-04T11:14:00.1133895Z self.common(fn, (a,), reference_in_float=False) 2025-12-04T11:14:00.1134461Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T11:14:00.1134994Z return func(*args, **kwds) 2025-12-04T11:14:00.1135640Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T11:14:00.1136304Z check_model( 2025-12-04T11:14:00.1136868Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 514, in check_model 2025-12-04T11:14:00.1137557Z actual = run(*example_inputs, **kwargs) 2025-12-04T11:14:00.1138435Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T11:14:00.1139344Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T11:14:00.1140258Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1019, in _compile_fx_inner 2025-12-04T11:14:00.1141143Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T11:14:00.1141987Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1003, in _compile_fx_inner 2025-12-04T11:14:00.1142778Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T11:14:00.1143598Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T11:14:00.1144598Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T11:14:00.1145595Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 422, in codegen_and_compile 2025-12-04T11:14:00.1146490Z output = self._send_to_child(inputs).deserialize(constants) 2025-12-04T11:14:00.1147344Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 596, in _send_to_child 2025-12-04T11:14:00.1148108Z return f.result() 2025-12-04T11:14:00.1148638Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 458, in result 2025-12-04T11:14:00.1149258Z return self.__get_result() 2025-12-04T11:14:00.1149867Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 403, in __get_result 2025-12-04T11:14:00.1150515Z raise self._exception 2025-12-04T11:14:00.1178452Z torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.1179005Z 2025-12-04T11:14:00.1179124Z Name= 2025-12-04T11:14:00.1179407Z Traceback (most recent call last): 2025-12-04T11:14:00.1180233Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.1181167Z result = job() 2025-12-04T11:14:00.1181912Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.1182827Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.1183652Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.1184513Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.1185369Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.1186167Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.1186979Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.1187995Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.1188731Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.1189510Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.1189960Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.1190264Z 2025-12-04T11:14:00.1190269Z 2025-12-04T11:14:00.1190987Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.1191852Z 2025-12-04T11:14:00.1191856Z 2025-12-04T11:14:00.1192077Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.1192947Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.1193593Z 2025-12-04T11:14:00.1193975Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.1194681Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1195159Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1195727Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1196485Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1197061Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1198169Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1199333Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1199974Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1200450Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1200895Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1201602Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1202297Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1203401Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1204566Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1205206Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1205673Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1206107Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1206800Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1207554Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1208640Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1209801Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1211058Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-c06a9a1a186bb263.xml - 2025-12-04T11:14:00.1212203Z =========================== short test summary info ============================ 2025-12-04T11:14:00.1213342Z FAILED [0.1117s] inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda - torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.1214308Z 2025-12-04T11:14:00.1214420Z Name= 2025-12-04T11:14:00.1214704Z Traceback (most recent call last): 2025-12-04T11:14:00.1215487Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.1216277Z result = job() 2025-12-04T11:14:00.1217027Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.1217935Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.1218871Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.1219727Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.1220596Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.1221396Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.1222270Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.1223093Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.1223831Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.1224639Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.1225080Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.1225395Z 2025-12-04T11:14:00.1225399Z 2025-12-04T11:14:00.1226119Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.1226968Z 2025-12-04T11:14:00.1226973Z 2025-12-04T11:14:00.1227200Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.1228064Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.1228708Z 2025-12-04T11:14:00.1228979Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.1229571Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:14:00.1230104Z ================= 1 failed, 278 deselected, 2 rerun in 15.20s ================== 2025-12-04T11:14:00.1230542Z Got exit code 1 2025-12-04T11:14:00.1230811Z Retrying single test... 2025-12-04T11:14:00.1231450Z W1204 11:08:56.173000 90210 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:14:00.1232685Z Test results will be stored in test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-3aa7e69ca6b255e3.xml 2025-12-04T11:14:00.1233626Z ============================= test session starts ============================== 2025-12-04T11:14:00.1234301Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:14:00.1234946Z cachedir: .pytest_cache 2025-12-04T11:14:00.1235667Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:14:00.1236449Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:14:00.1236805Z configfile: pytest.ini 2025-12-04T11:14:00.1237542Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:14:00.1238442Z collecting ... collected 879 items / 278 deselected / 601 selected 2025-12-04T11:14:00.1239400Z stepcurrent: skipping 16 already run items. Running only test/inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda 2025-12-04T11:14:00.1240241Z Running 1 items in this shard 2025-12-04T11:14:00.1240451Z 2025-12-04T11:14:00.1241148Z inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [15.1275s] [100%] 2025-12-04T11:14:00.1242637Z inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.1132s] [100%] 2025-12-04T11:14:00.1244025Z inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda <- test/inductor/test_torchinductor.py FAILED [0.1098s] [100%] 2025-12-04T11:14:00.1244747Z 2025-12-04T11:14:00.1244891Z ==================================== RERUNS ==================================== 2025-12-04T11:14:00.1245419Z ____________________ GPUTests.test_as_strided_on_views_cuda ____________________ 2025-12-04T11:14:00.1245912Z Traceback (most recent call last): 2025-12-04T11:14:00.1246558Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T11:14:00.1247205Z return value(self) 2025-12-04T11:14:00.1247911Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5667, in test_as_strided_on_views 2025-12-04T11:14:00.1248728Z self.common(fn, (a,), reference_in_float=False) 2025-12-04T11:14:00.1249308Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T11:14:00.1249844Z return func(*args, **kwds) 2025-12-04T11:14:00.1250512Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T11:14:00.1251192Z check_model( 2025-12-04T11:14:00.1251761Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 514, in check_model 2025-12-04T11:14:00.1252443Z actual = run(*example_inputs, **kwargs) 2025-12-04T11:14:00.1253181Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T11:14:00.1254061Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T11:14:00.1254969Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1019, in _compile_fx_inner 2025-12-04T11:14:00.1255812Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T11:14:00.1256653Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1003, in _compile_fx_inner 2025-12-04T11:14:00.1257457Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T11:14:00.1258359Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T11:14:00.1259356Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T11:14:00.1260370Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 422, in codegen_and_compile 2025-12-04T11:14:00.1261267Z output = self._send_to_child(inputs).deserialize(constants) 2025-12-04T11:14:00.1262125Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 596, in _send_to_child 2025-12-04T11:14:00.1262922Z return f.result() 2025-12-04T11:14:00.1263461Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 458, in result 2025-12-04T11:14:00.1264076Z return self.__get_result() 2025-12-04T11:14:00.1264682Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 403, in __get_result 2025-12-04T11:14:00.1265333Z raise self._exception 2025-12-04T11:14:00.1265912Z torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.1266434Z 2025-12-04T11:14:00.1266544Z Name= 2025-12-04T11:14:00.1266811Z Traceback (most recent call last): 2025-12-04T11:14:00.1267593Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.1268376Z result = job() 2025-12-04T11:14:00.1269112Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.1270011Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.1270829Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.1271680Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.1272534Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.1273328Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.1274132Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.1274947Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.1275670Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.1276482Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.1276983Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.1277288Z 2025-12-04T11:14:00.1277293Z 2025-12-04T11:14:00.1278012Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.1278912Z 2025-12-04T11:14:00.1278917Z 2025-12-04T11:14:00.1279137Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.1280004Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.1280642Z 2025-12-04T11:14:00.1280923Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.1281563Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1282037Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1282599Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1283311Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1283884Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1284993Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1286156Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1286820Z ____________________ GPUTests.test_as_strided_on_views_cuda ____________________ 2025-12-04T11:14:00.1287315Z Traceback (most recent call last): 2025-12-04T11:14:00.1288152Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T11:14:00.1288814Z return value(self) 2025-12-04T11:14:00.1289474Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5667, in test_as_strided_on_views 2025-12-04T11:14:00.1290351Z self.common(fn, (a,), reference_in_float=False) 2025-12-04T11:14:00.1290931Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T11:14:00.1291473Z return func(*args, **kwds) 2025-12-04T11:14:00.1292110Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T11:14:00.1292788Z check_model( 2025-12-04T11:14:00.1293360Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 514, in check_model 2025-12-04T11:14:00.1294035Z actual = run(*example_inputs, **kwargs) 2025-12-04T11:14:00.1294791Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T11:14:00.1295670Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T11:14:00.1296583Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1019, in _compile_fx_inner 2025-12-04T11:14:00.1297424Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T11:14:00.1298337Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1003, in _compile_fx_inner 2025-12-04T11:14:00.1299150Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T11:14:00.1299972Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T11:14:00.1300961Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T11:14:00.1301974Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 422, in codegen_and_compile 2025-12-04T11:14:00.1302868Z output = self._send_to_child(inputs).deserialize(constants) 2025-12-04T11:14:00.1303790Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 596, in _send_to_child 2025-12-04T11:14:00.1304602Z return f.result() 2025-12-04T11:14:00.1305145Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 458, in result 2025-12-04T11:14:00.1305764Z return self.__get_result() 2025-12-04T11:14:00.1306409Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 403, in __get_result 2025-12-04T11:14:00.1307066Z raise self._exception 2025-12-04T11:14:00.1307645Z torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.1308165Z 2025-12-04T11:14:00.1308266Z Name= 2025-12-04T11:14:00.1308550Z Traceback (most recent call last): 2025-12-04T11:14:00.1309343Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.1310134Z result = job() 2025-12-04T11:14:00.1310871Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.1311774Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.1312592Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.1313450Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.1314305Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.1315101Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.1315907Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.1316713Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.1317445Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.1318252Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.1318695Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.1318998Z 2025-12-04T11:14:00.1319004Z 2025-12-04T11:14:00.1319722Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.1320584Z 2025-12-04T11:14:00.1320589Z 2025-12-04T11:14:00.1320807Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.1321670Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.1322311Z 2025-12-04T11:14:00.1322593Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.1323216Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1323701Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1324265Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1324971Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1325536Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1326647Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1327801Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1328440Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1328909Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1329350Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1330097Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1330832Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1331935Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1333132Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1333697Z =================================== FAILURES =================================== 2025-12-04T11:14:00.1334212Z ____________________ GPUTests.test_as_strided_on_views_cuda ____________________ 2025-12-04T11:14:00.1334719Z Traceback (most recent call last): 2025-12-04T11:14:00.1335365Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T11:14:00.1336007Z return value(self) 2025-12-04T11:14:00.1336680Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 5667, in test_as_strided_on_views 2025-12-04T11:14:00.1337477Z self.common(fn, (a,), reference_in_float=False) 2025-12-04T11:14:00.1338056Z File "/opt/conda/envs/py_3.10/lib/python3.10/contextlib.py", line 79, in inner 2025-12-04T11:14:00.1338682Z return func(*args, **kwds) 2025-12-04T11:14:00.1339341Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 692, in check_model_gpu 2025-12-04T11:14:00.1340023Z check_model( 2025-12-04T11:14:00.1340580Z File "/var/lib/jenkins/workspace/test/inductor/test_torchinductor.py", line 514, in check_model 2025-12-04T11:14:00.1341273Z actual = run(*example_inputs, **kwargs) 2025-12-04T11:14:00.1342027Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T11:14:00.1342909Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T11:14:00.1343810Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1019, in _compile_fx_inner 2025-12-04T11:14:00.1344709Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T11:14:00.1345548Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1003, in _compile_fx_inner 2025-12-04T11:14:00.1346359Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T11:14:00.1347173Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T11:14:00.1348178Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T11:14:00.1349195Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 422, in codegen_and_compile 2025-12-04T11:14:00.1350087Z output = self._send_to_child(inputs).deserialize(constants) 2025-12-04T11:14:00.1350929Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 596, in _send_to_child 2025-12-04T11:14:00.1351693Z return f.result() 2025-12-04T11:14:00.1352238Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 458, in result 2025-12-04T11:14:00.1352848Z return self.__get_result() 2025-12-04T11:14:00.1353463Z File "/opt/conda/envs/py_3.10/lib/python3.10/concurrent/futures/_base.py", line 403, in __get_result 2025-12-04T11:14:00.1354118Z raise self._exception 2025-12-04T11:14:00.1354699Z torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.1355217Z 2025-12-04T11:14:00.1355316Z Name= 2025-12-04T11:14:00.1355597Z Traceback (most recent call last): 2025-12-04T11:14:00.1356384Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.1357160Z result = job() 2025-12-04T11:14:00.1357953Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.1358892Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.1359718Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.1360593Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.1361457Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.1362249Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.1363037Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.1363855Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.1364587Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.1365362Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.1365794Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.1366108Z 2025-12-04T11:14:00.1366113Z 2025-12-04T11:14:00.1366831Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.1367695Z 2025-12-04T11:14:00.1367700Z 2025-12-04T11:14:00.1367915Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.1368777Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.1369418Z 2025-12-04T11:14:00.1369697Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.1370319Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1370799Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1371397Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1372098Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1372678Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1373787Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1374949Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1375577Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1376055Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1376496Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1377180Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1377888Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1379062Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1380225Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1380855Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T11:14:00.1381328Z stats [('calls_captured', 5)] 2025-12-04T11:14:00.1381768Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T11:14:00.1382472Z inductor [('pattern_matcher_count', 1), ('pattern_matcher_nodes', 1), ('fxgraph_cache_miss', 1)] 2025-12-04T11:14:00.1383164Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T11:14:00.1384326Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T11:14:00.1385522Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T11:14:00.1386776Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-3aa7e69ca6b255e3.xml - 2025-12-04T11:14:00.1388121Z =========================== short test summary info ============================ 2025-12-04T11:14:00.1389263Z FAILED [0.1098s] inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda - torch._inductor.exc.InductorError: SubprocException: An exception occurred in a subprocess: 2025-12-04T11:14:00.1390206Z 2025-12-04T11:14:00.1390312Z Name= 2025-12-04T11:14:00.1390571Z Traceback (most recent call last): 2025-12-04T11:14:00.1391362Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_worker/subproc_pool.py", line 457, in do_job 2025-12-04T11:14:00.1392140Z result = job() 2025-12-04T11:14:00.1392885Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_subproc.py", line 92, in _run_in_child_subprocess 2025-12-04T11:14:00.1393769Z result = cls._run_in_child(pickled_input, extra_env) 2025-12-04T11:14:00.1394577Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 543, in _run_in_child 2025-12-04T11:14:00.1395420Z output_graph = _InProcessFxCompile().codegen_and_compile( 2025-12-04T11:14:00.1396275Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T11:14:00.1397056Z _check_triton_bf16_support(graph) 2025-12-04T11:14:00.1397853Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T11:14:00.1398655Z warn_and_skip(node.get_device()) 2025-12-04T11:14:00.1399371Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T11:14:00.1400217Z raise SkipFrame("BF16 is not supported") 2025-12-04T11:14:00.1400658Z torch._dynamo.exc.SkipFrame: BF16 is not supported 2025-12-04T11:14:00.1400958Z 2025-12-04T11:14:00.1400963Z 2025-12-04T11:14:00.1401699Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T11:14:00.1402549Z 2025-12-04T11:14:00.1402554Z 2025-12-04T11:14:00.1402783Z To execute this test, run the following from the base repo dir: 2025-12-04T11:14:00.1403633Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_as_strided_on_views_cuda 2025-12-04T11:14:00.1404284Z 2025-12-04T11:14:00.1404549Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:14:00.1405138Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:14:00.1405674Z ================= 1 failed, 278 deselected, 2 rerun in 15.43s ================== 2025-12-04T11:14:00.1406116Z Got exit code 1 2025-12-04T11:14:00.1406711Z FAILED CONSISTENTLY: test/inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda 2025-12-04T11:14:00.1407695Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:14:00.1408684Z W1204 11:09:25.758000 90422 site-packages/torch/_inductor/utils.py:1703] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:14:00.1409910Z Test results will be stored in test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-696ac340ce65f280.xml 2025-12-04T11:14:00.1410847Z ============================= test session starts ============================== 2025-12-04T11:14:00.1411508Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:14:00.1412153Z cachedir: .pytest_cache 2025-12-04T11:14:00.1412911Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:14:00.1413697Z rootdir: /var/lib/jenkins/workspace 2025-12-04T11:14:00.1414038Z configfile: pytest.ini 2025-12-04T11:14:00.1414809Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:14:00.1415719Z collecting ... collected 879 items / 17 deselected / 862 selected 2025-12-04T11:14:00.1416222Z stepcurrent: skipping 17 already run items. 2025-12-04T11:14:00.1416594Z Running 262 items in this shard 2025-12-04T11:14:00.1416813Z 2025-12-04T11:14:00.1417450Z inductor/test_compile_subprocess.py::GPUTests::test_assert_alignment_op_name_fail_cuda <- test/inductor/test_torchinductor.py PASSED [0.0046s] [ 0%] 2025-12-04T11:14:00.1418942Z inductor/test_compile_subprocess.py::GPUTests::test_assert_alignment_op_name_pass_cuda <- test/inductor/test_torchinductor.py PASSED [0.0031s] [ 0%] 2025-12-04T11:14:00.1420351Z inductor/test_compile_subprocess.py::GPUTests::test_assert_size_stride_op_name_fail_cuda <- test/inductor/test_torchinductor.py PASSED [0.0030s] [ 1%] 2025-12-04T11:14:00.1421438Z inductor/test_compile_subprocess.py::GPUTests::test_async PASSED [33.1981s] [ 1%] 2025-12-04T11:14:00.1422443Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d1_cuda <- test/inductor/test_torchinductor.py PASSED [2.8958s] [ 1%] 2025-12-04T11:14:00.1423664Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d4_cuda <- test/inductor/test_torchinductor.py PASSED [0.9181s] [ 2%] 2025-12-04T11:14:00.1424876Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d7_cuda <- test/inductor/test_torchinductor.py PASSED [0.2697s] [ 2%] 2025-12-04T11:14:00.1426082Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d8_cuda <- test/inductor/test_torchinductor.py PASSED [1.5390s] [ 3%] 2025-12-04T11:14:00.1427391Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d_backward3_cuda <- test/inductor/test_torchinductor.py PASSED [1.9482s] [ 3%] 2025-12-04T11:14:00.1428699Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool3d_backward4_cuda <- test/inductor/test_torchinductor.py PASSED [0.1497s] [ 3%] 2025-12-04T11:14:00.1430897Z inductor/test_compile_subprocess.py::GPUTests::test_bmm2_cuda <- test/inductor/test_torchinductor.py W1204 11:10:09.573000 90631 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:14:00.1432091Z PASSED [0.9933s] [ 4%] 2025-12-04T11:14:00.1432862Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_add_autotune_cuda <- test/inductor/test_torchinductor.py PASSED [0.9825s] [ 4%] 2025-12-04T11:14:00.1434208Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int16_int64_cuda <- test/inductor/test_torchinductor.py PASSED [2.1922s] [ 4%] 2025-12-04T11:14:00.1435561Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int32_int16_cuda <- test/inductor/test_torchinductor.py PASSED [2.2009s] [ 5%] 2025-12-04T11:14:00.1436909Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int64_int64_cuda <- test/inductor/test_torchinductor.py PASSED [2.1962s] [ 5%] 2025-12-04T11:14:00.1438243Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int64_int8_cuda <- test/inductor/test_torchinductor.py PASSED [1.4683s] [ 6%] 2025-12-04T11:14:00.1439575Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int8_int16_cuda <- test/inductor/test_torchinductor.py PASSED [1.5252s] [ 6%] 2025-12-04T11:14:00.1440907Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int8_int8_cuda <- test/inductor/test_torchinductor.py PASSED [1.4368s] [ 6%] 2025-12-04T11:14:00.1442243Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int8_uint8_cuda <- test/inductor/test_torchinductor.py PASSED [1.4995s] [ 7%] 2025-12-04T11:14:00.1443646Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_uint8_int64_cuda <- test/inductor/test_torchinductor.py PASSED [1.4576s] [ 7%] 2025-12-04T11:14:00.1444999Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_nd_tiling_False_cuda <- test/inductor/test_torchinductor.py PASSED [0.5681s] [ 8%] 2025-12-04T11:14:00.1446369Z inductor/test_compile_subprocess.py::GPUTests::test_buffer_copied_in_graph_cuda <- test/inductor/test_torchinductor.py PASSED [0.2762s] [ 8%] 2025-12-04T11:14:00.1447657Z inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_cuda <- test/inductor/test_torchinductor.py PASSED [0.7087s] [ 8%] 2025-12-04T11:14:00.1448977Z inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_int_ndigits_pos_cuda <- test/inductor/test_torchinductor.py PASSED [0.2254s] [ 9%] 2025-12-04T11:14:00.1450756Z inductor/test_compile_subprocess.py::GPUTests::test_cat_cuda <- test/inductor/test_torchinductor.py W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1452373Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1453862Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1455267Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1456609Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1458248Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1459810Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1461147Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1462539Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1464049Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1465565Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1466978Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1468364Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1469823Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1471286Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1472746Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1474344Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1475849Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1477353Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1478868Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1480391Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1481980Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1483663Z W1204 11:10:26.501000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1485212Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1486317Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1488011Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1489428Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1490875Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1492435Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1493941Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1495276Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1496664Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1498251Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1499772Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1501190Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1502592Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1504058Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1505604Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1507074Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1508588Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1510089Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1511560Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1513071Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1514601Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1516180Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1517853Z W1204 11:10:27.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1519399Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1520506Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1522035Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1523452Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1524814Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1526361Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1527867Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1529204Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1530584Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1532106Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1533625Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1535029Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1536474Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1538599Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1540113Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1541577Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1543037Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1544543Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1546022Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1547512Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1549033Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1550603Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1552285Z W1204 11:10:27.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1553880Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1554969Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1556464Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1557860Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1559223Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1560770Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1562276Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1563601Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1564983Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1566540Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1568080Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1569491Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1570916Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1572379Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1573828Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1575296Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1576755Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1578330Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1579823Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1581319Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1582901Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1584476Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1586156Z W1204 11:10:27.958000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1587310Z PASSED [1.9034s] [ 9%] 2025-12-04T11:14:00.1588264Z inductor/test_compile_subprocess.py::GPUTests::test_cat_empty_index_cuda <- test/inductor/test_torchinductor.py PASSED [0.5074s] [ 9%] 2025-12-04T11:14:00.1589552Z inductor/test_compile_subprocess.py::GPUTests::test_cat_extern_kernel_cuda <- test/inductor/test_torchinductor.py PASSED [0.3397s] [ 10%] 2025-12-04T11:14:00.1590823Z inductor/test_compile_subprocess.py::GPUTests::test_cat_inplace_cuda <- test/inductor/test_torchinductor.py PASSED [0.7548s] [ 10%] 2025-12-04T11:14:00.1592079Z inductor/test_compile_subprocess.py::GPUTests::test_cat_unbacked_2d_cuda <- test/inductor/test_torchinductor.py PASSED [0.9319s] [ 11%] 2025-12-04T11:14:00.1593355Z inductor/test_compile_subprocess.py::GPUTests::test_cat_unbacked_empty_1d_cuda <- test/inductor/test_torchinductor.py PASSED [0.8715s] [ 11%] 2025-12-04T11:14:00.1594666Z inductor/test_compile_subprocess.py::GPUTests::test_check_stack_no_cycles_cuda <- test/inductor/test_torchinductor.py PASSED [0.1915s] [ 11%] 2025-12-04T11:14:00.1596413Z inductor/test_compile_subprocess.py::GPUTests::test_clamp_cuda <- test/inductor/test_torchinductor.py W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1598127Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1599660Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1601132Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1602496Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1604049Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1605553Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1606876Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1608254Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1609786Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1611301Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1612710Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1614088Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1615597Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1617062Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1618602Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1620055Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1621558Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1623047Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1624558Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1626086Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1627644Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1629383Z W1204 11:10:32.288000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1630543Z PASSED [0.5457s] [ 12%] 2025-12-04T11:14:00.1631820Z inductor/test_compile_subprocess.py::GPUTests::test_clamp_type_promotion_cuda <- test/inductor/test_torchinductor.py W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1633545Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1635023Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1636429Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1637799Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1639353Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1640856Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1642191Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1643581Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1645156Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1646681Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1648085Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1649482Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1650963Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1652440Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1653897Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1655368Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1656870Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1658431Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1660019Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1661535Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1663150Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1664828Z W1204 11:10:32.553000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1665990Z PASSED [0.2894s] [ 12%] 2025-12-04T11:14:00.1667324Z inductor/test_compile_subprocess.py::GPUTests::test_clamp_type_promotion_non_tensor_cuda <- test/inductor/test_torchinductor.py W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1669060Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1670556Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1671964Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1673326Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1674874Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1676427Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1677762Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1679150Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1680677Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1682180Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1683592Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1684983Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1686454Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1688182Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1689639Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1691211Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1692711Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1694242Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1695737Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1697263Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1698926Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1700609Z W1204 11:10:32.826000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1701774Z PASSED [0.2580s] [ 12%] 2025-12-04T11:14:00.1703031Z inductor/test_compile_subprocess.py::GPUTests::test_complex_fallback_cuda <- test/inductor/test_torchinductor.py W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1704703Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1706199Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1707666Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1709028Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1710577Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1712091Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1713425Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1714819Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1716337Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1717865Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1719278Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1720664Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1722874Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1724349Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1725845Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1727315Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1728811Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1730303Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1730878Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1731694Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1732332Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1733246Z W1204 11:10:33.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1733371Z PASSED [0.4050s] [ 13%] 2025-12-04T11:14:00.1734029Z inductor/test_compile_subprocess.py::GPUTests::test_complex_memory_overlap_cuda <- test/inductor/test_torchinductor.py PASSED [0.0035s] [ 13%] 2025-12-04T11:14:00.1734696Z inductor/test_compile_subprocess.py::GPUTests::test_config_option_dont_assume_alignment_cuda <- test/inductor/test_torchinductor.py PASSED [0.6937s] [ 14%] 2025-12-04T11:14:00.1735430Z inductor/test_compile_subprocess.py::GPUTests::test_config_option_dont_assume_alignment_cudagraphs_cuda <- test/inductor/test_torchinductor.py PASSED [1.4645s] [ 14%] 2025-12-04T11:14:00.1736147Z inductor/test_compile_subprocess.py::GPUTests::test_config_option_dont_assume_alignment_recompiles_cuda <- test/inductor/test_torchinductor.py PASSED [1.0038s] [ 14%] 2025-12-04T11:14:00.1736774Z inductor/test_compile_subprocess.py::GPUTests::test_consecutive_split_cumsum_cuda <- test/inductor/test_torchinductor.py PASSED [0.4312s] [ 15%] 2025-12-04T11:14:00.1737335Z inductor/test_compile_subprocess.py::GPUTests::test_constant_pad_3d_cuda <- test/inductor/test_torchinductor.py PASSED [0.7677s] [ 15%] 2025-12-04T11:14:00.1738292Z inductor/test_compile_subprocess.py::GPUTests::test_conv3d_channels_last_use_block_ptr_True_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (only support cpu conv3d channels_last) [ 16%] 2025-12-04T11:14:00.1738856Z inductor/test_compile_subprocess.py::GPUTests::test_conv_backward_cuda <- test/inductor/test_torchinductor.py PASSED [0.3868s] [ 16%] 2025-12-04T11:14:00.1739681Z inductor/test_compile_subprocess.py::GPUTests::test_conv_bn_fuse_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 16%] 2025-12-04T11:14:00.1740261Z inductor/test_compile_subprocess.py::GPUTests::test_conv_shape_check_cuda <- test/inductor/test_torchinductor.py PASSED [0.0824s] [ 17%] 2025-12-04T11:14:00.1740816Z inductor/test_compile_subprocess.py::GPUTests::test_convolution2_cuda <- test/inductor/test_torchinductor.py PASSED [1.7226s] [ 17%] 2025-12-04T11:14:00.1741452Z inductor/test_compile_subprocess.py::GPUTests::test_convolution3_cuda <- test/inductor/test_torchinductor.py PASSED [0.8648s] [ 17%] 2025-12-04T11:14:00.1742009Z inductor/test_compile_subprocess.py::GPUTests::test_convolution5_cuda <- test/inductor/test_torchinductor.py PASSED [0.4649s] [ 18%] 2025-12-04T11:14:00.1743227Z inductor/test_compile_subprocess.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_False_cuda <- test/inductor/test_torchinductor.py W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1743701Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1744600Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1745000Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1745843Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1746437Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1747226Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1747629Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1748481Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1749069Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1749918Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1750362Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1751173Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1751695Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1752499Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1753034Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1753837Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1754406Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1755245Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1755863Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1756677Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1757334Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1758203Z W1204 11:10:42.392000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.device_put.default 2025-12-04T11:14:00.1758599Z W1204 11:10:42.564000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1759014Z W1204 11:10:42.565000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1759403Z W1204 11:10:42.566000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1759790Z W1204 11:10:42.567000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1760193Z W1204 11:10:42.568000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1760581Z W1204 11:10:42.568000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1760981Z W1204 11:10:42.569000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1761370Z W1204 11:10:42.570000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1761759Z W1204 11:10:42.571000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1762193Z W1204 11:10:42.572000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1762580Z W1204 11:10:42.572000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1762981Z W1204 11:10:42.573000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1763366Z W1204 11:10:42.574000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1763763Z W1204 11:10:42.575000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1764163Z W1204 11:10:42.576000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1764550Z W1204 11:10:42.576000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1764957Z W1204 11:10:42.577000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1765350Z W1204 11:10:42.578000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1765740Z W1204 11:10:42.579000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1766145Z W1204 11:10:42.580000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1766533Z W1204 11:10:42.580000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1766919Z W1204 11:10:42.581000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1767319Z W1204 11:10:42.582000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1767740Z W1204 11:10:42.583000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1768173Z W1204 11:10:42.584000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1768564Z W1204 11:10:42.584000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1769003Z W1204 11:10:42.585000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1769407Z W1204 11:10:42.586000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1769799Z W1204 11:10:42.587000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1770200Z W1204 11:10:42.588000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1770590Z W1204 11:10:42.589000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1770983Z W1204 11:10:42.589000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1771390Z W1204 11:10:42.590000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1771780Z W1204 11:10:42.591000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1772183Z W1204 11:10:42.592000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1772572Z W1204 11:10:42.593000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1772959Z W1204 11:10:42.593000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1773361Z W1204 11:10:42.594000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1773753Z W1204 11:10:42.595000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1774194Z W1204 11:10:42.596000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1774583Z W1204 11:10:42.597000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1774974Z W1204 11:10:42.597000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1775379Z W1204 11:10:42.598000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1775763Z W1204 11:10:42.599000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1776165Z W1204 11:10:42.600000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1776553Z W1204 11:10:42.601000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1776947Z W1204 11:10:42.601000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1777348Z W1204 11:10:42.602000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1777738Z W1204 11:10:42.603000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1778212Z W1204 11:10:42.604000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1778602Z W1204 11:10:42.604000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1778985Z W1204 11:10:42.605000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1779388Z W1204 11:10:42.606000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1779828Z W1204 11:10:42.607000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1780263Z W1204 11:10:42.608000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1780652Z W1204 11:10:42.608000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1781071Z W1204 11:10:42.609000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1781469Z W1204 11:10:42.610000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1781855Z W1204 11:10:42.611000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1782241Z W1204 11:10:42.612000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1782640Z W1204 11:10:42.612000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1783029Z W1204 11:10:42.613000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1783430Z W1204 11:10:42.614000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1783816Z W1204 11:10:42.615000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1784203Z W1204 11:10:42.616000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1784600Z W1204 11:10:42.616000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1784986Z W1204 11:10:42.617000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1785388Z W1204 11:10:42.618000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1785781Z W1204 11:10:42.619000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1786204Z W1204 11:10:42.620000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1786602Z W1204 11:10:42.620000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1786989Z W1204 11:10:42.621000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1787387Z W1204 11:10:42.622000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1787983Z W1204 11:10:42.623000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1788475Z W1204 11:10:42.624000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1788882Z W1204 11:10:42.624000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1789271Z W1204 11:10:42.625000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1789678Z W1204 11:10:42.626000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1790064Z W1204 11:10:42.627000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1790451Z W1204 11:10:42.628000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1790855Z W1204 11:10:42.628000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1791242Z W1204 11:10:42.629000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1791642Z W1204 11:10:42.630000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1792032Z W1204 11:10:42.631000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1792536Z W1204 11:10:42.632000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1792945Z W1204 11:10:42.632000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1793372Z W1204 11:10:42.633000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1793774Z W1204 11:10:42.634000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1794161Z W1204 11:10:42.635000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1794550Z W1204 11:10:42.635000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1794954Z W1204 11:10:42.636000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1795344Z W1204 11:10:42.637000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1795737Z W1204 11:10:42.638000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1796136Z W1204 11:10:42.639000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1796525Z W1204 11:10:42.640000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1796925Z W1204 11:10:42.640000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1797313Z W1204 11:10:42.641000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1797700Z W1204 11:10:42.642000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1798099Z W1204 11:10:42.643000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1798531Z W1204 11:10:42.643000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.1798650Z PASSED [1.5161s] [ 18%] 2025-12-04T11:14:00.1799172Z inductor/test_compile_subprocess.py::GPUTests::test_cos_cuda <- test/inductor/test_torchinductor.py PASSED [0.9112s] [ 19%] 2025-12-04T11:14:00.1799792Z inductor/test_compile_subprocess.py::GPUTests::test_cpu_scalar_with_cpu_tensor_cuda <- test/inductor/test_torchinductor.py PASSED [0.7369s] [ 19%] 2025-12-04T11:14:00.1800338Z inductor/test_compile_subprocess.py::GPUTests::test_cudnn_rnn_cuda <- test/inductor/test_torchinductor.py PASSED [3.8623s] [ 19%] 2025-12-04T11:14:00.1800904Z inductor/test_compile_subprocess.py::GPUTests::test_cumsum_zero_dim_cuda <- test/inductor/test_torchinductor.py PASSED [0.3640s] [ 20%] 2025-12-04T11:14:00.1801963Z inductor/test_compile_subprocess.py::GPUTests::test_custom_op_1_cuda <- test/inductor/test_torchinductor.py W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1802429Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1803343Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1803725Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1804566Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1805158Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1806012Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1806427Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1807295Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1807845Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1808684Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1809134Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1809950Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1810473Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1811283Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1811806Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1812605Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1813202Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1813999Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1814582Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1815393Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1816031Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1816853Z W1204 11:10:48.827000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.test.foo.default 2025-12-04T11:14:00.1816964Z PASSED [0.4547s] [ 20%] 2025-12-04T11:14:00.1818021Z inductor/test_compile_subprocess.py::GPUTests::test_custom_op_3_cuda <- test/inductor/test_torchinductor.py W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1818564Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1819509Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1819926Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1820764Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1821392Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1822184Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1822600Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1823440Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1824005Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1824832Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1825276Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1826089Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1826611Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1827485Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1828008Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1828814Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1829367Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1830159Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1830749Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1831568Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1832208Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1833026Z W1204 11:10:49.281000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.test.foo3.default 2025-12-04T11:14:00.1833147Z PASSED [0.1316s] [ 20%] 2025-12-04T11:14:00.1834445Z inductor/test_compile_subprocess.py::GPUTests::test_custom_op_default_layout_constraint_cuda <- test/inductor/test_torchinductor.py W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1834907Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1835844Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1836224Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1837075Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1837661Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1838445Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1838862Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1839699Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 96, in reducer_override 2025-12-04T11:14:00.1840223Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _OpPickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1841057Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 418, in reduce_helper 2025-12-04T11:14:00.1841598Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] result = cls.pickle(op, pickler.options) 2025-12-04T11:14:00.1842391Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1842966Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1843791Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1844422Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1845271Z W1204 11:10:49.420000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.mylib.copy_.default 2025-12-04T11:14:00.1845380Z PASSED [0.2427s] [ 21%] 2025-12-04T11:14:00.1846003Z inductor/test_compile_subprocess.py::GPUTests::test_custom_scan_op_compiled_cuda <- test/inductor/test_torchinductor.py PASSED [1.2707s] [ 21%] 2025-12-04T11:14:00.1846664Z inductor/test_compile_subprocess.py::GPUTests::test_deterministic_codegen_with_suffix_cuda <- test/inductor/test_torchinductor.py PASSED [0.8650s] [ 22%] 2025-12-04T11:14:00.1847726Z inductor/test_compile_subprocess.py::GPUTests::test_device_assert_cuda <- test/inductor/test_torchinductor.py W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1848260Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1849160Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1849591Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1850430Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1851009Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1851815Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1852219Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1853070Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1853625Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1854463Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1854907Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1855747Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1856283Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1857086Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1857619Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1858488Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1859066Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1859859Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1860430Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1861265Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1861930Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1862899Z W1204 11:10:51.818000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1863409Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1863911Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Traceback (most recent call last): 2025-12-04T11:14:00.1864800Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1865181Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] ).serialize() 2025-12-04T11:14:00.1866040Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1866621Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1867424Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1867827Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] pickler.dump(obj) 2025-12-04T11:14:00.1868664Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1869233Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1870098Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1870557Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] cls(obj, pickler.options), 2025-12-04T11:14:00.1871361Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1871898Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1872701Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1873229Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1874053Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1874608Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1875416Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1876022Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1876883Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1877513Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1878680Z W1204 11:10:52.137000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1878918Z PASSED [0.6260s] [ 22%] 2025-12-04T11:14:00.1879752Z inductor/test_compile_subprocess.py::GPUTests::test_diagonal_copy_cuda <- test/inductor/test_torchinductor.py PASSED [1.3179s] [ 22%] 2025-12-04T11:14:00.1880387Z inductor/test_compile_subprocess.py::GPUTests::test_dist_bf16_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (Requires sm80) [ 23%] 2025-12-04T11:14:00.1881408Z inductor/test_compile_subprocess.py::GPUTests::test_dist_cuda <- test/inductor/test_torchinductor.py W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1881889Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1882779Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1883159Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1884013Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1884636Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1885437Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1885840Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1886684Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1887252Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1888373Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1888833Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1889639Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1890175Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1890973Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1891605Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1892419Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1893012Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1893816Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1894389Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1895214Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1895839Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1896754Z W1204 11:10:54.118000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1896875Z PASSED [0.6750s] [ 23%] 2025-12-04T11:14:00.1897395Z inductor/test_compile_subprocess.py::GPUTests::test_div3_cuda <- test/inductor/test_torchinductor.py PASSED [0.3946s] [ 24%] 2025-12-04T11:14:00.1897917Z inductor/test_compile_subprocess.py::GPUTests::test_div5_cuda <- test/inductor/test_torchinductor.py PASSED [0.3475s] [ 24%] 2025-12-04T11:14:00.1898599Z inductor/test_compile_subprocess.py::GPUTests::test_div_presicion_accuracy_cuda <- test/inductor/test_torchinductor.py PASSED [0.4202s] [ 24%] 2025-12-04T11:14:00.1899682Z inductor/test_compile_subprocess.py::GPUTests::test_div_prim_cuda <- test/inductor/test_torchinductor.py W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1900164Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1901056Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1901450Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1902293Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1902891Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1903681Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1904085Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1904932Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1905520Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1906391Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1906866Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1907685Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1908202Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1909008Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1909544Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1910346Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1910917Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1911713Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1912289Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1913159Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1913785Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1914625Z W1204 11:10:55.603000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.div.default 2025-12-04T11:14:00.1915133Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1915602Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1916493Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1916876Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1917727Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1918305Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1919101Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1919538Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1920432Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1921019Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1921846Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1922303Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1923108Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1923647Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1924443Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1924978Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1925775Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1926328Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1927176Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1927749Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1928580Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1929205Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1930047Z W1204 11:10:55.770000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.div.default 2025-12-04T11:14:00.1930556Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1931011Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.1931914Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1932292Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.1933138Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1933744Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1934588Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1935028Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.1935857Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1936420Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1937247Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1937705Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.1938605Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1939130Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1939951Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1940472Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1941336Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1941887Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1942697Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1943270Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1944087Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1944728Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1945557Z W1204 11:10:55.941000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.div.default 2025-12-04T11:14:00.1945679Z PASSED [0.5867s] [ 25%] 2025-12-04T11:14:00.1946779Z inductor/test_compile_subprocess.py::GPUTests::test_div_softmax_symfloat_cuda <- test/inductor/test_torchinductor.py W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1947245Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] Traceback (most recent call last): 2025-12-04T11:14:00.1948218Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1948611Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] ).serialize() 2025-12-04T11:14:00.1949494Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1950077Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1950878Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1951292Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] pickler.dump(obj) 2025-12-04T11:14:00.1952138Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1952709Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1953537Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1953996Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] cls(obj, pickler.options), 2025-12-04T11:14:00.1954806Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1955380Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1956184Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1956713Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1957526Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1958082Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1958896Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1959476Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1960307Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1960935Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1961890Z W1204 11:10:56.374000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1962447Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1962905Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Traceback (most recent call last): 2025-12-04T11:14:00.1963843Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1964221Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] ).serialize() 2025-12-04T11:14:00.1965069Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1965660Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1966448Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1966865Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] pickler.dump(obj) 2025-12-04T11:14:00.1967701Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1968268Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1969103Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1969582Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] cls(obj, pickler.options), 2025-12-04T11:14:00.1970397Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.1970920Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.1971731Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.1972257Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.1973072Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.1973630Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.1974423Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.1975008Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.1975874Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.1976550Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.1977465Z W1204 11:10:57.789000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.1977615Z PASSED [2.7544s] [ 25%] 2025-12-04T11:14:00.1978500Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float16_float32_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (uses bfloat16 which requires SM >= 80) [ 25%] 2025-12-04T11:14:00.1979302Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float32_uint8_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (uses bfloat16 which requires SM >= 80) [ 26%] 2025-12-04T11:14:00.1980123Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float64_float64_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0031s] (uses bfloat16 which requires SM >= 80) [ 26%] 2025-12-04T11:14:00.1980905Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int16_int64_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0031s] (uses bfloat16 which requires SM >= 80) [ 27%] 2025-12-04T11:14:00.1981716Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int32_float64_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (uses bfloat16 which requires SM >= 80) [ 27%] 2025-12-04T11:14:00.1982495Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int32_int32_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (uses bfloat16 which requires SM >= 80) [ 27%] 2025-12-04T11:14:00.1983290Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int32_int64_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0031s] (uses bfloat16 which requires SM >= 80) [ 28%] 2025-12-04T11:14:00.1984128Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int32_uint8_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (uses bfloat16 which requires SM >= 80) [ 28%] 2025-12-04T11:14:00.1984923Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int64_float32_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (uses bfloat16 which requires SM >= 80) [ 29%] 2025-12-04T11:14:00.1985716Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int64_int16_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (uses bfloat16 which requires SM >= 80) [ 29%] 2025-12-04T11:14:00.1986490Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_int64_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0031s] (uses bfloat16 which requires SM >= 80) [ 29%] 2025-12-04T11:14:00.1987276Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_int8_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (uses bfloat16 which requires SM >= 80) [ 30%] 2025-12-04T11:14:00.1988367Z inductor/test_compile_subprocess.py::GPUTests::test_embedding_bag_byte_unpack_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (No cuda implementation (it returns empty)) [ 30%] 2025-12-04T11:14:00.1988929Z inductor/test_compile_subprocess.py::GPUTests::test_embedding_cuda <- test/inductor/test_torchinductor.py PASSED [0.6849s] [ 30%] 2025-12-04T11:14:00.1989453Z inductor/test_compile_subprocess.py::GPUTests::test_empty1_cuda <- test/inductor/test_torchinductor.py PASSED [0.1728s] [ 31%] 2025-12-04T11:14:00.1989975Z inductor/test_compile_subprocess.py::GPUTests::test_empty2_cuda <- test/inductor/test_torchinductor.py PASSED [0.1703s] [ 31%] 2025-12-04T11:14:00.1990547Z inductor/test_compile_subprocess.py::GPUTests::test_empty_strided_cuda <- test/inductor/test_torchinductor.py PASSED [0.1731s] [ 32%] 2025-12-04T11:14:00.1991297Z inductor/test_compile_subprocess.py::GPUTests::test_emulate_precision_triton_fp_fusion_cuda <- test/inductor/test_torchinductor.py PASSED [0.2732s] [ 32%] 2025-12-04T11:14:00.1992446Z inductor/test_compile_subprocess.py::GPUTests::test_fallback_mutable_op_list_cuda <- test/inductor/test_torchinductor.py W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.1992928Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] Traceback (most recent call last): 2025-12-04T11:14:00.1993818Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.1994189Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] ).serialize() 2025-12-04T11:14:00.1995029Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.1995615Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.1996398Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.1996790Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] pickler.dump(obj) 2025-12-04T11:14:00.1997622Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.1998167Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.1999005Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.1999476Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] cls(obj, pickler.options), 2025-12-04T11:14:00.2000291Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2000806Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2001613Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2002130Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2002923Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2003479Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2004264Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2004840Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2005674Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2006325Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2007178Z W1204 11:11:00.452000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.mylib.inplace_.default 2025-12-04T11:14:00.2007718Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2008189Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2009084Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2009484Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2010328Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2010917Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2011717Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2012118Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2012971Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 96, in reducer_override 2025-12-04T11:14:00.2013514Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _OpPickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2014352Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 418, in reduce_helper 2025-12-04T11:14:00.2014841Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] result = cls.pickle(op, pickler.options) 2025-12-04T11:14:00.2015634Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2016219Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2017035Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2017675Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2018594Z W1204 11:11:00.549000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.mylib.sin_out.default 2025-12-04T11:14:00.2018702Z PASSED [0.1960s] [ 32%] 2025-12-04T11:14:00.2019779Z inductor/test_compile_subprocess.py::GPUTests::test_fft_real_input_cuda <- test/inductor/test_torchinductor.py W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2020318Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2021223Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2021633Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2022741Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2025354Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2027835Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2030315Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2032986Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2035936Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2038528Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2040013Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2041523Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2043006Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2044457Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2045927Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2047395Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2048893Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2050383Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2051877Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2053406Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2055024Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2056755Z W1204 11:11:00.702000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.2057959Z PASSED [0.1328s] [ 33%] 2025-12-04T11:14:00.2058811Z inductor/test_compile_subprocess.py::GPUTests::test_float16_to_int16_cuda <- test/inductor/test_torchinductor.py PASSED [0.2096s] [ 33%] 2025-12-04T11:14:00.2060090Z inductor/test_compile_subprocess.py::GPUTests::test_float32_to_int32_cuda <- test/inductor/test_torchinductor.py PASSED [0.2225s] [ 33%] 2025-12-04T11:14:00.2061329Z inductor/test_compile_subprocess.py::GPUTests::test_fmin_fmax_cuda <- test/inductor/test_torchinductor.py PASSED [0.6727s] [ 34%] 2025-12-04T11:14:00.2062615Z inductor/test_compile_subprocess.py::GPUTests::test_fractional_max_pool2d1_cuda <- test/inductor/test_torchinductor.py PASSED [1.1289s] [ 34%] 2025-12-04T11:14:00.2063950Z inductor/test_compile_subprocess.py::GPUTests::test_fractional_max_pool2d4_cuda <- test/inductor/test_torchinductor.py PASSED [1.5817s] [ 35%] 2025-12-04T11:14:00.2065281Z inductor/test_compile_subprocess.py::GPUTests::test_full_like_transposed_cuda <- test/inductor/test_torchinductor.py PASSED [0.3853s] [ 35%] 2025-12-04T11:14:00.2066551Z inductor/test_compile_subprocess.py::GPUTests::test_fuse_tiled_cuda <- test/inductor/test_torchinductor.py PASSED [0.6300s] [ 35%] 2025-12-04T11:14:00.2068242Z inductor/test_compile_subprocess.py::GPUTests::test_gelu_cuda <- test/inductor/test_torchinductor.py W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2069854Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2071353Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2072826Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2074194Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2075758Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2077254Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2078590Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2079990Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2081521Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2083032Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2084449Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2110001Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2111741Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2113215Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2114744Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2116215Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2117724Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2119230Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2120733Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2122266Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2123850Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2125528Z W1204 11:11:06.131000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.2126744Z PASSED [0.8586s] [ 36%] 2025-12-04T11:14:00.2128100Z inductor/test_compile_subprocess.py::GPUTests::test_generated_code_has_size_stride_assert_cuda <- test/inductor/test_torchinductor.py W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2129877Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2131376Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2132785Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2134144Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2135713Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2137225Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2138653Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2140038Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2141555Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2143146Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2144559Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2145985Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2147434Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2148892Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2150360Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2151828Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2153324Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2154798Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2156319Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2157843Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2159455Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2161044Z W1204 11:11:06.541000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.test.foo.default 2025-12-04T11:14:00.2162095Z PASSED [0.2504s] [ 36%] 2025-12-04T11:14:00.2162819Z inductor/test_compile_subprocess.py::GPUTests::test_getitem_cuda <- test/inductor/test_torchinductor.py PASSED [0.0243s] [ 37%] 2025-12-04T11:14:00.2164103Z inductor/test_compile_subprocess.py::GPUTests::test_gpu_scalar_with_cpu_tensor_cuda <- test/inductor/test_torchinductor.py PASSED [0.0186s] [ 37%] 2025-12-04T11:14:00.2165473Z inductor/test_compile_subprocess.py::GPUTests::test_gpu_scalar_with_gpu_tensor_cuda <- test/inductor/test_torchinductor.py PASSED [0.1952s] [ 37%] 2025-12-04T11:14:00.2167324Z inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_arange1_cuda <- test/inductor/test_torchinductor.py W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2169043Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2170542Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2171950Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2173377Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2174928Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2176466Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2177798Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2179276Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2180814Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2182325Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2183741Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2185127Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2186595Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2188391Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2189947Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2191415Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2192915Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2194410Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2195900Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2197429Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2199009Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2200608Z W1204 11:11:07.030000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.2202089Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2203186Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Traceback (most recent call last): 2025-12-04T11:14:00.2204778Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2206194Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] ).serialize() 2025-12-04T11:14:00.2207598Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2209148Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2210654Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2211987Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] pickler.dump(obj) 2025-12-04T11:14:00.2213375Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2214906Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2216413Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2217825Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] cls(obj, pickler.options), 2025-12-04T11:14:00.2219488Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2221387Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2222841Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2224309Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2225777Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2227280Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2228772Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2230263Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2231786Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2233362Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2234997Z W1204 11:11:07.748000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.2236082Z PASSED [1.4482s] [ 38%] 2025-12-04T11:14:00.2237411Z inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_arange2_cuda <- test/inductor/test_torchinductor.py W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2239163Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2240656Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2242069Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2243426Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2244986Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2246495Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2247829Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2249221Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2250742Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2252308Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2253723Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2255118Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2256579Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2258034Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2259574Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2261040Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2262540Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2264023Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2265527Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2267119Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2268702Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2270335Z W1204 11:11:08.489000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.2271797Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2272899Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Traceback (most recent call last): 2025-12-04T11:14:00.2274396Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2275799Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] ).serialize() 2025-12-04T11:14:00.2277158Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2278694Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2280195Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2281523Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] pickler.dump(obj) 2025-12-04T11:14:00.2282943Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2284458Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2285976Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2287386Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] cls(obj, pickler.options), 2025-12-04T11:14:00.2289072Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2290546Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2292004Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2293470Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2294937Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2296429Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2298034Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2299604Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2301182Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2302764Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2304357Z W1204 11:11:08.646000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.2305409Z PASSED [0.3158s] [ 38%] 2025-12-04T11:14:00.2306700Z inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_argmax_cuda <- test/inductor/test_torchinductor.py W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2308414Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2309914Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2311321Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2312680Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2314312Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2315829Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2317160Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2318537Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2320072Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2321599Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2323016Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2324403Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2325848Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2327427Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2328925Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2330396Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2331920Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2333391Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2334896Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2336430Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2338010Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2339775Z W1204 11:11:08.812000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.2340924Z PASSED [0.7597s] [ 38%] 2025-12-04T11:14:00.2342270Z inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_constant_tensor1_cuda <- test/inductor/test_torchinductor.py W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2344027Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2345571Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2346967Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2348333Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2349889Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2351396Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2352728Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2354104Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2355639Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2357160Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2358565Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2360013Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2361484Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2362982Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2364447Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2365913Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2367403Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2368889Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2370402Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2371931Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2373506Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2375535Z W1204 11:11:09.573000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.2376699Z PASSED [0.7034s] [ 39%] 2025-12-04T11:14:00.2378099Z inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_unbacked_symint_as_output_cuda <- test/inductor/test_torchinductor.py W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2379985Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] Traceback (most recent call last): 2025-12-04T11:14:00.2381481Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2382907Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] ).serialize() 2025-12-04T11:14:00.2384289Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2385856Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2387374Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2388982Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] pickler.dump(obj) 2025-12-04T11:14:00.2390463Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2392072Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2393644Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2395059Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] cls(obj, pickler.options), 2025-12-04T11:14:00.2396439Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2397921Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2399441Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2400908Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2402370Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2403874Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2405373Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2406937Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2408471Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2410048Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2411654Z W1204 11:11:10.299000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.2412735Z PASSED [0.5880s] [ 39%] 2025-12-04T11:14:00.2414086Z inductor/test_compile_subprocess.py::GPUTests::test_grid_sampler_expand_preserves_view_cuda <- test/inductor/test_torchinductor.py W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2415842Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2417327Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2418796Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2420160Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2421794Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2423285Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2424656Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2426043Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2427566Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2429088Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2430481Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2431874Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2433333Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2434790Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2436244Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2437733Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2439225Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2440713Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2442207Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2443718Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2445293Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2446886Z W1204 11:11:12.001000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.2447965Z PASSED [8.8851s] [ 40%] 2025-12-04T11:14:00.2449206Z inductor/test_compile_subprocess.py::GPUTests::test_hardsigmoid_cuda <- test/inductor/test_torchinductor.py W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2450842Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2452396Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2453810Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2455197Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2456750Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2458315Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2459651Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2461040Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2462575Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2464080Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2465496Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2466894Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2468407Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2469864Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2471310Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2472770Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2474261Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2475754Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2477248Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2478758Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2480337Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2482052Z W1204 11:11:20.084000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.2483207Z PASSED [0.5849s] [ 40%] 2025-12-04T11:14:00.2484531Z inductor/test_compile_subprocess.py::GPUTests::test_hardswish_cuda <- test/inductor/test_torchinductor.py W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2486200Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2487689Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2489380Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2490749Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2492302Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2493814Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2495140Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2496522Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2498038Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2499698Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2501110Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2502503Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2503967Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2505414Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2506881Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2508350Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2509848Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2511343Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2512835Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2514460Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2516418Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2518374Z W1204 11:11:20.678000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.2519525Z PASSED [0.6060s] [ 40%] 2025-12-04T11:14:00.2520762Z inductor/test_compile_subprocess.py::GPUTests::test_hardtanh_cuda <- test/inductor/test_torchinductor.py W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2522409Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2523918Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2525330Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2526687Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2528256Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2529774Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2531156Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2532550Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2534076Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2535599Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2537016Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2538526Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2539984Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2541453Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2542922Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2544390Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2546000Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2547481Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2549022Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2550551Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2552138Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2553826Z W1204 11:11:21.240000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.2554973Z PASSED [0.5209s] [ 41%] 2025-12-04T11:14:00.2555747Z inductor/test_compile_subprocess.py::GPUTests::test_horizonal_fusion1_cuda <- test/inductor/test_torchinductor.py PASSED [0.6796s] [ 41%] 2025-12-04T11:14:00.2557072Z inductor/test_compile_subprocess.py::GPUTests::test_horizonal_fusion2_cuda <- test/inductor/test_torchinductor.py PASSED [0.5960s] [ 41%] 2025-12-04T11:14:00.2558323Z inductor/test_compile_subprocess.py::GPUTests::test_index1_cuda <- test/inductor/test_torchinductor.py PASSED [1.4018s] [ 42%] 2025-12-04T11:14:00.2559499Z inductor/test_compile_subprocess.py::GPUTests::test_index3_cuda <- test/inductor/test_torchinductor.py PASSED [0.6104s] [ 42%] 2025-12-04T11:14:00.2561353Z inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_device_assert_masked_cuda <- test/inductor/test_torchinductor.py W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2563184Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2564687Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2566086Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2567450Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2569011Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2570527Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2571864Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2573238Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2574773Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2576330Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2577793Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2579246Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2580742Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2582214Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2583681Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2585163Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2586653Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2588454Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2589972Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2591507Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2593174Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2594764Z W1204 11:11:24.788000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.2596254Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2597369Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2598869Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2600283Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2601644Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2603202Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2604707Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2606034Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2607465Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2609042Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2610572Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2612038Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2613431Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2614885Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2616362Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2617823Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2619374Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2620869Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2622355Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2623911Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2625437Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2627012Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2628596Z W1204 11:11:25.162000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.2629681Z PASSED [0.7513s] [ 43%] 2025-12-04T11:14:00.2630987Z inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_flip_cuda <- test/inductor/test_torchinductor.py W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2632709Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2634208Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2635609Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2636976Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2638584Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2640139Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2641492Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2642883Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2644415Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2645934Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2647352Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2648728Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2650210Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2651680Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2653156Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2654677Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2656170Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2657668Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2659266Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2660797Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2662387Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2663985Z W1204 11:11:25.528000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.2665076Z PASSED [0.1817s] [ 43%] 2025-12-04T11:14:00.2665821Z inductor/test_compile_subprocess.py::GPUTests::test_index_put3_cuda <- test/inductor/test_torchinductor.py PASSED [1.0532s] [ 43%] 2025-12-04T11:14:00.2667102Z inductor/test_compile_subprocess.py::GPUTests::test_index_put_fallback2_cuda <- test/inductor/test_torchinductor.py PASSED [0.7652s] [ 44%] 2025-12-04T11:14:00.2668406Z inductor/test_compile_subprocess.py::GPUTests::test_index_put_reinplace_cuda <- test/inductor/test_torchinductor.py PASSED [0.5133s] [ 44%] 2025-12-04T11:14:00.2669784Z inductor/test_compile_subprocess.py::GPUTests::test_inductor_assert_cuda <- test/inductor/test_torchinductor.py PASSED [1.0290s] [ 45%] 2025-12-04T11:14:00.2671623Z inductor/test_compile_subprocess.py::GPUTests::test_inplace_activations_cuda <- test/inductor/test_torchinductor.py W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2673350Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2674842Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2676247Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2677618Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2679185Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2680695Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2682015Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2683402Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2684936Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2686506Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2688198Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2689580Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2691041Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2692514Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2693985Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2695434Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2696936Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2698506Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2700018Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2701673Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2703242Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2704960Z W1204 11:11:29.637000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.2706121Z PASSED [0.9284s] [ 45%] 2025-12-04T11:14:00.2707369Z inductor/test_compile_subprocess.py::GPUTests::test_inplace_flip_cuda <- test/inductor/test_torchinductor.py W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2709014Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2710510Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2711915Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2713282Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2714841Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2716336Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2717721Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2719105Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2720638Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2722165Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2723562Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2724968Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2726435Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2727901Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2729348Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2730811Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2732371Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2733860Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2735395Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2736912Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2738560Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2740160Z W1204 11:11:30.366000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.rev.default 2025-12-04T11:14:00.2741634Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2742743Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2744221Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2745631Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2746995Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2748591Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2750089Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2751418Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2752805Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2754336Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2755859Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2757256Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2758645Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2760107Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2761608Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2763104Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2764559Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2766100Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2767590Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2769094Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2770611Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2772192Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2773785Z W1204 11:11:30.968000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.rev.default 2025-12-04T11:14:00.2774852Z PASSED [1.5041s] [ 45%] 2025-12-04T11:14:00.2776142Z inductor/test_compile_subprocess.py::GPUTests::test_inplace_mixed_dtype_ops_cuda <- test/inductor/test_torchinductor.py W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2777840Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2779452Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2780867Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2782237Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2783800Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2785303Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2786645Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2788320Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2789863Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2791379Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2792796Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2793733Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2794260Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2795108Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2795645Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2796445Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2797020Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2797818Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2798412Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2799226Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2799854Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2800833Z W1204 11:11:31.538000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.2800940Z PASSED [0.2133s] [ 46%] 2025-12-04T11:14:00.2801542Z inductor/test_compile_subprocess.py::GPUTests::test_inplace_resize_as_cuda <- test/inductor/test_torchinductor.py PASSED [0.0564s] [ 46%] 2025-12-04T11:14:00.2802113Z inductor/test_compile_subprocess.py::GPUTests::test_input_mutation1_cuda <- test/inductor/test_torchinductor.py PASSED [0.2824s] [ 46%] 2025-12-04T11:14:00.2802714Z inductor/test_compile_subprocess.py::GPUTests::test_invalid_operand_issue1_cuda <- test/inductor/test_torchinductor.py PASSED [1.3384s] [ 47%] 2025-12-04T11:14:00.2803303Z inductor/test_compile_subprocess.py::GPUTests::test_isin_tensor_scalar_cuda <- test/inductor/test_torchinductor.py PASSED [1.0244s] [ 47%] 2025-12-04T11:14:00.2803923Z inductor/test_compile_subprocess.py::GPUTests::test_large_grid_use_block_ptr_True_cuda <- test/inductor/test_torchinductor.py PASSED [0.8301s] [ 48%] 2025-12-04T11:14:00.2804552Z inductor/test_compile_subprocess.py::GPUTests::test_large_strided_reduction_cuda <- test/inductor/test_torchinductor.py PASSED [0.7997s] [ 48%] 2025-12-04T11:14:00.2805148Z inductor/test_compile_subprocess.py::GPUTests::test_large_tensor_reduction_cuda <- test/inductor/test_torchinductor.py PASSED [0.5912s] [ 48%] 2025-12-04T11:14:00.2806184Z inductor/test_compile_subprocess.py::GPUTests::test_lerp_cuda <- test/inductor/test_torchinductor.py W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2806643Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2807565Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2807995Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2808838Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2809464Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2810251Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2810665Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2811510Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2812068Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2812908Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2813351Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2814162Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2814690Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2815538Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2816063Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2816865Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2817434Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2818292Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2818893Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2819709Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2820338Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2821220Z W1204 11:11:36.705000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.copy_strided.default 2025-12-04T11:14:00.2821764Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2822267Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2823156Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2823581Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2824423Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2825005Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2825809Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2826212Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2827068Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2827619Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2828464Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2828946Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2829752Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2830286Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2831089Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2831622Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2832427Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2832996Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2833793Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2834366Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2835193Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2835881Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2836816Z W1204 11:11:37.027000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.2837375Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2837831Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2838729Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2839112Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2839968Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2840549Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2841353Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2841755Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2842596Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2843195Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2844024Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2844482Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2845280Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2845811Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2846616Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2847139Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2847950Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2848504Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2849307Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2849934Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2850758Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2851416Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2852325Z W1204 11:11:37.647000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.2852443Z PASSED [1.2411s] [ 49%] 2025-12-04T11:14:00.2853524Z inductor/test_compile_subprocess.py::GPUTests::test_like_channels_last_cuda <- test/inductor/test_torchinductor.py W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2854003Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2854890Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2855275Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2856125Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2856700Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2857536Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2857938Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2858851Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2859406Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2860229Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2860695Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2861496Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2862038Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2862837Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2863374Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2864243Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2864804Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2865638Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2866211Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2867035Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2867665Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2868559Z W1204 11:11:37.944000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.inductor_seeds.default 2025-12-04T11:14:00.2868665Z PASSED [0.7002s] [ 49%] 2025-12-04T11:14:00.2869210Z inductor/test_compile_subprocess.py::GPUTests::test_like_rands_cuda <- test/inductor/test_torchinductor.py PASSED [0.3062s] [ 50%] 2025-12-04T11:14:00.2869803Z inductor/test_compile_subprocess.py::GPUTests::test_like_rands_sliced_cuda <- test/inductor/test_torchinductor.py PASSED [0.3194s] [ 50%] 2025-12-04T11:14:00.2870331Z inductor/test_compile_subprocess.py::GPUTests::test_linear2_cuda <- test/inductor/test_torchinductor.py PASSED [1.6336s] [ 50%] 2025-12-04T11:14:00.2871288Z inductor/test_compile_subprocess.py::GPUTests::test_linear_dynamic_maxautotune_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 51%] 2025-12-04T11:14:00.2872045Z inductor/test_compile_subprocess.py::GPUTests::test_linear_float64_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (cuda failed for float64 linear) [ 51%] 2025-12-04T11:14:00.2872591Z inductor/test_compile_subprocess.py::GPUTests::test_linspace3_cuda <- test/inductor/test_torchinductor.py PASSED [0.1880s] [ 51%] 2025-12-04T11:14:00.2873236Z inductor/test_compile_subprocess.py::GPUTests::test_lite_dynamic_shape_assertion_cuda <- test/inductor/test_torchinductor.py PASSED [0.2224s] [ 52%] 2025-12-04T11:14:00.2873812Z inductor/test_compile_subprocess.py::GPUTests::test_lite_mode_fallback_cuda <- test/inductor/test_torchinductor.py PASSED [0.1335s] [ 52%] 2025-12-04T11:14:00.2874424Z inductor/test_compile_subprocess.py::GPUTests::test_lite_mode_not_decompose_cuda <- test/inductor/test_torchinductor.py PASSED [0.4851s] [ 53%] 2025-12-04T11:14:00.2874946Z inductor/test_compile_subprocess.py::GPUTests::test_log1p_cuda <- test/inductor/test_torchinductor.py PASSED [3.7817s] [ 53%] 2025-12-04T11:14:00.2876101Z inductor/test_compile_subprocess.py::GPUTests::test_low_memory_max_pool_dilation_1_dim_3_cuda <- test/inductor/test_torchinductor.py W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2876564Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2877457Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2877850Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2878829Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2879426Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2880249Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2880651Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2881502Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2882056Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2882897Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2883343Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2884155Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2884675Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2885476Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2886046Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2886845Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2887413Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2888478Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2889067Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2889883Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2890509Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2891492Z W1204 11:11:45.796000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.2891998Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2892468Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2893477Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2893869Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2894749Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2895326Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2896126Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2896535Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2897385Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2897938Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2898830Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2899287Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2900091Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2900675Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2901481Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2902017Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2902818Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2903374Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2904185Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2904758Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2905583Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2906208Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2907265Z W1204 11:11:48.607000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.2907372Z PASSED [5.6612s] [ 53%] 2025-12-04T11:14:00.2908526Z inductor/test_compile_subprocess.py::GPUTests::test_low_memory_max_pool_dilation_2_dim_2_cuda <- test/inductor/test_torchinductor.py W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2909034Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2909925Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2910321Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2911170Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2911762Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2912561Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2912965Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2913823Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2914413Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2915254Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2915704Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2916525Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2917048Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2917855Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2918392Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2919193Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2919763Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2920558Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2921160Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2922016Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2922670Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2923652Z W1204 11:11:51.433000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.2924161Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2924635Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2925534Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2925915Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2926769Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2927348Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2928154Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2928588Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2929438Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2929991Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2930813Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2931274Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2932080Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2932618Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2933420Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2933952Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2934749Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2935367Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2936173Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2936778Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2937598Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2938287Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2939274Z W1204 11:11:52.311000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.2939382Z PASSED [1.7453s] [ 54%] 2025-12-04T11:14:00.2940354Z inductor/test_compile_subprocess.py::GPUTests::test_mark_unbacked_with_hint_override_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipping triton backend only since not big GPU (not enough SM)) [ 54%] 2025-12-04T11:14:00.2940921Z inductor/test_compile_subprocess.py::GPUTests::test_masked_fill_cuda <- test/inductor/test_torchinductor.py PASSED [0.6584s] [ 54%] 2025-12-04T11:14:00.2941483Z inductor/test_compile_subprocess.py::GPUTests::test_masked_scatter_cuda <- test/inductor/test_torchinductor.py PASSED [0.6219s] [ 55%] 2025-12-04T11:14:00.2942540Z inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d1_cuda <- test/inductor/test_torchinductor.py W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2943041Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2943931Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2944325Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2945161Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2945753Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2946551Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2946968Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2947817Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2948371Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2949216Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2949737Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2950553Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2951105Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2951918Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2952438Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2953246Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2953811Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2954608Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2955190Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2956003Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2956644Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2957649Z W1204 11:11:54.461000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.2958158Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2958627Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2959515Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2959907Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2960751Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2961328Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2962131Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2962532Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2963383Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2963994Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2964837Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2965312Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2966109Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2966642Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2967447Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2967983Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2968792Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2969357Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2970154Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2970728Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2971595Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2972222Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2973200Z W1204 11:11:55.339000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.2973307Z PASSED [1.7388s] [ 55%] 2025-12-04T11:14:00.2974366Z inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d4_cuda <- test/inductor/test_torchinductor.py W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2974831Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2975713Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2976106Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2976948Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2977570Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2978466Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2978871Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2979760Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2980313Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2981154Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2981602Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2982417Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2982940Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2983742Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.2984276Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.2985081Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.2985684Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.2986477Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.2987059Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.2988145Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.2988788Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.2989772Z W1204 11:11:56.204000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.2990279Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.2990752Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.2991638Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.2992099Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.2992984Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.2993599Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.2994402Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.2994806Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.2995657Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.2996213Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.2997036Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.2997495Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.2998294Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.2998828Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.2999692Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3000228Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3001032Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3001581Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3002386Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3002962Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3003783Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3004408Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3005383Z W1204 11:11:57.575000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.3005488Z PASSED [2.3450s] [ 56%] 2025-12-04T11:14:00.3006127Z inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d6_dilation_1_cuda <- test/inductor/test_torchinductor.py PASSED [2.7511s] [ 56%] 2025-12-04T11:14:00.3006775Z inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d6_dilation_2_cuda <- test/inductor/test_torchinductor.py PASSED [3.2412s] [ 56%] 2025-12-04T11:14:00.3007820Z inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d8_cuda <- test/inductor/test_torchinductor.py W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3008326Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3009216Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3009607Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3010459Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3011037Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3011844Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3012246Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3013094Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3013684Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3014513Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3014972Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3015778Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3016309Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3017114Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3017654Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3018526Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3019081Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3019888Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3020497Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3021361Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3022018Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3022997Z W1204 11:12:04.552000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.3023505Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3023964Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3024867Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3025249Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3026098Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3026675Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3027476Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3027914Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3028752Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3029319Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3030145Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3030602Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3031410Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3031932Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3032752Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3033274Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3034091Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3034705Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3035519Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3036124Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3036937Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3037577Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3038554Z W1204 11:12:05.223000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T11:14:00.3038678Z PASSED [1.3286s] [ 57%] 2025-12-04T11:14:00.3039339Z inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d_with_indices_backward6_cuda <- test/inductor/test_torchinductor.py PASSED [0.3052s] [ 57%] 2025-12-04T11:14:00.3039965Z inductor/test_compile_subprocess.py::GPUTests::test_mixed_mm_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (Requires sm80) [ 58%] 2025-12-04T11:14:00.3040498Z inductor/test_compile_subprocess.py::GPUTests::test_mm_views_cuda <- test/inductor/test_torchinductor.py PASSED [0.1548s] [ 58%] 2025-12-04T11:14:00.3041547Z inductor/test_compile_subprocess.py::GPUTests::test_multi_device_cuda <- test/inductor/test_torchinductor.py W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3042057Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3042951Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3043346Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3044187Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3044780Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3045580Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3045982Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3046838Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3047392Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3048234Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3048705Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3049547Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3050127Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3050929Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3051468Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3052275Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3052845Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3053639Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3054212Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3055039Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3055667Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3056572Z W1204 11:12:06.357000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.device_put.default 2025-12-04T11:14:00.3056971Z W1204 11:12:06.376000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.3057375Z W1204 11:12:06.379000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.3057765Z W1204 11:12:06.382000 90422 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T11:14:00.3057870Z PASSED [0.8368s] [ 58%] 2025-12-04T11:14:00.3058746Z inductor/test_compile_subprocess.py::GPUTests::test_multi_gpu_recompile_on_index_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (requires multiple cuda devices) [ 59%] 2025-12-04T11:14:00.3059348Z inductor/test_compile_subprocess.py::GPUTests::test_multilayer_prime_size_cuda <- test/inductor/test_torchinductor.py PASSED [0.4163s] [ 59%] 2025-12-04T11:14:00.3059928Z inductor/test_compile_subprocess.py::GPUTests::test_multilayer_var_cuda <- test/inductor/test_torchinductor.py PASSED [1.4775s] [ 59%] 2025-12-04T11:14:00.3061064Z inductor/test_compile_subprocess.py::GPUTests::test_mutable_custom_op_fixed_layout_cuda <- test/inductor/test_torchinductor.py W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3061528Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3062431Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3062848Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3063735Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3064344Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3065150Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3065552Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3066395Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 96, in reducer_override 2025-12-04T11:14:00.3066921Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _OpPickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3067745Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 418, in reduce_helper 2025-12-04T11:14:00.3068250Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] result = cls.pickle(op, pickler.options) 2025-12-04T11:14:00.3069045Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3069622Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3070486Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3071111Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3071962Z W1204 11:12:09.059000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.mylib.copy_.default 2025-12-04T11:14:00.3072070Z PASSED [0.7375s] [ 60%] 2025-12-04T11:14:00.3072732Z inductor/test_compile_subprocess.py::GPUTests::test_nan_assert_inside_triton_kernel_cuda <- test/inductor/test_torchinductor.py PASSED [0.3905s] [ 60%] 2025-12-04T11:14:00.3073400Z inductor/test_compile_subprocess.py::GPUTests::test_nan_sort_stable_True_descending_False_cuda <- test/inductor/test_torchinductor.py PASSED [1.0174s] [ 61%] 2025-12-04T11:14:00.3073939Z inductor/test_compile_subprocess.py::GPUTests::test_nan_to_num_cuda <- test/inductor/test_torchinductor.py PASSED [0.4892s] [ 61%] 2025-12-04T11:14:00.3074478Z inductor/test_compile_subprocess.py::GPUTests::test_narrow_cuda <- test/inductor/test_torchinductor.py PASSED [0.6678s] [ 61%] 2025-12-04T11:14:00.3075138Z inductor/test_compile_subprocess.py::GPUTests::test_no_specization_over_symbolic_value_cuda <- test/inductor/test_torchinductor.py PASSED [0.7762s] [ 62%] 2025-12-04T11:14:00.3075712Z inductor/test_compile_subprocess.py::GPUTests::test_output_strides_cuda <- test/inductor/test_torchinductor.py PASSED [0.5646s] [ 62%] 2025-12-04T11:14:00.3076738Z inductor/test_compile_subprocess.py::GPUTests::test_pad_cast_cuda <- test/inductor/test_torchinductor.py W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3077233Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3078167Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3078578Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3079431Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3080009Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3080814Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3081220Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3082055Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3082621Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3083448Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3083904Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3084738Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3085275Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3086074Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3086595Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3087413Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3088258Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3089070Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3089647Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3090478Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3091106Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3092119Z W1204 11:12:13.719000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.3092647Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3093147Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3094049Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3094427Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3095274Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3095872Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3096665Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3097084Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3097925Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3098553Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3099502Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3099947Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3100764Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3101289Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3102108Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3102636Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3103456Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3104014Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3104811Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3105400Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3106289Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3106929Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3107884Z W1204 11:12:13.975000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.3108002Z PASSED [0.5125s] [ 62%] 2025-12-04T11:14:00.3108540Z inductor/test_compile_subprocess.py::GPUTests::test_pad_view_cuda <- test/inductor/test_torchinductor.py PASSED [1.6191s] [ 63%] 2025-12-04T11:14:00.3109655Z inductor/test_compile_subprocess.py::GPUTests::test_pattern_matcher_multi_user_cuda <- test/inductor/test_torchinductor.py W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3110133Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3111027Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3111426Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3112267Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3112848Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3113686Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3114088Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3114948Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3115501Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3116338Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3116788Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3117593Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3118127Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3118928Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3119461Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3120314Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3120884Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3121704Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3122278Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3123103Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3123732Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3124658Z W1204 11:12:16.029000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.3125165Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3125635Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3126527Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3126907Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3127791Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3128371Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3129170Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3129570Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3130407Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3130978Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3131804Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3132263Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3133063Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3133599Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3134465Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3134988Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3135834Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3136388Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3137191Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3137770Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3138677Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3139309Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3140215Z W1204 11:12:16.976000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.3140334Z PASSED [1.9284s] [ 63%] 2025-12-04T11:14:00.3140877Z inductor/test_compile_subprocess.py::GPUTests::test_permute2_cuda <- test/inductor/test_torchinductor.py PASSED [0.2819s] [ 64%] 2025-12-04T11:14:00.3141976Z inductor/test_compile_subprocess.py::GPUTests::test_philox_rand_cuda <- test/inductor/test_torchinductor.py W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3142434Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3143321Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3143708Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3144545Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3145146Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3145934Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3146348Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3147187Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3147740Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3148714Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3149471Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3150348Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3150962Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3151796Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3152357Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3153283Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3153898Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3154778Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3155389Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3156299Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3156975Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3157999Z W1204 11:12:18.093000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.rngprims.philox_rand.default 2025-12-04T11:14:00.3158543Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3159039Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Traceback (most recent call last): 2025-12-04T11:14:00.3160023Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3160510Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] ).serialize() 2025-12-04T11:14:00.3161452Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3162093Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3162969Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3163413Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] pickler.dump(obj) 2025-12-04T11:14:00.3164357Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3164976Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3165904Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3166453Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] cls(obj, pickler.options), 2025-12-04T11:14:00.3167305Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3167912Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3168758Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3169293Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3170236Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3170839Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3171722Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3172374Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3173247Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3173941Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3174890Z W1204 11:12:18.913000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.rngprims.philox_rand.default 2025-12-04T11:14:00.3175156Z PASSED [1.6424s] [ 64%] 2025-12-04T11:14:00.3175835Z inductor/test_compile_subprocess.py::GPUTests::test_pixel_shuffle_channels_last_cuda <- test/inductor/test_torchinductor.py PASSED [0.9670s] [ 64%] 2025-12-04T11:14:00.3176505Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_airy_ai_cuda <- test/inductor/test_torchinductor.py PASSED [0.1348s] [ 65%] 2025-12-04T11:14:00.3177107Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_bessel_j1_cuda <- test/inductor/test_torchinductor.py PASSED [0.3967s] [ 65%] 2025-12-04T11:14:00.3177855Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_chebyshev_polynomial_u_cuda <- test/inductor/test_torchinductor.py PASSED [0.1338s] [ 66%] 2025-12-04T11:14:00.3179199Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_erf_cuda <- test/inductor/test_torchinductor.py PASSED [0.4637s] [ 66%] 2025-12-04T11:14:00.3179822Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_erfinv_cuda <- test/inductor/test_torchinductor.py PASSED [0.4785s] [ 66%] 2025-12-04T11:14:00.3180561Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_exp2_cuda <- test/inductor/test_torchinductor.py PASSED [0.4517s] [ 67%] 2025-12-04T11:14:00.3181167Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_expit_cuda <- test/inductor/test_torchinductor.py PASSED [0.3749s] [ 67%] 2025-12-04T11:14:00.3181888Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_i1_cuda <- test/inductor/test_torchinductor.py PASSED [0.2443s] [ 67%] 2025-12-04T11:14:00.3182501Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_i1e_cuda <- test/inductor/test_torchinductor.py PASSED [0.1315s] [ 68%] 2025-12-04T11:14:00.3183187Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_legendre_polynomial_p_cuda <- test/inductor/test_torchinductor.py PASSED [0.1312s] [ 68%] 2025-12-04T11:14:00.3183900Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_modified_bessel_k0_cuda <- test/inductor/test_torchinductor.py PASSED [0.6486s] [ 69%] 2025-12-04T11:14:00.3184537Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_polygamma_cuda <- test/inductor/test_torchinductor.py PASSED [0.2566s] [ 69%] 2025-12-04T11:14:00.3185176Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_round_cuda <- test/inductor/test_torchinductor.py PASSED [0.4726s] [ 69%] 2025-12-04T11:14:00.3185948Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_t_cuda <- test/inductor/test_torchinductor.py PASSED [0.4914s] [ 70%] 2025-12-04T11:14:00.3186699Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_v_cuda <- test/inductor/test_torchinductor.py PASSED [0.1316s] [ 70%] 2025-12-04T11:14:00.3187420Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_spherical_bessel_j0_cuda <- test/inductor/test_torchinductor.py PASSED [0.1305s] [ 70%] 2025-12-04T11:14:00.3188801Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_xlogy_cuda <- test/inductor/test_torchinductor.py W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3189436Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3190345Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3190949Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3191831Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3192451Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3193341Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3193783Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3194728Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3195341Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3196295Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3196830Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3197672Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3198297Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3199173Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3199813Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3200652Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3201295Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3202126Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3202717Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3203671Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3204346Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3205378Z W1204 11:12:26.085000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.3205522Z PASSED [0.5575s] [ 71%] 2025-12-04T11:14:00.3206186Z inductor/test_compile_subprocess.py::GPUTests::test_polar_cuda <- test/inductor/test_torchinductor.py PASSED [0.2089s] [ 71%] 2025-12-04T11:14:00.3206774Z inductor/test_compile_subprocess.py::GPUTests::test_pow2_cuda <- test/inductor/test_torchinductor.py PASSED [0.8431s] [ 72%] 2025-12-04T11:14:00.3207495Z inductor/test_compile_subprocess.py::GPUTests::test_pow_by_natural_log2_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [1.1219s] [ 72%] 2025-12-04T11:14:00.3208106Z inductor/test_compile_subprocess.py::GPUTests::test_pow_int_cuda <- test/inductor/test_torchinductor.py PASSED [1.7306s] [ 72%] 2025-12-04T11:14:00.3208769Z inductor/test_compile_subprocess.py::GPUTests::test_profiler_mark_wrapper_call_cuda <- test/inductor/test_torchinductor.py PASSED [0.2058s] [ 73%] 2025-12-04T11:14:00.3209882Z inductor/test_compile_subprocess.py::GPUTests::test_randint_cuda <- test/inductor/test_torchinductor.py W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3210359Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3211413Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3211835Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3212779Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3213446Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3214311Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3214823Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3215716Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3216365Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3217224Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3217707Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3218669Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3219280Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3220193Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3220791Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3221759Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3222350Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3223158Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3223881Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3224737Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3225457Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3226374Z W1204 11:12:30.499000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.inductor_seeds.default 2025-12-04T11:14:00.3226540Z PASSED [0.9668s] [ 73%] 2025-12-04T11:14:00.3227526Z inductor/test_compile_subprocess.py::GPUTests::test_randn_with_dtype_and_device_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (only support cpu randn_with_dtype_and_device test) [ 74%] 2025-12-04T11:14:00.3228165Z inductor/test_compile_subprocess.py::GPUTests::test_reduction1_cuda <- test/inductor/test_torchinductor.py PASSED [0.5851s] [ 74%] 2025-12-04T11:14:00.3228817Z inductor/test_compile_subprocess.py::GPUTests::test_reduction3_cuda <- test/inductor/test_torchinductor.py PASSED [0.5426s] [ 74%] 2025-12-04T11:14:00.3229426Z inductor/test_compile_subprocess.py::GPUTests::test_reduction4_cuda <- test/inductor/test_torchinductor.py PASSED [0.9001s] [ 75%] 2025-12-04T11:14:00.3230118Z inductor/test_compile_subprocess.py::GPUTests::test_reinterpret_dtypeview_cuda <- test/inductor/test_torchinductor.py PASSED [0.3546s] [ 75%] 2025-12-04T11:14:00.3230699Z inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_clone_cuda <- test/inductor/test_torchinductor.py PASSED [0.5442s] [ 75%] 2025-12-04T11:14:00.3231353Z inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_copy_cuda <- test/inductor/test_torchinductor.py PASSED [1.0463s] [ 76%] 2025-12-04T11:14:00.3233482Z inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice1_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0008s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/151381 for platform(s) linux, rocm, slow. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 76%] 2025-12-04T11:14:00.3235522Z inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_view_default_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0008s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/151511 for platform(s) linux, rocm, slow. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 77%] 2025-12-04T11:14:00.3237602Z inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_view_dtype_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0008s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/151541 for platform(s) linux, rocm, slow. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 77%] 2025-12-04T11:14:00.3238799Z inductor/test_compile_subprocess.py::GPUTests::test_repeat_interleave_2_cuda <- test/inductor/test_torchinductor.py W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3239345Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3240367Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3240837Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3241718Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3242390Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3243196Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3243750Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3244686Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3245280Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3246222Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3246714Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3247628Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3248211Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3249102Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3249667Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3250499Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3251134Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3252005Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3252716Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3253571Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3254287Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3255156Z W1204 11:12:35.454000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.3255336Z PASSED [0.2858s] [ 77%] 2025-12-04T11:14:00.3256688Z inductor/test_compile_subprocess.py::GPUTests::test_repeat_interleave_Tensor_decomp_int32_nd_2_cuda <- test/inductor/test_torchinductor.py W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3257188Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3258268Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3258691Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3259641Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3260335Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3261185Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3261715Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3262589Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3263231Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3264078Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3264665Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3265520Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3266075Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3266964Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3267529Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3268465Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3269089Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3269965Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3270642Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3271494Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3272193Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3273098Z W1204 11:12:35.754000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.3273319Z PASSED [0.2896s] [ 78%] 2025-12-04T11:14:00.3274555Z inductor/test_compile_subprocess.py::GPUTests::test_repeat_interleave_Tensor_decomp_int64_nd_2_cuda <- test/inductor/test_torchinductor.py W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3275105Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3276113Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3276521Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3288149Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3288874Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3289691Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3290140Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3290984Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3291547Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3292389Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3292836Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3293659Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3294317Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3295142Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3295665Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3296469Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3297043Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3297839Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3298515Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3299332Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3299971Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3300865Z W1204 11:12:36.043000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.3301024Z PASSED [0.2777s] [ 78%] 2025-12-04T11:14:00.3301643Z inductor/test_compile_subprocess.py::GPUTests::test_repeat_interleave_cuda <- test/inductor/test_torchinductor.py PASSED [0.7657s] [ 79%] 2025-12-04T11:14:00.3302963Z inductor/test_compile_subprocess.py::GPUTests::test_replication_pad_errors_with_bool_cuda <- test/inductor/test_torchinductor.py E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] failed while attempting to run meta for aten.replication_pad1d.default 2025-12-04T11:14:00.3303443Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] Traceback (most recent call last): 2025-12-04T11:14:00.3304336Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T11:14:00.3304772Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] r = func(*args, **kwargs) 2025-12-04T11:14:00.3305524Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T11:14:00.3305990Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] return self._op(*args, **kwargs) 2025-12-04T11:14:00.3306817Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T11:14:00.3307259Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] result = fn(*args, **kwargs) 2025-12-04T11:14:00.3308186Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 1867, in meta_replication_pad1d 2025-12-04T11:14:00.3308604Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] torch._check( 2025-12-04T11:14:00.3309358Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T11:14:00.3310041Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T11:14:00.3310817Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T11:14:00.3311310Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] raise error_type(message_evaluated) 2025-12-04T11:14:00.3311925Z E1204 11:12:37.744000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/0] RuntimeError: "replication_pad1d" not implemented for 'torch.bool' 2025-12-04T11:14:00.3312565Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] failed while attempting to run meta for aten.replication_pad2d.default 2025-12-04T11:14:00.3313027Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] Traceback (most recent call last): 2025-12-04T11:14:00.3313907Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T11:14:00.3314354Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] r = func(*args, **kwargs) 2025-12-04T11:14:00.3315145Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T11:14:00.3315628Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] return self._op(*args, **kwargs) 2025-12-04T11:14:00.3316470Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T11:14:00.3316914Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] result = fn(*args, **kwargs) 2025-12-04T11:14:00.3317830Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 1978, in meta_replication_pad2d 2025-12-04T11:14:00.3318211Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] torch._check( 2025-12-04T11:14:00.3318977Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T11:14:00.3319647Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T11:14:00.3320436Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T11:14:00.3320907Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] raise error_type(message_evaluated) 2025-12-04T11:14:00.3321517Z E1204 11:12:37.905000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/1] RuntimeError: "replication_pad2d" not implemented for 'torch.bool' 2025-12-04T11:14:00.3322186Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] failed while attempting to run meta for aten.replication_pad3d.default 2025-12-04T11:14:00.3322647Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] Traceback (most recent call last): 2025-12-04T11:14:00.3323547Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T11:14:00.3323977Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] r = func(*args, **kwargs) 2025-12-04T11:14:00.3324710Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T11:14:00.3325184Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] return self._op(*args, **kwargs) 2025-12-04T11:14:00.3325999Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T11:14:00.3326456Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] result = fn(*args, **kwargs) 2025-12-04T11:14:00.3327359Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_meta_registrations.py", line 2104, in meta_replication_pad3d 2025-12-04T11:14:00.3327741Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] torch._check( 2025-12-04T11:14:00.3328495Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T11:14:00.3329224Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T11:14:00.3330014Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T11:14:00.3330539Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] raise error_type(message_evaluated) 2025-12-04T11:14:00.3331165Z E1204 11:12:38.063000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [1/2] RuntimeError: "replication_pad3d" not implemented for 'torch.bool' 2025-12-04T11:14:00.3331270Z PASSED [1.0556s] [ 79%] 2025-12-04T11:14:00.3332065Z inductor/test_compile_subprocess.py::GPUTests::test_round_correctness_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (need to debug tl.libdevice on A100/V100) [ 79%] 2025-12-04T11:14:00.3332675Z inductor/test_compile_subprocess.py::GPUTests::test_scalar_cpu_tensor_arg_cuda <- test/inductor/test_torchinductor.py PASSED [2.3173s] [ 80%] 2025-12-04T11:14:00.3333230Z inductor/test_compile_subprocess.py::GPUTests::test_scalar_input_cuda <- test/inductor/test_torchinductor.py PASSED [0.2629s] [ 80%] 2025-12-04T11:14:00.3333800Z inductor/test_compile_subprocess.py::GPUTests::test_scalar_output_cuda <- test/inductor/test_torchinductor.py PASSED [1.6945s] [ 80%] 2025-12-04T11:14:00.3334329Z inductor/test_compile_subprocess.py::GPUTests::test_scatter1_cuda <- test/inductor/test_torchinductor.py PASSED [0.7446s] [ 81%] 2025-12-04T11:14:00.3335083Z inductor/test_compile_subprocess.py::GPUTests::test_scatter_bf16_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0035s] (uses bfloat16 which requires SM >= 80) [ 81%] 2025-12-04T11:14:00.3335994Z inductor/test_compile_subprocess.py::GPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_True_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Does not support SDPA or pre-SM80 hardware) [ 82%] 2025-12-04T11:14:00.3336645Z inductor/test_compile_subprocess.py::GPUTests::test_searchsorted_broadcast_cuda <- test/inductor/test_torchinductor.py PASSED [0.4526s] [ 82%] 2025-12-04T11:14:00.3337221Z inductor/test_compile_subprocess.py::GPUTests::test_searchsorted_cuda <- test/inductor/test_torchinductor.py PASSED [12.9755s] [ 82%] 2025-12-04T11:14:00.3337839Z inductor/test_compile_subprocess.py::GPUTests::test_setitem_with_int_parameter_cuda <- test/inductor/test_torchinductor.py PASSED [0.6533s] [ 83%] 2025-12-04T11:14:00.3339025Z inductor/test_compile_subprocess.py::GPUTests::test_shape_prop_torch_ones_cuda <- test/inductor/test_torchinductor.py W1204 11:12:57.477000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3339487Z W1204 11:12:57.477000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3340404Z W1204 11:12:57.477000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3340783Z W1204 11:12:57.477000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3341625Z W1204 11:12:57.477000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3342223Z W1204 11:12:57.477000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3343012Z W1204 11:12:57.477000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3343512Z W1204 11:12:57.477000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3344285Z W1204 11:12:57.477000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] AttributeError: Can't pickle local object 'CommonTemplate.test_shape_prop_torch_ones..Model' 2025-12-04T11:14:00.3344827Z W1204 11:12:57.793000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3345298Z W1204 11:12:57.793000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3346188Z W1204 11:12:57.793000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3346581Z W1204 11:12:57.793000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3347430Z W1204 11:12:57.793000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3348021Z W1204 11:12:57.793000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3348816Z W1204 11:12:57.793000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3349217Z W1204 11:12:57.793000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3349996Z W1204 11:12:57.793000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] AttributeError: Can't pickle local object 'CommonTemplate.test_shape_prop_torch_ones..Model' 2025-12-04T11:14:00.3350134Z PASSED [0.8372s] [ 83%] 2025-12-04T11:14:00.3350759Z inductor/test_compile_subprocess.py::GPUTests::test_should_pad_bench_for_bmm_cuda <- test/inductor/test_torchinductor.py PASSED [0.0041s] [ 83%] 2025-12-04T11:14:00.3351289Z inductor/test_compile_subprocess.py::GPUTests::test_sigmoid_cuda <- test/inductor/test_torchinductor.py PASSED [0.4584s] [ 84%] 2025-12-04T11:14:00.3351853Z inductor/test_compile_subprocess.py::GPUTests::test_simplify_loops_cuda <- test/inductor/test_torchinductor.py PASSED [0.5934s] [ 84%] 2025-12-04T11:14:00.3352408Z inductor/test_compile_subprocess.py::GPUTests::test_single_elem_cuda <- test/inductor/test_torchinductor.py PASSED [0.3540s] [ 85%] 2025-12-04T11:14:00.3353078Z inductor/test_compile_subprocess.py::GPUTests::test_size_asserts_for_multi_output_fallback_cuda <- test/inductor/test_torchinductor.py PASSED [0.1322s] [ 85%] 2025-12-04T11:14:00.3353616Z inductor/test_compile_subprocess.py::GPUTests::test_slice3_cuda <- test/inductor/test_torchinductor.py PASSED [0.5996s] [ 85%] 2025-12-04T11:14:00.3354143Z inductor/test_compile_subprocess.py::GPUTests::test_slice4_cuda <- test/inductor/test_torchinductor.py PASSED [0.2953s] [ 86%] 2025-12-04T11:14:00.3354708Z inductor/test_compile_subprocess.py::GPUTests::test_slice_mutation3_cuda <- test/inductor/test_torchinductor.py PASSED [0.2171s] [ 86%] 2025-12-04T11:14:00.3355281Z inductor/test_compile_subprocess.py::GPUTests::test_slice_scatter2_cuda <- test/inductor/test_torchinductor.py PASSED [0.5341s] [ 87%] 2025-12-04T11:14:00.3355840Z inductor/test_compile_subprocess.py::GPUTests::test_slice_scatter4_cuda <- test/inductor/test_torchinductor.py PASSED [0.4639s] [ 87%] 2025-12-04T11:14:00.3356963Z inductor/test_compile_subprocess.py::GPUTests::test_slice_scatter_reinplace_cuda <- test/inductor/test_torchinductor.py W1204 11:13:01.954000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3357453Z W1204 11:13:01.954000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3358390Z W1204 11:13:01.954000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3358799Z W1204 11:13:01.954000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3359642Z W1204 11:13:01.954000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3360235Z W1204 11:13:01.954000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3361023Z W1204 11:13:01.954000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3361442Z W1204 11:13:01.954000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3362200Z W1204 11:13:01.954000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] AttributeError: Can't pickle local object 'CommonTemplate.test_slice_scatter_reinplace..M' 2025-12-04T11:14:00.3362663Z W1204 11:13:01.986000 90422 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T11:14:00.3362779Z PASSED [0.5006s] [ 87%] 2025-12-04T11:14:00.3363399Z inductor/test_compile_subprocess.py::GPUTests::test_slice_view_with_graph_break_cuda <- test/inductor/test_torchinductor.py PASSED [0.4597s] [ 88%] 2025-12-04T11:14:00.3364028Z inductor/test_compile_subprocess.py::GPUTests::test_softmax_one_kernel_persist_cuda <- test/inductor/test_torchinductor.py PASSED [0.4805s] [ 88%] 2025-12-04T11:14:00.3365064Z inductor/test_compile_subprocess.py::GPUTests::test_sort_bool_cuda <- test/inductor/test_torchinductor.py W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3365572Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3366463Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3366839Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3367692Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3368274Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3369077Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3369483Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3370320Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3370883Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3371764Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3372225Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3373027Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3373596Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3374395Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3374916Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3375735Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3376287Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3377095Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3377666Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3378565Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3379236Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3380148Z W1204 11:13:03.245000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.3380669Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3381125Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3382026Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3382408Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3383260Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3383839Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3384630Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3385043Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3385949Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3386520Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3387377Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3388506Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3389338Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3389865Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3390685Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3391208Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3392020Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3392574Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3393373Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3394034Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3394845Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3395486Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3396396Z W1204 11:13:10.938000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.3396518Z PASSED [15.2358s] [ 88%] 2025-12-04T11:14:00.3397082Z inductor/test_compile_subprocess.py::GPUTests::test_split_cumsum_cuda <- test/inductor/test_torchinductor.py PASSED [5.2971s] [ 89%] 2025-12-04T11:14:00.3397669Z inductor/test_compile_subprocess.py::GPUTests::test_split_cumsum_index_cuda <- test/inductor/test_torchinductor.py PASSED [1.8340s] [ 89%] 2025-12-04T11:14:00.3398852Z inductor/test_compile_subprocess.py::GPUTests::test_split_failed_cuda <- test/inductor/test_torchinductor.py E1204 11:13:25.562000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] failed while attempting to run meta for aten.split_with_sizes.default 2025-12-04T11:14:00.3399313Z E1204 11:13:25.562000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3400215Z E1204 11:13:25.562000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T11:14:00.3400689Z E1204 11:13:25.562000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] r = func(*args, **kwargs) 2025-12-04T11:14:00.3401469Z E1204 11:13:25.562000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T11:14:00.3401990Z E1204 11:13:25.562000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] return self._op(*args, **kwargs) 2025-12-04T11:14:00.3402824Z E1204 11:13:25.562000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 4359, in split_with_sizes 2025-12-04T11:14:00.3403243Z E1204 11:13:25.562000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] torch._check_with( 2025-12-04T11:14:00.3404022Z E1204 11:13:25.562000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T11:14:00.3404524Z E1204 11:13:25.562000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] raise error_type(message_evaluated) 2025-12-04T11:14:00.3405127Z E1204 11:13:25.562000 90422 site-packages/torch/_subclasses/fake_tensor.py:2827] [0/0] ValueError: Split sizes add up to 4 but got the tensor's size of 5 2025-12-04T11:14:00.3405235Z PASSED [0.0179s] [ 90%] 2025-12-04T11:14:00.3405896Z inductor/test_compile_subprocess.py::GPUTests::test_split_reduction_dynamic_shape_cuda <- test/inductor/test_torchinductor.py PASSED [0.5248s] [ 90%] 2025-12-04T11:14:00.3407070Z inductor/test_compile_subprocess.py::GPUTests::test_split_with_sizes_with_unbacked_symints_cuda <- test/inductor/test_torchinductor.py W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3407549Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] Traceback (most recent call last): 2025-12-04T11:14:00.3408470Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3408852Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] ).serialize() 2025-12-04T11:14:00.3409707Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3410284Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3411090Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3411499Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] pickler.dump(obj) 2025-12-04T11:14:00.3412352Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3412908Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3413735Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3414193Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3415055Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3415593Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3416424Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3416960Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3417756Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3418392Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3419202Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3419779Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3420603Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3421229Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3422077Z W1204 11:13:26.487000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.3422293Z PASSED [1.0754s] [ 90%] 2025-12-04T11:14:00.3422825Z inductor/test_compile_subprocess.py::GPUTests::test_sum_int_cuda <- test/inductor/test_torchinductor.py PASSED [0.7043s] [ 91%] 2025-12-04T11:14:00.3423372Z inductor/test_compile_subprocess.py::GPUTests::test_tensor1_cuda <- test/inductor/test_torchinductor.py PASSED [0.4251s] [ 91%] 2025-12-04T11:14:00.3423897Z inductor/test_compile_subprocess.py::GPUTests::test_tensor3_cuda <- test/inductor/test_torchinductor.py PASSED [0.5640s] [ 91%] 2025-12-04T11:14:00.3424502Z inductor/test_compile_subprocess.py::GPUTests::test_tmp_not_defined_issue2_cuda <- test/inductor/test_torchinductor.py PASSED [0.7996s] [ 92%] 2025-12-04T11:14:00.3425012Z inductor/test_compile_subprocess.py::GPUTests::test_topk_cuda <- test/inductor/test_torchinductor.py PASSED [0.2757s] [ 92%] 2025-12-04T11:14:00.3425572Z inductor/test_compile_subprocess.py::GPUTests::test_transpose_add_cuda <- test/inductor/test_torchinductor.py PASSED [0.4240s] [ 93%] 2025-12-04T11:14:00.3426128Z inductor/test_compile_subprocess.py::GPUTests::test_transpose_cuda <- test/inductor/test_torchinductor.py PASSED [0.8775s] [ 93%] 2025-12-04T11:14:00.3426738Z inductor/test_compile_subprocess.py::GPUTests::test_transposed_propagates_cuda <- test/inductor/test_torchinductor.py PASSED [0.2071s] [ 93%] 2025-12-04T11:14:00.3427838Z inductor/test_compile_subprocess.py::GPUTests::test_unbacked_float_item_cuda <- test/inductor/test_torchinductor.py W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3428301Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3429222Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3429647Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3430489Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3431110Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3431898Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3432310Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3433162Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3433709Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3434550Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3434992Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3435807Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3436366Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3437182Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3437705Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3438506Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3439073Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3439868Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3440462Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3441278Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3441916Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3442831Z W1204 11:13:31.865000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.3442983Z PASSED [0.6138s] [ 94%] 2025-12-04T11:14:00.3443653Z inductor/test_compile_subprocess.py::GPUTests::test_unsigned_constant_tensors_cuda <- test/inductor/test_torchinductor.py PASSED [1.1067s] [ 94%] 2025-12-04T11:14:00.3444250Z inductor/test_compile_subprocess.py::GPUTests::test_unspec_inputs_float32_cuda <- test/inductor/test_torchinductor.py PASSED [0.5776s] [ 95%] 2025-12-04T11:14:00.3444881Z inductor/test_compile_subprocess.py::GPUTests::test_unspec_inputs_uint8_cuda <- test/inductor/test_torchinductor.py PASSED [0.5155s] [ 95%] 2025-12-04T11:14:00.3445456Z inductor/test_compile_subprocess.py::GPUTests::test_unsqueeze_inplace_cuda <- test/inductor/test_torchinductor.py PASSED [0.4716s] [ 95%] 2025-12-04T11:14:00.3446556Z inductor/test_compile_subprocess.py::GPUTests::test_upsample_bilinear2d_a_cuda <- test/inductor/test_torchinductor.py W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3447034Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3447924Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3448320Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3449160Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3449752Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3450541Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3450977Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3451830Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3452382Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3453216Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3453662Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3454467Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3454999Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3455806Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3456342Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3457176Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3457778Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3458646Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3459263Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3460091Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3460717Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3461571Z W1204 11:13:35.054000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.3462079Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3462553Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3463447Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3463827Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3464680Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3465290Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3466096Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3466496Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3467345Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3467898Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3468731Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3469191Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3469991Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3470523Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3471355Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3471925Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3472724Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3473310Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3474115Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3474686Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3475511Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3476136Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3477070Z W1204 11:13:36.300000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.3477174Z PASSED [2.5334s] [ 96%] 2025-12-04T11:14:00.3478267Z inductor/test_compile_subprocess.py::GPUTests::test_upsample_nearest2d_cuda <- test/inductor/test_torchinductor.py W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3478780Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3479666Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3480059Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3480897Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3481472Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3482276Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3482681Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3483533Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3484089Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3484925Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3485408Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3486241Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3486781Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3487614Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3488410Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3489220Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3489798Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3490590Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3491164Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3491989Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3492612Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3493539Z W1204 11:13:37.546000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.3494044Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3494501Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3495401Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3495778Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3496634Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3497213Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3498015Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3498492Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3499328Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3499945Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3500814Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3501317Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3502120Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3502655Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3503459Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3503984Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3504795Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3505358Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3506165Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3506735Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3507605Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3508230Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3509143Z W1204 11:13:38.317000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.3509260Z PASSED [1.5049s] [ 96%] 2025-12-04T11:14:00.3510362Z inductor/test_compile_subprocess.py::GPUTests::test_vectorized_ops_masked_cuda <- test/inductor/test_torchinductor.py W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3510842Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3511737Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3512115Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3512968Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3513545Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3514380Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3514818Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3515669Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3516249Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3517072Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3517529Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3518339Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3518872Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3519674Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3520206Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3521010Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3521602Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3522408Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3522980Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3523805Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3524432Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3525283Z W1204 11:13:38.893000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.3525804Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3526261Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3527160Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3527539Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3528450Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3529034Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3529865Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3530271Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3531108Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3531676Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3532504Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3532965Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3533771Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3534292Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3535115Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3535669Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3536481Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3537039Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3537850Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3538492Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3539314Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3539956Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3540790Z W1204 11:13:39.185000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.3540910Z PASSED [0.5748s] [ 96%] 2025-12-04T11:14:00.3542048Z inductor/test_compile_subprocess.py::GPUTests::test_vectorized_ops_masked_var_novec_cuda <- test/inductor/test_torchinductor.py W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3542567Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3543490Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3543914Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3544767Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3545348Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3546151Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3546559Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3547393Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3547962Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3548789Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3549243Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3550082Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3550618Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3551416Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3551940Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3552749Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3553308Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3554113Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3554688Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3555509Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3556134Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3557025Z W1204 11:13:39.468000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.3557546Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3558038Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3558939Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3559317Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3560172Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3560751Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3561545Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3561958Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3562797Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3563369Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3564227Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3564687Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3565486Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3566004Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3566816Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3567338Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3568148Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3568705Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3569496Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3570082Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3570961Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3571603Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3572460Z W1204 11:13:39.672000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T11:14:00.3572577Z PASSED [0.4079s] [ 97%] 2025-12-04T11:14:00.3573134Z inductor/test_compile_subprocess.py::GPUTests::test_view_as_real_cuda <- test/inductor/test_torchinductor.py PASSED [0.2597s] [ 97%] 2025-12-04T11:14:00.3573680Z inductor/test_compile_subprocess.py::GPUTests::test_view_detach_cuda <- test/inductor/test_torchinductor.py PASSED [0.2602s] [ 98%] 2025-12-04T11:14:00.3574385Z inductor/test_compile_subprocess.py::GPUTests::test_view_uint8_through_differing_bitwidths_cuda <- test/inductor/test_torchinductor.py PASSED [1.0896s] [ 98%] 2025-12-04T11:14:00.3574910Z inductor/test_compile_subprocess.py::GPUTests::test_views1_cuda <- test/inductor/test_torchinductor.py PASSED [10.4805s] [ 98%] 2025-12-04T11:14:00.3575446Z inductor/test_compile_subprocess.py::GPUTests::test_views3_cuda <- test/inductor/test_torchinductor.py PASSED [0.8466s] [ 99%] 2025-12-04T11:14:00.3576467Z inductor/test_compile_subprocess.py::GPUTests::test_views7_cuda <- test/inductor/test_torchinductor.py W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T11:14:00.3576926Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T11:14:00.3577832Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T11:14:00.3578305Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T11:14:00.3579163Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T11:14:00.3579743Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T11:14:00.3580540Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T11:14:00.3580943Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T11:14:00.3581787Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T11:14:00.3582353Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T11:14:00.3583181Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T11:14:00.3583637Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T11:14:00.3584436Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T11:14:00.3585040Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T11:14:00.3585842Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T11:14:00.3586395Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T11:14:00.3587209Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T11:14:00.3588032Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T11:14:00.3588889Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T11:14:00.3589469Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T11:14:00.3590300Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T11:14:00.3590929Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T11:14:00.3591840Z W1204 11:13:53.290000 90422 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T11:14:00.3592033Z PASSED [0.6702s] [ 99%] 2025-12-04T11:14:00.3592562Z inductor/test_compile_subprocess.py::GPUTests::test_zeros_cuda <- test/inductor/test_torchinductor.py PASSED [1.0597s] [100%] 2025-12-04T11:14:00.3592572Z 2025-12-04T11:14:00.3593423Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-696ac340ce65f280.xml - 2025-12-04T11:14:00.3593657Z ========== 234 passed, 28 skipped, 17 deselected in 266.88s (0:04:26) ========== 2025-12-04T11:14:00.3594199Z The following tests failed consistently: ['test/inductor/test_compile_subprocess.py::GPUTests::test_as_strided_on_views_cuda'] 2025-12-04T11:14:00.3594205Z 2025-12-04T11:14:00.3594842Z FINISHED PRINTING LOG FILE of inductor/test_compile_subprocess 1/3 (test/test-reports/inductor.test_compile_subprocess_1.3_2d3d7d60d67c3c74_.log) 2025-12-04T11:14:00.3594848Z 2025-12-04T11:14:00.3595239Z Finished inductor/test_compile_subprocess 1/3 ... [2025-12-04 11:14:00.049079][7244.170295519], took 6.76min 2025-12-04T11:14:00.3596141Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-5bc7c9553ad37db8.xml 2025-12-04T11:14:00.3597068Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-c06a9a1a186bb263.xml 2025-12-04T11:14:00.3597958Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-3aa7e69ca6b255e3.xml 2025-12-04T11:14:00.3598837Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-696ac340ce65f280.xml 2025-12-04T11:14:00.6913690Z Uploading logs for 57119749259 to S3 2025-12-04T11:14:00.8026788Z Uploading artifacts took 0.51 seconds 2025-12-04T11:14:00.8027685Z inductor/test_compile_subprocess 1/3 failed! 2025-12-04T11:14:00.8032157Z Running inductor/test_deterministic 3/8 ... [2025-12-04 11:14:00.802980][7244.924199193] 2025-12-04T11:14:00.8032865Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T11:14:00.8037079Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_deterministic.py', '--shard-id=3', '--num-shards=8', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:14:00.803424] 2025-12-04T11:16:22.7667402Z 2025-12-04T11:16:22.7668497Z inductor/test_deterministic 3/8 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_deterministic_3.8_b560ed3aac269ff0_.log 2025-12-04T11:16:22.7673429Z Running 5 items in this shard: test/inductor/test_deterministic.py::DeterministicTest::test_run2run_determinism_model_name_BertForMaskedLM_training_or_inference_inference_precision_float32, test/inductor/test_deterministic.py::DeterministicTest::test_run2run_determinism_model_name_BertForMaskedLM_training_or_inference_training_precision_float16, test/inductor/test_deterministic.py::DeterministicTest::test_run2run_determinism_model_name_DistillGPT2_training_or_inference_inference_precision_amp, test/inductor/test_deterministic.py::DeterministicTest::test_run2run_determinism_model_name_DistillGPT2_training_or_inference_training_precision_float32, test/inductor/test_deterministic.py::DeterministicTest::test_run2run_determinism_model_name_GoogleFnet_training_or_inference_training_precision_bfloat16 2025-12-04T11:16:22.7677577Z 2025-12-04T11:16:22.7677954Z Finished inductor/test_deterministic 3/8 ... [2025-12-04 11:16:22.766537][7386.887757553], took 2.37min 2025-12-04T11:16:22.7759910Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_deterministic/inductor.test_deterministic-b80250c7092ee9c7.xml 2025-12-04T11:16:22.8688984Z Running inductor/test_deterministic 8/8 ... [2025-12-04 11:16:22.868614][7386.989834052] 2025-12-04T11:16:22.8690610Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T11:16:22.8692376Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_deterministic.py', '--shard-id=8', '--num-shards=8', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:16:22.868995] 2025-12-04T11:17:27.6748873Z 2025-12-04T11:17:27.6749674Z inductor/test_deterministic 8/8 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_deterministic_8.8_75991ffd80c13502_.log 2025-12-04T11:17:27.6754068Z Running 6 items in this shard: test/inductor/test_deterministic.py::DeterministicTest::test_max_autotune_deterministic_False, test/inductor/test_deterministic.py::DeterministicTest::test_mm_padding_deterministic_False, test/inductor/test_deterministic.py::DeterministicTest::test_pointwise_coordesc_tuning, test/inductor/test_deterministic.py::DeterministicTest::test_run2run_determinism_model_name_GoogleFnet_training_or_inference_inference_precision_bfloat16, test/inductor/test_deterministic.py::DeterministicTest::test_run2run_determinism_model_name_GoogleFnet_training_or_inference_training_precision_float32, test/inductor/test_deterministic.py::DeterministicTest::test_use_deterministic_algorithsm 2025-12-04T11:17:27.6757708Z 2025-12-04T11:17:27.6758084Z Finished inductor/test_deterministic 8/8 ... [2025-12-04 11:17:27.674689][7451.795908175], took 1.08min 2025-12-04T11:17:27.6842746Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_deterministic/inductor.test_deterministic-4c3a96ba74540dee.xml 2025-12-04T11:17:27.7894967Z Running inductor/test_op_dtype_prop 1/2 ... [2025-12-04 11:17:27.789253][7451.910472607] 2025-12-04T11:17:27.7895564Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T11:17:27.7899461Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_op_dtype_prop.py', '--shard-id=1', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:17:27.789643] 2025-12-04T12:05:01.2896695Z 2025-12-04T12:05:01.2898232Z PRINTING LOG FILE of inductor/test_op_dtype_prop 1/2 (test/test-reports/inductor.test_op_dtype_prop_1.2_88d9121bb179f5e4_.log) 2025-12-04T12:05:01.2901817Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-826c2562e296e7cd.xml 2025-12-04T12:05:01.2903317Z ============================= test session starts ============================== 2025-12-04T12:05:01.2904440Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.2905403Z cachedir: .pytest_cache 2025-12-04T12:05:01.2906513Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.2907936Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.2908531Z configfile: pytest.ini 2025-12-04T12:05:01.2909937Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.2911228Z collecting ... collected 571 items 2025-12-04T12:05:01.2911791Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T12:05:01.3127556Z Running 295 items in this shard: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asin_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asin_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan2_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan2_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erf_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp2_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp2_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_hypot_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sqrt_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_True_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_trunc_load_upcast_to_fp32_False_float16_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acos_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acos_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acos_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acosh_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acosh_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acosh_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_add_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_add_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_add_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_angle_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_angle_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asin_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asin_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asin_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asinh_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atanh_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atanh_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_and_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_and_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_and_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_not_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_not_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_or_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_right_shift_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_right_shift_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_xor_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ceil_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ceil_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_max_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_max_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_max_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_min_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_min_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clone_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clone_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cos_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cos_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cosh_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cosh_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cosh_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_digamma_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_digamma_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_no_rounding_mode_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_no_rounding_mode_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_trunc_rounding_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_trunc_rounding_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_trunc_rounding_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_eq_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erf_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfc_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfc_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfinv_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfinv_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfinv_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp2_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp2_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_frexp_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_frexp_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gcd_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ge_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ge_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ge_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gt_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gt_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gt_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_hypot_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_i0_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_i0_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_igamma_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isinf_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isinf_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isinf_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isnan_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isnan_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isnan_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_le_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_le_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lgamma_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lgamma_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lgamma_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log1p_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log1p_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log1p_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log2_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log2_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log2_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_and_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_and_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_and_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_not_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_not_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_not_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_or_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_or_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_or_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_xor_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lt_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_max_binary_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_maximum_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_maximum_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_min_binary_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_min_binary_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_min_binary_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_minimum_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_minimum_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_minimum_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_mul_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_mul_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_neg_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_neg_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_nextafter_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_0_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_0_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_0_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_2_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_2_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_4_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_pow_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_pow_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_pow_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_reciprocal_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_reciprocal_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_reciprocal_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_0_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_3_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_neg_3_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_rsqrt_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_rsqrt_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sigmoid_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sigmoid_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sigmoid_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sign_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_signbit_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sin_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sin_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sinh_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_square_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_square_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_square_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sub_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tan_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tan_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tanh_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tanh_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tanh_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_bool, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_float64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_trunc_cuda_float32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_trunc_cuda_int32, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_trunc_cuda_int64, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_support_cuda, test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.3276165Z 2025-12-04T12:05:01.3276891Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9351s] [ 0%] 2025-12-04T12:05:01.3278396Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2760s] [ 0%] 2025-12-04T12:05:01.3279806Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2684s] [ 0%] 2025-12-04T12:05:01.3280530Z 2025-12-04T12:05:01.3280684Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.3281341Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3281965Z Traceback (most recent call last): 2025-12-04T12:05:01.3282694Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3283457Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3284286Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3285078Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3285513Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3285846Z 2025-12-04T12:05:01.3286062Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3287135Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3288319Z 2025-12-04T12:05:01.3288606Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3289238Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3289725Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3290064Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3290406Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3290858Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3291323Z graph_break [] 2025-12-04T12:05:01.3291698Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3292794Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3293779Z warnings.warn( 2025-12-04T12:05:01.3294309Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3294937Z Traceback (most recent call last): 2025-12-04T12:05:01.3295639Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3296412Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3297226Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3297998Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3298514Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3298848Z 2025-12-04T12:05:01.3299065Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3300230Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3301081Z 2025-12-04T12:05:01.3301402Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3302034Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3302571Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3302907Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3303234Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3303697Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3304160Z graph_break [] 2025-12-04T12:05:01.3304521Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3305627Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3306602Z warnings.warn( 2025-12-04T12:05:01.3306973Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3307445Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3307779Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3308209Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3308709Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3309040Z graph_break [] 2025-12-04T12:05:01.3309419Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3310521Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3311485Z warnings.warn( 2025-12-04T12:05:01.3311800Z =================================== FAILURES =================================== 2025-12-04T12:05:01.3312460Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3313096Z Traceback (most recent call last): 2025-12-04T12:05:01.3313850Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3314628Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3315449Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3316226Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3316675Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3317010Z 2025-12-04T12:05:01.3317226Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3318300Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3319150Z 2025-12-04T12:05:01.3319416Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3320050Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3320533Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3320868Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3321194Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3321657Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3322115Z graph_break [] 2025-12-04T12:05:01.3322472Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3323571Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3324539Z warnings.warn( 2025-12-04T12:05:01.3324918Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3325376Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3325712Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3326188Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3326697Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3327041Z graph_break [] 2025-12-04T12:05:01.3327409Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3328530Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3329500Z warnings.warn( 2025-12-04T12:05:01.3329881Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3330355Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3330675Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3331108Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3331595Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3331928Z graph_break [] 2025-12-04T12:05:01.3332300Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3333402Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3334374Z warnings.warn( 2025-12-04T12:05:01.3335284Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-826c2562e296e7cd.xml - 2025-12-04T12:05:01.3336348Z =========================== short test summary info ============================ 2025-12-04T12:05:01.3337497Z FAILED [0.2684s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3338527Z 2025-12-04T12:05:01.3338763Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3339839Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3340753Z 2025-12-04T12:05:01.3341018Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3341611Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.3342192Z ========================== 1 failed, 2 rerun in 5.18s ========================== 2025-12-04T12:05:01.3342590Z Got exit code 1 2025-12-04T12:05:01.3342860Z Retrying single test... 2025-12-04T12:05:01.3343629Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-738ee4f0dcb0d1ea.xml 2025-12-04T12:05:01.3344495Z ============================= test session starts ============================== 2025-12-04T12:05:01.3345158Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.3345760Z cachedir: .pytest_cache 2025-12-04T12:05:01.3346484Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.3347265Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.3347615Z configfile: pytest.ini 2025-12-04T12:05:01.3348397Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.3349347Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.3350512Z stepcurrent: skipping 0 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3351556Z Running 1 items in this shard 2025-12-04T12:05:01.3351765Z 2025-12-04T12:05:01.3352509Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9343s] [100%] 2025-12-04T12:05:01.3354034Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2779s] [100%] 2025-12-04T12:05:01.3355435Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2703s] [100%] 2025-12-04T12:05:01.3356227Z 2025-12-04T12:05:01.3356370Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.3357019Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3357631Z Traceback (most recent call last): 2025-12-04T12:05:01.3358346Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3359120Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3359944Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3360724Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3361168Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3361488Z 2025-12-04T12:05:01.3361716Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3362782Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3363749Z 2025-12-04T12:05:01.3364017Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3364648Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3365128Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3365457Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3365799Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3366307Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3366771Z graph_break [] 2025-12-04T12:05:01.3367132Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3368238Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3369218Z warnings.warn( 2025-12-04T12:05:01.3369730Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3370357Z Traceback (most recent call last): 2025-12-04T12:05:01.3371072Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3371845Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3372641Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3373429Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3373879Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3374198Z 2025-12-04T12:05:01.3374413Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3375486Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3376348Z 2025-12-04T12:05:01.3376617Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3377250Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3377716Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3378056Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3378465Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3378933Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3379429Z graph_break [] 2025-12-04T12:05:01.3379839Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3380947Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3381951Z warnings.warn( 2025-12-04T12:05:01.3382339Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3382814Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3383154Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3383575Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3384064Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3384406Z graph_break [] 2025-12-04T12:05:01.3384764Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3385861Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3386844Z warnings.warn( 2025-12-04T12:05:01.3387141Z =================================== FAILURES =================================== 2025-12-04T12:05:01.3388250Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3388889Z Traceback (most recent call last): 2025-12-04T12:05:01.3389611Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3390368Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3391180Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3391966Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3392398Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3392813Z 2025-12-04T12:05:01.3393031Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3394105Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3394958Z 2025-12-04T12:05:01.3395237Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3395856Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3396331Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3396666Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3397009Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3397463Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3397925Z graph_break [] 2025-12-04T12:05:01.3398298Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3399388Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3400373Z warnings.warn( 2025-12-04T12:05:01.3400753Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3401226Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3401549Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3401983Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3402469Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3402795Z graph_break [] 2025-12-04T12:05:01.3403167Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3404263Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3405233Z warnings.warn( 2025-12-04T12:05:01.3405653Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3406173Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3406514Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3406934Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3407469Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3407812Z graph_break [] 2025-12-04T12:05:01.3408177Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3409275Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3410244Z warnings.warn( 2025-12-04T12:05:01.3411168Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-738ee4f0dcb0d1ea.xml - 2025-12-04T12:05:01.3412227Z =========================== short test summary info ============================ 2025-12-04T12:05:01.3413379Z FAILED [0.2703s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3414340Z 2025-12-04T12:05:01.3414556Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3415629Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3416472Z 2025-12-04T12:05:01.3416738Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3417326Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.3417854Z ================== 1 failed, 294 deselected, 2 rerun in 5.18s ================== 2025-12-04T12:05:01.3418378Z Got exit code 1 2025-12-04T12:05:01.3418634Z Retrying single test... 2025-12-04T12:05:01.3419444Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a05156439a0d1496.xml 2025-12-04T12:05:01.3420318Z ============================= test session starts ============================== 2025-12-04T12:05:01.3420979Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.3421581Z cachedir: .pytest_cache 2025-12-04T12:05:01.3422292Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.3423087Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.3423427Z configfile: pytest.ini 2025-12-04T12:05:01.3424212Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.3425174Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.3426331Z stepcurrent: skipping 0 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3427383Z Running 1 items in this shard 2025-12-04T12:05:01.3427605Z 2025-12-04T12:05:01.3428288Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9255s] [100%] 2025-12-04T12:05:01.3429779Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2739s] [100%] 2025-12-04T12:05:01.3431193Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2712s] [100%] 2025-12-04T12:05:01.3431914Z 2025-12-04T12:05:01.3432056Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.3432740Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3433402Z Traceback (most recent call last): 2025-12-04T12:05:01.3434122Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3434922Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3435739Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3436529Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3436962Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3437295Z 2025-12-04T12:05:01.3437512Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3438585Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3439440Z 2025-12-04T12:05:01.3439734Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3440354Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3440837Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3441183Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3441512Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3441977Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3442448Z graph_break [] 2025-12-04T12:05:01.3442923Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3444084Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3445207Z warnings.warn( 2025-12-04T12:05:01.3445778Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3446465Z Traceback (most recent call last): 2025-12-04T12:05:01.3447192Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3447972Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3448792Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3449570Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3450025Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3450346Z 2025-12-04T12:05:01.3450576Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3451651Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3452500Z 2025-12-04T12:05:01.3452768Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3453410Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3453892Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3454217Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3454554Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3455016Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3455478Z graph_break [] 2025-12-04T12:05:01.3455840Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3456936Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3457912Z warnings.warn( 2025-12-04T12:05:01.3458339Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3458818Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3459200Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3459692Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3460165Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3460507Z graph_break [] 2025-12-04T12:05:01.3460880Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3461999Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3462971Z warnings.warn( 2025-12-04T12:05:01.3463283Z =================================== FAILURES =================================== 2025-12-04T12:05:01.3463938Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3464552Z Traceback (most recent call last): 2025-12-04T12:05:01.3465276Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3466055Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3466858Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3467667Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3468175Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3468539Z 2025-12-04T12:05:01.3468784Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3469919Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3470915Z 2025-12-04T12:05:01.3471185Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3471879Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3472470Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3472796Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3473172Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3473667Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3474181Z graph_break [] 2025-12-04T12:05:01.3474555Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3475788Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3476821Z warnings.warn( 2025-12-04T12:05:01.3477266Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3477743Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3478143Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3478602Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3479125Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3479464Z graph_break [] 2025-12-04T12:05:01.3479827Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3481063Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3482110Z warnings.warn( 2025-12-04T12:05:01.3482559Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3483061Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3483416Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3483910Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3484383Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3484784Z graph_break [] 2025-12-04T12:05:01.3485153Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3486457Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3487508Z warnings.warn( 2025-12-04T12:05:01.3488638Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a05156439a0d1496.xml - 2025-12-04T12:05:01.3489782Z =========================== short test summary info ============================ 2025-12-04T12:05:01.3490912Z FAILED [0.2712s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3491879Z 2025-12-04T12:05:01.3492096Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3493170Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3494023Z 2025-12-04T12:05:01.3494307Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3494888Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.3495413Z ================== 1 failed, 294 deselected, 2 rerun in 5.17s ================== 2025-12-04T12:05:01.3495861Z Got exit code 1 2025-12-04T12:05:01.3496659Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3497889Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.3499160Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a884406c7304f17c.xml 2025-12-04T12:05:01.3540805Z ============================= test session starts ============================== 2025-12-04T12:05:01.3541794Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.3542422Z cachedir: .pytest_cache 2025-12-04T12:05:01.3543230Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.3544033Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.3544379Z configfile: pytest.ini 2025-12-04T12:05:01.3545166Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.3546128Z collecting ... collected 571 items / 1 deselected / 570 selected 2025-12-04T12:05:01.3546622Z stepcurrent: skipping 1 already run items. 2025-12-04T12:05:01.3547012Z Running 294 items in this shard 2025-12-04T12:05:01.3547238Z 2025-12-04T12:05:01.3550980Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9260s] [ 0%] 2025-12-04T12:05:01.3552473Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2774s] [ 0%] 2025-12-04T12:05:01.3553874Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2725s] [ 0%] 2025-12-04T12:05:01.3554593Z 2025-12-04T12:05:01.3554737Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.3555388Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.3556018Z Traceback (most recent call last): 2025-12-04T12:05:01.3556744Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3557507Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3558414Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3559266Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3559706Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3560036Z 2025-12-04T12:05:01.3560293Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3561357Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3562203Z 2025-12-04T12:05:01.3562484Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3563104Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3563582Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3563931Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3564260Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3564730Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3565193Z graph_break [] 2025-12-04T12:05:01.3565570Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3566663Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3567641Z warnings.warn( 2025-12-04T12:05:01.3568158Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.3568768Z Traceback (most recent call last): 2025-12-04T12:05:01.3569488Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3570265Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3571083Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3571923Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3572376Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3572697Z 2025-12-04T12:05:01.3572928Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3573987Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3574846Z 2025-12-04T12:05:01.3575114Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3575753Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3576228Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3576554Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3576892Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3577351Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3577809Z graph_break [] 2025-12-04T12:05:01.3578269Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3579383Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3580361Z warnings.warn( 2025-12-04T12:05:01.3580731Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3581208Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3581541Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3581964Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3582449Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3582788Z graph_break [] 2025-12-04T12:05:01.3583157Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3584311Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3585290Z warnings.warn( 2025-12-04T12:05:01.3585601Z =================================== FAILURES =================================== 2025-12-04T12:05:01.3586270Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.3586891Z Traceback (most recent call last): 2025-12-04T12:05:01.3587607Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3588602Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3589406Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3590190Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3590635Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3590958Z 2025-12-04T12:05:01.3591176Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3592249Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3593107Z 2025-12-04T12:05:01.3593376Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3594004Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3594468Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3594809Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3595148Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3595608Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3596058Z graph_break [] 2025-12-04T12:05:01.3596432Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3597532Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3598572Z warnings.warn( 2025-12-04T12:05:01.3598956Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3599434Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3599757Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3600195Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3600685Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3601031Z graph_break [] 2025-12-04T12:05:01.3601393Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3602488Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3603462Z warnings.warn( 2025-12-04T12:05:01.3603836Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3604316Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3604653Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3605090Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3605568Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3605907Z graph_break [] 2025-12-04T12:05:01.3606282Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3607363Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3608342Z warnings.warn( 2025-12-04T12:05:01.3609260Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a884406c7304f17c.xml - 2025-12-04T12:05:01.3610381Z =========================== short test summary info ============================ 2025-12-04T12:05:01.3611569Z FAILED [0.2725s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3612536Z 2025-12-04T12:05:01.3612801Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3613869Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3614717Z 2025-12-04T12:05:01.3614996Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3615578Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.3616099Z =================== 1 failed, 1 deselected, 2 rerun in 5.18s =================== 2025-12-04T12:05:01.3616544Z Got exit code 1 2025-12-04T12:05:01.3616819Z Retrying single test... 2025-12-04T12:05:01.3617575Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c03b63e1d9e14442.xml 2025-12-04T12:05:01.3618529Z ============================= test session starts ============================== 2025-12-04T12:05:01.3619198Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.3619789Z cachedir: .pytest_cache 2025-12-04T12:05:01.3620509Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.3621301Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.3621655Z configfile: pytest.ini 2025-12-04T12:05:01.3622429Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.3623397Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.3624602Z stepcurrent: skipping 1 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3625649Z Running 1 items in this shard 2025-12-04T12:05:01.3625865Z 2025-12-04T12:05:01.3626535Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9319s] [100%] 2025-12-04T12:05:01.3628020Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2791s] [100%] 2025-12-04T12:05:01.3629417Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2798s] [100%] 2025-12-04T12:05:01.3630134Z 2025-12-04T12:05:01.3630289Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.3630929Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.3631552Z Traceback (most recent call last): 2025-12-04T12:05:01.3632267Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3633048Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3633854Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3634637Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3635083Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3635404Z 2025-12-04T12:05:01.3635619Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3636691Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3637584Z 2025-12-04T12:05:01.3637884Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3638524Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3638989Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3639357Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3639696Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3640146Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3640611Z graph_break [] 2025-12-04T12:05:01.3640985Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3642084Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3643044Z warnings.warn( 2025-12-04T12:05:01.3643567Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.3644192Z Traceback (most recent call last): 2025-12-04T12:05:01.3644897Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3645676Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3646488Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3647271Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3647706Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3648037Z 2025-12-04T12:05:01.3648254Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3649318Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3650167Z 2025-12-04T12:05:01.3650449Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3651100Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3651575Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3651910Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3652237Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3652695Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3653154Z graph_break [] 2025-12-04T12:05:01.3653516Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3654610Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3655583Z warnings.warn( 2025-12-04T12:05:01.3655965Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3656427Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3656772Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3657213Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3657689Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3658030Z graph_break [] 2025-12-04T12:05:01.3658494Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3659595Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3660550Z warnings.warn( 2025-12-04T12:05:01.3660863Z =================================== FAILURES =================================== 2025-12-04T12:05:01.3661512Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.3662122Z Traceback (most recent call last): 2025-12-04T12:05:01.3662879Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3663660Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3664508Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3665284Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3665788Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3666108Z 2025-12-04T12:05:01.3666340Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3667420Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3668264Z 2025-12-04T12:05:01.3668537Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3669154Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3669629Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3669953Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3670285Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3670742Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3671201Z graph_break [] 2025-12-04T12:05:01.3671561Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3672654Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3673621Z warnings.warn( 2025-12-04T12:05:01.3673987Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3674453Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3674783Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3675212Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3675684Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3676054Z graph_break [] 2025-12-04T12:05:01.3676418Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3677498Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3678465Z warnings.warn( 2025-12-04T12:05:01.3678843Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3679310Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3679628Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3680058Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3680533Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3680853Z graph_break [] 2025-12-04T12:05:01.3681214Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3682305Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3683260Z warnings.warn( 2025-12-04T12:05:01.3684176Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c03b63e1d9e14442.xml - 2025-12-04T12:05:01.3685235Z =========================== short test summary info ============================ 2025-12-04T12:05:01.3686370Z FAILED [0.2798s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3687310Z 2025-12-04T12:05:01.3687524Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3688770Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3689699Z 2025-12-04T12:05:01.3690005Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3690592Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.3691104Z ================== 1 failed, 294 deselected, 2 rerun in 5.19s ================== 2025-12-04T12:05:01.3691597Z Got exit code 1 2025-12-04T12:05:01.3691862Z Retrying single test... 2025-12-04T12:05:01.3692625Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a7e12d722ef68d61.xml 2025-12-04T12:05:01.3693583Z ============================= test session starts ============================== 2025-12-04T12:05:01.3694309Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.3694965Z cachedir: .pytest_cache 2025-12-04T12:05:01.3695722Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.3696577Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.3696924Z configfile: pytest.ini 2025-12-04T12:05:01.3697765Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.3698909Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.3700140Z stepcurrent: skipping 1 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3701244Z Running 1 items in this shard 2025-12-04T12:05:01.3701453Z 2025-12-04T12:05:01.3702136Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9197s] [100%] 2025-12-04T12:05:01.3703600Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2741s] [100%] 2025-12-04T12:05:01.3705051Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2739s] [100%] 2025-12-04T12:05:01.3705783Z 2025-12-04T12:05:01.3705921Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.3706568Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.3707227Z Traceback (most recent call last): 2025-12-04T12:05:01.3708006Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3708827Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3709640Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3710414Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3710854Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3711169Z 2025-12-04T12:05:01.3711393Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3712440Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3713293Z 2025-12-04T12:05:01.3713557Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3714182Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3714655Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3714969Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3715301Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3715759Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3716199Z graph_break [] 2025-12-04T12:05:01.3716608Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3717737Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3718735Z warnings.warn( 2025-12-04T12:05:01.3719232Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.3719847Z Traceback (most recent call last): 2025-12-04T12:05:01.3720552Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3721317Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3722113Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3722886Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3723326Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3723646Z 2025-12-04T12:05:01.3723863Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3724916Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3725772Z 2025-12-04T12:05:01.3726038Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3726659Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3727119Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3727447Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3727777Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3728220Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3728672Z graph_break [] 2025-12-04T12:05:01.3729036Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3730165Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3731118Z warnings.warn( 2025-12-04T12:05:01.3731492Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3731955Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3732272Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3732700Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3733174Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3733497Z graph_break [] 2025-12-04T12:05:01.3733865Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3734956Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3735921Z warnings.warn( 2025-12-04T12:05:01.3736218Z =================================== FAILURES =================================== 2025-12-04T12:05:01.3736855Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.3737468Z Traceback (most recent call last): 2025-12-04T12:05:01.3738241Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3739001Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3739810Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3740591Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3741023Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3741347Z 2025-12-04T12:05:01.3741561Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3742694Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3743532Z 2025-12-04T12:05:01.3743806Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3744449Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3744917Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3745250Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3745570Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3746024Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3746477Z graph_break [] 2025-12-04T12:05:01.3746842Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3747919Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3748886Z warnings.warn( 2025-12-04T12:05:01.3749264Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3749717Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3750044Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3750465Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3750940Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3751258Z graph_break [] 2025-12-04T12:05:01.3751614Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3752698Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3753652Z warnings.warn( 2025-12-04T12:05:01.3754021Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3754483Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3754839Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3755265Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3755738Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3756067Z graph_break [] 2025-12-04T12:05:01.3756424Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3757501Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3758461Z warnings.warn( 2025-12-04T12:05:01.3759362Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a7e12d722ef68d61.xml - 2025-12-04T12:05:01.3760419Z =========================== short test summary info ============================ 2025-12-04T12:05:01.3761555Z FAILED [0.2739s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3762499Z 2025-12-04T12:05:01.3762722Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3763778Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3764620Z 2025-12-04T12:05:01.3764888Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3765470Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.3765988Z ================== 1 failed, 294 deselected, 2 rerun in 5.15s ================== 2025-12-04T12:05:01.3766410Z Got exit code 1 2025-12-04T12:05:01.3767197Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.3768448Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.3769578Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-758c1dbc5b36fbbe.xml 2025-12-04T12:05:01.3770476Z ============================= test session starts ============================== 2025-12-04T12:05:01.3771142Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.3771742Z cachedir: .pytest_cache 2025-12-04T12:05:01.3772446Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.3773212Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.3773554Z configfile: pytest.ini 2025-12-04T12:05:01.3774328Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.3775269Z collecting ... collected 571 items / 2 deselected / 569 selected 2025-12-04T12:05:01.3775766Z stepcurrent: skipping 2 already run items. 2025-12-04T12:05:01.3776140Z Running 293 items in this shard 2025-12-04T12:05:01.3776351Z 2025-12-04T12:05:01.3776948Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_float16_cuda PASSED [3.3605s] [ 0%] 2025-12-04T12:05:01.3778396Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0940s] [ 0%] 2025-12-04T12:05:01.3779901Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2732s] [ 0%] 2025-12-04T12:05:01.3781315Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2641s] [ 0%] 2025-12-04T12:05:01.3782077Z 2025-12-04T12:05:01.3782234Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.3782869Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3783494Z Traceback (most recent call last): 2025-12-04T12:05:01.3784205Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3784968Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3785761Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3786537Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3786978Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3787292Z 2025-12-04T12:05:01.3787513Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3788779Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3789649Z 2025-12-04T12:05:01.3789908Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3790533Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3791005Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3791321Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3791747Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3792225Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3792546Z graph_break [] 2025-12-04T12:05:01.3792908Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3794072Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3795030Z warnings.warn( 2025-12-04T12:05:01.3795588Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3796210Z Traceback (most recent call last): 2025-12-04T12:05:01.3796917Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3797714Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3798517Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3799281Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3799700Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3800016Z 2025-12-04T12:05:01.3800222Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3801277Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3802127Z 2025-12-04T12:05:01.3802393Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3803004Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3803463Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3803782Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3804199Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3804654Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3804973Z graph_break [] 2025-12-04T12:05:01.3805324Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3806390Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3807345Z warnings.warn( 2025-12-04T12:05:01.3807755Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3808209Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3808516Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3808930Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3809395Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3809708Z graph_break [] 2025-12-04T12:05:01.3810055Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3811128Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3812072Z warnings.warn( 2025-12-04T12:05:01.3812358Z =================================== FAILURES =================================== 2025-12-04T12:05:01.3812993Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3813604Z Traceback (most recent call last): 2025-12-04T12:05:01.3814296Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3815044Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3815836Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3816595Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3817011Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3817324Z 2025-12-04T12:05:01.3817532Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3818647Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3819492Z 2025-12-04T12:05:01.3819787Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3820432Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3820885Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3821200Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3821613Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3822111Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3822430Z graph_break [] 2025-12-04T12:05:01.3822773Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3823852Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3824805Z warnings.warn( 2025-12-04T12:05:01.3825161Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3825607Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3825927Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3826341Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3826799Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3827123Z graph_break [] 2025-12-04T12:05:01.3827482Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3828552Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3829636Z warnings.warn( 2025-12-04T12:05:01.3830032Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3830489Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3830804Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3831222Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3831688Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3832050Z graph_break [] 2025-12-04T12:05:01.3832410Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3833489Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3834445Z warnings.warn( 2025-12-04T12:05:01.3835341Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-758c1dbc5b36fbbe.xml - 2025-12-04T12:05:01.3836391Z =========================== short test summary info ============================ 2025-12-04T12:05:01.3837526Z FAILED [0.2641s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3838479Z 2025-12-04T12:05:01.3838693Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3839749Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3840597Z 2025-12-04T12:05:01.3840855Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3841426Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.3841944Z ============== 1 failed, 1 passed, 2 deselected, 2 rerun in 5.69s ============== 2025-12-04T12:05:01.3842373Z Got exit code 1 2025-12-04T12:05:01.3842621Z Retrying single test... 2025-12-04T12:05:01.3843368Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1356c8f9e8cb48ab.xml 2025-12-04T12:05:01.3844217Z ============================= test session starts ============================== 2025-12-04T12:05:01.3844892Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.3845481Z cachedir: .pytest_cache 2025-12-04T12:05:01.3846211Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.3846970Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.3847332Z configfile: pytest.ini 2025-12-04T12:05:01.3848087Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.3849023Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.3850167Z stepcurrent: skipping 3 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3851194Z Running 1 items in this shard 2025-12-04T12:05:01.3851397Z 2025-12-04T12:05:01.3852086Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9129s] [100%] 2025-12-04T12:05:01.3853575Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2772s] [100%] 2025-12-04T12:05:01.3854972Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2714s] [100%] 2025-12-04T12:05:01.3855700Z 2025-12-04T12:05:01.3855836Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.3856468Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3857073Z Traceback (most recent call last): 2025-12-04T12:05:01.3857764Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3858593Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3859437Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3860199Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3860624Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3860944Z 2025-12-04T12:05:01.3861153Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3862207Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3863058Z 2025-12-04T12:05:01.3863316Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3863926Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3864382Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3864697Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3865013Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3865458Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3865902Z graph_break [] 2025-12-04T12:05:01.3866247Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3867332Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3868285Z warnings.warn( 2025-12-04T12:05:01.3868785Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3869396Z Traceback (most recent call last): 2025-12-04T12:05:01.3870084Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3870835Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3871674Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3872475Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3872904Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3873216Z 2025-12-04T12:05:01.3873464Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3874507Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3875358Z 2025-12-04T12:05:01.3875615Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3876222Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3876675Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3876981Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3877295Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3877739Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3878177Z graph_break [] 2025-12-04T12:05:01.3878529Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3879606Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3880559Z warnings.warn( 2025-12-04T12:05:01.3880918Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3881371Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3881684Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3882101Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3882571Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3882894Z graph_break [] 2025-12-04T12:05:01.3883245Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3884360Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3885316Z warnings.warn( 2025-12-04T12:05:01.3885609Z =================================== FAILURES =================================== 2025-12-04T12:05:01.3886237Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3886842Z Traceback (most recent call last): 2025-12-04T12:05:01.3887542Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3888524Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3889323Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3890097Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3890546Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3890862Z 2025-12-04T12:05:01.3891076Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3892151Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3893006Z 2025-12-04T12:05:01.3893281Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3893903Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3894363Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3894698Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3895032Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3895477Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3895930Z graph_break [] 2025-12-04T12:05:01.3896372Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3897502Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3898544Z warnings.warn( 2025-12-04T12:05:01.3898991Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3899463Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3899779Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3900215Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3900697Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3901023Z graph_break [] 2025-12-04T12:05:01.3901394Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3902486Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3903461Z warnings.warn( 2025-12-04T12:05:01.3903827Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3904294Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3904625Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3905046Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3905528Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3905865Z graph_break [] 2025-12-04T12:05:01.3906217Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3907304Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3908268Z warnings.warn( 2025-12-04T12:05:01.3909191Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1356c8f9e8cb48ab.xml - 2025-12-04T12:05:01.3910298Z =========================== short test summary info ============================ 2025-12-04T12:05:01.3911446Z FAILED [0.2714s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3912420Z 2025-12-04T12:05:01.3912634Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3913705Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3914553Z 2025-12-04T12:05:01.3914815Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3915400Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.3915921Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.3916369Z Got exit code 1 2025-12-04T12:05:01.3916623Z Retrying single test... 2025-12-04T12:05:01.3917386Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3d33fda2cceb2cbd.xml 2025-12-04T12:05:01.3918260Z ============================= test session starts ============================== 2025-12-04T12:05:01.3918910Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.3919510Z cachedir: .pytest_cache 2025-12-04T12:05:01.3920217Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.3921012Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.3921351Z configfile: pytest.ini 2025-12-04T12:05:01.3922132Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.3923133Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.3924336Z stepcurrent: skipping 3 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3925406Z Running 1 items in this shard 2025-12-04T12:05:01.3925632Z 2025-12-04T12:05:01.3926318Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9133s] [100%] 2025-12-04T12:05:01.3927831Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2697s] [100%] 2025-12-04T12:05:01.3929253Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2701s] [100%] 2025-12-04T12:05:01.3929981Z 2025-12-04T12:05:01.3930128Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.3930787Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3931425Z Traceback (most recent call last): 2025-12-04T12:05:01.3932143Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3932911Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3933733Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3934519Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3934953Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3935284Z 2025-12-04T12:05:01.3935503Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3936580Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3937472Z 2025-12-04T12:05:01.3937755Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3938444Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3938967Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3939303Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3939642Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3940089Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3940553Z graph_break [] 2025-12-04T12:05:01.3940927Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3942016Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3942996Z warnings.warn( 2025-12-04T12:05:01.3943533Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3944165Z Traceback (most recent call last): 2025-12-04T12:05:01.3944867Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3945638Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3946454Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3947228Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3947674Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3948010Z 2025-12-04T12:05:01.3948225Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3949338Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3950191Z 2025-12-04T12:05:01.3950485Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3951121Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3951597Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3951966Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3952290Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3952751Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3953210Z graph_break [] 2025-12-04T12:05:01.3953568Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3954669Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3955642Z warnings.warn( 2025-12-04T12:05:01.3956016Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3956490Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3956826Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3957264Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3957739Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3958082Z graph_break [] 2025-12-04T12:05:01.3958458Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3959541Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3960513Z warnings.warn( 2025-12-04T12:05:01.3960823Z =================================== FAILURES =================================== 2025-12-04T12:05:01.3961483Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.3962099Z Traceback (most recent call last): 2025-12-04T12:05:01.3962870Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.3963640Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.3964440Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.3965227Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.3965677Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3965996Z 2025-12-04T12:05:01.3966222Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3967285Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3968148Z 2025-12-04T12:05:01.3968411Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3969045Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3969522Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3969843Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3970179Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3970641Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3971091Z graph_break [] 2025-12-04T12:05:01.3971461Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3972559Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3973535Z warnings.warn( 2025-12-04T12:05:01.3973904Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3974378Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3974713Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3975173Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3975701Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3976046Z graph_break [] 2025-12-04T12:05:01.3976401Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3977532Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3978586Z warnings.warn( 2025-12-04T12:05:01.3978964Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.3979425Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.3979760Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.3980196Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.3980668Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.3981008Z graph_break [] 2025-12-04T12:05:01.3981382Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.3982468Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.3983442Z warnings.warn( 2025-12-04T12:05:01.3984371Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3d33fda2cceb2cbd.xml - 2025-12-04T12:05:01.3985448Z =========================== short test summary info ============================ 2025-12-04T12:05:01.3986590Z FAILED [0.2701s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.3987560Z 2025-12-04T12:05:01.3987932Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.3989024Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3989948Z 2025-12-04T12:05:01.3990233Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.3990829Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.3991350Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.3991802Z Got exit code 1 2025-12-04T12:05:01.3992608Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.3993787Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.3994929Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0982db42fabefc11.xml 2025-12-04T12:05:01.3995822Z ============================= test session starts ============================== 2025-12-04T12:05:01.3996500Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.3997090Z cachedir: .pytest_cache 2025-12-04T12:05:01.3997807Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.3998603Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.3998941Z configfile: pytest.ini 2025-12-04T12:05:01.3999723Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4000678Z collecting ... collected 571 items / 4 deselected / 567 selected 2025-12-04T12:05:01.4001178Z stepcurrent: skipping 4 already run items. 2025-12-04T12:05:01.4001553Z Running 291 items in this shard 2025-12-04T12:05:01.4001773Z 2025-12-04T12:05:01.4002447Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_True_float16_cuda PASSED [3.4384s] [ 0%] 2025-12-04T12:05:01.4003876Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0940s] [ 0%] 2025-12-04T12:05:01.4005437Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2651s] [ 0%] 2025-12-04T12:05:01.4006856Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2718s] [ 0%] 2025-12-04T12:05:01.4007599Z 2025-12-04T12:05:01.4007741Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4008403Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4009039Z Traceback (most recent call last): 2025-12-04T12:05:01.4009748Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4010507Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4011307Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4012082Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4012502Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4012822Z 2025-12-04T12:05:01.4013030Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4014091Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4014948Z 2025-12-04T12:05:01.4015215Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4015878Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4016347Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4016673Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4017096Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4017233Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4017332Z graph_break [] 2025-12-04T12:05:01.4017547Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4018381Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4018483Z warnings.warn( 2025-12-04T12:05:01.4018855Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4018970Z Traceback (most recent call last): 2025-12-04T12:05:01.4019439Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4019606Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4020121Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4020256Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4020449Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4020455Z 2025-12-04T12:05:01.4020671Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4021400Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4021405Z 2025-12-04T12:05:01.4021670Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4021894Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4022041Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4022179Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4022404Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4022522Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4022645Z graph_break [] 2025-12-04T12:05:01.4022873Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4023599Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4023697Z warnings.warn( 2025-12-04T12:05:01.4023922Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4024031Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4024154Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4024372Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4024493Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4024594Z graph_break [] 2025-12-04T12:05:01.4024807Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4025536Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4025641Z warnings.warn( 2025-12-04T12:05:01.4025786Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4026167Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4026286Z Traceback (most recent call last): 2025-12-04T12:05:01.4026755Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4026919Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4027467Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4027588Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4027786Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4027794Z 2025-12-04T12:05:01.4028002Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4028730Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4028735Z 2025-12-04T12:05:01.4028995Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4029205Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4029318Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4029427Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4029652Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4029770Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4029862Z graph_break [] 2025-12-04T12:05:01.4030077Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4030802Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4030901Z warnings.warn( 2025-12-04T12:05:01.4031117Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4031221Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4031338Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4031551Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4031667Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4031767Z graph_break [] 2025-12-04T12:05:01.4032006Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4032761Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4032864Z warnings.warn( 2025-12-04T12:05:01.4033121Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4033231Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4033339Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4033553Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4033675Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4033765Z graph_break [] 2025-12-04T12:05:01.4033973Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4034702Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4034802Z warnings.warn( 2025-12-04T12:05:01.4035550Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0982db42fabefc11.xml - 2025-12-04T12:05:01.4035723Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4036554Z FAILED [0.2718s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4036560Z 2025-12-04T12:05:01.4036776Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4037499Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4037505Z 2025-12-04T12:05:01.4037770Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4037983Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4038188Z ============== 1 failed, 1 passed, 4 deselected, 2 rerun in 5.76s ============== 2025-12-04T12:05:01.4038295Z Got exit code 1 2025-12-04T12:05:01.4038399Z Retrying single test... 2025-12-04T12:05:01.4038969Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e365e000dec26b6e.xml 2025-12-04T12:05:01.4039131Z ============================= test session starts ============================== 2025-12-04T12:05:01.4039479Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4039586Z cachedir: .pytest_cache 2025-12-04T12:05:01.4040106Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4040226Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4040340Z configfile: pytest.ini 2025-12-04T12:05:01.4040931Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4041161Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4041965Z stepcurrent: skipping 5 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4042076Z Running 1 items in this shard 2025-12-04T12:05:01.4042080Z 2025-12-04T12:05:01.4042771Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9189s] [100%] 2025-12-04T12:05:01.4043454Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2673s] [100%] 2025-12-04T12:05:01.4044114Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2661s] [100%] 2025-12-04T12:05:01.4044120Z 2025-12-04T12:05:01.4044254Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4044650Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4044773Z Traceback (most recent call last): 2025-12-04T12:05:01.4045246Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4045403Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4045919Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4046038Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4049473Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4049487Z 2025-12-04T12:05:01.4049710Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4050444Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4050454Z 2025-12-04T12:05:01.4050738Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4050961Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4051086Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4051202Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4051320Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4051554Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4051653Z graph_break [] 2025-12-04T12:05:01.4051869Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4052653Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4052767Z warnings.warn( 2025-12-04T12:05:01.4053137Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4053263Z Traceback (most recent call last): 2025-12-04T12:05:01.4053750Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4053906Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4054435Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4054555Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4054748Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4054756Z 2025-12-04T12:05:01.4054985Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4055714Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4055720Z 2025-12-04T12:05:01.4055999Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4056212Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4056321Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4056447Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4056568Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4056790Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4056899Z graph_break [] 2025-12-04T12:05:01.4057113Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4057915Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4058062Z warnings.warn( 2025-12-04T12:05:01.4058376Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4058502Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4058657Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4058882Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4059014Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4059110Z graph_break [] 2025-12-04T12:05:01.4059324Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4060070Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4060171Z warnings.warn( 2025-12-04T12:05:01.4060410Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4060779Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4060900Z Traceback (most recent call last): 2025-12-04T12:05:01.4061386Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4061542Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4062061Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4062199Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4062391Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4062397Z 2025-12-04T12:05:01.4062625Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4063352Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4063360Z 2025-12-04T12:05:01.4063628Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4063857Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4063968Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4064093Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4064213Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4064432Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4064540Z graph_break [] 2025-12-04T12:05:01.4064754Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4065487Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4065596Z warnings.warn( 2025-12-04T12:05:01.4065815Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4065942Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4066053Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4066273Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4066411Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4066509Z graph_break [] 2025-12-04T12:05:01.4066723Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4067469Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4067568Z warnings.warn( 2025-12-04T12:05:01.4067791Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4067900Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4068013Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4068287Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4068439Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4068536Z graph_break [] 2025-12-04T12:05:01.4068758Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4069519Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4069621Z warnings.warn( 2025-12-04T12:05:01.4070385Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e365e000dec26b6e.xml - 2025-12-04T12:05:01.4070557Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4071416Z FAILED [0.2661s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4071473Z 2025-12-04T12:05:01.4071695Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4072433Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4072441Z 2025-12-04T12:05:01.4072704Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4072882Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4073094Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.4073193Z Got exit code 1 2025-12-04T12:05:01.4073301Z Retrying single test... 2025-12-04T12:05:01.4073890Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c05ba049f6d7f244.xml 2025-12-04T12:05:01.4074054Z ============================= test session starts ============================== 2025-12-04T12:05:01.4074424Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4074533Z cachedir: .pytest_cache 2025-12-04T12:05:01.4075059Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4075196Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4075303Z configfile: pytest.ini 2025-12-04T12:05:01.4075902Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4076140Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4076946Z stepcurrent: skipping 5 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4077077Z Running 1 items in this shard 2025-12-04T12:05:01.4077082Z 2025-12-04T12:05:01.4077772Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9166s] [100%] 2025-12-04T12:05:01.4078472Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2726s] [100%] 2025-12-04T12:05:01.4079069Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2676s] [100%] 2025-12-04T12:05:01.4079075Z 2025-12-04T12:05:01.4079215Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4079596Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4079718Z Traceback (most recent call last): 2025-12-04T12:05:01.4080240Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4080448Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4080979Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4081150Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4081342Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4081348Z 2025-12-04T12:05:01.4081560Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4082290Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4082295Z 2025-12-04T12:05:01.4082562Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4082833Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4082950Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4083069Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4083204Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4083428Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4083543Z graph_break [] 2025-12-04T12:05:01.4083757Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4084492Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4084605Z warnings.warn( 2025-12-04T12:05:01.4084975Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4085096Z Traceback (most recent call last): 2025-12-04T12:05:01.4085584Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4085743Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4086274Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4086398Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4086590Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4086596Z 2025-12-04T12:05:01.4086825Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4087550Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4087555Z 2025-12-04T12:05:01.4088001Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4088220Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4088336Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4088464Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4088587Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4088810Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4088918Z graph_break [] 2025-12-04T12:05:01.4089136Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4089889Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4089990Z warnings.warn( 2025-12-04T12:05:01.4090202Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4090325Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4090437Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4090657Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4090858Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4090956Z graph_break [] 2025-12-04T12:05:01.4091220Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4091970Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4092109Z warnings.warn( 2025-12-04T12:05:01.4092263Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4092627Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4092750Z Traceback (most recent call last): 2025-12-04T12:05:01.4093241Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4093394Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4093913Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4094101Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4094291Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4094297Z 2025-12-04T12:05:01.4094524Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4095250Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4095256Z 2025-12-04T12:05:01.4095532Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4095748Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4095861Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4095987Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4096106Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4096329Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4096438Z graph_break [] 2025-12-04T12:05:01.4096653Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4097384Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4097501Z warnings.warn( 2025-12-04T12:05:01.4097712Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4113818Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4113936Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4114193Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4114317Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4114414Z graph_break [] 2025-12-04T12:05:01.4114653Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4115424Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4115525Z warnings.warn( 2025-12-04T12:05:01.4115755Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4115870Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4115996Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4116218Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4116336Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4116446Z graph_break [] 2025-12-04T12:05:01.4116661Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4117386Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4117499Z warnings.warn( 2025-12-04T12:05:01.4118398Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c05ba049f6d7f244.xml - 2025-12-04T12:05:01.4118579Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4119460Z FAILED [0.2676s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4119467Z 2025-12-04T12:05:01.4119683Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4120409Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4120416Z 2025-12-04T12:05:01.4120682Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4120922Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4121122Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.4121217Z Got exit code 1 2025-12-04T12:05:01.4121871Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4122287Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.4122875Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e64a3e20d6733c67.xml 2025-12-04T12:05:01.4123041Z ============================= test session starts ============================== 2025-12-04T12:05:01.4123395Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4123518Z cachedir: .pytest_cache 2025-12-04T12:05:01.4124047Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4124185Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4124293Z configfile: pytest.ini 2025-12-04T12:05:01.4124889Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4125124Z collecting ... collected 571 items / 6 deselected / 565 selected 2025-12-04T12:05:01.4125267Z stepcurrent: skipping 6 already run items. 2025-12-04T12:05:01.4125381Z Running 289 items in this shard 2025-12-04T12:05:01.4125387Z 2025-12-04T12:05:01.4126000Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_float16_cuda PASSED [3.4920s] [ 0%] 2025-12-04T12:05:01.4126685Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0929s] [ 0%] 2025-12-04T12:05:01.4127383Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2773s] [ 0%] 2025-12-04T12:05:01.4127977Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2647s] [ 0%] 2025-12-04T12:05:01.4127985Z 2025-12-04T12:05:01.4128128Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4128504Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4128624Z Traceback (most recent call last): 2025-12-04T12:05:01.4129114Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4129268Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4129824Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4129992Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4130186Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4130193Z 2025-12-04T12:05:01.4130423Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4131177Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4131183Z 2025-12-04T12:05:01.4131447Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4131679Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4131788Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4131912Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4132135Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4132299Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4132408Z graph_break [] 2025-12-04T12:05:01.4132629Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4133376Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4133479Z warnings.warn( 2025-12-04T12:05:01.4133844Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4133974Z Traceback (most recent call last): 2025-12-04T12:05:01.4134451Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4134607Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4135135Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4135264Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4135465Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4135471Z 2025-12-04T12:05:01.4135685Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4136406Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4136412Z 2025-12-04T12:05:01.4136689Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4136909Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4137031Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4137146Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4137367Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4137502Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4137601Z graph_break [] 2025-12-04T12:05:01.4137821Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4138663Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4138772Z warnings.warn( 2025-12-04T12:05:01.4138999Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4139105Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4139221Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4139455Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4139577Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4139676Z graph_break [] 2025-12-04T12:05:01.4139903Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4140693Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4140839Z warnings.warn( 2025-12-04T12:05:01.4140982Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4141379Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4141517Z Traceback (most recent call last): 2025-12-04T12:05:01.4141996Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4142151Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4142681Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4142801Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4143003Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4143046Z 2025-12-04T12:05:01.4143264Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4143983Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4143991Z 2025-12-04T12:05:01.4144268Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4144482Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4144603Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4144717Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4144935Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4145068Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4145165Z graph_break [] 2025-12-04T12:05:01.4145378Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4146127Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4146225Z warnings.warn( 2025-12-04T12:05:01.4146432Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4146558Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4146674Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4146904Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4147025Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4147120Z graph_break [] 2025-12-04T12:05:01.4147343Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4148071Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4148172Z warnings.warn( 2025-12-04T12:05:01.4148395Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4148508Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4148628Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4148847Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4148965Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4149077Z graph_break [] 2025-12-04T12:05:01.4149286Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4150008Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4150115Z warnings.warn( 2025-12-04T12:05:01.4150867Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e64a3e20d6733c67.xml - 2025-12-04T12:05:01.4151090Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4151961Z FAILED [0.2647s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4151967Z 2025-12-04T12:05:01.4152214Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4152939Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4152945Z 2025-12-04T12:05:01.4153209Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4153399Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4153606Z ============== 1 failed, 1 passed, 6 deselected, 2 rerun in 5.81s ============== 2025-12-04T12:05:01.4153708Z Got exit code 1 2025-12-04T12:05:01.4153863Z Retrying single test... 2025-12-04T12:05:01.4154442Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-60656dccead73746.xml 2025-12-04T12:05:01.4154615Z ============================= test session starts ============================== 2025-12-04T12:05:01.4154971Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4155079Z cachedir: .pytest_cache 2025-12-04T12:05:01.4155611Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4155736Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4155843Z configfile: pytest.ini 2025-12-04T12:05:01.4156442Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4156671Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4157494Z stepcurrent: skipping 7 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4157607Z Running 1 items in this shard 2025-12-04T12:05:01.4157615Z 2025-12-04T12:05:01.4158299Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9331s] [100%] 2025-12-04T12:05:01.4158990Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2663s] [100%] 2025-12-04T12:05:01.4159583Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2672s] [100%] 2025-12-04T12:05:01.4159590Z 2025-12-04T12:05:01.4159744Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4160111Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4160232Z Traceback (most recent call last): 2025-12-04T12:05:01.4160723Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4160879Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4161415Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4161538Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4161730Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4161736Z 2025-12-04T12:05:01.4161961Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4162680Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4162719Z 2025-12-04T12:05:01.4163023Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4163243Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4163351Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4163511Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4163632Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4163853Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4163962Z graph_break [] 2025-12-04T12:05:01.4164179Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4164924Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4165026Z warnings.warn( 2025-12-04T12:05:01.4165392Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4165562Z Traceback (most recent call last): 2025-12-04T12:05:01.4166040Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4166196Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4166730Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4166854Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4167056Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4167062Z 2025-12-04T12:05:01.4167276Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4167991Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4167999Z 2025-12-04T12:05:01.4168279Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4168498Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4168621Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4168736Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4168859Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4169091Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4169189Z graph_break [] 2025-12-04T12:05:01.4169400Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4170147Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4170250Z warnings.warn( 2025-12-04T12:05:01.4170476Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4170586Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4170704Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4170941Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4171060Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4171158Z graph_break [] 2025-12-04T12:05:01.4171383Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4172109Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4172224Z warnings.warn( 2025-12-04T12:05:01.4172366Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4172726Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4172863Z Traceback (most recent call last): 2025-12-04T12:05:01.4173374Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4173562Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4174091Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4174240Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4174443Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4174448Z 2025-12-04T12:05:01.4174659Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4175377Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4175383Z 2025-12-04T12:05:01.4175662Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4175876Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4176055Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4176165Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4176287Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4176521Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4176617Z graph_break [] 2025-12-04T12:05:01.4176829Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4177572Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4177670Z warnings.warn( 2025-12-04T12:05:01.4177895Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4178006Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4178185Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4178417Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4178542Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4178635Z graph_break [] 2025-12-04T12:05:01.4178865Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4179592Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4179691Z warnings.warn( 2025-12-04T12:05:01.4179915Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4180024Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4180152Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4180370Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4180486Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4180594Z graph_break [] 2025-12-04T12:05:01.4180809Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4181539Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4181648Z warnings.warn( 2025-12-04T12:05:01.4182398Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-60656dccead73746.xml - 2025-12-04T12:05:01.4182580Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4183416Z FAILED [0.2672s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4183422Z 2025-12-04T12:05:01.4183637Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4184406Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4184414Z 2025-12-04T12:05:01.4184711Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4184900Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4185132Z ================== 1 failed, 294 deselected, 2 rerun in 5.15s ================== 2025-12-04T12:05:01.4185230Z Got exit code 1 2025-12-04T12:05:01.4185348Z Retrying single test... 2025-12-04T12:05:01.4185926Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-74b07a56cd02ebb4.xml 2025-12-04T12:05:01.4186095Z ============================= test session starts ============================== 2025-12-04T12:05:01.4186446Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4186553Z cachedir: .pytest_cache 2025-12-04T12:05:01.4187085Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4187247Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4187352Z configfile: pytest.ini 2025-12-04T12:05:01.4188123Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4188357Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4189176Z stepcurrent: skipping 7 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4189291Z Running 1 items in this shard 2025-12-04T12:05:01.4189296Z 2025-12-04T12:05:01.4189980Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9143s] [100%] 2025-12-04T12:05:01.4190681Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2662s] [100%] 2025-12-04T12:05:01.4191280Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2532s] [100%] 2025-12-04T12:05:01.4191288Z 2025-12-04T12:05:01.4191437Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4191799Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4191921Z Traceback (most recent call last): 2025-12-04T12:05:01.4192405Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4192561Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4193092Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4193221Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4193414Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4193419Z 2025-12-04T12:05:01.4193646Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4194366Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4194372Z 2025-12-04T12:05:01.4194652Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4194869Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4194981Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4195107Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4195227Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4195450Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4195629Z graph_break [] 2025-12-04T12:05:01.4195887Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4196639Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4196785Z warnings.warn( 2025-12-04T12:05:01.4197145Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4197280Z Traceback (most recent call last): 2025-12-04T12:05:01.4197749Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4197902Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4198439Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4198612Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4198820Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4198825Z 2025-12-04T12:05:01.4199046Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4199760Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4199768Z 2025-12-04T12:05:01.4200044Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4200258Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4200381Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4200493Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4200613Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4200843Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4200941Z graph_break [] 2025-12-04T12:05:01.4201159Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4201905Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4202004Z warnings.warn( 2025-12-04T12:05:01.4202229Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4202340Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4202454Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4202684Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4202806Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4202897Z graph_break [] 2025-12-04T12:05:01.4203126Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4203858Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4203970Z warnings.warn( 2025-12-04T12:05:01.4204113Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4204474Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4204609Z Traceback (most recent call last): 2025-12-04T12:05:01.4205087Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4205240Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4205770Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4205892Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4206091Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4206096Z 2025-12-04T12:05:01.4206311Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4207088Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4207094Z 2025-12-04T12:05:01.4207368Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4207619Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4207741Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4207855Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4207977Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4208214Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4208311Z graph_break [] 2025-12-04T12:05:01.4208530Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4209276Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4209414Z warnings.warn( 2025-12-04T12:05:01.4209640Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4209745Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4209858Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4210092Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4210212Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4210308Z graph_break [] 2025-12-04T12:05:01.4210529Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4211257Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4211357Z warnings.warn( 2025-12-04T12:05:01.4211580Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4211692Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4211821Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4212035Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4212154Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4212258Z graph_break [] 2025-12-04T12:05:01.4212468Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4213196Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4213304Z warnings.warn( 2025-12-04T12:05:01.4214057Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-74b07a56cd02ebb4.xml - 2025-12-04T12:05:01.4214237Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4215066Z FAILED [0.2532s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4215072Z 2025-12-04T12:05:01.4215288Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4216024Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4216030Z 2025-12-04T12:05:01.4216293Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4216482Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4216680Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.4216777Z Got exit code 1 2025-12-04T12:05:01.4217457Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4217904Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.4218566Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-97c1aae6502fe9a4.xml 2025-12-04T12:05:01.4218787Z ============================= test session starts ============================== 2025-12-04T12:05:01.4219139Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4219268Z cachedir: .pytest_cache 2025-12-04T12:05:01.4219786Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4219926Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4220034Z configfile: pytest.ini 2025-12-04T12:05:01.4220623Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4220896Z collecting ... collected 571 items / 8 deselected / 563 selected 2025-12-04T12:05:01.4221039Z stepcurrent: skipping 8 already run items. 2025-12-04T12:05:01.4221149Z Running 287 items in this shard 2025-12-04T12:05:01.4221157Z 2025-12-04T12:05:01.4221772Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_float16_cuda PASSED [3.4606s] [ 0%] 2025-12-04T12:05:01.4222354Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asin_load_upcast_to_fp32_False_float16_cuda PASSED [0.3187s] [ 0%] 2025-12-04T12:05:01.4222947Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asin_load_upcast_to_fp32_True_float16_cuda PASSED [0.3120s] [ 1%] 2025-12-04T12:05:01.4223632Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0922s] [ 1%] 2025-12-04T12:05:01.4224334Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2753s] [ 1%] 2025-12-04T12:05:01.4224932Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2581s] [ 1%] 2025-12-04T12:05:01.4224940Z 2025-12-04T12:05:01.4225076Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4225460Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4225580Z Traceback (most recent call last): 2025-12-04T12:05:01.4226065Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4226222Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4226745Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4226882Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4227074Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4227079Z 2025-12-04T12:05:01.4227294Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4228034Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4228040Z 2025-12-04T12:05:01.4228305Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4228533Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4228643Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4228757Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4229024Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4229148Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4229273Z graph_break [] 2025-12-04T12:05:01.4229499Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4230236Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4230373Z warnings.warn( 2025-12-04T12:05:01.4230744Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4230865Z Traceback (most recent call last): 2025-12-04T12:05:01.4231351Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4231510Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4232040Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4232190Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4232385Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4232390Z 2025-12-04T12:05:01.4232609Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4233336Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4233342Z 2025-12-04T12:05:01.4233613Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4233830Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4233944Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4234067Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4234285Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4234408Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4234514Z graph_break [] 2025-12-04T12:05:01.4234732Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4235467Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4235578Z warnings.warn( 2025-12-04T12:05:01.4235789Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4235905Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4236017Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4236236Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4236367Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4236459Z graph_break [] 2025-12-04T12:05:01.4236675Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4237415Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4237513Z warnings.warn( 2025-12-04T12:05:01.4237668Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4238038Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4238155Z Traceback (most recent call last): 2025-12-04T12:05:01.4238647Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4238802Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4239317Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4239448Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4239636Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4239673Z 2025-12-04T12:05:01.4239900Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4240652Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4240686Z 2025-12-04T12:05:01.4240950Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4241178Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4241286Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4241409Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4241625Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4241742Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4241851Z graph_break [] 2025-12-04T12:05:01.4242063Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4242828Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4242940Z warnings.warn( 2025-12-04T12:05:01.4243150Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4243270Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4243384Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4243601Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4243730Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4243827Z graph_break [] 2025-12-04T12:05:01.4244037Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4244776Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4244878Z warnings.warn( 2025-12-04T12:05:01.4245104Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4245216Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4245328Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4245555Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4245677Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4245768Z graph_break [] 2025-12-04T12:05:01.4245991Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4246718Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4246816Z warnings.warn( 2025-12-04T12:05:01.4247575Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-97c1aae6502fe9a4.xml - 2025-12-04T12:05:01.4247751Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4248592Z FAILED [0.2581s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4248600Z 2025-12-04T12:05:01.4248816Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4249549Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4249555Z 2025-12-04T12:05:01.4249815Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4249993Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4250215Z ============== 1 failed, 3 passed, 8 deselected, 2 rerun in 6.40s ============== 2025-12-04T12:05:01.4250314Z Got exit code 1 2025-12-04T12:05:01.4250450Z Retrying single test... 2025-12-04T12:05:01.4251060Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9520e13942c1e127.xml 2025-12-04T12:05:01.4251223Z ============================= test session starts ============================== 2025-12-04T12:05:01.4251612Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4251719Z cachedir: .pytest_cache 2025-12-04T12:05:01.4252239Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4252372Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4252477Z configfile: pytest.ini 2025-12-04T12:05:01.4253072Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4253361Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4254173Z stepcurrent: skipping 11 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4254300Z Running 1 items in this shard 2025-12-04T12:05:01.4254307Z 2025-12-04T12:05:01.4254992Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9134s] [100%] 2025-12-04T12:05:01.4255689Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2688s] [100%] 2025-12-04T12:05:01.4256284Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2582s] [100%] 2025-12-04T12:05:01.4256290Z 2025-12-04T12:05:01.4256433Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4256817Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4256936Z Traceback (most recent call last): 2025-12-04T12:05:01.4257430Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4257586Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4258183Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4258333Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4258522Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4258527Z 2025-12-04T12:05:01.4258738Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4259476Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4259483Z 2025-12-04T12:05:01.4259747Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4259978Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4260092Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4260204Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4260340Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4260559Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4260665Z graph_break [] 2025-12-04T12:05:01.4260878Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4261611Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4261722Z warnings.warn( 2025-12-04T12:05:01.4262133Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4262299Z Traceback (most recent call last): 2025-12-04T12:05:01.4262782Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4262966Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4263499Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4263618Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4263810Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4263816Z 2025-12-04T12:05:01.4264033Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4264751Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4264787Z 2025-12-04T12:05:01.4265064Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4265283Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4265394Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4265522Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4265645Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4265862Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4265968Z graph_break [] 2025-12-04T12:05:01.4266180Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4266930Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4267032Z warnings.warn( 2025-12-04T12:05:01.4267243Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4267368Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4267479Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4267702Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4267829Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4267925Z graph_break [] 2025-12-04T12:05:01.4268140Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4268872Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4268970Z warnings.warn( 2025-12-04T12:05:01.4269123Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4269488Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4269604Z Traceback (most recent call last): 2025-12-04T12:05:01.4270092Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4270246Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4270777Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4270900Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4271088Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4271093Z 2025-12-04T12:05:01.4271314Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4272041Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4272047Z 2025-12-04T12:05:01.4272319Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4272566Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4272677Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4272830Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4272953Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4273169Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4273309Z graph_break [] 2025-12-04T12:05:01.4273524Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4274259Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4274370Z warnings.warn( 2025-12-04T12:05:01.4274582Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4274702Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4274814Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4275031Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4275196Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4275292Z graph_break [] 2025-12-04T12:05:01.4275502Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4276246Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4276343Z warnings.warn( 2025-12-04T12:05:01.4276568Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4276677Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4276790Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4277020Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4277141Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4277235Z graph_break [] 2025-12-04T12:05:01.4277461Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4278190Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4278297Z warnings.warn( 2025-12-04T12:05:01.4279051Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9520e13942c1e127.xml - 2025-12-04T12:05:01.4279220Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4280062Z FAILED [0.2582s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4280069Z 2025-12-04T12:05:01.4280285Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4281020Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4281028Z 2025-12-04T12:05:01.4281295Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4281473Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4281688Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.4281788Z Got exit code 1 2025-12-04T12:05:01.4281893Z Retrying single test... 2025-12-04T12:05:01.4282481Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5b20aba83da29ea2.xml 2025-12-04T12:05:01.4282640Z ============================= test session starts ============================== 2025-12-04T12:05:01.4282997Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4283103Z cachedir: .pytest_cache 2025-12-04T12:05:01.4283685Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4283828Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4283935Z configfile: pytest.ini 2025-12-04T12:05:01.4284537Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4284794Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4285607Z stepcurrent: skipping 11 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4285727Z Running 1 items in this shard 2025-12-04T12:05:01.4285732Z 2025-12-04T12:05:01.4286418Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9273s] [100%] 2025-12-04T12:05:01.4287147Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2735s] [100%] 2025-12-04T12:05:01.4288036Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2640s] [100%] 2025-12-04T12:05:01.4288047Z 2025-12-04T12:05:01.4288189Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4288567Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4288735Z Traceback (most recent call last): 2025-12-04T12:05:01.4289245Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4289396Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4289996Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4290132Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4290393Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4290398Z 2025-12-04T12:05:01.4290625Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4291347Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4291353Z 2025-12-04T12:05:01.4291615Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4291840Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4291946Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4292056Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4292182Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4292402Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4292510Z graph_break [] 2025-12-04T12:05:01.4292724Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4293456Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4293567Z warnings.warn( 2025-12-04T12:05:01.4293927Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4294043Z Traceback (most recent call last): 2025-12-04T12:05:01.4294520Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4294672Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4295199Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4295398Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4295628Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4295634Z 2025-12-04T12:05:01.4295858Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4296577Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4296620Z 2025-12-04T12:05:01.4296894Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4297110Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4297219Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4297340Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4297459Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4297675Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4297826Z graph_break [] 2025-12-04T12:05:01.4298040Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4298848Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4298951Z warnings.warn( 2025-12-04T12:05:01.4299162Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4299276Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4299387Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4299601Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4299734Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4299827Z graph_break [] 2025-12-04T12:05:01.4300040Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4300782Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4300885Z warnings.warn( 2025-12-04T12:05:01.4301040Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4301399Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4301521Z Traceback (most recent call last): 2025-12-04T12:05:01.4302003Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4302154Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4302678Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4302800Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4302988Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4302995Z 2025-12-04T12:05:01.4303223Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4303946Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4303951Z 2025-12-04T12:05:01.4304225Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4304438Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4304543Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4304664Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4304785Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4305001Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4305106Z graph_break [] 2025-12-04T12:05:01.4305313Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4306132Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4306231Z warnings.warn( 2025-12-04T12:05:01.4306442Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4306591Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4306705Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4306921Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4307048Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4307144Z graph_break [] 2025-12-04T12:05:01.4307354Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4308090Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4308187Z warnings.warn( 2025-12-04T12:05:01.4308442Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4308548Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4308662Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4308886Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4308999Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4309095Z graph_break [] 2025-12-04T12:05:01.4309314Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4310038Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4310140Z warnings.warn( 2025-12-04T12:05:01.4310893Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5b20aba83da29ea2.xml - 2025-12-04T12:05:01.4311064Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4311925Z FAILED [0.2640s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4311931Z 2025-12-04T12:05:01.4312143Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4312875Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4312881Z 2025-12-04T12:05:01.4313143Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4313316Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4313523Z ================== 1 failed, 294 deselected, 2 rerun in 5.16s ================== 2025-12-04T12:05:01.4313624Z Got exit code 1 2025-12-04T12:05:01.4314275Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4314683Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.4315256Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c4f43dfc4bd691e8.xml 2025-12-04T12:05:01.4315425Z ============================= test session starts ============================== 2025-12-04T12:05:01.4315776Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4315887Z cachedir: .pytest_cache 2025-12-04T12:05:01.4316403Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4316525Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4316638Z configfile: pytest.ini 2025-12-04T12:05:01.4317294Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4317515Z collecting ... collected 571 items / 12 deselected / 559 selected 2025-12-04T12:05:01.4317662Z stepcurrent: skipping 12 already run items. 2025-12-04T12:05:01.4317801Z Running 283 items in this shard 2025-12-04T12:05:01.4317807Z 2025-12-04T12:05:01.4318420Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_float16_cuda PASSED [3.4722s] [ 0%] 2025-12-04T12:05:01.4319010Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_True_float16_cuda PASSED [0.3358s] [ 0%] 2025-12-04T12:05:01.4319597Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan2_load_upcast_to_fp32_False_float16_cuda PASSED [0.3500s] [ 1%] 2025-12-04T12:05:01.4320193Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan2_load_upcast_to_fp32_True_float16_cuda PASSED [0.3424s] [ 1%] 2025-12-04T12:05:01.4320903Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0931s] [ 1%] 2025-12-04T12:05:01.4321597Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2675s] [ 1%] 2025-12-04T12:05:01.4322186Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2601s] [ 1%] 2025-12-04T12:05:01.4322192Z 2025-12-04T12:05:01.4322325Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4322693Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4322812Z Traceback (most recent call last): 2025-12-04T12:05:01.4323298Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4323455Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4323974Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4324104Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4324291Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4324296Z 2025-12-04T12:05:01.4324512Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4325234Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4325240Z 2025-12-04T12:05:01.4325503Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4325732Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4325842Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4325960Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4326179Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4326296Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4326403Z graph_break [] 2025-12-04T12:05:01.4326615Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4327347Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4327461Z warnings.warn( 2025-12-04T12:05:01.4327830Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4327962Z Traceback (most recent call last): 2025-12-04T12:05:01.4328470Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4328629Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4329288Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4329416Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4329635Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4329641Z 2025-12-04T12:05:01.4329885Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4330681Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4330690Z 2025-12-04T12:05:01.4330995Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4331213Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4331365Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4331550Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4331777Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4331896Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4332007Z graph_break [] 2025-12-04T12:05:01.4332228Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4332973Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4333076Z warnings.warn( 2025-12-04T12:05:01.4333289Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4333407Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4333521Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4333741Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4333873Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4333971Z graph_break [] 2025-12-04T12:05:01.4334203Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4334933Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4335032Z warnings.warn( 2025-12-04T12:05:01.4335187Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4335550Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4335670Z Traceback (most recent call last): 2025-12-04T12:05:01.4336157Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4336311Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4336848Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4336971Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4337161Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4337168Z 2025-12-04T12:05:01.4337391Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4338168Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4338175Z 2025-12-04T12:05:01.4338456Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4338672Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4338782Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4338910Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4339127Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4339298Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4339405Z graph_break [] 2025-12-04T12:05:01.4339647Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4340392Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4340518Z warnings.warn( 2025-12-04T12:05:01.4340731Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4340850Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4340962Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4341181Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4341314Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4341407Z graph_break [] 2025-12-04T12:05:01.4341616Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4342403Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4342500Z warnings.warn( 2025-12-04T12:05:01.4342723Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4342829Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4342938Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4343168Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4343282Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4343377Z graph_break [] 2025-12-04T12:05:01.4343602Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4344328Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4344437Z warnings.warn( 2025-12-04T12:05:01.4345203Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c4f43dfc4bd691e8.xml - 2025-12-04T12:05:01.4345372Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4346218Z FAILED [0.2601s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4346224Z 2025-12-04T12:05:01.4346438Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4347168Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4347173Z 2025-12-04T12:05:01.4347437Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4347619Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4347842Z ============= 1 failed, 4 passed, 12 deselected, 2 rerun in 6.82s ============== 2025-12-04T12:05:01.4347942Z Got exit code 1 2025-12-04T12:05:01.4348057Z Retrying single test... 2025-12-04T12:05:01.4348629Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1ceca9168882f805.xml 2025-12-04T12:05:01.4348793Z ============================= test session starts ============================== 2025-12-04T12:05:01.4349153Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4349258Z cachedir: .pytest_cache 2025-12-04T12:05:01.4349777Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4349908Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4350010Z configfile: pytest.ini 2025-12-04T12:05:01.4350674Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4350904Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4351707Z stepcurrent: skipping 16 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4351861Z Running 1 items in this shard 2025-12-04T12:05:01.4351866Z 2025-12-04T12:05:01.4352548Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9088s] [100%] 2025-12-04T12:05:01.4353236Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2679s] [100%] 2025-12-04T12:05:01.4353864Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2692s] [100%] 2025-12-04T12:05:01.4353872Z 2025-12-04T12:05:01.4354020Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4354380Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4354506Z Traceback (most recent call last): 2025-12-04T12:05:01.4354998Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4355152Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4355671Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4355804Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4355995Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4356002Z 2025-12-04T12:05:01.4356232Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4356954Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4356959Z 2025-12-04T12:05:01.4357226Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4357457Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4357566Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4357688Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4357810Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4358031Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4358137Z graph_break [] 2025-12-04T12:05:01.4358351Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4359095Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4359205Z warnings.warn( 2025-12-04T12:05:01.4359571Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4359703Z Traceback (most recent call last): 2025-12-04T12:05:01.4360176Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4360331Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4360854Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4360975Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4361163Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4361169Z 2025-12-04T12:05:01.4361389Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4362171Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4362176Z 2025-12-04T12:05:01.4362456Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4362706Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4362817Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4362938Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4363056Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4363275Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4363380Z graph_break [] 2025-12-04T12:05:01.4363591Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4364336Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4364469Z warnings.warn( 2025-12-04T12:05:01.4364684Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4364803Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4364915Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4365133Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4365266Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4365359Z graph_break [] 2025-12-04T12:05:01.4365582Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4366311Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4366411Z warnings.warn( 2025-12-04T12:05:01.4366565Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4366931Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4367052Z Traceback (most recent call last): 2025-12-04T12:05:01.4367537Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4367691Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4368220Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4368339Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4368526Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4368531Z 2025-12-04T12:05:01.4368752Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4369471Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4369481Z 2025-12-04T12:05:01.4369758Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4369973Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4370083Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4370208Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4370327Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4370546Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4370654Z graph_break [] 2025-12-04T12:05:01.4370868Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4371617Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4371715Z warnings.warn( 2025-12-04T12:05:01.4371927Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4372091Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4372239Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4372464Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4372595Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4372724Z graph_break [] 2025-12-04T12:05:01.4372949Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4373673Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4373770Z warnings.warn( 2025-12-04T12:05:01.4373991Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4374100Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4374213Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4374441Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4374916Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4375010Z graph_break [] 2025-12-04T12:05:01.4375234Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4375964Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4376077Z warnings.warn( 2025-12-04T12:05:01.4376837Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1ceca9168882f805.xml - 2025-12-04T12:05:01.4377008Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4377849Z FAILED [0.2692s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4377858Z 2025-12-04T12:05:01.4378075Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4378888Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4378897Z 2025-12-04T12:05:01.4379162Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4379342Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4379554Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.4379656Z Got exit code 1 2025-12-04T12:05:01.4379778Z Retrying single test... 2025-12-04T12:05:01.4380357Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c6f480cabc56b496.xml 2025-12-04T12:05:01.4380521Z ============================= test session starts ============================== 2025-12-04T12:05:01.4380892Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4381003Z cachedir: .pytest_cache 2025-12-04T12:05:01.4381535Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4381662Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4381773Z configfile: pytest.ini 2025-12-04T12:05:01.4382378Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4382605Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4383412Z stepcurrent: skipping 16 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4383537Z Running 1 items in this shard 2025-12-04T12:05:01.4383544Z 2025-12-04T12:05:01.4384299Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9154s] [100%] 2025-12-04T12:05:01.4384992Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2691s] [100%] 2025-12-04T12:05:01.4385633Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2640s] [100%] 2025-12-04T12:05:01.4385638Z 2025-12-04T12:05:01.4385790Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4386153Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4386274Z Traceback (most recent call last): 2025-12-04T12:05:01.4386768Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4386956Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4387477Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4387617Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4388045Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4388052Z 2025-12-04T12:05:01.4388284Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4389007Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4389013Z 2025-12-04T12:05:01.4389279Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4389514Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4389633Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4389763Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4389886Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4390109Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4390218Z graph_break [] 2025-12-04T12:05:01.4390440Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4391175Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4391289Z warnings.warn( 2025-12-04T12:05:01.4391651Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4391782Z Traceback (most recent call last): 2025-12-04T12:05:01.4392258Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4392416Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4392949Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4393073Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4393266Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4393274Z 2025-12-04T12:05:01.4393505Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4394219Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4394224Z 2025-12-04T12:05:01.4394501Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4394716Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4394826Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4394950Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4395150Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4395423Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4395522Z graph_break [] 2025-12-04T12:05:01.4395734Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4396517Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4396616Z warnings.warn( 2025-12-04T12:05:01.4396832Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4396950Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4397063Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4397283Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4397421Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4397560Z graph_break [] 2025-12-04T12:05:01.4397791Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4398518Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4398620Z warnings.warn( 2025-12-04T12:05:01.4398773Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4399137Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4399258Z Traceback (most recent call last): 2025-12-04T12:05:01.4399745Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4399902Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4400433Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4400558Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4400752Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4400758Z 2025-12-04T12:05:01.4400986Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4401704Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4401711Z 2025-12-04T12:05:01.4401987Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4402203Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4402315Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4402440Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4402563Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4402785Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4402898Z graph_break [] 2025-12-04T12:05:01.4403117Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4403862Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4403964Z warnings.warn( 2025-12-04T12:05:01.4404176Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4404298Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4404410Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4404628Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4404763Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4404859Z graph_break [] 2025-12-04T12:05:01.4405083Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4405850Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4405981Z warnings.warn( 2025-12-04T12:05:01.4406210Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4406321Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4406464Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4406695Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4406816Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4406931Z graph_break [] 2025-12-04T12:05:01.4407144Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4407870Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4407985Z warnings.warn( 2025-12-04T12:05:01.4408752Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c6f480cabc56b496.xml - 2025-12-04T12:05:01.4408958Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4409807Z FAILED [0.2640s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4409816Z 2025-12-04T12:05:01.4410033Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4410765Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4410770Z 2025-12-04T12:05:01.4411038Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4411225Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4411429Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.4411530Z Got exit code 1 2025-12-04T12:05:01.4412176Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4412589Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.4413163Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9497acc525242c8a.xml 2025-12-04T12:05:01.4413335Z ============================= test session starts ============================== 2025-12-04T12:05:01.4413687Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4413805Z cachedir: .pytest_cache 2025-12-04T12:05:01.4414328Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4414454Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4414577Z configfile: pytest.ini 2025-12-04T12:05:01.4415168Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4415409Z collecting ... collected 571 items / 17 deselected / 554 selected 2025-12-04T12:05:01.4415549Z stepcurrent: skipping 17 already run items. 2025-12-04T12:05:01.4415662Z Running 278 items in this shard 2025-12-04T12:05:01.4415668Z 2025-12-04T12:05:01.4416274Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_True_float16_cuda PASSED [3.4485s] [ 0%] 2025-12-04T12:05:01.4416873Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_False_float16_cuda PASSED [0.3205s] [ 0%] 2025-12-04T12:05:01.4417616Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0911s] [ 1%] 2025-12-04T12:05:01.4418391Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2728s] [ 1%] 2025-12-04T12:05:01.4419025Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2629s] [ 1%] 2025-12-04T12:05:01.4419030Z 2025-12-04T12:05:01.4419182Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4419546Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4419682Z Traceback (most recent call last): 2025-12-04T12:05:01.4420161Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4420353Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4420894Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4421017Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4421208Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4421215Z 2025-12-04T12:05:01.4421443Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4422164Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4422170Z 2025-12-04T12:05:01.4422446Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4422664Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4422775Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4422899Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4423122Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4423255Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4423352Z graph_break [] 2025-12-04T12:05:01.4423571Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4424319Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4424418Z warnings.warn( 2025-12-04T12:05:01.4424779Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4424912Z Traceback (most recent call last): 2025-12-04T12:05:01.4425387Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4425558Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4426082Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4426208Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4426408Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4426413Z 2025-12-04T12:05:01.4426633Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4427353Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4427371Z 2025-12-04T12:05:01.4427634Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4427853Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4427978Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4428092Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4428347Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4428483Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4428624Z graph_break [] 2025-12-04T12:05:01.4428841Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4429588Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4429721Z warnings.warn( 2025-12-04T12:05:01.4429944Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4430054Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4430166Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4430399Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4430518Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4430613Z graph_break [] 2025-12-04T12:05:01.4430839Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4431611Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4431722Z warnings.warn( 2025-12-04T12:05:01.4431866Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4432232Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4432365Z Traceback (most recent call last): 2025-12-04T12:05:01.4432841Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4432995Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4433525Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4433647Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4433861Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4433867Z 2025-12-04T12:05:01.4434084Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4434802Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4434823Z 2025-12-04T12:05:01.4435089Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4435308Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4435434Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4435549Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4435768Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4435902Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4436002Z graph_break [] 2025-12-04T12:05:01.4436220Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4436977Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4437080Z warnings.warn( 2025-12-04T12:05:01.4437313Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4437427Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4437542Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4437777Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4437898Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4437996Z graph_break [] 2025-12-04T12:05:01.4438223Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4438953Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4439102Z warnings.warn( 2025-12-04T12:05:01.4439347Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4439458Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4439586Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4439833Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4439955Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4440067Z graph_break [] 2025-12-04T12:05:01.4440284Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4441028Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4441132Z warnings.warn( 2025-12-04T12:05:01.4441886Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9497acc525242c8a.xml - 2025-12-04T12:05:01.4442107Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4442937Z FAILED [0.2629s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4442945Z 2025-12-04T12:05:01.4443174Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4443894Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4443899Z 2025-12-04T12:05:01.4444168Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4444361Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4444573Z ============= 1 failed, 2 passed, 17 deselected, 2 rerun in 6.08s ============== 2025-12-04T12:05:01.4444678Z Got exit code 1 2025-12-04T12:05:01.4444797Z Retrying single test... 2025-12-04T12:05:01.4445374Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-158a50fa606ffe95.xml 2025-12-04T12:05:01.4445548Z ============================= test session starts ============================== 2025-12-04T12:05:01.4445902Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4446008Z cachedir: .pytest_cache 2025-12-04T12:05:01.4446549Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4446672Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4446779Z configfile: pytest.ini 2025-12-04T12:05:01.4447384Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4447615Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4448433Z stepcurrent: skipping 19 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4448549Z Running 1 items in this shard 2025-12-04T12:05:01.4448554Z 2025-12-04T12:05:01.4449245Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9176s] [100%] 2025-12-04T12:05:01.4449924Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2702s] [100%] 2025-12-04T12:05:01.4450518Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2619s] [100%] 2025-12-04T12:05:01.4450525Z 2025-12-04T12:05:01.4450714Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4451115Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4451250Z Traceback (most recent call last): 2025-12-04T12:05:01.4451761Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4451919Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4452459Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4452583Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4452769Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4452774Z 2025-12-04T12:05:01.4453004Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4453722Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4453761Z 2025-12-04T12:05:01.4454039Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4454257Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4454371Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4454495Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4454618Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4454852Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4454951Z graph_break [] 2025-12-04T12:05:01.4455167Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4455915Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4456017Z warnings.warn( 2025-12-04T12:05:01.4456384Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4456518Z Traceback (most recent call last): 2025-12-04T12:05:01.4456986Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4457149Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4457668Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4457789Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4457988Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4457993Z 2025-12-04T12:05:01.4458285Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4459006Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4459024Z 2025-12-04T12:05:01.4459293Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4459509Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4459637Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4459752Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4459874Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4460107Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4460207Z graph_break [] 2025-12-04T12:05:01.4460424Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4461172Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4461276Z warnings.warn( 2025-12-04T12:05:01.4461547Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4461660Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4461802Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4462036Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4462156Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4462284Z graph_break [] 2025-12-04T12:05:01.4462510Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4463240Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4463355Z warnings.warn( 2025-12-04T12:05:01.4463500Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4463868Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4464034Z Traceback (most recent call last): 2025-12-04T12:05:01.4464514Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4464668Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4465197Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4465321Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4465520Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4465525Z 2025-12-04T12:05:01.4465739Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4466456Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4466474Z 2025-12-04T12:05:01.4466737Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4466957Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4467082Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4467195Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4467317Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4467547Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4467647Z graph_break [] 2025-12-04T12:05:01.4467862Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4468601Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4468700Z warnings.warn( 2025-12-04T12:05:01.4468923Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4469033Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4469144Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4469379Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4469501Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4469597Z graph_break [] 2025-12-04T12:05:01.4469825Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4470555Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4470667Z warnings.warn( 2025-12-04T12:05:01.4470877Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4470986Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4471109Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4471326Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4471444Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4471548Z graph_break [] 2025-12-04T12:05:01.4471806Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4472581Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4472680Z warnings.warn( 2025-12-04T12:05:01.4473469Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-158a50fa606ffe95.xml - 2025-12-04T12:05:01.4473650Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4474476Z FAILED [0.2619s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4474482Z 2025-12-04T12:05:01.4474709Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4475574Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4475580Z 2025-12-04T12:05:01.4475847Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4476035Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4476309Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.4476421Z Got exit code 1 2025-12-04T12:05:01.4476530Z Retrying single test... 2025-12-04T12:05:01.4477174Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-27359545a6656d51.xml 2025-12-04T12:05:01.4477350Z ============================= test session starts ============================== 2025-12-04T12:05:01.4477742Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4477878Z cachedir: .pytest_cache 2025-12-04T12:05:01.4478415Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4478601Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4478723Z configfile: pytest.ini 2025-12-04T12:05:01.4479389Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4479620Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4480506Z stepcurrent: skipping 19 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4480622Z Running 1 items in this shard 2025-12-04T12:05:01.4480628Z 2025-12-04T12:05:01.4481402Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9298s] [100%] 2025-12-04T12:05:01.4482163Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2750s] [100%] 2025-12-04T12:05:01.4482829Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2760s] [100%] 2025-12-04T12:05:01.4482837Z 2025-12-04T12:05:01.4482990Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4483418Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4483556Z Traceback (most recent call last): 2025-12-04T12:05:01.4484033Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4484188Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4484773Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4484927Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4485122Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4485140Z 2025-12-04T12:05:01.4485389Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4486108Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4486113Z 2025-12-04T12:05:01.4486390Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4486607Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4486718Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4486847Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4487003Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4487239Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4487340Z graph_break [] 2025-12-04T12:05:01.4487556Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4488516Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4488630Z warnings.warn( 2025-12-04T12:05:01.4488995Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4489132Z Traceback (most recent call last): 2025-12-04T12:05:01.4489607Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4489777Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4490300Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4490428Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4490636Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4490641Z 2025-12-04T12:05:01.4490859Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4491602Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4491608Z 2025-12-04T12:05:01.4491875Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4492094Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4492219Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4492337Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4492457Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4492694Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4492798Z graph_break [] 2025-12-04T12:05:01.4493030Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4493769Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4493872Z warnings.warn( 2025-12-04T12:05:01.4494103Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4494217Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4494332Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4494566Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4494689Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4494788Z graph_break [] 2025-12-04T12:05:01.4495023Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4495869Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4495985Z warnings.warn( 2025-12-04T12:05:01.4496132Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4496536Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4496669Z Traceback (most recent call last): 2025-12-04T12:05:01.4497144Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4497314Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4497838Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4497962Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4498224Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4498279Z 2025-12-04T12:05:01.4498502Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4499224Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4499247Z 2025-12-04T12:05:01.4499515Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4499734Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4499863Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4499976Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4500096Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4500331Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4500428Z graph_break [] 2025-12-04T12:05:01.4500642Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4501393Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4501491Z warnings.warn( 2025-12-04T12:05:01.4501716Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4501826Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4501940Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4502173Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4502292Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4502389Z graph_break [] 2025-12-04T12:05:01.4502613Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4503342Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4503455Z warnings.warn( 2025-12-04T12:05:01.4503668Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4503782Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4503907Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4504126Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4504246Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4504354Z graph_break [] 2025-12-04T12:05:01.4504566Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4505307Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4505407Z warnings.warn( 2025-12-04T12:05:01.4506152Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-27359545a6656d51.xml - 2025-12-04T12:05:01.4506371Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4507235Z FAILED [0.2760s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4507268Z 2025-12-04T12:05:01.4507497Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4508217Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4508223Z 2025-12-04T12:05:01.4508487Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4508679Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4508879Z ================== 1 failed, 294 deselected, 2 rerun in 5.18s ================== 2025-12-04T12:05:01.4509039Z Got exit code 1 2025-12-04T12:05:01.4509688Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4510101Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.4510695Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1f2cd5255ddbd20d.xml 2025-12-04T12:05:01.4510857Z ============================= test session starts ============================== 2025-12-04T12:05:01.4511209Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4511331Z cachedir: .pytest_cache 2025-12-04T12:05:01.4511853Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4511988Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4512098Z configfile: pytest.ini 2025-12-04T12:05:01.4512692Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4512928Z collecting ... collected 571 items / 20 deselected / 551 selected 2025-12-04T12:05:01.4513073Z stepcurrent: skipping 20 already run items. 2025-12-04T12:05:01.4513188Z Running 275 items in this shard 2025-12-04T12:05:01.4513203Z 2025-12-04T12:05:01.4513799Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_float16_cuda PASSED [3.4405s] [ 0%] 2025-12-04T12:05:01.4514485Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0927s] [ 0%] 2025-12-04T12:05:01.4515177Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2698s] [ 0%] 2025-12-04T12:05:01.4515778Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2699s] [ 0%] 2025-12-04T12:05:01.4515783Z 2025-12-04T12:05:01.4515935Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4516298Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4516422Z Traceback (most recent call last): 2025-12-04T12:05:01.4516912Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4517070Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4517607Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4517729Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4517921Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4517960Z 2025-12-04T12:05:01.4518218Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4518936Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4518970Z 2025-12-04T12:05:01.4519249Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4519467Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4519579Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4519705Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4519925Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4520046Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4520157Z graph_break [] 2025-12-04T12:05:01.4520371Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4521138Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4521249Z warnings.warn( 2025-12-04T12:05:01.4521611Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4521748Z Traceback (most recent call last): 2025-12-04T12:05:01.4522219Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4522374Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4522904Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4523028Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4523232Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4523239Z 2025-12-04T12:05:01.4523455Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4524179Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4524185Z 2025-12-04T12:05:01.4524463Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4524683Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4524808Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4524923Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4525140Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4525274Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4525369Z graph_break [] 2025-12-04T12:05:01.4525585Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4526331Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4526437Z warnings.warn( 2025-12-04T12:05:01.4526651Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4526773Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4526887Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4527118Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4527237Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4527336Z graph_break [] 2025-12-04T12:05:01.4527559Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4528284Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4528383Z warnings.warn( 2025-12-04T12:05:01.4528576Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4528968Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4529103Z Traceback (most recent call last): 2025-12-04T12:05:01.4529577Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4529759Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4530291Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4530413Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4530604Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4530621Z 2025-12-04T12:05:01.4530836Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4531560Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4531594Z 2025-12-04T12:05:01.4531875Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4532090Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4532204Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4532333Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4532553Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4532685Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4532783Z graph_break [] 2025-12-04T12:05:01.4532999Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4533742Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4533845Z warnings.warn( 2025-12-04T12:05:01.4534066Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4534190Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4534304Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4534531Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4534656Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4534755Z graph_break [] 2025-12-04T12:05:01.4534982Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4535712Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4535812Z warnings.warn( 2025-12-04T12:05:01.4536037Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4536147Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4536260Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4536495Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4536618Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4536726Z graph_break [] 2025-12-04T12:05:01.4536939Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4537670Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4537780Z warnings.warn( 2025-12-04T12:05:01.4538612Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1f2cd5255ddbd20d.xml - 2025-12-04T12:05:01.4538799Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4539671Z FAILED [0.2699s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4539682Z 2025-12-04T12:05:01.4539929Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4540667Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4540702Z 2025-12-04T12:05:01.4540966Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4541165Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4541375Z ============= 1 failed, 1 passed, 20 deselected, 2 rerun in 5.76s ============== 2025-12-04T12:05:01.4541478Z Got exit code 1 2025-12-04T12:05:01.4541600Z Retrying single test... 2025-12-04T12:05:01.4542177Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-6e021b62864c2856.xml 2025-12-04T12:05:01.4542371Z ============================= test session starts ============================== 2025-12-04T12:05:01.4542736Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4542846Z cachedir: .pytest_cache 2025-12-04T12:05:01.4543384Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4543512Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4543624Z configfile: pytest.ini 2025-12-04T12:05:01.4544233Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4544464Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4545282Z stepcurrent: skipping 21 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4545404Z Running 1 items in this shard 2025-12-04T12:05:01.4545409Z 2025-12-04T12:05:01.4546092Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9142s] [100%] 2025-12-04T12:05:01.4546794Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2685s] [100%] 2025-12-04T12:05:01.4547389Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2648s] [100%] 2025-12-04T12:05:01.4547395Z 2025-12-04T12:05:01.4547548Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4547910Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4548030Z Traceback (most recent call last): 2025-12-04T12:05:01.4548524Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4548685Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4549221Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4549350Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4549539Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4549545Z 2025-12-04T12:05:01.4549772Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4550493Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4550498Z 2025-12-04T12:05:01.4550778Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4551027Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4551143Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4551300Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4551421Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4551643Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4551800Z graph_break [] 2025-12-04T12:05:01.4552015Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4552760Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4552862Z warnings.warn( 2025-12-04T12:05:01.4553224Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4553356Z Traceback (most recent call last): 2025-12-04T12:05:01.4553833Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4554016Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4554551Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4554672Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4554875Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4554880Z 2025-12-04T12:05:01.4555097Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4555814Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4555819Z 2025-12-04T12:05:01.4556094Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4556307Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4556429Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4556546Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4556668Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4556898Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4556997Z graph_break [] 2025-12-04T12:05:01.4557210Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4557953Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4558051Z warnings.warn( 2025-12-04T12:05:01.4558262Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4558384Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4558495Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4558727Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4558850Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4558949Z graph_break [] 2025-12-04T12:05:01.4559176Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4559904Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4560006Z warnings.warn( 2025-12-04T12:05:01.4560156Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4560515Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4560647Z Traceback (most recent call last): 2025-12-04T12:05:01.4561122Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4561275Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4561833Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4561986Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4562178Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4562196Z 2025-12-04T12:05:01.4562411Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4563158Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4563164Z 2025-12-04T12:05:01.4563437Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4563652Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4563760Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4563883Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4564004Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4564264Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4564359Z graph_break [] 2025-12-04T12:05:01.4564573Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4565311Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4565413Z warnings.warn( 2025-12-04T12:05:01.4565624Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4565744Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4565855Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4566081Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4566202Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4566296Z graph_break [] 2025-12-04T12:05:01.4566519Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4567255Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4567357Z warnings.warn( 2025-12-04T12:05:01.4567582Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4567694Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4567818Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4568037Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4568156Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4568263Z graph_break [] 2025-12-04T12:05:01.4568475Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4569203Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4569315Z warnings.warn( 2025-12-04T12:05:01.4570069Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-6e021b62864c2856.xml - 2025-12-04T12:05:01.4570247Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4571079Z FAILED [0.2648s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4571087Z 2025-12-04T12:05:01.4571302Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4572028Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4572033Z 2025-12-04T12:05:01.4572296Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4572514Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4572745Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.4572844Z Got exit code 1 2025-12-04T12:05:01.4572961Z Retrying single test... 2025-12-04T12:05:01.4573534Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-49627e719c8e890b.xml 2025-12-04T12:05:01.4573722Z ============================= test session starts ============================== 2025-12-04T12:05:01.4574082Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4574190Z cachedir: .pytest_cache 2025-12-04T12:05:01.4574724Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4574845Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4574952Z configfile: pytest.ini 2025-12-04T12:05:01.4575595Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4575820Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4576631Z stepcurrent: skipping 21 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4576746Z Running 1 items in this shard 2025-12-04T12:05:01.4576751Z 2025-12-04T12:05:01.4577428Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9138s] [100%] 2025-12-04T12:05:01.4578187Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2634s] [100%] 2025-12-04T12:05:01.4578788Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2590s] [100%] 2025-12-04T12:05:01.4578798Z 2025-12-04T12:05:01.4578950Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4579312Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4579434Z Traceback (most recent call last): 2025-12-04T12:05:01.4579925Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4580081Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4580616Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4580739Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4580930Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4580939Z 2025-12-04T12:05:01.4581167Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4581884Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4581890Z 2025-12-04T12:05:01.4582168Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4582384Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4582495Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4582621Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4582742Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4582959Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4583069Z graph_break [] 2025-12-04T12:05:01.4583283Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4584065Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4584196Z warnings.warn( 2025-12-04T12:05:01.4584558Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4584722Z Traceback (most recent call last): 2025-12-04T12:05:01.4585195Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4585349Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4585880Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4586002Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4586201Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4586206Z 2025-12-04T12:05:01.4586421Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4587179Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4587185Z 2025-12-04T12:05:01.4587462Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4587678Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4587956Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4588067Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4588185Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4588419Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4588514Z graph_break [] 2025-12-04T12:05:01.4588727Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4589477Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4589579Z warnings.warn( 2025-12-04T12:05:01.4589809Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4589917Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4590030Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4590261Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4590381Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4590476Z graph_break [] 2025-12-04T12:05:01.4590701Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4591492Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4591649Z warnings.warn( 2025-12-04T12:05:01.4591887Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4592290Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4592477Z Traceback (most recent call last): 2025-12-04T12:05:01.4620655Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4620915Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4621447Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4621571Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4621764Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4621772Z 2025-12-04T12:05:01.4622006Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4622731Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4622740Z 2025-12-04T12:05:01.4623158Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4623453Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4623569Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4623739Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4623861Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4624082Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4624193Z graph_break [] 2025-12-04T12:05:01.4624413Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4625154Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4625272Z warnings.warn( 2025-12-04T12:05:01.4625489Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4625670Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4625784Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4626006Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4626143Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4626242Z graph_break [] 2025-12-04T12:05:01.4626455Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4627199Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4627296Z warnings.warn( 2025-12-04T12:05:01.4627508Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4627629Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4627742Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4627969Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4628095Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4628191Z graph_break [] 2025-12-04T12:05:01.4628421Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4629156Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4629257Z warnings.warn( 2025-12-04T12:05:01.4630024Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-49627e719c8e890b.xml - 2025-12-04T12:05:01.4630195Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4631048Z FAILED [0.2590s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4631058Z 2025-12-04T12:05:01.4631276Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4632003Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4632023Z 2025-12-04T12:05:01.4632286Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4632462Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4632670Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.4632769Z Got exit code 1 2025-12-04T12:05:01.4633403Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4633821Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.4634470Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-66537c67c1172e95.xml 2025-12-04T12:05:01.4634645Z ============================= test session starts ============================== 2025-12-04T12:05:01.4635000Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4635139Z cachedir: .pytest_cache 2025-12-04T12:05:01.4635674Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4635796Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4635914Z configfile: pytest.ini 2025-12-04T12:05:01.4636507Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4636735Z collecting ... collected 571 items / 22 deselected / 549 selected 2025-12-04T12:05:01.4636935Z stepcurrent: skipping 22 already run items. 2025-12-04T12:05:01.4637049Z Running 273 items in this shard 2025-12-04T12:05:01.4637055Z 2025-12-04T12:05:01.4637654Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_float16_cuda PASSED [3.4395s] [ 0%] 2025-12-04T12:05:01.4638375Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0966s] [ 0%] 2025-12-04T12:05:01.4639080Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2658s] [ 0%] 2025-12-04T12:05:01.4639700Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2694s] [ 0%] 2025-12-04T12:05:01.4639706Z 2025-12-04T12:05:01.4639846Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4640256Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4640381Z Traceback (most recent call last): 2025-12-04T12:05:01.4640859Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4641028Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4641551Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4641674Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4641878Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4641884Z 2025-12-04T12:05:01.4642099Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4642851Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4642862Z 2025-12-04T12:05:01.4643129Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4643347Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4643468Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4643585Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4643815Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4643937Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4644034Z graph_break [] 2025-12-04T12:05:01.4644260Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4644995Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4645098Z warnings.warn( 2025-12-04T12:05:01.4645522Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4645648Z Traceback (most recent call last): 2025-12-04T12:05:01.4646162Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4646314Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4646863Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4646996Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4647184Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4647190Z 2025-12-04T12:05:01.4647404Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4648156Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4648192Z 2025-12-04T12:05:01.4648461Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4648693Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4648804Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4648918Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4649150Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4649271Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4649380Z graph_break [] 2025-12-04T12:05:01.4649594Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4650329Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4650440Z warnings.warn( 2025-12-04T12:05:01.4650653Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4650766Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4650885Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4651107Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4651227Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4651332Z graph_break [] 2025-12-04T12:05:01.4651548Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4652284Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4652383Z warnings.warn( 2025-12-04T12:05:01.4652524Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4652920Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4653039Z Traceback (most recent call last): 2025-12-04T12:05:01.4653521Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4653687Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4654204Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4654339Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4654528Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4654534Z 2025-12-04T12:05:01.4654746Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4655497Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4655503Z 2025-12-04T12:05:01.4655765Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4655993Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4656134Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4656279Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4656510Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4656632Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4656759Z graph_break [] 2025-12-04T12:05:01.4656980Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4657713Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4657824Z warnings.warn( 2025-12-04T12:05:01.4658035Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4658220Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4658351Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4658571Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4658735Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4658845Z graph_break [] 2025-12-04T12:05:01.4659063Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4659803Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4659904Z warnings.warn( 2025-12-04T12:05:01.4660122Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4660243Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4660352Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4660569Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4660701Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4660796Z graph_break [] 2025-12-04T12:05:01.4661005Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4661755Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4661855Z warnings.warn( 2025-12-04T12:05:01.4662617Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-66537c67c1172e95.xml - 2025-12-04T12:05:01.4662787Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4663651Z FAILED [0.2694s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4663670Z 2025-12-04T12:05:01.4663886Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4664630Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4664637Z 2025-12-04T12:05:01.4664918Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4665095Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4665324Z ============= 1 failed, 1 passed, 22 deselected, 2 rerun in 5.76s ============== 2025-12-04T12:05:01.4665425Z Got exit code 1 2025-12-04T12:05:01.4665533Z Retrying single test... 2025-12-04T12:05:01.4666129Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b567c9f6400d034b.xml 2025-12-04T12:05:01.4666287Z ============================= test session starts ============================== 2025-12-04T12:05:01.4666639Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4666758Z cachedir: .pytest_cache 2025-12-04T12:05:01.4667314Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4667484Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4667592Z configfile: pytest.ini 2025-12-04T12:05:01.4668184Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4668450Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4669273Z stepcurrent: skipping 23 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4669388Z Running 1 items in this shard 2025-12-04T12:05:01.4669401Z 2025-12-04T12:05:01.4670107Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9352s] [100%] 2025-12-04T12:05:01.4670865Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2646s] [100%] 2025-12-04T12:05:01.4671488Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2607s] [100%] 2025-12-04T12:05:01.4671496Z 2025-12-04T12:05:01.4671636Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4672030Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4672148Z Traceback (most recent call last): 2025-12-04T12:05:01.4672627Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4672792Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4673318Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4673446Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4673649Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4673655Z 2025-12-04T12:05:01.4673869Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4674622Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4674627Z 2025-12-04T12:05:01.4674891Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4675112Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4675235Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4675350Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4675484Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4675710Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4675807Z graph_break [] 2025-12-04T12:05:01.4676035Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4676765Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4676868Z warnings.warn( 2025-12-04T12:05:01.4677260Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4677381Z Traceback (most recent call last): 2025-12-04T12:05:01.4677868Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4678021Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4678541Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4678707Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4678925Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4678931Z 2025-12-04T12:05:01.4679159Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4679907Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4679943Z 2025-12-04T12:05:01.4680207Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4680436Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4680550Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4680662Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4680794Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4681017Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4681161Z graph_break [] 2025-12-04T12:05:01.4681375Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4682109Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4682229Z warnings.warn( 2025-12-04T12:05:01.4682440Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4682551Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4682676Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4682894Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4683029Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4683123Z graph_break [] 2025-12-04T12:05:01.4683335Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4684082Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4684185Z warnings.warn( 2025-12-04T12:05:01.4684326Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4684721Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4684841Z Traceback (most recent call last): 2025-12-04T12:05:01.4685328Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4685483Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4685999Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4686132Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4686320Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4686328Z 2025-12-04T12:05:01.4686540Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4687294Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4687302Z 2025-12-04T12:05:01.4687567Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4687960Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4688070Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4688184Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4688320Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4688541Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4688657Z graph_break [] 2025-12-04T12:05:01.4688872Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4689707Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4689823Z warnings.warn( 2025-12-04T12:05:01.4690038Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4690185Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4690314Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4690536Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4690657Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4690764Z graph_break [] 2025-12-04T12:05:01.4690978Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4691718Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4691817Z warnings.warn( 2025-12-04T12:05:01.4692075Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4692199Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4692312Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4692531Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4692664Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4692761Z graph_break [] 2025-12-04T12:05:01.4692985Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4693715Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4693813Z warnings.warn( 2025-12-04T12:05:01.4694577Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b567c9f6400d034b.xml - 2025-12-04T12:05:01.4694750Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4695622Z FAILED [0.2607s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4695628Z 2025-12-04T12:05:01.4695846Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4696586Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4696591Z 2025-12-04T12:05:01.4696869Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4697046Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4697255Z ================== 1 failed, 294 deselected, 2 rerun in 5.15s ================== 2025-12-04T12:05:01.4697355Z Got exit code 1 2025-12-04T12:05:01.4697462Z Retrying single test... 2025-12-04T12:05:01.4698059Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2dff14642abe4d78.xml 2025-12-04T12:05:01.4698287Z ============================= test session starts ============================== 2025-12-04T12:05:01.4698650Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4698772Z cachedir: .pytest_cache 2025-12-04T12:05:01.4699297Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4699435Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4699543Z configfile: pytest.ini 2025-12-04T12:05:01.4700139Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4700383Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4701286Z stepcurrent: skipping 23 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4701416Z Running 1 items in this shard 2025-12-04T12:05:01.4701452Z 2025-12-04T12:05:01.4702157Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9127s] [100%] 2025-12-04T12:05:01.4702856Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2680s] [100%] 2025-12-04T12:05:01.4703481Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2676s] [100%] 2025-12-04T12:05:01.4703487Z 2025-12-04T12:05:01.4703626Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4704060Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4704179Z Traceback (most recent call last): 2025-12-04T12:05:01.4704657Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4704825Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4705349Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4705486Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4705676Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4705682Z 2025-12-04T12:05:01.4705894Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4706654Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4706661Z 2025-12-04T12:05:01.4706929Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4707160Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4707273Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4707390Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4707525Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4707748Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4707844Z graph_break [] 2025-12-04T12:05:01.4708070Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4708803Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4708902Z warnings.warn( 2025-12-04T12:05:01.4709305Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4709428Z Traceback (most recent call last): 2025-12-04T12:05:01.4709914Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4710069Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4710589Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4710724Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4710914Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4710920Z 2025-12-04T12:05:01.4711146Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4711882Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4711890Z 2025-12-04T12:05:01.4712189Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4712448Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4712562Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4712721Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4712852Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4713070Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4713181Z graph_break [] 2025-12-04T12:05:01.4713395Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4714128Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4714238Z warnings.warn( 2025-12-04T12:05:01.4714451Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4714595Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4714718Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4714943Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4715077Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4715173Z graph_break [] 2025-12-04T12:05:01.4715392Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4716132Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4716231Z warnings.warn( 2025-12-04T12:05:01.4716375Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4716769Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4716890Z Traceback (most recent call last): 2025-12-04T12:05:01.4717389Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4717546Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4718065Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4718200Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4718388Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4718393Z 2025-12-04T12:05:01.4718625Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4719370Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4719376Z 2025-12-04T12:05:01.4719639Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4719866Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4719980Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4720097Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4720227Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4720449Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4720561Z graph_break [] 2025-12-04T12:05:01.4720777Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4721510Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4721620Z warnings.warn( 2025-12-04T12:05:01.4721836Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4721947Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4722068Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4722288Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4722453Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4722577Z graph_break [] 2025-12-04T12:05:01.4722792Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4723530Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4723675Z warnings.warn( 2025-12-04T12:05:01.4723886Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4724005Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4724118Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4724336Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4724467Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4724565Z graph_break [] 2025-12-04T12:05:01.4724788Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4725553Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4725655Z warnings.warn( 2025-12-04T12:05:01.4726426Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2dff14642abe4d78.xml - 2025-12-04T12:05:01.4726603Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4727461Z FAILED [0.2676s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4727467Z 2025-12-04T12:05:01.4727683Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4728425Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4728444Z 2025-12-04T12:05:01.4728711Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4728887Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4729096Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.4729195Z Got exit code 1 2025-12-04T12:05:01.4729848Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4730367Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.4730973Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-65b5f001c721490a.xml 2025-12-04T12:05:01.4731216Z ============================= test session starts ============================== 2025-12-04T12:05:01.4731588Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4731697Z cachedir: .pytest_cache 2025-12-04T12:05:01.4732318Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4732445Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4732554Z configfile: pytest.ini 2025-12-04T12:05:01.4733245Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4733468Z collecting ... collected 571 items / 24 deselected / 547 selected 2025-12-04T12:05:01.4733699Z stepcurrent: skipping 24 already run items. 2025-12-04T12:05:01.4733813Z Running 271 items in this shard 2025-12-04T12:05:01.4733818Z 2025-12-04T12:05:01.4734585Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_True_float16_cuda PASSED [3.4341s] [ 0%] 2025-12-04T12:05:01.4735313Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0923s] [ 0%] 2025-12-04T12:05:01.4736025Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2690s] [ 0%] 2025-12-04T12:05:01.4736630Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2662s] [ 0%] 2025-12-04T12:05:01.4736637Z 2025-12-04T12:05:01.4736777Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4737140Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4737276Z Traceback (most recent call last): 2025-12-04T12:05:01.4737797Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4737968Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4738568Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4738698Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4738904Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4738909Z 2025-12-04T12:05:01.4739126Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4739863Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4739869Z 2025-12-04T12:05:01.4740135Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4740358Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4740482Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4740599Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4740819Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4740951Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4741051Z graph_break [] 2025-12-04T12:05:01.4741280Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4742010Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4742108Z warnings.warn( 2025-12-04T12:05:01.4742477Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4742600Z Traceback (most recent call last): 2025-12-04T12:05:01.4743074Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4743242Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4743758Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4743890Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4744079Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4744084Z 2025-12-04T12:05:01.4744297Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4745023Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4745029Z 2025-12-04T12:05:01.4745296Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4745523Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4745678Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4745793Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4746056Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4746177Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4746275Z graph_break [] 2025-12-04T12:05:01.4746529Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4747263Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4747373Z warnings.warn( 2025-12-04T12:05:01.4747589Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4747697Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4747816Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4748033Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4748191Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4748298Z graph_break [] 2025-12-04T12:05:01.4748513Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4749249Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4749350Z warnings.warn( 2025-12-04T12:05:01.4749492Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4749860Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4749977Z Traceback (most recent call last): 2025-12-04T12:05:01.4750449Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4750610Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4751130Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4751262Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4751451Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4751457Z 2025-12-04T12:05:01.4751669Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4752400Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4752408Z 2025-12-04T12:05:01.4752673Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4752900Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4753010Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4753124Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4753353Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4753476Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4753571Z graph_break [] 2025-12-04T12:05:01.4753796Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4754534Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4754645Z warnings.warn( 2025-12-04T12:05:01.4754858Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4754970Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4755091Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4755307Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4755428Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4755536Z graph_break [] 2025-12-04T12:05:01.4755748Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4756558Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4756669Z warnings.warn( 2025-12-04T12:05:01.4756881Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4757033Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4757145Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4757362Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4757488Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4757583Z graph_break [] 2025-12-04T12:05:01.4757794Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4758528Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4758659Z warnings.warn( 2025-12-04T12:05:01.4759425Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-65b5f001c721490a.xml - 2025-12-04T12:05:01.4759593Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4760430Z FAILED [0.2662s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4760436Z 2025-12-04T12:05:01.4760661Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4761379Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4761385Z 2025-12-04T12:05:01.4761661Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4761844Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4762059Z ============= 1 failed, 1 passed, 24 deselected, 2 rerun in 5.74s ============== 2025-12-04T12:05:01.4762165Z Got exit code 1 2025-12-04T12:05:01.4762270Z Retrying single test... 2025-12-04T12:05:01.4762858Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-51dd6a289fe92b8c.xml 2025-12-04T12:05:01.4763019Z ============================= test session starts ============================== 2025-12-04T12:05:01.4763374Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4763495Z cachedir: .pytest_cache 2025-12-04T12:05:01.4764018Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4764139Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4764257Z configfile: pytest.ini 2025-12-04T12:05:01.4764853Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4765092Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4765897Z stepcurrent: skipping 25 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4766009Z Running 1 items in this shard 2025-12-04T12:05:01.4766014Z 2025-12-04T12:05:01.4766707Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9171s] [100%] 2025-12-04T12:05:01.4767384Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2617s] [100%] 2025-12-04T12:05:01.4768047Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2598s] [100%] 2025-12-04T12:05:01.4768054Z 2025-12-04T12:05:01.4768192Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4768563Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4768715Z Traceback (most recent call last): 2025-12-04T12:05:01.4769191Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4769358Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4769879Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4770002Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4770199Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4770234Z 2025-12-04T12:05:01.4770451Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4771181Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4771188Z 2025-12-04T12:05:01.4771454Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4771670Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4771792Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4771905Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4772023Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4772252Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4772345Z graph_break [] 2025-12-04T12:05:01.4772570Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4773308Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4773408Z warnings.warn( 2025-12-04T12:05:01.4773787Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4773909Z Traceback (most recent call last): 2025-12-04T12:05:01.4774382Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4774552Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4775070Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4775200Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4775387Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4775393Z 2025-12-04T12:05:01.4775607Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4776337Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4776343Z 2025-12-04T12:05:01.4776606Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4776838Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4776947Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4777059Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4777188Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4777406Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4777503Z graph_break [] 2025-12-04T12:05:01.4777721Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4778569Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4778686Z warnings.warn( 2025-12-04T12:05:01.4778928Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4779038Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4779190Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4779411Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4779534Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4779640Z graph_break [] 2025-12-04T12:05:01.4779852Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4780592Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4780695Z warnings.warn( 2025-12-04T12:05:01.4780838Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4781248Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4781371Z Traceback (most recent call last): 2025-12-04T12:05:01.4781844Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4782009Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4782529Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4782665Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4782856Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4782861Z 2025-12-04T12:05:01.4783076Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4783814Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4783824Z 2025-12-04T12:05:01.4784094Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4784320Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4784432Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4784548Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4784683Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4784903Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4785002Z graph_break [] 2025-12-04T12:05:01.4785227Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4785962Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4786075Z warnings.warn( 2025-12-04T12:05:01.4786293Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4786408Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4786536Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4786757Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4786878Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4786989Z graph_break [] 2025-12-04T12:05:01.4787203Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4788164Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4788266Z warnings.warn( 2025-12-04T12:05:01.4788479Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4788604Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4788715Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4789010Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4789145Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4789283Z graph_break [] 2025-12-04T12:05:01.4789494Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4790235Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4790382Z warnings.warn( 2025-12-04T12:05:01.4791144Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-51dd6a289fe92b8c.xml - 2025-12-04T12:05:01.4791313Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4792145Z FAILED [0.2598s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4792204Z 2025-12-04T12:05:01.4792425Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4793148Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4793155Z 2025-12-04T12:05:01.4793428Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4793609Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4793807Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.4793914Z Got exit code 1 2025-12-04T12:05:01.4794021Z Retrying single test... 2025-12-04T12:05:01.4794608Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-fcdeca5855991d94.xml 2025-12-04T12:05:01.4794766Z ============================= test session starts ============================== 2025-12-04T12:05:01.4795125Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4795244Z cachedir: .pytest_cache 2025-12-04T12:05:01.4795768Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4795890Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4796005Z configfile: pytest.ini 2025-12-04T12:05:01.4796599Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4796834Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4797637Z stepcurrent: skipping 25 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4797751Z Running 1 items in this shard 2025-12-04T12:05:01.4797759Z 2025-12-04T12:05:01.4798452Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9018s] [100%] 2025-12-04T12:05:01.4799133Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2649s] [100%] 2025-12-04T12:05:01.4799738Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2610s] [100%] 2025-12-04T12:05:01.4799744Z 2025-12-04T12:05:01.4799881Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4800251Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4800372Z Traceback (most recent call last): 2025-12-04T12:05:01.4800900Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4801099Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4801623Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4801776Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4801978Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4801983Z 2025-12-04T12:05:01.4802196Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4802926Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4802931Z 2025-12-04T12:05:01.4803196Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4803413Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4803578Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4803692Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4803828Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4804049Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4804148Z graph_break [] 2025-12-04T12:05:01.4804374Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4805111Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4805210Z warnings.warn( 2025-12-04T12:05:01.4805583Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4805705Z Traceback (most recent call last): 2025-12-04T12:05:01.4806192Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4806351Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4806870Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4807001Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4807194Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4807200Z 2025-12-04T12:05:01.4807412Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4808143Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4808149Z 2025-12-04T12:05:01.4808413Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4808640Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4808751Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4808871Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4809004Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4809227Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4809324Z graph_break [] 2025-12-04T12:05:01.4809546Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4810283Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4810393Z warnings.warn( 2025-12-04T12:05:01.4810606Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4810716Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4810837Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4811054Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4811175Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4811283Z graph_break [] 2025-12-04T12:05:01.4811532Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4812303Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4812430Z warnings.warn( 2025-12-04T12:05:01.4812571Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4812946Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4813067Z Traceback (most recent call last): 2025-12-04T12:05:01.4813541Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4813704Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4814221Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4814425Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4814619Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4814625Z 2025-12-04T12:05:01.4814842Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4815577Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4815583Z 2025-12-04T12:05:01.4815848Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4816073Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4816185Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4816298Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4816432Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4816651Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4816753Z graph_break [] 2025-12-04T12:05:01.4816980Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4817712Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4817826Z warnings.warn( 2025-12-04T12:05:01.4818039Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4818242Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4818368Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4818587Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4818709Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4818815Z graph_break [] 2025-12-04T12:05:01.4819027Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4819770Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4819873Z warnings.warn( 2025-12-04T12:05:01.4820086Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4820208Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4820322Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4820537Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4820674Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4820769Z graph_break [] 2025-12-04T12:05:01.4820978Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4821719Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4821818Z warnings.warn( 2025-12-04T12:05:01.4822654Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-fcdeca5855991d94.xml - 2025-12-04T12:05:01.4822824Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4823658Z FAILED [0.2610s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4823707Z 2025-12-04T12:05:01.4823924Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4824645Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4824651Z 2025-12-04T12:05:01.4824924Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4825102Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4825336Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.4825448Z Got exit code 1 2025-12-04T12:05:01.4826089Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4826512Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.4827091Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-97fc9d1afafb45ba.xml 2025-12-04T12:05:01.4827253Z ============================= test session starts ============================== 2025-12-04T12:05:01.4827615Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4827727Z cachedir: .pytest_cache 2025-12-04T12:05:01.4828265Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4828392Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4828501Z configfile: pytest.ini 2025-12-04T12:05:01.4829105Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4829334Z collecting ... collected 571 items / 26 deselected / 545 selected 2025-12-04T12:05:01.4829476Z stepcurrent: skipping 26 already run items. 2025-12-04T12:05:01.4829599Z Running 269 items in this shard 2025-12-04T12:05:01.4829604Z 2025-12-04T12:05:01.4830199Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_float16_cuda PASSED [3.4329s] [ 0%] 2025-12-04T12:05:01.4830883Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0920s] [ 0%] 2025-12-04T12:05:01.4831563Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2631s] [ 0%] 2025-12-04T12:05:01.4832288Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2526s] [ 0%] 2025-12-04T12:05:01.4832296Z 2025-12-04T12:05:01.4832436Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4832805Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4833011Z Traceback (most recent call last): 2025-12-04T12:05:01.4833489Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4833669Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4834300Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4834439Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4834754Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4834761Z 2025-12-04T12:05:01.4834977Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4835813Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4835819Z 2025-12-04T12:05:01.4836152Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4836405Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4836533Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4836649Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4836874Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4837067Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4837170Z graph_break [] 2025-12-04T12:05:01.4837390Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4838143Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4838248Z warnings.warn( 2025-12-04T12:05:01.4838621Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4838743Z Traceback (most recent call last): 2025-12-04T12:05:01.4839220Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4839390Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4839911Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4840038Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4840243Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4840251Z 2025-12-04T12:05:01.4840467Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4841201Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4841209Z 2025-12-04T12:05:01.4841477Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4841696Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4841821Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4841937Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4842173Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4842295Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4842397Z graph_break [] 2025-12-04T12:05:01.4842629Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4843365Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4843470Z warnings.warn( 2025-12-04T12:05:01.4843699Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4843811Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4843937Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4844158Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4844278Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4844389Z graph_break [] 2025-12-04T12:05:01.4844602Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4845365Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4845483Z warnings.warn( 2025-12-04T12:05:01.4845670Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4846044Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4846198Z Traceback (most recent call last): 2025-12-04T12:05:01.4846679Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4846847Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4847367Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4847488Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4847692Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4847697Z 2025-12-04T12:05:01.4847950Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4848674Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4848680Z 2025-12-04T12:05:01.4848946Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4849162Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4849284Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4849399Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4849630Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4849752Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4849848Z graph_break [] 2025-12-04T12:05:01.4850073Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4850811Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4850916Z warnings.warn( 2025-12-04T12:05:01.4851140Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4851249Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4851365Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4851594Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4851776Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4851884Z graph_break [] 2025-12-04T12:05:01.4852096Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4852826Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4852937Z warnings.warn( 2025-12-04T12:05:01.4853149Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4853262Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4853388Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4853607Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4853744Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4853843Z graph_break [] 2025-12-04T12:05:01.4854052Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4854790Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4854891Z warnings.warn( 2025-12-04T12:05:01.4855645Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-97fc9d1afafb45ba.xml - 2025-12-04T12:05:01.4855825Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4856719Z FAILED [0.2526s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4856725Z 2025-12-04T12:05:01.4856954Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4857698Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4857703Z 2025-12-04T12:05:01.4857978Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4858266Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4858483Z ============= 1 failed, 1 passed, 26 deselected, 2 rerun in 5.72s ============== 2025-12-04T12:05:01.4858594Z Got exit code 1 2025-12-04T12:05:01.4858701Z Retrying single test... 2025-12-04T12:05:01.4859331Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-58940be16b8df1f1.xml 2025-12-04T12:05:01.4859508Z ============================= test session starts ============================== 2025-12-04T12:05:01.4859861Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4859981Z cachedir: .pytest_cache 2025-12-04T12:05:01.4860507Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4860632Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4860753Z configfile: pytest.ini 2025-12-04T12:05:01.4861350Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4861580Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4862399Z stepcurrent: skipping 27 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4862513Z Running 1 items in this shard 2025-12-04T12:05:01.4862518Z 2025-12-04T12:05:01.4863207Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9304s] [100%] 2025-12-04T12:05:01.4863884Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2785s] [100%] 2025-12-04T12:05:01.4864486Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2724s] [100%] 2025-12-04T12:05:01.4864491Z 2025-12-04T12:05:01.4864631Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4864992Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4865130Z Traceback (most recent call last): 2025-12-04T12:05:01.4865613Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4865772Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4866307Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4866432Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4866636Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4866642Z 2025-12-04T12:05:01.4866858Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4867576Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4867595Z 2025-12-04T12:05:01.4867913Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4868163Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4868288Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4868403Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4868555Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4868792Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4868890Z graph_break [] 2025-12-04T12:05:01.4869104Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4869854Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4869955Z warnings.warn( 2025-12-04T12:05:01.4870322Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4870478Z Traceback (most recent call last): 2025-12-04T12:05:01.4870956Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4871125Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4871644Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4871768Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4871974Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4871980Z 2025-12-04T12:05:01.4872194Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4872923Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4872929Z 2025-12-04T12:05:01.4873196Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4873419Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4873544Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4873657Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4873789Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4874009Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4874107Z graph_break [] 2025-12-04T12:05:01.4874333Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4875063Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4875163Z warnings.warn( 2025-12-04T12:05:01.4875387Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4875497Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4875622Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4875842Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4875963Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4876069Z graph_break [] 2025-12-04T12:05:01.4876283Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4877012Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4877124Z warnings.warn( 2025-12-04T12:05:01.4877268Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4877638Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4877760Z Traceback (most recent call last): 2025-12-04T12:05:01.4878237Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4878457Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4879009Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4879134Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4879393Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4879399Z 2025-12-04T12:05:01.4879615Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4880345Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4880350Z 2025-12-04T12:05:01.4880614Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4880833Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4880958Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4881110Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4881246Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4881467Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4881565Z graph_break [] 2025-12-04T12:05:01.4881790Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4882530Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4882630Z warnings.warn( 2025-12-04T12:05:01.4882855Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4882966Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4883079Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4883308Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4883427Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4883536Z graph_break [] 2025-12-04T12:05:01.4883751Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4884483Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4884597Z warnings.warn( 2025-12-04T12:05:01.4884810Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4884919Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4885044Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4885262Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4885394Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4885490Z graph_break [] 2025-12-04T12:05:01.4885704Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4886449Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4886554Z warnings.warn( 2025-12-04T12:05:01.4887308Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-58940be16b8df1f1.xml - 2025-12-04T12:05:01.4887491Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4888506Z FAILED [0.2724s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4888512Z 2025-12-04T12:05:01.4888741Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4889457Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4889466Z 2025-12-04T12:05:01.4889840Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4890066Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4890266Z ================== 1 failed, 294 deselected, 2 rerun in 5.18s ================== 2025-12-04T12:05:01.4890419Z Got exit code 1 2025-12-04T12:05:01.4890527Z Retrying single test... 2025-12-04T12:05:01.4891101Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9349febb39593fe3.xml 2025-12-04T12:05:01.4891276Z ============================= test session starts ============================== 2025-12-04T12:05:01.4891630Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4891754Z cachedir: .pytest_cache 2025-12-04T12:05:01.4892281Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4892450Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4892571Z configfile: pytest.ini 2025-12-04T12:05:01.4893169Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4893399Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4894216Z stepcurrent: skipping 27 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4894332Z Running 1 items in this shard 2025-12-04T12:05:01.4894338Z 2025-12-04T12:05:01.4895031Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9075s] [100%] 2025-12-04T12:05:01.4895705Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2663s] [100%] 2025-12-04T12:05:01.4896314Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2616s] [100%] 2025-12-04T12:05:01.4896320Z 2025-12-04T12:05:01.4896463Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4896823Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4896958Z Traceback (most recent call last): 2025-12-04T12:05:01.4897438Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4897597Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4898198Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4898326Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4898534Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4898540Z 2025-12-04T12:05:01.4898757Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4899473Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4899493Z 2025-12-04T12:05:01.4899759Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4899977Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4900102Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4900216Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4900336Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4900571Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4900669Z graph_break [] 2025-12-04T12:05:01.4900931Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4901709Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4901810Z warnings.warn( 2025-12-04T12:05:01.4902207Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4902328Z Traceback (most recent call last): 2025-12-04T12:05:01.4902800Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4902968Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4903487Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4903611Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4903812Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4903850Z 2025-12-04T12:05:01.4904069Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4904797Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4904804Z 2025-12-04T12:05:01.4905069Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4905289Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4905416Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4905531Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4905667Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4905887Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4905983Z graph_break [] 2025-12-04T12:05:01.4906211Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4906950Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4907051Z warnings.warn( 2025-12-04T12:05:01.4907276Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4907388Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4907513Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4907731Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4907853Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4907962Z graph_break [] 2025-12-04T12:05:01.4908176Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4908911Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4909024Z warnings.warn( 2025-12-04T12:05:01.4909170Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4909539Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.4909661Z Traceback (most recent call last): 2025-12-04T12:05:01.4910143Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4910311Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4910828Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4910950Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4911150Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4911156Z 2025-12-04T12:05:01.4911369Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4912152Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4912160Z 2025-12-04T12:05:01.4912424Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4912642Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4912797Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4912909Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4913045Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4913265Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4913363Z graph_break [] 2025-12-04T12:05:01.4913589Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4914323Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4914455Z warnings.warn( 2025-12-04T12:05:01.4914684Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4914796Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4914921Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4915140Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4915263Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4915373Z graph_break [] 2025-12-04T12:05:01.4915586Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4916315Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4916429Z warnings.warn( 2025-12-04T12:05:01.4916640Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4916748Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4916880Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4917102Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4917235Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4917330Z graph_break [] 2025-12-04T12:05:01.4917543Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4918288Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4918388Z warnings.warn( 2025-12-04T12:05:01.4919139Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9349febb39593fe3.xml - 2025-12-04T12:05:01.4919318Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4920143Z FAILED [0.2616s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4920154Z 2025-12-04T12:05:01.4920380Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4921088Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4921096Z 2025-12-04T12:05:01.4921375Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4921550Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4921746Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.4921856Z Got exit code 1 2025-12-04T12:05:01.4922490Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.4922943Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.4923576Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-610ad64c0d2affb1.xml 2025-12-04T12:05:01.4923740Z ============================= test session starts ============================== 2025-12-04T12:05:01.4924136Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4924241Z cachedir: .pytest_cache 2025-12-04T12:05:01.4924767Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4924901Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4925010Z configfile: pytest.ini 2025-12-04T12:05:01.4925612Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4925876Z collecting ... collected 571 items / 28 deselected / 543 selected 2025-12-04T12:05:01.4926023Z stepcurrent: skipping 28 already run items. 2025-12-04T12:05:01.4926150Z Running 267 items in this shard 2025-12-04T12:05:01.4926156Z 2025-12-04T12:05:01.4926746Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erf_load_upcast_to_fp32_False_float16_cuda PASSED [3.4331s] [ 0%] 2025-12-04T12:05:01.4927446Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2224s] [ 0%] 2025-12-04T12:05:01.4928128Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2661s] [ 0%] 2025-12-04T12:05:01.4928716Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2631s] [ 0%] 2025-12-04T12:05:01.4928725Z 2025-12-04T12:05:01.4928880Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4929242Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4929376Z Traceback (most recent call last): 2025-12-04T12:05:01.4929856Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4930009Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4930543Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4930664Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4930857Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4930862Z 2025-12-04T12:05:01.4931087Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4931813Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4931819Z 2025-12-04T12:05:01.4932095Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4932313Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4932426Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4932557Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4932777Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4932909Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4933006Z graph_break [] 2025-12-04T12:05:01.4933222Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4933973Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4934114Z warnings.warn( 2025-12-04T12:05:01.4934509Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4934646Z Traceback (most recent call last): 2025-12-04T12:05:01.4935120Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4935319Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4935839Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4935964Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4936167Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4936172Z 2025-12-04T12:05:01.4936388Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4937108Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4937156Z 2025-12-04T12:05:01.4937424Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4937642Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4937768Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4937882Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4938103Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4938303Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4938403Z graph_break [] 2025-12-04T12:05:01.4938617Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4939364Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4939464Z warnings.warn( 2025-12-04T12:05:01.4939697Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4939811Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4939923Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4940156Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4940278Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4940374Z graph_break [] 2025-12-04T12:05:01.4940599Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4941329Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4941452Z warnings.warn( 2025-12-04T12:05:01.4941596Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4941955Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4942096Z Traceback (most recent call last): 2025-12-04T12:05:01.4942575Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4942733Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4943269Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4943395Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4943602Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4943608Z 2025-12-04T12:05:01.4943823Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4944546Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4944565Z 2025-12-04T12:05:01.4944831Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4945093Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4945249Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4945366Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4945585Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4945751Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4945851Z graph_break [] 2025-12-04T12:05:01.4946067Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4946816Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4946918Z warnings.warn( 2025-12-04T12:05:01.4947147Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4947258Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4947406Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4947644Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4947769Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4947868Z graph_break [] 2025-12-04T12:05:01.4948094Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4948830Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4948945Z warnings.warn( 2025-12-04T12:05:01.4949161Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4949268Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4949396Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4949616Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4949736Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4949852Z graph_break [] 2025-12-04T12:05:01.4950067Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4950811Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4950914Z warnings.warn( 2025-12-04T12:05:01.4951671Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-610ad64c0d2affb1.xml - 2025-12-04T12:05:01.4951859Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4952691Z FAILED [0.2631s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4952696Z 2025-12-04T12:05:01.4952924Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4953650Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4953656Z 2025-12-04T12:05:01.4953919Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4954109Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4954319Z ============= 1 failed, 1 passed, 28 deselected, 2 rerun in 5.87s ============== 2025-12-04T12:05:01.4954431Z Got exit code 1 2025-12-04T12:05:01.4954539Z Retrying single test... 2025-12-04T12:05:01.4955109Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-535142d7b2475ad0.xml 2025-12-04T12:05:01.4955281Z ============================= test session starts ============================== 2025-12-04T12:05:01.4955631Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4955784Z cachedir: .pytest_cache 2025-12-04T12:05:01.4956350Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4956475Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4956594Z configfile: pytest.ini 2025-12-04T12:05:01.4957221Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4957450Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4958264Z stepcurrent: skipping 29 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4958379Z Running 1 items in this shard 2025-12-04T12:05:01.4958384Z 2025-12-04T12:05:01.4959077Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9268s] [100%] 2025-12-04T12:05:01.4959797Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2761s] [100%] 2025-12-04T12:05:01.4960390Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2745s] [100%] 2025-12-04T12:05:01.4960398Z 2025-12-04T12:05:01.4960550Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4960910Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4961044Z Traceback (most recent call last): 2025-12-04T12:05:01.4961522Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4961676Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4962218Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4962341Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4962532Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4962553Z 2025-12-04T12:05:01.4962768Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4963490Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4963495Z 2025-12-04T12:05:01.4963775Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4963995Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4964109Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4964236Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4964360Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4964595Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4964696Z graph_break [] 2025-12-04T12:05:01.4964912Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4965656Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4965758Z warnings.warn( 2025-12-04T12:05:01.4966119Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4966252Z Traceback (most recent call last): 2025-12-04T12:05:01.4966724Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4966892Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4967460Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4967616Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4967821Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4967827Z 2025-12-04T12:05:01.4968039Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4968806Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4968812Z 2025-12-04T12:05:01.4969076Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4969293Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4969420Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4969534Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4969658Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4969932Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4970032Z graph_break [] 2025-12-04T12:05:01.4970261Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4970993Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4971095Z warnings.warn( 2025-12-04T12:05:01.4971319Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4971430Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4971547Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4971777Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4971899Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4971996Z graph_break [] 2025-12-04T12:05:01.4972222Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4972959Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4973071Z warnings.warn( 2025-12-04T12:05:01.4973213Z =================================== FAILURES =================================== 2025-12-04T12:05:01.4973576Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4973709Z Traceback (most recent call last): 2025-12-04T12:05:01.4974185Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4974350Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4974868Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4974991Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4975200Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4975208Z 2025-12-04T12:05:01.4975423Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4976143Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4976164Z 2025-12-04T12:05:01.4976429Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4976645Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4976771Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4976886Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4977008Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4977241Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4977339Z graph_break [] 2025-12-04T12:05:01.4977554Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4978443Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4978546Z warnings.warn( 2025-12-04T12:05:01.4978773Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4978916Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4979029Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4979260Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4979380Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4979476Z graph_break [] 2025-12-04T12:05:01.4979702Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4980432Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4980577Z warnings.warn( 2025-12-04T12:05:01.4980795Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4980903Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4981027Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4981246Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4981367Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4981474Z graph_break [] 2025-12-04T12:05:01.4981686Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4982423Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4982522Z warnings.warn( 2025-12-04T12:05:01.4983273Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-535142d7b2475ad0.xml - 2025-12-04T12:05:01.4983459Z =========================== short test summary info ============================ 2025-12-04T12:05:01.4984289Z FAILED [0.2745s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4984297Z 2025-12-04T12:05:01.4984523Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4985242Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4985247Z 2025-12-04T12:05:01.4985511Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4985703Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.4985903Z ================== 1 failed, 294 deselected, 2 rerun in 5.17s ================== 2025-12-04T12:05:01.4986020Z Got exit code 1 2025-12-04T12:05:01.4986129Z Retrying single test... 2025-12-04T12:05:01.4986711Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-44b8ebfaa38db266.xml 2025-12-04T12:05:01.4986888Z ============================= test session starts ============================== 2025-12-04T12:05:01.4987241Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.4987349Z cachedir: .pytest_cache 2025-12-04T12:05:01.4988080Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.4988207Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.4988330Z configfile: pytest.ini 2025-12-04T12:05:01.4988929Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.4989229Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.4990086Z stepcurrent: skipping 29 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4990201Z Running 1 items in this shard 2025-12-04T12:05:01.4990245Z 2025-12-04T12:05:01.4990940Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9284s] [100%] 2025-12-04T12:05:01.4991620Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2665s] [100%] 2025-12-04T12:05:01.4992212Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2641s] [100%] 2025-12-04T12:05:01.4992219Z 2025-12-04T12:05:01.4992435Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.4992800Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4992933Z Traceback (most recent call last): 2025-12-04T12:05:01.4993413Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4993574Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4994107Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4994233Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4994424Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4994442Z 2025-12-04T12:05:01.4994657Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.4995391Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.4995398Z 2025-12-04T12:05:01.4995680Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.4995900Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.4996012Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.4996141Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.4996263Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.4996499Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.4996598Z graph_break [] 2025-12-04T12:05:01.4996814Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.4997559Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.4997664Z warnings.warn( 2025-12-04T12:05:01.4998027Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.4998166Z Traceback (most recent call last): 2025-12-04T12:05:01.4998643Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.4998814Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.4999333Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.4999456Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.4999661Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.4999666Z 2025-12-04T12:05:01.4999883Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5000619Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5000627Z 2025-12-04T12:05:01.5000927Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5001179Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5001309Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5001455Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5001578Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5001814Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5001914Z graph_break [] 2025-12-04T12:05:01.5002142Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5002880Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5002983Z warnings.warn( 2025-12-04T12:05:01.5003211Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5003373Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5003488Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5003724Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5003845Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5003948Z graph_break [] 2025-12-04T12:05:01.5004180Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5004909Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5005023Z warnings.warn( 2025-12-04T12:05:01.5005165Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5005527Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5005664Z Traceback (most recent call last): 2025-12-04T12:05:01.5006145Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5006316Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5006831Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5006957Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5007164Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5007169Z 2025-12-04T12:05:01.5007379Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5008095Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5008115Z 2025-12-04T12:05:01.5008378Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5008596Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5008723Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5008840Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5008960Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5009189Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5009295Z graph_break [] 2025-12-04T12:05:01.5009509Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5010255Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5010358Z warnings.warn( 2025-12-04T12:05:01.5010584Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5010693Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5010806Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5011071Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5011195Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5011325Z graph_break [] 2025-12-04T12:05:01.5011553Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5012283Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5012426Z warnings.warn( 2025-12-04T12:05:01.5012642Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5012754Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5012880Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5013100Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5013222Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5013330Z graph_break [] 2025-12-04T12:05:01.5013541Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5014316Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5014417Z warnings.warn( 2025-12-04T12:05:01.5015174Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-44b8ebfaa38db266.xml - 2025-12-04T12:05:01.5015362Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5016194Z FAILED [0.2641s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5016200Z 2025-12-04T12:05:01.5016428Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5017151Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5017159Z 2025-12-04T12:05:01.5017426Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5017615Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5017815Z ================== 1 failed, 294 deselected, 2 rerun in 5.16s ================== 2025-12-04T12:05:01.5017927Z Got exit code 1 2025-12-04T12:05:01.5018638Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5019050Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.5019636Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-bba5c35945b28810.xml 2025-12-04T12:05:01.5019802Z ============================= test session starts ============================== 2025-12-04T12:05:01.5020168Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5020278Z cachedir: .pytest_cache 2025-12-04T12:05:01.5020801Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5020941Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5021049Z configfile: pytest.ini 2025-12-04T12:05:01.5021641Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5021885Z collecting ... collected 571 items / 30 deselected / 541 selected 2025-12-04T12:05:01.5022032Z stepcurrent: skipping 30 already run items. 2025-12-04T12:05:01.5022160Z Running 265 items in this shard 2025-12-04T12:05:01.5022165Z 2025-12-04T12:05:01.5022881Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9265s] [ 0%] 2025-12-04T12:05:01.5023590Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2645s] [ 0%] 2025-12-04T12:05:01.5024225Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2561s] [ 0%] 2025-12-04T12:05:01.5024231Z 2025-12-04T12:05:01.5024371Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5024742Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5024865Z Traceback (most recent call last): 2025-12-04T12:05:01.5025341Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5025543Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5026071Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5026206Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5026396Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5026404Z 2025-12-04T12:05:01.5026622Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5027350Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5027356Z 2025-12-04T12:05:01.5027620Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5027850Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5027961Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5028076Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5028211Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5028431Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5028527Z graph_break [] 2025-12-04T12:05:01.5028754Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5029487Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5029588Z warnings.warn( 2025-12-04T12:05:01.5029957Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5030077Z Traceback (most recent call last): 2025-12-04T12:05:01.5030566Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5030724Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5031248Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5031384Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5031576Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5031581Z 2025-12-04T12:05:01.5031810Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5032522Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5032528Z 2025-12-04T12:05:01.5032794Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5033022Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5033134Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5033248Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5033381Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5033633Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5033751Z graph_break [] 2025-12-04T12:05:01.5034004Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5034738Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5034881Z warnings.warn( 2025-12-04T12:05:01.5035093Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5035202Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5035325Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5035544Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5035677Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5035773Z graph_break [] 2025-12-04T12:05:01.5035986Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5036761Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5036860Z warnings.warn( 2025-12-04T12:05:01.5037003Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5037372Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5037494Z Traceback (most recent call last): 2025-12-04T12:05:01.5037982Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5038135Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5038652Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5038786Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5038981Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5038987Z 2025-12-04T12:05:01.5039209Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5039934Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5039942Z 2025-12-04T12:05:01.5040208Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5040437Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5040551Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5040670Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5040805Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5041026Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5041137Z graph_break [] 2025-12-04T12:05:01.5041353Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5042091Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5042204Z warnings.warn( 2025-12-04T12:05:01.5042417Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5042534Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5042666Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5042883Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5043003Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5043113Z graph_break [] 2025-12-04T12:05:01.5043324Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5044062Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5044194Z warnings.warn( 2025-12-04T12:05:01.5044439Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5044561Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5044674Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5044938Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5045071Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5045167Z graph_break [] 2025-12-04T12:05:01.5045389Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5046112Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5046212Z warnings.warn( 2025-12-04T12:05:01.5046992Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-bba5c35945b28810.xml - 2025-12-04T12:05:01.5047204Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5048046Z FAILED [0.2561s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5048053Z 2025-12-04T12:05:01.5048268Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5048984Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5048990Z 2025-12-04T12:05:01.5049273Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5049453Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5049665Z ================== 1 failed, 30 deselected, 2 rerun in 5.13s =================== 2025-12-04T12:05:01.5049771Z Got exit code 1 2025-12-04T12:05:01.5049880Z Retrying single test... 2025-12-04T12:05:01.5050472Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a3c90c249d149d10.xml 2025-12-04T12:05:01.5050637Z ============================= test session starts ============================== 2025-12-04T12:05:01.5050989Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5051114Z cachedir: .pytest_cache 2025-12-04T12:05:01.5051641Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5051780Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5051889Z configfile: pytest.ini 2025-12-04T12:05:01.5052484Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5052733Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5053541Z stepcurrent: skipping 30 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5053674Z Running 1 items in this shard 2025-12-04T12:05:01.5053679Z 2025-12-04T12:05:01.5054358Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9232s] [100%] 2025-12-04T12:05:01.5055033Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2677s] [100%] 2025-12-04T12:05:01.5055639Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2586s] [100%] 2025-12-04T12:05:01.5055647Z 2025-12-04T12:05:01.5055832Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5056243Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5056367Z Traceback (most recent call last): 2025-12-04T12:05:01.5056853Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5057051Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5057578Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5057719Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5057915Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5057920Z 2025-12-04T12:05:01.5058210Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5058946Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5058994Z 2025-12-04T12:05:01.5059264Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5059496Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5059614Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5059727Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5059863Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5060085Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5060183Z graph_break [] 2025-12-04T12:05:01.5060412Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5061140Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5061240Z warnings.warn( 2025-12-04T12:05:01.5061614Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5061736Z Traceback (most recent call last): 2025-12-04T12:05:01.5062220Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5062376Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5062895Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5063028Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5063219Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5063225Z 2025-12-04T12:05:01.5063451Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5064164Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5064174Z 2025-12-04T12:05:01.5064441Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5064665Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5064775Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5064891Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5065023Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5065245Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5065358Z graph_break [] 2025-12-04T12:05:01.5065570Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5066300Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5066412Z warnings.warn( 2025-12-04T12:05:01.5066668Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5066782Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5066942Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5067166Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5067297Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5067431Z graph_break [] 2025-12-04T12:05:01.5067645Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5068387Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5068488Z warnings.warn( 2025-12-04T12:05:01.5068632Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5069000Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5069120Z Traceback (most recent call last): 2025-12-04T12:05:01.5069645Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5069802Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5070318Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5070455Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5070643Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5070649Z 2025-12-04T12:05:01.5070867Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5071599Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5071604Z 2025-12-04T12:05:01.5071869Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5072102Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5072212Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5072327Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5072461Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5072681Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5072792Z graph_break [] 2025-12-04T12:05:01.5073006Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5073736Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5073849Z warnings.warn( 2025-12-04T12:05:01.5074062Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5074172Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5074297Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5074520Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5074654Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5074753Z graph_break [] 2025-12-04T12:05:01.5074966Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5075705Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5075812Z warnings.warn( 2025-12-04T12:05:01.5076024Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5076145Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5076258Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5076476Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5076612Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5076707Z graph_break [] 2025-12-04T12:05:01.5076976Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5077738Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5077842Z warnings.warn( 2025-12-04T12:05:01.5078644Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a3c90c249d149d10.xml - 2025-12-04T12:05:01.5078817Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5079654Z FAILED [0.2586s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5079660Z 2025-12-04T12:05:01.5079876Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5080586Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5080628Z 2025-12-04T12:05:01.5080905Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5081083Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5081295Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.5081392Z Got exit code 1 2025-12-04T12:05:01.5081497Z Retrying single test... 2025-12-04T12:05:01.5082086Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c896d4e441aec272.xml 2025-12-04T12:05:01.5082246Z ============================= test session starts ============================== 2025-12-04T12:05:01.5082600Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5082721Z cachedir: .pytest_cache 2025-12-04T12:05:01.5083249Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5083386Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5083494Z configfile: pytest.ini 2025-12-04T12:05:01.5084089Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5084334Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5085139Z stepcurrent: skipping 30 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5085265Z Running 1 items in this shard 2025-12-04T12:05:01.5085270Z 2025-12-04T12:05:01.5085943Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9091s] [100%] 2025-12-04T12:05:01.5086621Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2650s] [100%] 2025-12-04T12:05:01.5087220Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2602s] [100%] 2025-12-04T12:05:01.5087228Z 2025-12-04T12:05:01.5087369Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5087934Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5088110Z Traceback (most recent call last): 2025-12-04T12:05:01.5088592Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5088762Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5089381Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5089582Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5089777Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5089783Z 2025-12-04T12:05:01.5089998Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5090770Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5090776Z 2025-12-04T12:05:01.5091040Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5091274Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5091389Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5091503Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5091635Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5091902Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5091999Z graph_break [] 2025-12-04T12:05:01.5092229Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5092965Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5093086Z warnings.warn( 2025-12-04T12:05:01.5093442Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5093564Z Traceback (most recent call last): 2025-12-04T12:05:01.5094051Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5094207Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5094722Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5094864Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5095055Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5095061Z 2025-12-04T12:05:01.5095292Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5096006Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5096013Z 2025-12-04T12:05:01.5096279Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5096510Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5096623Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5096750Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5096872Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5097094Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5097209Z graph_break [] 2025-12-04T12:05:01.5097426Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5098217Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5098338Z warnings.warn( 2025-12-04T12:05:01.5106937Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5107145Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5107284Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5107528Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5107653Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5107768Z graph_break [] 2025-12-04T12:05:01.5107995Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5108881Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5109033Z warnings.warn( 2025-12-04T12:05:01.5109183Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5109560Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5109729Z Traceback (most recent call last): 2025-12-04T12:05:01.5110210Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5110384Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5110912Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5111050Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5111243Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5111291Z 2025-12-04T12:05:01.5111512Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5112244Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5112250Z 2025-12-04T12:05:01.5112521Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5112756Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5112870Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5112986Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5113125Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5113350Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5113448Z graph_break [] 2025-12-04T12:05:01.5113679Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5114425Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5114543Z warnings.warn( 2025-12-04T12:05:01.5114756Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5114870Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5114998Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5115220Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5115344Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5115455Z graph_break [] 2025-12-04T12:05:01.5115669Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5116409Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5116511Z warnings.warn( 2025-12-04T12:05:01.5116729Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5116854Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5116972Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5117191Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5117330Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5117429Z graph_break [] 2025-12-04T12:05:01.5117642Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5118382Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5118481Z warnings.warn( 2025-12-04T12:05:01.5119245Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c896d4e441aec272.xml - 2025-12-04T12:05:01.5119413Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5120307Z FAILED [0.2602s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5120327Z 2025-12-04T12:05:01.5120544Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5121291Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5121297Z 2025-12-04T12:05:01.5121575Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5121752Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5121962Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.5122061Z Got exit code 1 2025-12-04T12:05:01.5122725Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5123153Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.5123730Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0f626bbdc83148be.xml 2025-12-04T12:05:01.5123894Z ============================= test session starts ============================== 2025-12-04T12:05:01.5124261Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5124371Z cachedir: .pytest_cache 2025-12-04T12:05:01.5124906Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5125038Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5125150Z configfile: pytest.ini 2025-12-04T12:05:01.5125764Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5125988Z collecting ... collected 571 items / 31 deselected / 540 selected 2025-12-04T12:05:01.5126131Z stepcurrent: skipping 31 already run items. 2025-12-04T12:05:01.5126258Z Running 264 items in this shard 2025-12-04T12:05:01.5126263Z 2025-12-04T12:05:01.5126855Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_float16_cuda PASSED [3.4418s] [ 0%] 2025-12-04T12:05:01.5127564Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2230s] [ 0%] 2025-12-04T12:05:01.5128253Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2714s] [ 0%] 2025-12-04T12:05:01.5128876Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2677s] [ 0%] 2025-12-04T12:05:01.5128881Z 2025-12-04T12:05:01.5129022Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5129396Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5129531Z Traceback (most recent call last): 2025-12-04T12:05:01.5130007Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5130166Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5130702Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5130828Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5131032Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5131040Z 2025-12-04T12:05:01.5131285Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5132049Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5132085Z 2025-12-04T12:05:01.5132364Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5132584Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5132709Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5132825Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5133049Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5133182Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5133281Z graph_break [] 2025-12-04T12:05:01.5133499Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5134285Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5134387Z warnings.warn( 2025-12-04T12:05:01.5134775Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5134899Z Traceback (most recent call last): 2025-12-04T12:05:01.5135374Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5135544Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5136063Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5136184Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5136389Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5136395Z 2025-12-04T12:05:01.5136610Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5137361Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5137366Z 2025-12-04T12:05:01.5137632Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5137853Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5137977Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5138091Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5138433Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5138557Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5138655Z graph_break [] 2025-12-04T12:05:01.5138892Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5139633Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5139739Z warnings.warn( 2025-12-04T12:05:01.5139965Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5140075Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5140204Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5140423Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5140543Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5140654Z graph_break [] 2025-12-04T12:05:01.5140870Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5141601Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5141712Z warnings.warn( 2025-12-04T12:05:01.5141855Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5142327Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5142448Z Traceback (most recent call last): 2025-12-04T12:05:01.5142924Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5143122Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5143644Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5143766Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5143973Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5143979Z 2025-12-04T12:05:01.5144195Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5144938Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5144974Z 2025-12-04T12:05:01.5145243Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5145457Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5145581Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5145696Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5145925Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5146045Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5146145Z graph_break [] 2025-12-04T12:05:01.5146368Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5147102Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5147202Z warnings.warn( 2025-12-04T12:05:01.5147431Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5147540Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5147656Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5147887Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5148007Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5148117Z graph_break [] 2025-12-04T12:05:01.5148329Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5149058Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5149169Z warnings.warn( 2025-12-04T12:05:01.5149382Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5149496Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5149621Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5149843Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5149976Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5150076Z graph_break [] 2025-12-04T12:05:01.5150288Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5151029Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5151134Z warnings.warn( 2025-12-04T12:05:01.5151885Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0f626bbdc83148be.xml - 2025-12-04T12:05:01.5152069Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5152915Z FAILED [0.2677s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5152954Z 2025-12-04T12:05:01.5153214Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5153946Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5153985Z 2025-12-04T12:05:01.5154264Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5154442Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5154655Z ============= 1 failed, 1 passed, 31 deselected, 2 rerun in 5.88s ============== 2025-12-04T12:05:01.5154765Z Got exit code 1 2025-12-04T12:05:01.5154873Z Retrying single test... 2025-12-04T12:05:01.5155447Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e68881511844a645.xml 2025-12-04T12:05:01.5155622Z ============================= test session starts ============================== 2025-12-04T12:05:01.5156006Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5156130Z cachedir: .pytest_cache 2025-12-04T12:05:01.5156654Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5156781Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5156901Z configfile: pytest.ini 2025-12-04T12:05:01.5157503Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5157733Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5158566Z stepcurrent: skipping 32 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5158682Z Running 1 items in this shard 2025-12-04T12:05:01.5158690Z 2025-12-04T12:05:01.5159399Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9283s] [100%] 2025-12-04T12:05:01.5160092Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2706s] [100%] 2025-12-04T12:05:01.5160711Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2624s] [100%] 2025-12-04T12:05:01.5160717Z 2025-12-04T12:05:01.5160858Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5161231Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5161370Z Traceback (most recent call last): 2025-12-04T12:05:01.5161853Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5162013Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5162547Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5162674Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5162879Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5162885Z 2025-12-04T12:05:01.5163099Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5163824Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5163843Z 2025-12-04T12:05:01.5164113Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5164332Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5164492Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5164641Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5164764Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5164999Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5165129Z graph_break [] 2025-12-04T12:05:01.5165342Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5166090Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5166193Z warnings.warn( 2025-12-04T12:05:01.5166577Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5166699Z Traceback (most recent call last): 2025-12-04T12:05:01.5167174Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5167375Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5167896Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5168034Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5168229Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5168234Z 2025-12-04T12:05:01.5168447Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5169193Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5169198Z 2025-12-04T12:05:01.5169468Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5169699Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5169812Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5169930Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5170066Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5170287Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5170384Z graph_break [] 2025-12-04T12:05:01.5170607Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5171343Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5171443Z warnings.warn( 2025-12-04T12:05:01.5171669Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5171780Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5171906Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5172124Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5172248Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5172358Z graph_break [] 2025-12-04T12:05:01.5172571Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5173302Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5173414Z warnings.warn( 2025-12-04T12:05:01.5173557Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5173943Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5174068Z Traceback (most recent call last): 2025-12-04T12:05:01.5174545Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5174713Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5175263Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5175391Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5175624Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5175630Z 2025-12-04T12:05:01.5175843Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5176614Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5176620Z 2025-12-04T12:05:01.5176887Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5177105Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5177228Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5177343Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5177479Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5177767Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5177866Z graph_break [] 2025-12-04T12:05:01.5178094Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5178922Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5179029Z warnings.warn( 2025-12-04T12:05:01.5179254Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5179364Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5179491Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5179711Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5179831Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5179939Z graph_break [] 2025-12-04T12:05:01.5180152Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5180890Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5181001Z warnings.warn( 2025-12-04T12:05:01.5181214Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5181326Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5181452Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5181672Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5181804Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5181899Z graph_break [] 2025-12-04T12:05:01.5182115Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5182854Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5182957Z warnings.warn( 2025-12-04T12:05:01.5183714Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e68881511844a645.xml - 2025-12-04T12:05:01.5183898Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5184738Z FAILED [0.2624s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5184746Z 2025-12-04T12:05:01.5184977Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5185706Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5185711Z 2025-12-04T12:05:01.5185990Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5186214Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5186446Z ================== 1 failed, 294 deselected, 2 rerun in 5.15s ================== 2025-12-04T12:05:01.5186563Z Got exit code 1 2025-12-04T12:05:01.5186673Z Retrying single test... 2025-12-04T12:05:01.5187243Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a161840b55d48987.xml 2025-12-04T12:05:01.5187447Z ============================= test session starts ============================== 2025-12-04T12:05:01.5187986Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5188112Z cachedir: .pytest_cache 2025-12-04T12:05:01.5188639Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5188764Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5188887Z configfile: pytest.ini 2025-12-04T12:05:01.5189561Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5189792Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5190625Z stepcurrent: skipping 32 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5190742Z Running 1 items in this shard 2025-12-04T12:05:01.5190747Z 2025-12-04T12:05:01.5191463Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9124s] [100%] 2025-12-04T12:05:01.5192155Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2611s] [100%] 2025-12-04T12:05:01.5192771Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2606s] [100%] 2025-12-04T12:05:01.5192781Z 2025-12-04T12:05:01.5192925Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5193295Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5193436Z Traceback (most recent call last): 2025-12-04T12:05:01.5193915Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5194086Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5194606Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5194729Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5194929Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5194937Z 2025-12-04T12:05:01.5195156Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5195887Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5195905Z 2025-12-04T12:05:01.5196172Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5196390Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5196518Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5196631Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5196753Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5196987Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5197085Z graph_break [] 2025-12-04T12:05:01.5197298Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5198136Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5198235Z warnings.warn( 2025-12-04T12:05:01.5198617Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5198782Z Traceback (most recent call last): 2025-12-04T12:05:01.5199257Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5199427Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5199945Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5200084Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5200273Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5200278Z 2025-12-04T12:05:01.5200492Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5201263Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5201270Z 2025-12-04T12:05:01.5201534Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5201768Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5201880Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5201996Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5202130Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5202349Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5202447Z graph_break [] 2025-12-04T12:05:01.5202673Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5203407Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5203524Z warnings.warn( 2025-12-04T12:05:01.5203738Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5203849Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5203975Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5204195Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5204319Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5204434Z graph_break [] 2025-12-04T12:05:01.5204648Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5205378Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5205489Z warnings.warn( 2025-12-04T12:05:01.5205630Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5206021Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5206145Z Traceback (most recent call last): 2025-12-04T12:05:01.5206622Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5206792Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5207308Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5207433Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5207637Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5207644Z 2025-12-04T12:05:01.5207859Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5208600Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5208636Z 2025-12-04T12:05:01.5208932Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5209152Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5209276Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5209419Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5209553Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5209772Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5209870Z graph_break [] 2025-12-04T12:05:01.5210098Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5210833Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5210934Z warnings.warn( 2025-12-04T12:05:01.5211159Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5211302Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5211431Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5211649Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5211771Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5211882Z graph_break [] 2025-12-04T12:05:01.5212095Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5212822Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5212943Z warnings.warn( 2025-12-04T12:05:01.5213155Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5213275Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5213388Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5213607Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5213741Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5213839Z graph_break [] 2025-12-04T12:05:01.5214058Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5214798Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5214902Z warnings.warn( 2025-12-04T12:05:01.5215669Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a161840b55d48987.xml - 2025-12-04T12:05:01.5215840Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5216684Z FAILED [0.2606s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5216693Z 2025-12-04T12:05:01.5216922Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5217652Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5217659Z 2025-12-04T12:05:01.5217938Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5218209Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5218412Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.5218523Z Got exit code 1 2025-12-04T12:05:01.5219173Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5219597Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.5220247Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-84da9e83d0a63a49.xml 2025-12-04T12:05:01.5220415Z ============================= test session starts ============================== 2025-12-04T12:05:01.5220777Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5220934Z cachedir: .pytest_cache 2025-12-04T12:05:01.5221457Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5221596Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5221705Z configfile: pytest.ini 2025-12-04T12:05:01.5222310Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5222537Z collecting ... collected 571 items / 33 deselected / 538 selected 2025-12-04T12:05:01.5222713Z stepcurrent: skipping 33 already run items. 2025-12-04T12:05:01.5222840Z Running 262 items in this shard 2025-12-04T12:05:01.5222847Z 2025-12-04T12:05:01.5223458Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_float16_cuda PASSED [3.4423s] [ 0%] 2025-12-04T12:05:01.5224166Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0957s] [ 0%] 2025-12-04T12:05:01.5224850Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2678s] [ 0%] 2025-12-04T12:05:01.5225453Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2709s] [ 0%] 2025-12-04T12:05:01.5225459Z 2025-12-04T12:05:01.5225613Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5225988Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5226124Z Traceback (most recent call last): 2025-12-04T12:05:01.5226597Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5226752Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5227284Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5227405Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5227603Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5227609Z 2025-12-04T12:05:01.5227822Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5228552Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5228560Z 2025-12-04T12:05:01.5228839Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5229054Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5229180Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5229295Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5229515Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5229646Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5229742Z graph_break [] 2025-12-04T12:05:01.5229956Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5230703Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5230802Z warnings.warn( 2025-12-04T12:05:01.5231202Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5231365Z Traceback (most recent call last): 2025-12-04T12:05:01.5231842Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5232008Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5232553Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5232675Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5232875Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5232881Z 2025-12-04T12:05:01.5233094Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5233824Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5233872Z 2025-12-04T12:05:01.5234140Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5234361Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5234484Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5234601Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5234825Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5234957Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5235054Z graph_break [] 2025-12-04T12:05:01.5235282Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5236015Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5236116Z warnings.warn( 2025-12-04T12:05:01.5236339Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5236454Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5236566Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5236796Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5236917Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5237024Z graph_break [] 2025-12-04T12:05:01.5237236Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5237964Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5238072Z warnings.warn( 2025-12-04T12:05:01.5238213Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5238582Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5238711Z Traceback (most recent call last): 2025-12-04T12:05:01.5239194Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5239366Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5239885Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5240009Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5240210Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5240215Z 2025-12-04T12:05:01.5240431Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5241164Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5241170Z 2025-12-04T12:05:01.5241435Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5241652Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5241813Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5241958Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5242180Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5242311Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5242438Z graph_break [] 2025-12-04T12:05:01.5242665Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5243401Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5243500Z warnings.warn( 2025-12-04T12:05:01.5243724Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5243837Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5243952Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5244181Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5244336Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5244433Z graph_break [] 2025-12-04T12:05:01.5244657Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5245389Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5245499Z warnings.warn( 2025-12-04T12:05:01.5245711Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5245821Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5245942Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5246160Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5246281Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5246386Z graph_break [] 2025-12-04T12:05:01.5246599Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5247342Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5247441Z warnings.warn( 2025-12-04T12:05:01.5248188Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-84da9e83d0a63a49.xml - 2025-12-04T12:05:01.5248371Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5249201Z FAILED [0.2709s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5249206Z 2025-12-04T12:05:01.5249432Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5250164Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5250172Z 2025-12-04T12:05:01.5250439Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5250625Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5250840Z ============= 1 failed, 1 passed, 33 deselected, 2 rerun in 5.76s ============== 2025-12-04T12:05:01.5250948Z Got exit code 1 2025-12-04T12:05:01.5251055Z Retrying single test... 2025-12-04T12:05:01.5251635Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5ba1ce04884dd264.xml 2025-12-04T12:05:01.5251806Z ============================= test session starts ============================== 2025-12-04T12:05:01.5252156Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5252266Z cachedir: .pytest_cache 2025-12-04T12:05:01.5252834Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5252991Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5253112Z configfile: pytest.ini 2025-12-04T12:05:01.5253703Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5253962Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5254783Z stepcurrent: skipping 34 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5254895Z Running 1 items in this shard 2025-12-04T12:05:01.5254900Z 2025-12-04T12:05:01.5255601Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9083s] [100%] 2025-12-04T12:05:01.5256320Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2644s] [100%] 2025-12-04T12:05:01.5256919Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2649s] [100%] 2025-12-04T12:05:01.5256938Z 2025-12-04T12:05:01.5257079Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5257445Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5257577Z Traceback (most recent call last): 2025-12-04T12:05:01.5258052Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5258290Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5258829Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5258957Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5259162Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5259168Z 2025-12-04T12:05:01.5259381Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5260107Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5260112Z 2025-12-04T12:05:01.5260388Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5260604Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5260730Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5260846Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5260966Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5261203Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5261306Z graph_break [] 2025-12-04T12:05:01.5261523Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5262270Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5262373Z warnings.warn( 2025-12-04T12:05:01.5262755Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5262875Z Traceback (most recent call last): 2025-12-04T12:05:01.5263348Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5263513Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5264028Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5264197Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5264400Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5264451Z 2025-12-04T12:05:01.5264669Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5265400Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5265437Z 2025-12-04T12:05:01.5265703Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5265920Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5266043Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5266156Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5266285Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5266505Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5266639Z graph_break [] 2025-12-04T12:05:01.5266872Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5267611Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5267715Z warnings.warn( 2025-12-04T12:05:01.5267940Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5268050Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5268165Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5268400Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5268522Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5268633Z graph_break [] 2025-12-04T12:05:01.5268849Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5269583Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5269698Z warnings.warn( 2025-12-04T12:05:01.5269844Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5270215Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5270350Z Traceback (most recent call last): 2025-12-04T12:05:01.5270828Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5270995Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5271513Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5271637Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5271842Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5271847Z 2025-12-04T12:05:01.5272070Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5272805Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5272811Z 2025-12-04T12:05:01.5273080Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5273297Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5273421Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5273536Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5273661Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5273894Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5273993Z graph_break [] 2025-12-04T12:05:01.5274217Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5274984Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5275655Z warnings.warn( 2025-12-04T12:05:01.5275885Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5275996Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5276145Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5276378Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5276500Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5276607Z graph_break [] 2025-12-04T12:05:01.5276823Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5277552Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5277667Z warnings.warn( 2025-12-04T12:05:01.5277883Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5278027Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5278153Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5278368Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5278487Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5278596Z graph_break [] 2025-12-04T12:05:01.5278807Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5279544Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5279643Z warnings.warn( 2025-12-04T12:05:01.5280395Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5ba1ce04884dd264.xml - 2025-12-04T12:05:01.5280573Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5281411Z FAILED [0.2649s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5281417Z 2025-12-04T12:05:01.5281641Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5282372Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5282377Z 2025-12-04T12:05:01.5282641Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5282827Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5283027Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.5283140Z Got exit code 1 2025-12-04T12:05:01.5283244Z Retrying single test... 2025-12-04T12:05:01.5283828Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7aa6851dee05bf26.xml 2025-12-04T12:05:01.5283999Z ============================= test session starts ============================== 2025-12-04T12:05:01.5284351Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5284463Z cachedir: .pytest_cache 2025-12-04T12:05:01.5284996Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5285120Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5285239Z configfile: pytest.ini 2025-12-04T12:05:01.5285833Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5286059Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5286950Z stepcurrent: skipping 34 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5287065Z Running 1 items in this shard 2025-12-04T12:05:01.5287071Z 2025-12-04T12:05:01.5287951Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9015s] [100%] 2025-12-04T12:05:01.5288720Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2652s] [100%] 2025-12-04T12:05:01.5289316Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2599s] [100%] 2025-12-04T12:05:01.5289336Z 2025-12-04T12:05:01.5289474Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5289892Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5290033Z Traceback (most recent call last): 2025-12-04T12:05:01.5290507Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5290665Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5291198Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5291319Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5291521Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5291526Z 2025-12-04T12:05:01.5291739Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5292466Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5292474Z 2025-12-04T12:05:01.5292752Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5292972Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5293096Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5293209Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5293330Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5293563Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5293659Z graph_break [] 2025-12-04T12:05:01.5293875Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5294616Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5294715Z warnings.warn( 2025-12-04T12:05:01.5295095Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5295220Z Traceback (most recent call last): 2025-12-04T12:05:01.5295697Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5295866Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5296387Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5296509Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5296713Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5296718Z 2025-12-04T12:05:01.5296935Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5297671Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5297677Z 2025-12-04T12:05:01.5297990Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5298357Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5298487Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5298602Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5298738Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5298993Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5299094Z graph_break [] 2025-12-04T12:05:01.5299324Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5300057Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5300160Z warnings.warn( 2025-12-04T12:05:01.5300386Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5300495Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5300656Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5300875Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5300995Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5301102Z graph_break [] 2025-12-04T12:05:01.5301317Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5302048Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5302157Z warnings.warn( 2025-12-04T12:05:01.5302298Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5302665Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5302800Z Traceback (most recent call last): 2025-12-04T12:05:01.5303274Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5303443Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5303962Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5304087Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5304290Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5304295Z 2025-12-04T12:05:01.5304510Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5305247Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5305253Z 2025-12-04T12:05:01.5305516Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5305732Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5305856Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5305973Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5306095Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5306327Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5306423Z graph_break [] 2025-12-04T12:05:01.5306648Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5307379Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5307478Z warnings.warn( 2025-12-04T12:05:01.5307702Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5307810Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5307924Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5308155Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5308277Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5308437Z graph_break [] 2025-12-04T12:05:01.5308685Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5309416Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5309553Z warnings.warn( 2025-12-04T12:05:01.5309766Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5309875Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5309996Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5310214Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5310346Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5310440Z graph_break [] 2025-12-04T12:05:01.5310652Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5311426Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5311525Z warnings.warn( 2025-12-04T12:05:01.5312280Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7aa6851dee05bf26.xml - 2025-12-04T12:05:01.5312462Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5313302Z FAILED [0.2599s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5313308Z 2025-12-04T12:05:01.5313535Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5314270Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5314280Z 2025-12-04T12:05:01.5314557Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5314734Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5314932Z ================== 1 failed, 294 deselected, 2 rerun in 5.10s ================== 2025-12-04T12:05:01.5315045Z Got exit code 1 2025-12-04T12:05:01.5315688Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5316102Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.5316692Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cff0107c6e55c745.xml 2025-12-04T12:05:01.5316852Z ============================= test session starts ============================== 2025-12-04T12:05:01.5317215Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5317324Z cachedir: .pytest_cache 2025-12-04T12:05:01.5317847Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5317983Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5318088Z configfile: pytest.ini 2025-12-04T12:05:01.5318683Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5318923Z collecting ... collected 571 items / 35 deselected / 536 selected 2025-12-04T12:05:01.5319064Z stepcurrent: skipping 35 already run items. 2025-12-04T12:05:01.5319188Z Running 260 items in this shard 2025-12-04T12:05:01.5319193Z 2025-12-04T12:05:01.5319823Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp2_load_upcast_to_fp32_False_float16_cuda PASSED [3.4206s] [ 0%] 2025-12-04T12:05:01.5320441Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp2_load_upcast_to_fp32_True_float16_cuda PASSED [0.2986s] [ 0%] 2025-12-04T12:05:01.5321133Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0909s] [ 1%] 2025-12-04T12:05:01.5321835Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2688s] [ 1%] 2025-12-04T12:05:01.5322444Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2721s] [ 1%] 2025-12-04T12:05:01.5322450Z 2025-12-04T12:05:01.5322592Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5322963Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5323122Z Traceback (most recent call last): 2025-12-04T12:05:01.5323605Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5323777Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5324303Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5324429Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5324636Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5324642Z 2025-12-04T12:05:01.5324856Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5325586Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5325591Z 2025-12-04T12:05:01.5325863Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5326085Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5326211Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5326329Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5326566Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5326693Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5326793Z graph_break [] 2025-12-04T12:05:01.5327025Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5327763Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5327864Z warnings.warn( 2025-12-04T12:05:01.5328237Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5328362Z Traceback (most recent call last): 2025-12-04T12:05:01.5328854Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5329011Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5329531Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5329672Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5329864Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5329870Z 2025-12-04T12:05:01.5330086Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5330815Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5330821Z 2025-12-04T12:05:01.5331083Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5331359Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5331501Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5331616Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5331851Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5332004Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5332100Z graph_break [] 2025-12-04T12:05:01.5332328Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5333066Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5333181Z warnings.warn( 2025-12-04T12:05:01.5333394Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5333507Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5333633Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5333888Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5334013Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5334119Z graph_break [] 2025-12-04T12:05:01.5334336Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5335084Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5335183Z warnings.warn( 2025-12-04T12:05:01.5335325Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5335692Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5335812Z Traceback (most recent call last): 2025-12-04T12:05:01.5336381Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5336607Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5337133Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5337352Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5337545Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5337553Z 2025-12-04T12:05:01.5337769Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5338568Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5338575Z 2025-12-04T12:05:01.5338843Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5339075Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5339187Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5339306Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5339543Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5339665Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5339762Z graph_break [] 2025-12-04T12:05:01.5339991Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5340725Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5340835Z warnings.warn( 2025-12-04T12:05:01.5341049Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5341160Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5341285Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5341504Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5341623Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5341735Z graph_break [] 2025-12-04T12:05:01.5342014Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5342784Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5342885Z warnings.warn( 2025-12-04T12:05:01.5343130Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5343251Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5343362Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5343582Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5343715Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5343811Z graph_break [] 2025-12-04T12:05:01.5344023Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5344764Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5344921Z warnings.warn( 2025-12-04T12:05:01.5345690Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cff0107c6e55c745.xml - 2025-12-04T12:05:01.5345862Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5346691Z FAILED [0.2721s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5346712Z 2025-12-04T12:05:01.5346929Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5347639Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5347644Z 2025-12-04T12:05:01.5347925Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5348107Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5348334Z ============= 1 failed, 2 passed, 35 deselected, 2 rerun in 6.03s ============== 2025-12-04T12:05:01.5348435Z Got exit code 1 2025-12-04T12:05:01.5348549Z Retrying single test... 2025-12-04T12:05:01.5349141Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-65ccdb61caed4adb.xml 2025-12-04T12:05:01.5349302Z ============================= test session starts ============================== 2025-12-04T12:05:01.5349654Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5349776Z cachedir: .pytest_cache 2025-12-04T12:05:01.5350303Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5350442Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5350550Z configfile: pytest.ini 2025-12-04T12:05:01.5351147Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5351390Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5352195Z stepcurrent: skipping 37 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5352310Z Running 1 items in this shard 2025-12-04T12:05:01.5352327Z 2025-12-04T12:05:01.5353000Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9099s] [100%] 2025-12-04T12:05:01.5353706Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2645s] [100%] 2025-12-04T12:05:01.5354338Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2537s] [100%] 2025-12-04T12:05:01.5354344Z 2025-12-04T12:05:01.5354485Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5354884Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5355006Z Traceback (most recent call last): 2025-12-04T12:05:01.5355486Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5355651Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5356171Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5356293Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5356537Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5356543Z 2025-12-04T12:05:01.5356760Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5357487Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5357495Z 2025-12-04T12:05:01.5357761Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5357977Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5358100Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5358215Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5358349Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5358570Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5358666Z graph_break [] 2025-12-04T12:05:01.5358897Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5359636Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5359738Z warnings.warn( 2025-12-04T12:05:01.5360106Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5360231Z Traceback (most recent call last): 2025-12-04T12:05:01.5360715Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5360871Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5361389Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5361522Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5361713Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5361721Z 2025-12-04T12:05:01.5361933Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5362662Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5362670Z 2025-12-04T12:05:01.5362933Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5363159Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5363270Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5363383Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5363515Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5363733Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5363841Z graph_break [] 2025-12-04T12:05:01.5364054Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5364850Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5364967Z warnings.warn( 2025-12-04T12:05:01.5365180Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5365322Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5365448Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5365669Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5365795Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5365909Z graph_break [] 2025-12-04T12:05:01.5366123Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5366867Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5366971Z warnings.warn( 2025-12-04T12:05:01.5367154Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5367532Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5367654Z Traceback (most recent call last): 2025-12-04T12:05:01.5368132Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5368303Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5368823Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5368959Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5369150Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5369155Z 2025-12-04T12:05:01.5369369Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5370100Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5370109Z 2025-12-04T12:05:01.5370377Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5370606Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5370720Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5370832Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5370964Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5371184Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5371281Z graph_break [] 2025-12-04T12:05:01.5371511Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5372248Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5372363Z warnings.warn( 2025-12-04T12:05:01.5372577Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5372691Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5372819Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5373038Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5373160Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5373271Z graph_break [] 2025-12-04T12:05:01.5373484Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5374229Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5374330Z warnings.warn( 2025-12-04T12:05:01.5374545Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5374670Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5374788Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5375047Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5375213Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5375312Z graph_break [] 2025-12-04T12:05:01.5375524Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5376299Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5376401Z warnings.warn( 2025-12-04T12:05:01.5377171Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-65ccdb61caed4adb.xml - 2025-12-04T12:05:01.5377344Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5378235Z FAILED [0.2537s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5378293Z 2025-12-04T12:05:01.5378516Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5379235Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5379242Z 2025-12-04T12:05:01.5379523Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5379705Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5379921Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.5380025Z Got exit code 1 2025-12-04T12:05:01.5380131Z Retrying single test... 2025-12-04T12:05:01.5380721Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-332efb6825e8ee00.xml 2025-12-04T12:05:01.5380889Z ============================= test session starts ============================== 2025-12-04T12:05:01.5381244Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5381371Z cachedir: .pytest_cache 2025-12-04T12:05:01.5381894Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5382035Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5382143Z configfile: pytest.ini 2025-12-04T12:05:01.5382740Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5382985Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5383786Z stepcurrent: skipping 37 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5383906Z Running 1 items in this shard 2025-12-04T12:05:01.5383926Z 2025-12-04T12:05:01.5384610Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9176s] [100%] 2025-12-04T12:05:01.5385283Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2647s] [100%] 2025-12-04T12:05:01.5385882Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2629s] [100%] 2025-12-04T12:05:01.5385888Z 2025-12-04T12:05:01.5386032Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5386405Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5386528Z Traceback (most recent call last): 2025-12-04T12:05:01.5387050Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5387248Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5387947Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5388168Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5388375Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5388381Z 2025-12-04T12:05:01.5388595Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5389323Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5389329Z 2025-12-04T12:05:01.5389592Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5389859Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5389986Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5390101Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5390233Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5390454Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5390554Z graph_break [] 2025-12-04T12:05:01.5390779Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5391518Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5391617Z warnings.warn( 2025-12-04T12:05:01.5391985Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5392108Z Traceback (most recent call last): 2025-12-04T12:05:01.5392598Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5392757Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5393278Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5393413Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5393607Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5393612Z 2025-12-04T12:05:01.5393828Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5394558Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5394563Z 2025-12-04T12:05:01.5394828Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5395058Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5395177Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5395292Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5395429Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5395653Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5395764Z graph_break [] 2025-12-04T12:05:01.5395981Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5396713Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5396825Z warnings.warn( 2025-12-04T12:05:01.5397039Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5397148Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5397276Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5397497Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5397667Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5397779Z graph_break [] 2025-12-04T12:05:01.5398036Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5398776Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5398908Z warnings.warn( 2025-12-04T12:05:01.5399051Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5399419Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5399540Z Traceback (most recent call last): 2025-12-04T12:05:01.5400025Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5400181Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5400703Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5400882Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5401076Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5401081Z 2025-12-04T12:05:01.5401296Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5402027Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5402033Z 2025-12-04T12:05:01.5402302Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5402532Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5402644Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5402760Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5402893Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5403117Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5403214Z graph_break [] 2025-12-04T12:05:01.5403444Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5404180Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5404296Z warnings.warn( 2025-12-04T12:05:01.5404509Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5404618Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5404744Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5404962Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5405082Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5405191Z graph_break [] 2025-12-04T12:05:01.5405402Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5406276Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5406381Z warnings.warn( 2025-12-04T12:05:01.5406593Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5406719Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5406878Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5407140Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5407273Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5407371Z graph_break [] 2025-12-04T12:05:01.5407599Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5408415Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5408586Z warnings.warn( 2025-12-04T12:05:01.5409519Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-332efb6825e8ee00.xml - 2025-12-04T12:05:01.5409693Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5410550Z FAILED [0.2629s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5410571Z 2025-12-04T12:05:01.5410787Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5411501Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5411507Z 2025-12-04T12:05:01.5411784Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5412003Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5412220Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.5412319Z Got exit code 1 2025-12-04T12:05:01.5412949Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5413374Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.5413948Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5cf5b975b5ed0b41.xml 2025-12-04T12:05:01.5414108Z ============================= test session starts ============================== 2025-12-04T12:05:01.5414470Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5414578Z cachedir: .pytest_cache 2025-12-04T12:05:01.5415119Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5415245Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5415351Z configfile: pytest.ini 2025-12-04T12:05:01.5415955Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5416181Z collecting ... collected 571 items / 38 deselected / 533 selected 2025-12-04T12:05:01.5416333Z stepcurrent: skipping 38 already run items. 2025-12-04T12:05:01.5416449Z Running 257 items in this shard 2025-12-04T12:05:01.5416455Z 2025-12-04T12:05:01.5417046Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_float16_cuda PASSED [3.4340s] [ 0%] 2025-12-04T12:05:01.5417730Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0914s] [ 0%] 2025-12-04T12:05:01.5418465Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2611s] [ 0%] 2025-12-04T12:05:01.5419062Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2630s] [ 0%] 2025-12-04T12:05:01.5419070Z 2025-12-04T12:05:01.5419209Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5419562Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5419698Z Traceback (most recent call last): 2025-12-04T12:05:01.5420176Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5420349Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5420914Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5421070Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5421276Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5421282Z 2025-12-04T12:05:01.5421498Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5422241Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5422260Z 2025-12-04T12:05:01.5422525Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5422743Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5422867Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5422985Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5423208Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5423376Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5423477Z graph_break [] 2025-12-04T12:05:01.5423695Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5424439Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5424542Z warnings.warn( 2025-12-04T12:05:01.5424910Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5425032Z Traceback (most recent call last): 2025-12-04T12:05:01.5425507Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5425675Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5426195Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5426336Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5426527Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5426533Z 2025-12-04T12:05:01.5426748Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5427472Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5427480Z 2025-12-04T12:05:01.5427745Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5427975Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5428086Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5428199Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5428435Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5428556Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5428658Z graph_break [] 2025-12-04T12:05:01.5428898Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5429629Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5429733Z warnings.warn( 2025-12-04T12:05:01.5429962Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5430074Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5430201Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5430422Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5430545Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5430656Z graph_break [] 2025-12-04T12:05:01.5430872Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5431633Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5431795Z warnings.warn( 2025-12-04T12:05:01.5431941Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5432307Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5432460Z Traceback (most recent call last): 2025-12-04T12:05:01.5432937Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5433109Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5433629Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5433754Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5433961Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5433997Z 2025-12-04T12:05:01.5434218Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5434945Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5434951Z 2025-12-04T12:05:01.5435222Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5435442Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5435574Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5435689Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5435925Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5436050Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5436149Z graph_break [] 2025-12-04T12:05:01.5436381Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5437116Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5437219Z warnings.warn( 2025-12-04T12:05:01.5437444Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5437559Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5437686Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5437909Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5438029Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5438141Z graph_break [] 2025-12-04T12:05:01.5438352Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5439080Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5439191Z warnings.warn( 2025-12-04T12:05:01.5439407Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5439516Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5439650Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5439867Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5439997Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5440095Z graph_break [] 2025-12-04T12:05:01.5440307Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5441045Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5441143Z warnings.warn( 2025-12-04T12:05:01.5441896Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5cf5b975b5ed0b41.xml - 2025-12-04T12:05:01.5442078Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5442959Z FAILED [0.2630s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5442966Z 2025-12-04T12:05:01.5443195Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5443958Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5443963Z 2025-12-04T12:05:01.5444244Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5444422Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5444639Z ============= 1 failed, 1 passed, 38 deselected, 2 rerun in 5.73s ============== 2025-12-04T12:05:01.5444752Z Got exit code 1 2025-12-04T12:05:01.5444892Z Retrying single test... 2025-12-04T12:05:01.5445476Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-66c6826e1f29c7ae.xml 2025-12-04T12:05:01.5445648Z ============================= test session starts ============================== 2025-12-04T12:05:01.5446000Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5446123Z cachedir: .pytest_cache 2025-12-04T12:05:01.5446647Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5446770Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5446890Z configfile: pytest.ini 2025-12-04T12:05:01.5447484Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5447713Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5448530Z stepcurrent: skipping 39 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5448644Z Running 1 items in this shard 2025-12-04T12:05:01.5448649Z 2025-12-04T12:05:01.5449337Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9157s] [100%] 2025-12-04T12:05:01.5450005Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2633s] [100%] 2025-12-04T12:05:01.5450600Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2587s] [100%] 2025-12-04T12:05:01.5450606Z 2025-12-04T12:05:01.5450748Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5451107Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5451243Z Traceback (most recent call last): 2025-12-04T12:05:01.5451722Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5451890Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5452415Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5452539Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5452741Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5452746Z 2025-12-04T12:05:01.5452960Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5453672Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5453692Z 2025-12-04T12:05:01.5453990Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5454240Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5454366Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5454508Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5454627Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5454861Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5454958Z graph_break [] 2025-12-04T12:05:01.5455172Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5455920Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5456024Z warnings.warn( 2025-12-04T12:05:01.5456396Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5456552Z Traceback (most recent call last): 2025-12-04T12:05:01.5457026Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5457195Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5457718Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5457854Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5458044Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5458049Z 2025-12-04T12:05:01.5458349Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5459078Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5459083Z 2025-12-04T12:05:01.5459355Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5459590Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5459704Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5459820Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5459956Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5460180Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5460279Z graph_break [] 2025-12-04T12:05:01.5460507Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5461238Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5461337Z warnings.warn( 2025-12-04T12:05:01.5461564Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5461675Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5461805Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5462024Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5462144Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5462254Z graph_break [] 2025-12-04T12:05:01.5462473Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5463210Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5463326Z warnings.warn( 2025-12-04T12:05:01.5463468Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5463832Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5463956Z Traceback (most recent call last): 2025-12-04T12:05:01.5464432Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5464651Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5465202Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5465328Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5465565Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5465570Z 2025-12-04T12:05:01.5465785Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5466507Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5466512Z 2025-12-04T12:05:01.5466778Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5466995Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5467147Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5467264Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5467397Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5467617Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5467715Z graph_break [] 2025-12-04T12:05:01.5467943Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5468681Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5468779Z warnings.warn( 2025-12-04T12:05:01.5469004Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5469113Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5469239Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5469458Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5469585Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5469698Z graph_break [] 2025-12-04T12:05:01.5469912Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5470639Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5470752Z warnings.warn( 2025-12-04T12:05:01.5470968Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5471090Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5471203Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5471425Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5471561Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5471658Z graph_break [] 2025-12-04T12:05:01.5471871Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5472620Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5472724Z warnings.warn( 2025-12-04T12:05:01.5473475Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-66c6826e1f29c7ae.xml - 2025-12-04T12:05:01.5473663Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5474486Z FAILED [0.2587s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5474492Z 2025-12-04T12:05:01.5474722Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5475546Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5475554Z 2025-12-04T12:05:01.5475910Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5476120Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5476416Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.5476567Z Got exit code 1 2025-12-04T12:05:01.5476675Z Retrying single test... 2025-12-04T12:05:01.5477343Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2a970c00eea5bec9.xml 2025-12-04T12:05:01.5477518Z ============================= test session starts ============================== 2025-12-04T12:05:01.5477939Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5478072Z cachedir: .pytest_cache 2025-12-04T12:05:01.5478602Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5478833Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5478958Z configfile: pytest.ini 2025-12-04T12:05:01.5479630Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5479863Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5480763Z stepcurrent: skipping 39 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5480878Z Running 1 items in this shard 2025-12-04T12:05:01.5480883Z 2025-12-04T12:05:01.5481663Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9097s] [100%] 2025-12-04T12:05:01.5482336Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2652s] [100%] 2025-12-04T12:05:01.5482938Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2551s] [100%] 2025-12-04T12:05:01.5482944Z 2025-12-04T12:05:01.5483090Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5483442Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5483578Z Traceback (most recent call last): 2025-12-04T12:05:01.5484057Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5484228Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5484752Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5484882Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5485090Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5485098Z 2025-12-04T12:05:01.5485316Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5486028Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5486050Z 2025-12-04T12:05:01.5486319Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5486545Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5486672Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5486788Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5486912Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5487155Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5487258Z graph_break [] 2025-12-04T12:05:01.5487523Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5488492Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5488636Z warnings.warn( 2025-12-04T12:05:01.5489005Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5489129Z Traceback (most recent call last): 2025-12-04T12:05:01.5489609Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5489783Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5490304Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5490443Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5490681Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5490688Z 2025-12-04T12:05:01.5490906Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5491636Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5491644Z 2025-12-04T12:05:01.5491909Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5492143Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5492257Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5492372Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5492508Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5492727Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5492824Z graph_break [] 2025-12-04T12:05:01.5493053Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5493788Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5493889Z warnings.warn( 2025-12-04T12:05:01.5494115Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5494232Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5494359Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5494576Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5494698Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5494805Z graph_break [] 2025-12-04T12:05:01.5495016Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5495750Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5495863Z warnings.warn( 2025-12-04T12:05:01.5496010Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5496372Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5496495Z Traceback (most recent call last): 2025-12-04T12:05:01.5496974Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5497139Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5497656Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5497778Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5497985Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5497991Z 2025-12-04T12:05:01.5498262Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5499070Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5499076Z 2025-12-04T12:05:01.5499344Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5499593Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5499718Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5499835Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5499969Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5500187Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5500285Z graph_break [] 2025-12-04T12:05:01.5500512Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5501247Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5501382Z warnings.warn( 2025-12-04T12:05:01.5501610Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5501721Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5501848Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5502071Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5502191Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5502303Z graph_break [] 2025-12-04T12:05:01.5502516Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5503244Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5503357Z warnings.warn( 2025-12-04T12:05:01.5503573Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5503699Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5503813Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5504033Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5504165Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5504262Z graph_break [] 2025-12-04T12:05:01.5504475Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5505218Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5505319Z warnings.warn( 2025-12-04T12:05:01.5506073Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2a970c00eea5bec9.xml - 2025-12-04T12:05:01.5506260Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5507084Z FAILED [0.2551s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5507094Z 2025-12-04T12:05:01.5507325Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5508039Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5508046Z 2025-12-04T12:05:01.5508321Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5508500Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5508701Z ================== 1 failed, 294 deselected, 2 rerun in 5.10s ================== 2025-12-04T12:05:01.5508815Z Got exit code 1 2025-12-04T12:05:01.5509447Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5509911Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.5510524Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-284ef1fed3dcf877.xml 2025-12-04T12:05:01.5510718Z ============================= test session starts ============================== 2025-12-04T12:05:01.5511082Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5511192Z cachedir: .pytest_cache 2025-12-04T12:05:01.5511715Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5511853Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5511961Z configfile: pytest.ini 2025-12-04T12:05:01.5512565Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5512840Z collecting ... collected 571 items / 40 deselected / 531 selected 2025-12-04T12:05:01.5512984Z stepcurrent: skipping 40 already run items. 2025-12-04T12:05:01.5513110Z Running 255 items in this shard 2025-12-04T12:05:01.5513115Z 2025-12-04T12:05:01.5513802Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9151s] [ 0%] 2025-12-04T12:05:01.5514498Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2626s] [ 0%] 2025-12-04T12:05:01.5515093Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2629s] [ 0%] 2025-12-04T12:05:01.5515099Z 2025-12-04T12:05:01.5515239Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5515617Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5515742Z Traceback (most recent call last): 2025-12-04T12:05:01.5516230Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5516386Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5516908Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5517042Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5517231Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5517237Z 2025-12-04T12:05:01.5517452Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5518184Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5518192Z 2025-12-04T12:05:01.5518462Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5518694Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5518805Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5518921Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5519055Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5519278Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5519373Z graph_break [] 2025-12-04T12:05:01.5519600Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5520334Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5520446Z warnings.warn( 2025-12-04T12:05:01.5520809Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5520968Z Traceback (most recent call last): 2025-12-04T12:05:01.5521484Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5521641Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5522197Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5522318Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5522509Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5522515Z 2025-12-04T12:05:01.5522740Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5523455Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5523461Z 2025-12-04T12:05:01.5523775Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5523993Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5524104Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5524230Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5524350Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5524572Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5524683Z graph_break [] 2025-12-04T12:05:01.5524895Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5525643Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5525744Z warnings.warn( 2025-12-04T12:05:01.5525955Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5526077Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5526194Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5526417Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5526552Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5526652Z graph_break [] 2025-12-04T12:05:01.5526865Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5527610Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5527712Z warnings.warn( 2025-12-04T12:05:01.5527866Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5528228Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5528352Z Traceback (most recent call last): 2025-12-04T12:05:01.5528849Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5529011Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5529534Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5529673Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5529866Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5529871Z 2025-12-04T12:05:01.5530099Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5530818Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5530824Z 2025-12-04T12:05:01.5531089Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5531318Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5531431Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5531594Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5531744Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5531967Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5532076Z graph_break [] 2025-12-04T12:05:01.5532324Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5533054Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5533168Z warnings.warn( 2025-12-04T12:05:01.5533381Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5533503Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5533616Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5533835Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5534033Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5534132Z graph_break [] 2025-12-04T12:05:01.5534358Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5535104Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5535208Z warnings.warn( 2025-12-04T12:05:01.5535437Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5535551Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5535666Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5535899Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5536022Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5536119Z graph_break [] 2025-12-04T12:05:01.5536346Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5537079Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5537195Z warnings.warn( 2025-12-04T12:05:01.5537953Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-284ef1fed3dcf877.xml - 2025-12-04T12:05:01.5538197Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5539052Z FAILED [0.2629s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5539058Z 2025-12-04T12:05:01.5539275Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5540008Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5540018Z 2025-12-04T12:05:01.5540287Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5540467Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5540679Z ================== 1 failed, 40 deselected, 2 rerun in 5.12s =================== 2025-12-04T12:05:01.5540781Z Got exit code 1 2025-12-04T12:05:01.5540890Z Retrying single test... 2025-12-04T12:05:01.5541477Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5553e88673e5c7d3.xml 2025-12-04T12:05:01.5541639Z ============================= test session starts ============================== 2025-12-04T12:05:01.5542005Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5542115Z cachedir: .pytest_cache 2025-12-04T12:05:01.5542682Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5542825Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5542964Z configfile: pytest.ini 2025-12-04T12:05:01.5543570Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5543827Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5544639Z stepcurrent: skipping 40 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5544767Z Running 1 items in this shard 2025-12-04T12:05:01.5544772Z 2025-12-04T12:05:01.5545451Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9335s] [100%] 2025-12-04T12:05:01.5546149Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2697s] [100%] 2025-12-04T12:05:01.5546785Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2666s] [100%] 2025-12-04T12:05:01.5546794Z 2025-12-04T12:05:01.5546936Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5547312Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5547440Z Traceback (most recent call last): 2025-12-04T12:05:01.5547937Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5548094Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5548617Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5548757Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5548947Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5548953Z 2025-12-04T12:05:01.5549169Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5549901Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5549909Z 2025-12-04T12:05:01.5550173Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5550399Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5550510Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5550624Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5550756Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5550976Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5551092Z graph_break [] 2025-12-04T12:05:01.5551308Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5552046Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5552161Z warnings.warn( 2025-12-04T12:05:01.5552520Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5552640Z Traceback (most recent call last): 2025-12-04T12:05:01.5553126Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5553280Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5553807Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5553933Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5554158Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5554165Z 2025-12-04T12:05:01.5554427Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5555146Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5555199Z 2025-12-04T12:05:01.5555479Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5555694Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5555805Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5555931Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5556052Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5556272Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5556383Z graph_break [] 2025-12-04T12:05:01.5556641Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5557394Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5557495Z warnings.warn( 2025-12-04T12:05:01.5557710Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5557832Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5557943Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5558162Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5558292Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5558390Z graph_break [] 2025-12-04T12:05:01.5558603Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5559340Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5559445Z warnings.warn( 2025-12-04T12:05:01.5559599Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5559961Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5560085Z Traceback (most recent call last): 2025-12-04T12:05:01.5560573Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5560728Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5561255Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5561378Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5561569Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5561575Z 2025-12-04T12:05:01.5561803Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5562532Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5562538Z 2025-12-04T12:05:01.5562815Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5563032Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5563143Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5563271Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5563392Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5563614Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5563727Z graph_break [] 2025-12-04T12:05:01.5563942Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5564708Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5564824Z warnings.warn( 2025-12-04T12:05:01.5565075Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5565199Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5565312Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5565560Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5565694Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5565790Z graph_break [] 2025-12-04T12:05:01.5566006Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5566745Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5566847Z warnings.warn( 2025-12-04T12:05:01.5567074Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5567218Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5567330Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5567567Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5567687Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5567783Z graph_break [] 2025-12-04T12:05:01.5568007Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5568737Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5568851Z warnings.warn( 2025-12-04T12:05:01.5569602Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5553e88673e5c7d3.xml - 2025-12-04T12:05:01.5569772Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5570615Z FAILED [0.2666s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5570622Z 2025-12-04T12:05:01.5570837Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5571574Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5571582Z 2025-12-04T12:05:01.5571846Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5572024Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5572234Z ================== 1 failed, 294 deselected, 2 rerun in 5.16s ================== 2025-12-04T12:05:01.5572332Z Got exit code 1 2025-12-04T12:05:01.5572441Z Retrying single test... 2025-12-04T12:05:01.5573031Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-93bd8f6af98a9b4c.xml 2025-12-04T12:05:01.5573197Z ============================= test session starts ============================== 2025-12-04T12:05:01.5573560Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5573669Z cachedir: .pytest_cache 2025-12-04T12:05:01.5574189Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5574323Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5574429Z configfile: pytest.ini 2025-12-04T12:05:01.5575032Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5575257Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5576097Z stepcurrent: skipping 40 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5576259Z Running 1 items in this shard 2025-12-04T12:05:01.5576264Z 2025-12-04T12:05:01.5576951Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9234s] [100%] 2025-12-04T12:05:01.5577674Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2642s] [100%] 2025-12-04T12:05:01.5578348Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2643s] [100%] 2025-12-04T12:05:01.5578355Z 2025-12-04T12:05:01.5578495Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5578888Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5579054Z Traceback (most recent call last): 2025-12-04T12:05:01.5579546Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5579704Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5580226Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5580360Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5580553Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5580558Z 2025-12-04T12:05:01.5580777Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5581509Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5581515Z 2025-12-04T12:05:01.5581785Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5582022Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5582139Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5582252Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5582387Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5582611Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5582721Z graph_break [] 2025-12-04T12:05:01.5582936Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5583669Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5583783Z warnings.warn( 2025-12-04T12:05:01.5584145Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5584269Z Traceback (most recent call last): 2025-12-04T12:05:01.5584759Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5584913Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5585443Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5585568Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5585759Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5585765Z 2025-12-04T12:05:01.5596424Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5597285Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5597295Z 2025-12-04T12:05:01.5597581Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5597965Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5598145Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5598284Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5598412Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5598645Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5598812Z graph_break [] 2025-12-04T12:05:01.5599038Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5599803Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5599934Z warnings.warn( 2025-12-04T12:05:01.5600156Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5600288Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5600406Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5600687Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5600831Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5600935Z graph_break [] 2025-12-04T12:05:01.5601154Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5601915Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5602022Z warnings.warn( 2025-12-04T12:05:01.5602186Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5602552Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5602681Z Traceback (most recent call last): 2025-12-04T12:05:01.5603178Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5603344Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5603898Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5604042Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5604235Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5604245Z 2025-12-04T12:05:01.5604480Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5605205Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5605211Z 2025-12-04T12:05:01.5605484Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5605723Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5605839Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5605975Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5606106Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5606334Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5606453Z graph_break [] 2025-12-04T12:05:01.5606676Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5607414Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5607534Z warnings.warn( 2025-12-04T12:05:01.5607752Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5607879Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5607998Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5608222Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5608362Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5608466Z graph_break [] 2025-12-04T12:05:01.5608719Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5609497Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5609630Z warnings.warn( 2025-12-04T12:05:01.5609845Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5609970Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5610087Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5610321Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5610447Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5610548Z graph_break [] 2025-12-04T12:05:01.5610777Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5611515Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5611651Z warnings.warn( 2025-12-04T12:05:01.5612428Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-93bd8f6af98a9b4c.xml - 2025-12-04T12:05:01.5612606Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5613461Z FAILED [0.2643s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5613468Z 2025-12-04T12:05:01.5613683Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5614478Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5614486Z 2025-12-04T12:05:01.5614760Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5614944Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5615162Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.5615273Z Got exit code 1 2025-12-04T12:05:01.5615930Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5616347Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.5616960Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-58f36c545585b24b.xml 2025-12-04T12:05:01.5617159Z ============================= test session starts ============================== 2025-12-04T12:05:01.5617619Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5617812Z cachedir: .pytest_cache 2025-12-04T12:05:01.5623557Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5623687Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5623811Z configfile: pytest.ini 2025-12-04T12:05:01.5624405Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5624626Z collecting ... collected 571 items / 41 deselected / 530 selected 2025-12-04T12:05:01.5624771Z stepcurrent: skipping 41 already run items. 2025-12-04T12:05:01.5624879Z Running 254 items in this shard 2025-12-04T12:05:01.5624885Z 2025-12-04T12:05:01.5625483Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_float16_cuda PASSED [3.4458s] [ 0%] 2025-12-04T12:05:01.5626130Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_False_float16_cuda PASSED [0.3063s] [ 0%] 2025-12-04T12:05:01.5626840Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0906s] [ 1%] 2025-12-04T12:05:01.5627553Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2705s] [ 1%] 2025-12-04T12:05:01.5628148Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2554s] [ 1%] 2025-12-04T12:05:01.5628155Z 2025-12-04T12:05:01.5628303Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5628660Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5628809Z Traceback (most recent call last): 2025-12-04T12:05:01.5629292Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5629444Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5629970Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5630090Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5630278Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5630284Z 2025-12-04T12:05:01.5630501Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5631222Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5631228Z 2025-12-04T12:05:01.5631493Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5631712Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5631822Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5631940Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5652782Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5652938Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5653041Z graph_break [] 2025-12-04T12:05:01.5653255Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5653995Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5654091Z warnings.warn( 2025-12-04T12:05:01.5654448Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5654571Z Traceback (most recent call last): 2025-12-04T12:05:01.5655041Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5655197Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5655720Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5655840Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5656035Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5656041Z 2025-12-04T12:05:01.5656252Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5656961Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5656967Z 2025-12-04T12:05:01.5657233Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5657446Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5657620Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5657731Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5657981Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5658109Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5658320Z graph_break [] 2025-12-04T12:05:01.5658533Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5659276Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5659375Z warnings.warn( 2025-12-04T12:05:01.5659586Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5659698Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5659805Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5660029Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5660182Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5660273Z graph_break [] 2025-12-04T12:05:01.5660492Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5661221Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5661318Z warnings.warn( 2025-12-04T12:05:01.5661463Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5661817Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5661938Z Traceback (most recent call last): 2025-12-04T12:05:01.5662404Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5662555Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5663079Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5663202Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5663389Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5663403Z 2025-12-04T12:05:01.5663614Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5664333Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5664338Z 2025-12-04T12:05:01.5664606Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5664815Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5664922Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5665038Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5665252Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5665381Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5665483Z graph_break [] 2025-12-04T12:05:01.5665698Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5666445Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5666548Z warnings.warn( 2025-12-04T12:05:01.5666760Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5666882Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5666995Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5667225Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5667344Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5667442Z graph_break [] 2025-12-04T12:05:01.5667667Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5668458Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5668561Z warnings.warn( 2025-12-04T12:05:01.5668787Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5668926Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5669056Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5669274Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5669397Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5669508Z graph_break [] 2025-12-04T12:05:01.5669722Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5670448Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5670590Z warnings.warn( 2025-12-04T12:05:01.5671343Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-58f36c545585b24b.xml - 2025-12-04T12:05:01.5671524Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5672352Z FAILED [0.2554s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5672358Z 2025-12-04T12:05:01.5672573Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5673298Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5673304Z 2025-12-04T12:05:01.5673571Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5673768Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5673984Z ============= 1 failed, 2 passed, 41 deselected, 2 rerun in 6.07s ============== 2025-12-04T12:05:01.5674083Z Got exit code 1 2025-12-04T12:05:01.5674203Z Retrying single test... 2025-12-04T12:05:01.5674786Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-6ef09e48fb8b226a.xml 2025-12-04T12:05:01.5674947Z ============================= test session starts ============================== 2025-12-04T12:05:01.5675311Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5675419Z cachedir: .pytest_cache 2025-12-04T12:05:01.5675955Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5676078Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5676188Z configfile: pytest.ini 2025-12-04T12:05:01.5676802Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5677032Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5677852Z stepcurrent: skipping 43 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5677967Z Running 1 items in this shard 2025-12-04T12:05:01.5677972Z 2025-12-04T12:05:01.5678652Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9186s] [100%] 2025-12-04T12:05:01.5679340Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2658s] [100%] 2025-12-04T12:05:01.5679990Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2646s] [100%] 2025-12-04T12:05:01.5679996Z 2025-12-04T12:05:01.5680151Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5680542Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5680665Z Traceback (most recent call last): 2025-12-04T12:05:01.5681153Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5681311Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5681845Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5681969Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5682160Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5682193Z 2025-12-04T12:05:01.5682425Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5683145Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5683152Z 2025-12-04T12:05:01.5683428Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5683644Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5683756Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5683881Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5684002Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5684222Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5684333Z graph_break [] 2025-12-04T12:05:01.5684548Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5685302Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5685403Z warnings.warn( 2025-12-04T12:05:01.5685764Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5685899Z Traceback (most recent call last): 2025-12-04T12:05:01.5686374Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5686528Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5687057Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5687180Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5687387Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5687393Z 2025-12-04T12:05:01.5687612Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5688602Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5688611Z 2025-12-04T12:05:01.5688893Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5689114Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5689241Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5689355Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5689476Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5689708Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5689807Z graph_break [] 2025-12-04T12:05:01.5690022Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5690856Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5691060Z warnings.warn( 2025-12-04T12:05:01.5691291Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5691401Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5691557Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5691790Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5691913Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5692010Z graph_break [] 2025-12-04T12:05:01.5692236Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5692964Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5693063Z warnings.warn( 2025-12-04T12:05:01.5693220Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5693633Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5693768Z Traceback (most recent call last): 2025-12-04T12:05:01.5694244Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5694402Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5694932Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5695056Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5695263Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5695269Z 2025-12-04T12:05:01.5695483Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5696209Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5696217Z 2025-12-04T12:05:01.5696500Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5696720Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5696848Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5696966Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5697087Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5697319Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5697419Z graph_break [] 2025-12-04T12:05:01.5697633Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5698451Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5698557Z warnings.warn( 2025-12-04T12:05:01.5698780Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5698905Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5699022Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5699255Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5699378Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5699478Z graph_break [] 2025-12-04T12:05:01.5699706Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5700434Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5700535Z warnings.warn( 2025-12-04T12:05:01.5700760Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5700869Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5700998Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5701255Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5701377Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5701531Z graph_break [] 2025-12-04T12:05:01.5701748Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5702474Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5702617Z warnings.warn( 2025-12-04T12:05:01.5703366Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-6ef09e48fb8b226a.xml - 2025-12-04T12:05:01.5703554Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5704383Z FAILED [0.2646s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5704421Z 2025-12-04T12:05:01.5704644Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5705378Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5705386Z 2025-12-04T12:05:01.5705650Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5705843Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5706043Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.5706141Z Got exit code 1 2025-12-04T12:05:01.5706263Z Retrying single test... 2025-12-04T12:05:01.5706842Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e71119cdd1d43b1e.xml 2025-12-04T12:05:01.5707021Z ============================= test session starts ============================== 2025-12-04T12:05:01.5707376Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5707484Z cachedir: .pytest_cache 2025-12-04T12:05:01.5708016Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5708141Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5708248Z configfile: pytest.ini 2025-12-04T12:05:01.5708856Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5709082Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5709898Z stepcurrent: skipping 43 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5710013Z Running 1 items in this shard 2025-12-04T12:05:01.5710020Z 2025-12-04T12:05:01.5710708Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9132s] [100%] 2025-12-04T12:05:01.5711396Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2641s] [100%] 2025-12-04T12:05:01.5711985Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2614s] [100%] 2025-12-04T12:05:01.5711991Z 2025-12-04T12:05:01.5712143Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5712503Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5712623Z Traceback (most recent call last): 2025-12-04T12:05:01.5713142Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5713337Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5713879Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5714035Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5714229Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5714235Z 2025-12-04T12:05:01.5714462Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5715179Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5715185Z 2025-12-04T12:05:01.5715463Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5715682Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5715828Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5715956Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5716079Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5716302Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5716416Z graph_break [] 2025-12-04T12:05:01.5716632Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5717383Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5717486Z warnings.warn( 2025-12-04T12:05:01.5717849Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5717985Z Traceback (most recent call last): 2025-12-04T12:05:01.5718458Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5718621Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5719155Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5719276Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5719484Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5719489Z 2025-12-04T12:05:01.5719703Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5720423Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5720428Z 2025-12-04T12:05:01.5720707Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5720926Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5721049Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5721167Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5721289Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5721522Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5721619Z graph_break [] 2025-12-04T12:05:01.5721834Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5722582Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5722685Z warnings.warn( 2025-12-04T12:05:01.5722912Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5723022Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5723136Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5723367Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5723488Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5723588Z graph_break [] 2025-12-04T12:05:01.5723872Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5724607Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5724751Z warnings.warn( 2025-12-04T12:05:01.5724893Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5725257Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5725391Z Traceback (most recent call last): 2025-12-04T12:05:01.5725867Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5726023Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5726556Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5726707Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5726912Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5726918Z 2025-12-04T12:05:01.5727131Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5727849Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5727855Z 2025-12-04T12:05:01.5728132Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5728348Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5728471Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5728585Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5728706Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5728938Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5729039Z graph_break [] 2025-12-04T12:05:01.5729257Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5729997Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5730101Z warnings.warn( 2025-12-04T12:05:01.5730314Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5730434Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5730548Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5730777Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5730898Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5730995Z graph_break [] 2025-12-04T12:05:01.5731221Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5731954Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5732052Z warnings.warn( 2025-12-04T12:05:01.5732278Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5732391Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5732516Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5732731Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5732853Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5732961Z graph_break [] 2025-12-04T12:05:01.5733173Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5733901Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5734011Z warnings.warn( 2025-12-04T12:05:01.5734826Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e71119cdd1d43b1e.xml - 2025-12-04T12:05:01.5735009Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5735837Z FAILED [0.2614s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5735875Z 2025-12-04T12:05:01.5736092Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5736820Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5736826Z 2025-12-04T12:05:01.5737089Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5737279Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5737513Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.5737613Z Got exit code 1 2025-12-04T12:05:01.5738342Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5738758Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.5739350Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c8dc22bc4fac83a2.xml 2025-12-04T12:05:01.5739513Z ============================= test session starts ============================== 2025-12-04T12:05:01.5739865Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5739992Z cachedir: .pytest_cache 2025-12-04T12:05:01.5740517Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5740662Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5740773Z configfile: pytest.ini 2025-12-04T12:05:01.5741365Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5741609Z collecting ... collected 571 items / 44 deselected / 527 selected 2025-12-04T12:05:01.5741752Z stepcurrent: skipping 44 already run items. 2025-12-04T12:05:01.5741866Z Running 251 items in this shard 2025-12-04T12:05:01.5741872Z 2025-12-04T12:05:01.5742569Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9365s] [ 0%] 2025-12-04T12:05:01.5743247Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2709s] [ 0%] 2025-12-04T12:05:01.5743858Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2659s] [ 0%] 2025-12-04T12:05:01.5743864Z 2025-12-04T12:05:01.5744003Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5744367Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5744505Z Traceback (most recent call last): 2025-12-04T12:05:01.5744980Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5745148Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5745668Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5745790Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5745994Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5746049Z 2025-12-04T12:05:01.5746295Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5747029Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5747066Z 2025-12-04T12:05:01.5747332Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5747549Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5747672Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5747787Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5747909Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5748143Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5748243Z graph_break [] 2025-12-04T12:05:01.5748474Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5749246Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5749348Z warnings.warn( 2025-12-04T12:05:01.5749723Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5749848Z Traceback (most recent call last): 2025-12-04T12:05:01.5750321Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5750497Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5751012Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5751147Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5751335Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5751343Z 2025-12-04T12:05:01.5751561Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5752296Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5752302Z 2025-12-04T12:05:01.5752569Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5752799Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5752910Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5753025Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5753159Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5753376Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5753474Z graph_break [] 2025-12-04T12:05:01.5753704Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5754440Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5754555Z warnings.warn( 2025-12-04T12:05:01.5754768Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5754882Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5755010Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5755229Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5755345Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5755458Z graph_break [] 2025-12-04T12:05:01.5755669Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5756410Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5756511Z warnings.warn( 2025-12-04T12:05:01.5756692Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5757100Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5757223Z Traceback (most recent call last): 2025-12-04T12:05:01.5757700Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5757902Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5758419Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5758554Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5758745Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5758751Z 2025-12-04T12:05:01.5758964Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5759702Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5759736Z 2025-12-04T12:05:01.5760004Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5760235Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5760348Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5760461Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5760596Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5760813Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5760909Z graph_break [] 2025-12-04T12:05:01.5761133Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5761864Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5761977Z warnings.warn( 2025-12-04T12:05:01.5762193Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5762302Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5762428Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5762646Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5762766Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5762873Z graph_break [] 2025-12-04T12:05:01.5763083Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5763823Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5763923Z warnings.warn( 2025-12-04T12:05:01.5764133Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5764257Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5764364Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5764585Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5764718Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5764812Z graph_break [] 2025-12-04T12:05:01.5765024Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5765766Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5765863Z warnings.warn( 2025-12-04T12:05:01.5766633Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c8dc22bc4fac83a2.xml - 2025-12-04T12:05:01.5766800Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5767659Z FAILED [0.2659s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5767680Z 2025-12-04T12:05:01.5767930Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5768653Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5768687Z 2025-12-04T12:05:01.5768965Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5769141Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5769338Z ================== 1 failed, 44 deselected, 2 rerun in 5.16s =================== 2025-12-04T12:05:01.5769451Z Got exit code 1 2025-12-04T12:05:01.5769555Z Retrying single test... 2025-12-04T12:05:01.5770140Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-8170ae01b738ee47.xml 2025-12-04T12:05:01.5770336Z ============================= test session starts ============================== 2025-12-04T12:05:01.5770691Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5770811Z cachedir: .pytest_cache 2025-12-04T12:05:01.5771329Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5771457Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5771573Z configfile: pytest.ini 2025-12-04T12:05:01.5772164Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5772406Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5773214Z stepcurrent: skipping 44 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5773335Z Running 1 items in this shard 2025-12-04T12:05:01.5773340Z 2025-12-04T12:05:01.5774033Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9123s] [100%] 2025-12-04T12:05:01.5774716Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2658s] [100%] 2025-12-04T12:05:01.5775323Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2618s] [100%] 2025-12-04T12:05:01.5775329Z 2025-12-04T12:05:01.5775466Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5775837Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5775956Z Traceback (most recent call last): 2025-12-04T12:05:01.5776435Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5776605Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5777124Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5777249Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5777454Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5777460Z 2025-12-04T12:05:01.5777671Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5778469Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5778476Z 2025-12-04T12:05:01.5778785Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5779045Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5779177Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5779323Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5779446Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5779680Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5779810Z graph_break [] 2025-12-04T12:05:01.5780039Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5780769Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5780869Z warnings.warn( 2025-12-04T12:05:01.5781245Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5781363Z Traceback (most recent call last): 2025-12-04T12:05:01.5781850Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5782051Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5782570Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5782704Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5782895Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5782900Z 2025-12-04T12:05:01.5783112Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5783840Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5783846Z 2025-12-04T12:05:01.5784110Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5784338Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5784452Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5784564Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5784698Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5784915Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5785012Z graph_break [] 2025-12-04T12:05:01.5785239Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5785968Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5786080Z warnings.warn( 2025-12-04T12:05:01.5786293Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5786400Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5786526Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5786744Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5786866Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5786976Z graph_break [] 2025-12-04T12:05:01.5787189Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5788208Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5788314Z warnings.warn( 2025-12-04T12:05:01.5788459Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5788832Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5788951Z Traceback (most recent call last): 2025-12-04T12:05:01.5789424Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5789594Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5790182Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5790360Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5790553Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5790559Z 2025-12-04T12:05:01.5790774Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5791551Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5791557Z 2025-12-04T12:05:01.5791821Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5792048Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5792159Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5792271Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5792402Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5792668Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5792763Z graph_break [] 2025-12-04T12:05:01.5792994Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5793728Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5793843Z warnings.warn( 2025-12-04T12:05:01.5794054Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5794160Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5794285Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5794503Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5794628Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5794738Z graph_break [] 2025-12-04T12:05:01.5794951Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5795694Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5795792Z warnings.warn( 2025-12-04T12:05:01.5796003Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5796124Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5796233Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5796450Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5796579Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5796674Z graph_break [] 2025-12-04T12:05:01.5796885Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5797618Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5797717Z warnings.warn( 2025-12-04T12:05:01.5798487Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-8170ae01b738ee47.xml - 2025-12-04T12:05:01.5798654Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5799486Z FAILED [0.2618s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5799505Z 2025-12-04T12:05:01.5799718Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5800434Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5800440Z 2025-12-04T12:05:01.5800711Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5800918Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5801145Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.5801258Z Got exit code 1 2025-12-04T12:05:01.5801364Z Retrying single test... 2025-12-04T12:05:01.5801946Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0595c5125e4115a2.xml 2025-12-04T12:05:01.5802133Z ============================= test session starts ============================== 2025-12-04T12:05:01.5802482Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5802603Z cachedir: .pytest_cache 2025-12-04T12:05:01.5803126Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5803250Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5803367Z configfile: pytest.ini 2025-12-04T12:05:01.5803992Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5804234Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5805036Z stepcurrent: skipping 44 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5805151Z Running 1 items in this shard 2025-12-04T12:05:01.5805156Z 2025-12-04T12:05:01.5805851Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9103s] [100%] 2025-12-04T12:05:01.5806528Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2671s] [100%] 2025-12-04T12:05:01.5807134Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2571s] [100%] 2025-12-04T12:05:01.5807143Z 2025-12-04T12:05:01.5807283Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5807654Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5807782Z Traceback (most recent call last): 2025-12-04T12:05:01.5808258Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5808428Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5808947Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5809067Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5809272Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5809280Z 2025-12-04T12:05:01.5809497Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5810230Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5810236Z 2025-12-04T12:05:01.5810502Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5810721Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5810850Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5810962Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5811098Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5811316Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5811413Z graph_break [] 2025-12-04T12:05:01.5811640Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5812429Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5812531Z warnings.warn( 2025-12-04T12:05:01.5812904Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5813055Z Traceback (most recent call last): 2025-12-04T12:05:01.5813540Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5813697Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5814215Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5814352Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5814541Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5814546Z 2025-12-04T12:05:01.5814758Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5815517Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5815524Z 2025-12-04T12:05:01.5815785Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5816014Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5816126Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5816238Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5816370Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5816586Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5816686Z graph_break [] 2025-12-04T12:05:01.5816909Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5817646Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5817760Z warnings.warn( 2025-12-04T12:05:01.5817974Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5818082Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5818275Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5818495Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5818614Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5818723Z graph_break [] 2025-12-04T12:05:01.5818934Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5819679Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5819777Z warnings.warn( 2025-12-04T12:05:01.5819917Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5820298Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5820418Z Traceback (most recent call last): 2025-12-04T12:05:01.5820889Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5821060Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5821577Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5821710Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5821898Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5821904Z 2025-12-04T12:05:01.5822117Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5822845Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5822891Z 2025-12-04T12:05:01.5823182Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5823410Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5823519Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5823676Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5823806Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5824025Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5824123Z graph_break [] 2025-12-04T12:05:01.5824347Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5825072Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5825182Z warnings.warn( 2025-12-04T12:05:01.5825394Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5825535Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5825659Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5825881Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5826000Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5826111Z graph_break [] 2025-12-04T12:05:01.5826321Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5827064Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5827162Z warnings.warn( 2025-12-04T12:05:01.5827373Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5827494Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5827607Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5827826Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5827962Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5828058Z graph_break [] 2025-12-04T12:05:01.5828272Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5829006Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5829108Z warnings.warn( 2025-12-04T12:05:01.5829867Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0595c5125e4115a2.xml - 2025-12-04T12:05:01.5830036Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5830862Z FAILED [0.2571s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5830881Z 2025-12-04T12:05:01.5831100Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5831824Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5831832Z 2025-12-04T12:05:01.5832105Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5832283Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5832480Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.5832587Z Got exit code 1 2025-12-04T12:05:01.5833220Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5833641Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.5834278Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ea830dd4e98e3dce.xml 2025-12-04T12:05:01.5834441Z ============================= test session starts ============================== 2025-12-04T12:05:01.5834799Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5834940Z cachedir: .pytest_cache 2025-12-04T12:05:01.5835470Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5835591Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5835698Z configfile: pytest.ini 2025-12-04T12:05:01.5836300Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5836522Z collecting ... collected 571 items / 45 deselected / 526 selected 2025-12-04T12:05:01.5836697Z stepcurrent: skipping 45 already run items. 2025-12-04T12:05:01.5836818Z Running 250 items in this shard 2025-12-04T12:05:01.5836826Z 2025-12-04T12:05:01.5837416Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_True_float16_cuda PASSED [3.4827s] [ 0%] 2025-12-04T12:05:01.5838137Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_hypot_load_upcast_to_fp32_False_float16_cuda PASSED [0.3196s] [ 0%] 2025-12-04T12:05:01.5838893Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0932s] [ 1%] 2025-12-04T12:05:01.5839656Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2839s] [ 1%] 2025-12-04T12:05:01.5840321Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2610s] [ 1%] 2025-12-04T12:05:01.5840330Z 2025-12-04T12:05:01.5840475Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5840900Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5841035Z Traceback (most recent call last): 2025-12-04T12:05:01.5841525Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5841749Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5842273Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5842460Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5842653Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5842659Z 2025-12-04T12:05:01.5842876Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5843618Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5843624Z 2025-12-04T12:05:01.5843894Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5844126Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5844237Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5844350Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5844580Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5844700Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5844795Z graph_break [] 2025-12-04T12:05:01.5845019Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5845800Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5845913Z warnings.warn( 2025-12-04T12:05:01.5846303Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5846423Z Traceback (most recent call last): 2025-12-04T12:05:01.5846941Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5847092Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5847621Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5847741Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5847936Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5847942Z 2025-12-04T12:05:01.5848169Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5848879Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5848921Z 2025-12-04T12:05:01.5849200Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5849417Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5849533Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5849659Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5849878Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5850000Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5850107Z graph_break [] 2025-12-04T12:05:01.5850318Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5851050Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5851162Z warnings.warn( 2025-12-04T12:05:01.5851381Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5851503Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5851614Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5851833Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5851967Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5852064Z graph_break [] 2025-12-04T12:05:01.5852280Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5853019Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5853121Z warnings.warn( 2025-12-04T12:05:01.5853273Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5853634Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5853758Z Traceback (most recent call last): 2025-12-04T12:05:01.5854244Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5854397Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5854919Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5855054Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5855242Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5855248Z 2025-12-04T12:05:01.5855471Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5856189Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5856194Z 2025-12-04T12:05:01.5856493Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5856763Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5856877Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5857005Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5857254Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5857376Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5857488Z graph_break [] 2025-12-04T12:05:01.5857704Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5858520Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5858637Z warnings.warn( 2025-12-04T12:05:01.5858853Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5859015Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5859132Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5859355Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5859493Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5859591Z graph_break [] 2025-12-04T12:05:01.5859804Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5860552Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5860656Z warnings.warn( 2025-12-04T12:05:01.5860887Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5860998Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5861112Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5861347Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5861468Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5861571Z graph_break [] 2025-12-04T12:05:01.5861802Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5862532Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5862648Z warnings.warn( 2025-12-04T12:05:01.5863407Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ea830dd4e98e3dce.xml - 2025-12-04T12:05:01.5863578Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5864424Z FAILED [0.2610s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5864430Z 2025-12-04T12:05:01.5864647Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5865386Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5865392Z 2025-12-04T12:05:01.5865661Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5865842Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5866069Z ============= 1 failed, 2 passed, 45 deselected, 2 rerun in 6.13s ============== 2025-12-04T12:05:01.5866168Z Got exit code 1 2025-12-04T12:05:01.5866275Z Retrying single test... 2025-12-04T12:05:01.5866863Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0ddafb1da2805e03.xml 2025-12-04T12:05:01.5867024Z ============================= test session starts ============================== 2025-12-04T12:05:01.5867476Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5867588Z cachedir: .pytest_cache 2025-12-04T12:05:01.5868156Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5868293Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5868432Z configfile: pytest.ini 2025-12-04T12:05:01.5869027Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5869269Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5870078Z stepcurrent: skipping 47 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5870202Z Running 1 items in this shard 2025-12-04T12:05:01.5870207Z 2025-12-04T12:05:01.5870922Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9120s] [100%] 2025-12-04T12:05:01.5871616Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2656s] [100%] 2025-12-04T12:05:01.5872213Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2593s] [100%] 2025-12-04T12:05:01.5872219Z 2025-12-04T12:05:01.5872357Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5872726Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5872847Z Traceback (most recent call last): 2025-12-04T12:05:01.5873343Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5873507Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5874035Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5874170Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5874365Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5874372Z 2025-12-04T12:05:01.5874587Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5875315Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5875320Z 2025-12-04T12:05:01.5875587Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5875818Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5875929Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5876049Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5876181Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5876405Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5876518Z graph_break [] 2025-12-04T12:05:01.5876732Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5877469Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5877584Z warnings.warn( 2025-12-04T12:05:01.5877949Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5878070Z Traceback (most recent call last): 2025-12-04T12:05:01.5878554Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5878710Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5879278Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5879430Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5879624Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5879630Z 2025-12-04T12:05:01.5879884Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5880606Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5880611Z 2025-12-04T12:05:01.5880889Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5881104Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5881216Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5881342Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5881461Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5881713Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5881824Z graph_break [] 2025-12-04T12:05:01.5882043Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5882789Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5882892Z warnings.warn( 2025-12-04T12:05:01.5883104Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5883228Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5883341Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5883560Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5883693Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5883791Z graph_break [] 2025-12-04T12:05:01.5884003Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5884752Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5884854Z warnings.warn( 2025-12-04T12:05:01.5885008Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5885373Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5885495Z Traceback (most recent call last): 2025-12-04T12:05:01.5885986Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5886141Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5886670Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5886792Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5886989Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5886994Z 2025-12-04T12:05:01.5887224Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5888140Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5888150Z 2025-12-04T12:05:01.5888433Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5888651Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5888762Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5888894Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5889015Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5889236Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5889346Z graph_break [] 2025-12-04T12:05:01.5889636Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5890405Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5890519Z warnings.warn( 2025-12-04T12:05:01.5890772Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5890894Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5891007Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5891227Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5891360Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5891456Z graph_break [] 2025-12-04T12:05:01.5891669Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5892413Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5892560Z warnings.warn( 2025-12-04T12:05:01.5892787Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5892897Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5893009Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5893240Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5893361Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5893457Z graph_break [] 2025-12-04T12:05:01.5893680Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5894402Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5894514Z warnings.warn( 2025-12-04T12:05:01.5895269Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0ddafb1da2805e03.xml - 2025-12-04T12:05:01.5895445Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5896289Z FAILED [0.2593s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5896297Z 2025-12-04T12:05:01.5896514Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5897248Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5897253Z 2025-12-04T12:05:01.5897518Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5897693Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5897902Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.5898005Z Got exit code 1 2025-12-04T12:05:01.5898179Z Retrying single test... 2025-12-04T12:05:01.5898772Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3b676b001d5055f4.xml 2025-12-04T12:05:01.5898933Z ============================= test session starts ============================== 2025-12-04T12:05:01.5899296Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5899405Z cachedir: .pytest_cache 2025-12-04T12:05:01.5899926Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5900061Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5900168Z configfile: pytest.ini 2025-12-04T12:05:01.5900775Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5901042Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5901878Z stepcurrent: skipping 47 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5902035Z Running 1 items in this shard 2025-12-04T12:05:01.5902040Z 2025-12-04T12:05:01.5902719Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9030s] [100%] 2025-12-04T12:05:01.5903410Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2635s] [100%] 2025-12-04T12:05:01.5904003Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2538s] [100%] 2025-12-04T12:05:01.5904037Z 2025-12-04T12:05:01.5904181Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5904557Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5904681Z Traceback (most recent call last): 2025-12-04T12:05:01.5905175Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5905332Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5905857Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5905993Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5906185Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5906191Z 2025-12-04T12:05:01.5906405Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5907136Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5907146Z 2025-12-04T12:05:01.5907412Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5907638Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5907750Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5907866Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5907999Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5908220Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5908329Z graph_break [] 2025-12-04T12:05:01.5908543Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5909275Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5909389Z warnings.warn( 2025-12-04T12:05:01.5909755Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5909877Z Traceback (most recent call last): 2025-12-04T12:05:01.5910364Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5910521Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5911052Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5911175Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5911367Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5911373Z 2025-12-04T12:05:01.5911602Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5912371Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5912379Z 2025-12-04T12:05:01.5912689Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5912907Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5913047Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5913175Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5913295Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5913519Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5913631Z graph_break [] 2025-12-04T12:05:01.5913847Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5914596Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5914700Z warnings.warn( 2025-12-04T12:05:01.5914949Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5915073Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5915191Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5915411Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5915542Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5915643Z graph_break [] 2025-12-04T12:05:01.5915855Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5916601Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5916702Z warnings.warn( 2025-12-04T12:05:01.5916860Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5917222Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.5917348Z Traceback (most recent call last): 2025-12-04T12:05:01.5917842Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5917998Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5918530Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5918655Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5918848Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5918853Z 2025-12-04T12:05:01.5919080Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5919798Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5919804Z 2025-12-04T12:05:01.5920081Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5920299Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5920417Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5920547Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5920668Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5920887Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5921003Z graph_break [] 2025-12-04T12:05:01.5921216Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5921948Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5922061Z warnings.warn( 2025-12-04T12:05:01.5922272Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5922392Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5922504Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5922754Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5922917Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5923015Z graph_break [] 2025-12-04T12:05:01.5923227Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5923996Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5924095Z warnings.warn( 2025-12-04T12:05:01.5924317Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5924425Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5924537Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5924768Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5924892Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5924988Z graph_break [] 2025-12-04T12:05:01.5925244Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5925973Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5926085Z warnings.warn( 2025-12-04T12:05:01.5926839Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3b676b001d5055f4.xml - 2025-12-04T12:05:01.5927012Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5927849Z FAILED [0.2538s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5927855Z 2025-12-04T12:05:01.5928070Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5928806Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5928810Z 2025-12-04T12:05:01.5929075Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5929250Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5929464Z ================== 1 failed, 294 deselected, 2 rerun in 5.09s ================== 2025-12-04T12:05:01.5929567Z Got exit code 1 2025-12-04T12:05:01.5930214Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.5930623Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.5931198Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-15a19448251eca38.xml 2025-12-04T12:05:01.5931376Z ============================= test session starts ============================== 2025-12-04T12:05:01.5931730Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5931839Z cachedir: .pytest_cache 2025-12-04T12:05:01.5932370Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5932495Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5932614Z configfile: pytest.ini 2025-12-04T12:05:01.5933203Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5933426Z collecting ... collected 571 items / 48 deselected / 523 selected 2025-12-04T12:05:01.5933579Z stepcurrent: skipping 48 already run items. 2025-12-04T12:05:01.5933690Z Running 247 items in this shard 2025-12-04T12:05:01.5933698Z 2025-12-04T12:05:01.5934463Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9115s] [ 0%] 2025-12-04T12:05:01.5935150Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2681s] [ 0%] 2025-12-04T12:05:01.5935781Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2626s] [ 0%] 2025-12-04T12:05:01.5935787Z 2025-12-04T12:05:01.5935940Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5936309Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5936442Z Traceback (most recent call last): 2025-12-04T12:05:01.5936917Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5937108Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5937644Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5937769Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5937963Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5937981Z 2025-12-04T12:05:01.5938272Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5939000Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5939006Z 2025-12-04T12:05:01.5939289Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5939508Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5939624Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5939755Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5939879Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5940116Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5940215Z graph_break [] 2025-12-04T12:05:01.5940429Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5941181Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5941282Z warnings.warn( 2025-12-04T12:05:01.5941647Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5941779Z Traceback (most recent call last): 2025-12-04T12:05:01.5942253Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5942420Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5942942Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5943064Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5943265Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5943273Z 2025-12-04T12:05:01.5943484Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5944220Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5944225Z 2025-12-04T12:05:01.5944491Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5944704Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5944829Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5944942Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5945099Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5945356Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5945453Z graph_break [] 2025-12-04T12:05:01.5945681Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5946442Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5946543Z warnings.warn( 2025-12-04T12:05:01.5946769Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5946881Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5946993Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5947224Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5947342Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5947449Z graph_break [] 2025-12-04T12:05:01.5947710Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5948442Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5948553Z warnings.warn( 2025-12-04T12:05:01.5948697Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5949069Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5949198Z Traceback (most recent call last): 2025-12-04T12:05:01.5949670Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5949837Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5950357Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5950482Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5950687Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5950695Z 2025-12-04T12:05:01.5950909Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5951634Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5951654Z 2025-12-04T12:05:01.5951917Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5952132Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5952252Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5952368Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5952490Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5952721Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5952819Z graph_break [] 2025-12-04T12:05:01.5953037Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5953782Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5953885Z warnings.warn( 2025-12-04T12:05:01.5954107Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5954215Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5954327Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5954561Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5954681Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5954778Z graph_break [] 2025-12-04T12:05:01.5955003Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5955768Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5955883Z warnings.warn( 2025-12-04T12:05:01.5956123Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5956234Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5956393Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5956612Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5956732Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5956839Z graph_break [] 2025-12-04T12:05:01.5957052Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5957791Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5957889Z warnings.warn( 2025-12-04T12:05:01.5958641Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-15a19448251eca38.xml - 2025-12-04T12:05:01.5958850Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5959687Z FAILED [0.2626s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5959696Z 2025-12-04T12:05:01.5959922Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5960644Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5960650Z 2025-12-04T12:05:01.5960913Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5961102Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5961303Z ================== 1 failed, 48 deselected, 2 rerun in 5.12s =================== 2025-12-04T12:05:01.5961415Z Got exit code 1 2025-12-04T12:05:01.5961523Z Retrying single test... 2025-12-04T12:05:01.5962103Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7dd506857cedd350.xml 2025-12-04T12:05:01.5962287Z ============================= test session starts ============================== 2025-12-04T12:05:01.5962638Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5962747Z cachedir: .pytest_cache 2025-12-04T12:05:01.5963283Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5963408Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5963532Z configfile: pytest.ini 2025-12-04T12:05:01.5964127Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5964363Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5965189Z stepcurrent: skipping 48 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5965307Z Running 1 items in this shard 2025-12-04T12:05:01.5965312Z 2025-12-04T12:05:01.5966013Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9155s] [100%] 2025-12-04T12:05:01.5966704Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2642s] [100%] 2025-12-04T12:05:01.5967307Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2590s] [100%] 2025-12-04T12:05:01.5967345Z 2025-12-04T12:05:01.5967534Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5967905Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5968039Z Traceback (most recent call last): 2025-12-04T12:05:01.5968549Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5968704Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5969242Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5969370Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5969577Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5969582Z 2025-12-04T12:05:01.5969799Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5970561Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5970567Z 2025-12-04T12:05:01.5970845Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5971066Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5971193Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5971309Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5971432Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5971665Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5971765Z graph_break [] 2025-12-04T12:05:01.5971979Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5972728Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5972835Z warnings.warn( 2025-12-04T12:05:01.5973206Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5973349Z Traceback (most recent call last): 2025-12-04T12:05:01.5973821Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5973995Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5974514Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5974639Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5974845Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5974851Z 2025-12-04T12:05:01.5975064Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5975802Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5975812Z 2025-12-04T12:05:01.5976077Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5976291Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5976415Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5976528Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5976648Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5976878Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5976976Z graph_break [] 2025-12-04T12:05:01.5977203Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5977933Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5978034Z warnings.warn( 2025-12-04T12:05:01.5978374Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5978518Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5978633Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5978866Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5979019Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5979129Z graph_break [] 2025-12-04T12:05:01.5979344Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5980071Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5980185Z warnings.warn( 2025-12-04T12:05:01.5980327Z =================================== FAILURES =================================== 2025-12-04T12:05:01.5980698Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5980872Z Traceback (most recent call last): 2025-12-04T12:05:01.5981350Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.5981516Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.5982039Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.5982162Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.5982365Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5982371Z 2025-12-04T12:05:01.5982585Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5983322Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5983327Z 2025-12-04T12:05:01.5983599Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5983819Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5983944Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5984060Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5984181Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5984415Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5984513Z graph_break [] 2025-12-04T12:05:01.5984739Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5985473Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5985573Z warnings.warn( 2025-12-04T12:05:01.5985798Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5985908Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5986024Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5986258Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5986379Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5986474Z graph_break [] 2025-12-04T12:05:01.5986698Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5987429Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5987543Z warnings.warn( 2025-12-04T12:05:01.5987939Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.5988056Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.5988185Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.5988406Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.5988531Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.5988648Z graph_break [] 2025-12-04T12:05:01.5988972Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.5989724Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.5989896Z warnings.warn( 2025-12-04T12:05:01.5990647Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7dd506857cedd350.xml - 2025-12-04T12:05:01.5990838Z =========================== short test summary info ============================ 2025-12-04T12:05:01.5991684Z FAILED [0.2590s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.5991691Z 2025-12-04T12:05:01.5991926Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.5992698Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5992703Z 2025-12-04T12:05:01.5992970Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.5993164Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.5993362Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.5993473Z Got exit code 1 2025-12-04T12:05:01.5993579Z Retrying single test... 2025-12-04T12:05:01.5994149Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a068bf6354320ba5.xml 2025-12-04T12:05:01.5994322Z ============================= test session starts ============================== 2025-12-04T12:05:01.5994676Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.5994786Z cachedir: .pytest_cache 2025-12-04T12:05:01.5995324Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.5995450Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.5995578Z configfile: pytest.ini 2025-12-04T12:05:01.5996170Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.5996399Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.5997220Z stepcurrent: skipping 48 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.5997337Z Running 1 items in this shard 2025-12-04T12:05:01.5997342Z 2025-12-04T12:05:01.5998045Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9047s] [100%] 2025-12-04T12:05:01.5998733Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2629s] [100%] 2025-12-04T12:05:01.5999330Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2606s] [100%] 2025-12-04T12:05:01.5999348Z 2025-12-04T12:05:01.5999487Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.5999857Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.5999993Z Traceback (most recent call last): 2025-12-04T12:05:01.6000470Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6000630Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6001227Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6001352Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6001553Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6001588Z 2025-12-04T12:05:01.6001805Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6002530Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6002536Z 2025-12-04T12:05:01.6002814Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6003031Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6003155Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6003298Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6003420Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6003656Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6003755Z graph_break [] 2025-12-04T12:05:01.6003967Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6004716Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6004816Z warnings.warn( 2025-12-04T12:05:01.6005192Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6005311Z Traceback (most recent call last): 2025-12-04T12:05:01.6005785Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6005951Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6006473Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6006599Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6006803Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6006809Z 2025-12-04T12:05:01.6007027Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6007764Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6007770Z 2025-12-04T12:05:01.6008037Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6008254Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6008382Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6008500Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6008621Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6008862Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6008964Z graph_break [] 2025-12-04T12:05:01.6009192Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6009919Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6010021Z warnings.warn( 2025-12-04T12:05:01.6010247Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6010357Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6010471Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6010704Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6010826Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6010936Z graph_break [] 2025-12-04T12:05:01.6011151Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6011941Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6012054Z warnings.warn( 2025-12-04T12:05:01.6012197Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6012597Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6012732Z Traceback (most recent call last): 2025-12-04T12:05:01.6013203Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6013370Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6013888Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6014010Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6014247Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6014253Z 2025-12-04T12:05:01.6014468Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6015205Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6015213Z 2025-12-04T12:05:01.6015477Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6015693Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6015818Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6015932Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6016051Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6016283Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6016381Z graph_break [] 2025-12-04T12:05:01.6016613Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6017349Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6017452Z warnings.warn( 2025-12-04T12:05:01.6017681Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6017793Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6017908Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6018239Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6018364Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6018475Z graph_break [] 2025-12-04T12:05:01.6018689Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6019418Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6019540Z warnings.warn( 2025-12-04T12:05:01.6019757Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6019870Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6019999Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6020221Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6020343Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6020454Z graph_break [] 2025-12-04T12:05:01.6020668Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6021410Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6021512Z warnings.warn( 2025-12-04T12:05:01.6022264Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a068bf6354320ba5.xml - 2025-12-04T12:05:01.6022514Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6023348Z FAILED [0.2606s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6023383Z 2025-12-04T12:05:01.6023615Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6024337Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6024342Z 2025-12-04T12:05:01.6024606Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6024799Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6024998Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.6025150Z Got exit code 1 2025-12-04T12:05:01.6025802Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6026212Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.6026799Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0bf202cdb375dae7.xml 2025-12-04T12:05:01.6026965Z ============================= test session starts ============================== 2025-12-04T12:05:01.6027328Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6027437Z cachedir: .pytest_cache 2025-12-04T12:05:01.6027958Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6028097Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6028204Z configfile: pytest.ini 2025-12-04T12:05:01.6028797Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6029034Z collecting ... collected 571 items / 49 deselected / 522 selected 2025-12-04T12:05:01.6029176Z stepcurrent: skipping 49 already run items. 2025-12-04T12:05:01.6029301Z Running 246 items in this shard 2025-12-04T12:05:01.6029307Z 2025-12-04T12:05:01.6029991Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9065s] [ 0%] 2025-12-04T12:05:01.6030672Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2678s] [ 0%] 2025-12-04T12:05:01.6031281Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2544s] [ 0%] 2025-12-04T12:05:01.6031289Z 2025-12-04T12:05:01.6031432Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6031803Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6031927Z Traceback (most recent call last): 2025-12-04T12:05:01.6032402Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6032568Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6033091Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6033229Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6033420Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6033425Z 2025-12-04T12:05:01.6033674Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6034451Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6034457Z 2025-12-04T12:05:01.6034722Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6034986Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6035100Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6035214Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6035345Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6035569Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6035665Z graph_break [] 2025-12-04T12:05:01.6035891Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6036625Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6036831Z warnings.warn( 2025-12-04T12:05:01.6037194Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6037316Z Traceback (most recent call last): 2025-12-04T12:05:01.6037803Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6037960Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6038481Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6038617Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6038808Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6038813Z 2025-12-04T12:05:01.6039042Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6039768Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6039774Z 2025-12-04T12:05:01.6040040Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6040272Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6040385Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6040511Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6040631Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6040850Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6040958Z graph_break [] 2025-12-04T12:05:01.6041171Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6041903Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6042020Z warnings.warn( 2025-12-04T12:05:01.6042238Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6042360Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6042473Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6042694Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6042827Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6042925Z graph_break [] 2025-12-04T12:05:01.6043139Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6043877Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6043979Z warnings.warn( 2025-12-04T12:05:01.6044122Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6044531Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6044683Z Traceback (most recent call last): 2025-12-04T12:05:01.6045174Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6045329Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6045875Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6046011Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6046201Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6046207Z 2025-12-04T12:05:01.6046433Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6047152Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6047186Z 2025-12-04T12:05:01.6047455Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6047685Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6047798Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6047909Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6048042Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6048261Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6048371Z graph_break [] 2025-12-04T12:05:01.6048581Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6049313Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6049424Z warnings.warn( 2025-12-04T12:05:01.6049636Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6049752Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6049877Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6050098Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6050231Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6050329Z graph_break [] 2025-12-04T12:05:01.6050542Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6051282Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6051381Z warnings.warn( 2025-12-04T12:05:01.6051594Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6051713Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6051826Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6052056Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6052180Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6052277Z graph_break [] 2025-12-04T12:05:01.6052502Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6053228Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6053329Z warnings.warn( 2025-12-04T12:05:01.6054092Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0bf202cdb375dae7.xml - 2025-12-04T12:05:01.6054261Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6055101Z FAILED [0.2544s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6055109Z 2025-12-04T12:05:01.6055352Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6056103Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6056123Z 2025-12-04T12:05:01.6056417Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6056596Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6056802Z ================== 1 failed, 49 deselected, 2 rerun in 5.11s =================== 2025-12-04T12:05:01.6056905Z Got exit code 1 2025-12-04T12:05:01.6057010Z Retrying single test... 2025-12-04T12:05:01.6057595Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-796bdc853aa8ac88.xml 2025-12-04T12:05:01.6057756Z ============================= test session starts ============================== 2025-12-04T12:05:01.6058246Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6058360Z cachedir: .pytest_cache 2025-12-04T12:05:01.6058884Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6059022Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6059130Z configfile: pytest.ini 2025-12-04T12:05:01.6059722Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6059961Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6060769Z stepcurrent: skipping 49 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6060898Z Running 1 items in this shard 2025-12-04T12:05:01.6060907Z 2025-12-04T12:05:01.6061592Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9270s] [100%] 2025-12-04T12:05:01.6062274Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2671s] [100%] 2025-12-04T12:05:01.6062887Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2644s] [100%] 2025-12-04T12:05:01.6062892Z 2025-12-04T12:05:01.6063033Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6063410Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6063533Z Traceback (most recent call last): 2025-12-04T12:05:01.6064010Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6064185Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6064711Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6064845Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6065039Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6065044Z 2025-12-04T12:05:01.6065257Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6065989Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6065996Z 2025-12-04T12:05:01.6066261Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6066492Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6066603Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6066757Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6066891Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6067140Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6067239Z graph_break [] 2025-12-04T12:05:01.6067467Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6068228Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6068341Z warnings.warn( 2025-12-04T12:05:01.6068707Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6068828Z Traceback (most recent call last): 2025-12-04T12:05:01.6069313Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6069466Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6070023Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6070164Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6070355Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6070362Z 2025-12-04T12:05:01.6070588Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6071305Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6071310Z 2025-12-04T12:05:01.6071575Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6071805Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6071917Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6072045Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6072172Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6072394Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6072507Z graph_break [] 2025-12-04T12:05:01.6072721Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6073457Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6073575Z warnings.warn( 2025-12-04T12:05:01.6073789Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6073915Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6074028Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6074247Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6074382Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6074479Z graph_break [] 2025-12-04T12:05:01.6074696Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6075440Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6075539Z warnings.warn( 2025-12-04T12:05:01.6075698Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6076060Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6076183Z Traceback (most recent call last): 2025-12-04T12:05:01.6076673Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6076829Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6077349Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6077532Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6077727Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6077761Z 2025-12-04T12:05:01.6077987Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6078711Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6078747Z 2025-12-04T12:05:01.6079013Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6079243Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6079353Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6079482Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6079600Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6079818Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6079961Z graph_break [] 2025-12-04T12:05:01.6080181Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6080920Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6081031Z warnings.warn( 2025-12-04T12:05:01.6081244Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6081364Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6081476Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6081692Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6081822Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6081917Z graph_break [] 2025-12-04T12:05:01.6082127Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6082878Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6082980Z warnings.warn( 2025-12-04T12:05:01.6083195Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6083318Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6083429Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6083657Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6083776Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6083872Z graph_break [] 2025-12-04T12:05:01.6084092Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6084818Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6084920Z warnings.warn( 2025-12-04T12:05:01.6085688Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-796bdc853aa8ac88.xml - 2025-12-04T12:05:01.6085864Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6086707Z FAILED [0.2644s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6086716Z 2025-12-04T12:05:01.6086933Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6087658Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6087663Z 2025-12-04T12:05:01.6088122Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6088304Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6088587Z ================== 1 failed, 294 deselected, 2 rerun in 5.15s ================== 2025-12-04T12:05:01.6088688Z Got exit code 1 2025-12-04T12:05:01.6088834Z Retrying single test... 2025-12-04T12:05:01.6089425Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-abef13434279995d.xml 2025-12-04T12:05:01.6089627Z ============================= test session starts ============================== 2025-12-04T12:05:01.6089990Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6090096Z cachedir: .pytest_cache 2025-12-04T12:05:01.6090618Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6090752Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6090860Z configfile: pytest.ini 2025-12-04T12:05:01.6091455Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6091738Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6092544Z stepcurrent: skipping 49 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6092670Z Running 1 items in this shard 2025-12-04T12:05:01.6092676Z 2025-12-04T12:05:01.6093357Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9006s] [100%] 2025-12-04T12:05:01.6094044Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2755s] [100%] 2025-12-04T12:05:01.6094638Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2633s] [100%] 2025-12-04T12:05:01.6094648Z 2025-12-04T12:05:01.6094791Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6095163Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6095282Z Traceback (most recent call last): 2025-12-04T12:05:01.6095762Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6095933Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6096457Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6096596Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6096786Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6096792Z 2025-12-04T12:05:01.6097006Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6097746Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6097751Z 2025-12-04T12:05:01.6098017Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6098345Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6098458Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6098572Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6098710Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6104558Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6104730Z graph_break [] 2025-12-04T12:05:01.6104980Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6105733Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6105925Z warnings.warn( 2025-12-04T12:05:01.6106333Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6106457Z Traceback (most recent call last): 2025-12-04T12:05:01.6106940Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6107139Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6107664Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6107795Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6107985Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6107994Z 2025-12-04T12:05:01.6108213Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6108935Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6108980Z 2025-12-04T12:05:01.6109252Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6109473Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6109584Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6109698Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6109825Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6110045Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6110143Z graph_break [] 2025-12-04T12:05:01.6110356Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6111089Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6111198Z warnings.warn( 2025-12-04T12:05:01.6111413Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6111521Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6111642Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6111860Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6111984Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6112078Z graph_break [] 2025-12-04T12:05:01.6112290Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6113024Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6113121Z warnings.warn( 2025-12-04T12:05:01.6113259Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6113629Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6113754Z Traceback (most recent call last): 2025-12-04T12:05:01.6114235Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6114391Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6114904Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6115034Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6115225Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6115231Z 2025-12-04T12:05:01.6115442Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6116165Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6116171Z 2025-12-04T12:05:01.6116439Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6116719Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6116829Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6116941Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6117065Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6117311Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6117412Z graph_break [] 2025-12-04T12:05:01.6117624Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6118355Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6118457Z warnings.warn( 2025-12-04T12:05:01.6118669Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6118775Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6118939Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6119155Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6119273Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6119375Z graph_break [] 2025-12-04T12:05:01.6119584Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6120318Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6120415Z warnings.warn( 2025-12-04T12:05:01.6120621Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6120741Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6120850Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6121064Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6121187Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6121282Z graph_break [] 2025-12-04T12:05:01.6121499Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6122225Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6122321Z warnings.warn( 2025-12-04T12:05:01.6123075Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-abef13434279995d.xml - 2025-12-04T12:05:01.6123240Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6124087Z FAILED [0.2633s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6124094Z 2025-12-04T12:05:01.6124307Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6125028Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6125034Z 2025-12-04T12:05:01.6125309Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6125484Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6125690Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.6125788Z Got exit code 1 2025-12-04T12:05:01.6126426Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6126847Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.6127448Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-47824ae1e1d0f6cb.xml 2025-12-04T12:05:01.6127656Z ============================= test session starts ============================== 2025-12-04T12:05:01.6128011Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6128119Z cachedir: .pytest_cache 2025-12-04T12:05:01.6128680Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6128803Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6128907Z configfile: pytest.ini 2025-12-04T12:05:01.6129509Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6129731Z collecting ... collected 571 items / 50 deselected / 521 selected 2025-12-04T12:05:01.6129882Z stepcurrent: skipping 50 already run items. 2025-12-04T12:05:01.6129995Z Running 245 items in this shard 2025-12-04T12:05:01.6130030Z 2025-12-04T12:05:01.6130631Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_float16_cuda PASSED [3.4307s] [ 0%] 2025-12-04T12:05:01.6131330Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.4343s] [ 0%] 2025-12-04T12:05:01.6132018Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2716s] [ 0%] 2025-12-04T12:05:01.6132627Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2600s] [ 0%] 2025-12-04T12:05:01.6132633Z 2025-12-04T12:05:01.6132770Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6133138Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6133272Z Traceback (most recent call last): 2025-12-04T12:05:01.6133753Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6133916Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6134443Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6134562Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6134764Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6134769Z 2025-12-04T12:05:01.6134984Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6135706Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6135723Z 2025-12-04T12:05:01.6135990Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6136208Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6136327Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6136442Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6136657Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6136793Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6136889Z graph_break [] 2025-12-04T12:05:01.6137113Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6137842Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6137943Z warnings.warn( 2025-12-04T12:05:01.6138411Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6138538Z Traceback (most recent call last): 2025-12-04T12:05:01.6139084Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6139250Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6139768Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6139932Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6140126Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6140132Z 2025-12-04T12:05:01.6140340Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6141076Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6141081Z 2025-12-04T12:05:01.6141344Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6141603Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6141715Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6141825Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6142054Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6142176Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6142269Z graph_break [] 2025-12-04T12:05:01.6142495Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6143225Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6143339Z warnings.warn( 2025-12-04T12:05:01.6143550Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6143656Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6143780Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6144003Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6144124Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6144231Z graph_break [] 2025-12-04T12:05:01.6144437Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6145164Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6145277Z warnings.warn( 2025-12-04T12:05:01.6145418Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6145801Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6145917Z Traceback (most recent call last): 2025-12-04T12:05:01.6146394Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6146563Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6147082Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6147217Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6147406Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6147413Z 2025-12-04T12:05:01.6147627Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6148354Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6148360Z 2025-12-04T12:05:01.6148624Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6148853Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6148960Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6149074Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6149334Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6149479Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6149576Z graph_break [] 2025-12-04T12:05:01.6149797Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6150551Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6150651Z warnings.warn( 2025-12-04T12:05:01.6150872Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6150979Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6151104Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6151318Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6151435Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6151567Z graph_break [] 2025-12-04T12:05:01.6151778Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6152509Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6152622Z warnings.warn( 2025-12-04T12:05:01.6152832Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6152953Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6153064Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6153281Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6153410Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6153504Z graph_break [] 2025-12-04T12:05:01.6153716Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6154453Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6154555Z warnings.warn( 2025-12-04T12:05:01.6155319Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-47824ae1e1d0f6cb.xml - 2025-12-04T12:05:01.6155489Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6156321Z FAILED [0.2600s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6156327Z 2025-12-04T12:05:01.6156550Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6157274Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6157281Z 2025-12-04T12:05:01.6157560Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6157737Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6157948Z ============= 1 failed, 1 passed, 50 deselected, 2 rerun in 6.08s ============== 2025-12-04T12:05:01.6158057Z Got exit code 1 2025-12-04T12:05:01.6158159Z Retrying single test... 2025-12-04T12:05:01.6158741Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4038060382adbf9a.xml 2025-12-04T12:05:01.6158901Z ============================= test session starts ============================== 2025-12-04T12:05:01.6159250Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6159371Z cachedir: .pytest_cache 2025-12-04T12:05:01.6159893Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6160046Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6160162Z configfile: pytest.ini 2025-12-04T12:05:01.6160779Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6161018Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6161870Z stepcurrent: skipping 51 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6161982Z Running 1 items in this shard 2025-12-04T12:05:01.6161988Z 2025-12-04T12:05:01.6162685Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9220s] [100%] 2025-12-04T12:05:01.6163364Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2714s] [100%] 2025-12-04T12:05:01.6164008Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2744s] [100%] 2025-12-04T12:05:01.6164014Z 2025-12-04T12:05:01.6164161Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6164525Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6164656Z Traceback (most recent call last): 2025-12-04T12:05:01.6165127Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6165298Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6165815Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6165934Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6166139Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6166145Z 2025-12-04T12:05:01.6166356Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6167092Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6167100Z 2025-12-04T12:05:01.6167366Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6167579Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6167699Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6167809Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6167927Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6168156Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6168252Z graph_break [] 2025-12-04T12:05:01.6168480Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6169214Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6169310Z warnings.warn( 2025-12-04T12:05:01.6169689Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6169804Z Traceback (most recent call last): 2025-12-04T12:05:01.6170276Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6170438Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6170956Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6171091Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6171278Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6171286Z 2025-12-04T12:05:01.6171531Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6172291Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6172324Z 2025-12-04T12:05:01.6172586Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6172815Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6172922Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6173033Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6173163Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6173382Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6173479Z graph_break [] 2025-12-04T12:05:01.6173698Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6175074Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6175189Z warnings.warn( 2025-12-04T12:05:01.6175402Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6175510Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6175636Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6175853Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6175976Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6176080Z graph_break [] 2025-12-04T12:05:01.6176292Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6177035Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6177133Z warnings.warn( 2025-12-04T12:05:01.6177277Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6177657Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6177776Z Traceback (most recent call last): 2025-12-04T12:05:01.6178352Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6178523Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6179044Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6179176Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6179364Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6179370Z 2025-12-04T12:05:01.6179583Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6180322Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6180332Z 2025-12-04T12:05:01.6180593Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6180821Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6180934Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6181044Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6181174Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6181388Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6181483Z graph_break [] 2025-12-04T12:05:01.6181705Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6182432Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6182587Z warnings.warn( 2025-12-04T12:05:01.6182833Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6182943Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6183067Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6183288Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6183439Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6183548Z graph_break [] 2025-12-04T12:05:01.6183758Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6184485Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6184597Z warnings.warn( 2025-12-04T12:05:01.6184805Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6184925Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6185069Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6185286Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6185417Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6185509Z graph_break [] 2025-12-04T12:05:01.6185719Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6186455Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6186550Z warnings.warn( 2025-12-04T12:05:01.6187313Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4038060382adbf9a.xml - 2025-12-04T12:05:01.6187483Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6188558Z FAILED [0.2744s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6188569Z 2025-12-04T12:05:01.6188803Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6189529Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6189537Z 2025-12-04T12:05:01.6189815Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6189991Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6190185Z ================== 1 failed, 294 deselected, 2 rerun in 5.16s ================== 2025-12-04T12:05:01.6190297Z Got exit code 1 2025-12-04T12:05:01.6190402Z Retrying single test... 2025-12-04T12:05:01.6190993Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d0c6827a4c24a96e.xml 2025-12-04T12:05:01.6191159Z ============================= test session starts ============================== 2025-12-04T12:05:01.6191510Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6191630Z cachedir: .pytest_cache 2025-12-04T12:05:01.6192157Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6192278Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6192394Z configfile: pytest.ini 2025-12-04T12:05:01.6192984Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6193220Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6194110Z stepcurrent: skipping 51 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6194226Z Running 1 items in this shard 2025-12-04T12:05:01.6194274Z 2025-12-04T12:05:01.6194978Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9270s] [100%] 2025-12-04T12:05:01.6195720Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2695s] [100%] 2025-12-04T12:05:01.6196330Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2779s] [100%] 2025-12-04T12:05:01.6196336Z 2025-12-04T12:05:01.6196474Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6196839Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6197016Z Traceback (most recent call last): 2025-12-04T12:05:01.6197499Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6197664Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6198184Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6198308Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6198508Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6198513Z 2025-12-04T12:05:01.6198727Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6199461Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6199467Z 2025-12-04T12:05:01.6199733Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6199953Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6200078Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6200191Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6200310Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6200542Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6200640Z graph_break [] 2025-12-04T12:05:01.6200864Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6201597Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6201696Z warnings.warn( 2025-12-04T12:05:01.6202075Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6202196Z Traceback (most recent call last): 2025-12-04T12:05:01.6202674Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6202842Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6203357Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6203490Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6203681Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6203687Z 2025-12-04T12:05:01.6203898Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6204628Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6204633Z 2025-12-04T12:05:01.6204893Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6205118Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6205266Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6205422Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6205553Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6205772Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6205895Z graph_break [] 2025-12-04T12:05:01.6206119Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6206850Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6206960Z warnings.warn( 2025-12-04T12:05:01.6207171Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6207277Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6207402Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6207618Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6207773Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6207884Z graph_break [] 2025-12-04T12:05:01.6208093Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6208832Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6208934Z warnings.warn( 2025-12-04T12:05:01.6209075Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6209453Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6209570Z Traceback (most recent call last): 2025-12-04T12:05:01.6210047Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6210205Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6210727Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6210856Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6211045Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6211053Z 2025-12-04T12:05:01.6211263Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6211991Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6211997Z 2025-12-04T12:05:01.6212262Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6212482Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6212591Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6212700Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6212831Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6213048Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6213143Z graph_break [] 2025-12-04T12:05:01.6213366Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6214102Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6214216Z warnings.warn( 2025-12-04T12:05:01.6214430Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6214538Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6214656Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6214873Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6214988Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6215096Z graph_break [] 2025-12-04T12:05:01.6215339Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6216101Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6216203Z warnings.warn( 2025-12-04T12:05:01.6216497Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6216620Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6216732Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6216946Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6217072Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6217166Z graph_break [] 2025-12-04T12:05:01.6217375Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6218191Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6218334Z warnings.warn( 2025-12-04T12:05:01.6219099Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d0c6827a4c24a96e.xml - 2025-12-04T12:05:01.6219268Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6220103Z FAILED [0.2779s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6220124Z 2025-12-04T12:05:01.6220340Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6221058Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6221063Z 2025-12-04T12:05:01.6221335Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6221518Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6221710Z ================== 1 failed, 294 deselected, 2 rerun in 5.16s ================== 2025-12-04T12:05:01.6221820Z Got exit code 1 2025-12-04T12:05:01.6222461Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6222879Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.6223448Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-eb0f3705043be69a.xml 2025-12-04T12:05:01.6223605Z ============================= test session starts ============================== 2025-12-04T12:05:01.6223965Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6224078Z cachedir: .pytest_cache 2025-12-04T12:05:01.6224610Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6224732Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6224835Z configfile: pytest.ini 2025-12-04T12:05:01.6225430Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6225652Z collecting ... collected 571 items / 52 deselected / 519 selected 2025-12-04T12:05:01.6225788Z stepcurrent: skipping 52 already run items. 2025-12-04T12:05:01.6225904Z Running 243 items in this shard 2025-12-04T12:05:01.6225909Z 2025-12-04T12:05:01.6226601Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9108s] [ 0%] 2025-12-04T12:05:01.6227320Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2639s] [ 0%] 2025-12-04T12:05:01.6227949Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2561s] [ 0%] 2025-12-04T12:05:01.6227983Z 2025-12-04T12:05:01.6228135Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6228497Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6228617Z Traceback (most recent call last): 2025-12-04T12:05:01.6229097Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6229250Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6229769Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6229932Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6230125Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6230132Z 2025-12-04T12:05:01.6230354Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6231074Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6231082Z 2025-12-04T12:05:01.6231345Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6231568Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6231678Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6231798Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6231913Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6232133Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6232241Z graph_break [] 2025-12-04T12:05:01.6232455Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6233187Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6233295Z warnings.warn( 2025-12-04T12:05:01.6233657Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6233785Z Traceback (most recent call last): 2025-12-04T12:05:01.6234252Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6234403Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6234924Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6235043Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6235235Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6235240Z 2025-12-04T12:05:01.6235458Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6236178Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6236186Z 2025-12-04T12:05:01.6236453Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6236665Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6236775Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6236897Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6237012Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6237237Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6237330Z graph_break [] 2025-12-04T12:05:01.6237575Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6238344Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6238441Z warnings.warn( 2025-12-04T12:05:01.6238681Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6238798Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6238909Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6239126Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6239249Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6239344Z graph_break [] 2025-12-04T12:05:01.6239560Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6240293Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6240425Z warnings.warn( 2025-12-04T12:05:01.6240575Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6240943Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6241067Z Traceback (most recent call last): 2025-12-04T12:05:01.6241548Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6241697Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6242214Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6242333Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6242521Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6242528Z 2025-12-04T12:05:01.6242746Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6243472Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6243477Z 2025-12-04T12:05:01.6243750Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6243967Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6244078Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6244200Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6244317Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6244532Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6244637Z graph_break [] 2025-12-04T12:05:01.6244849Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6245593Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6245694Z warnings.warn( 2025-12-04T12:05:01.6245905Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6246023Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6246138Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6246351Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6246484Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6246577Z graph_break [] 2025-12-04T12:05:01.6246799Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6247535Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6247627Z warnings.warn( 2025-12-04T12:05:01.6247850Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6248009Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6248121Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6248380Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6248501Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6248593Z graph_break [] 2025-12-04T12:05:01.6248850Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6249579Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6249688Z warnings.warn( 2025-12-04T12:05:01.6250439Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-eb0f3705043be69a.xml - 2025-12-04T12:05:01.6250607Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6251490Z FAILED [0.2561s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6251496Z 2025-12-04T12:05:01.6251710Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6252445Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6252450Z 2025-12-04T12:05:01.6252712Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6252898Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6253090Z ================== 1 failed, 52 deselected, 2 rerun in 5.11s =================== 2025-12-04T12:05:01.6253186Z Got exit code 1 2025-12-04T12:05:01.6253300Z Retrying single test... 2025-12-04T12:05:01.6253877Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b4b4a4cac36fd9fb.xml 2025-12-04T12:05:01.6254037Z ============================= test session starts ============================== 2025-12-04T12:05:01.6254398Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6254503Z cachedir: .pytest_cache 2025-12-04T12:05:01.6255034Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6255157Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6255261Z configfile: pytest.ini 2025-12-04T12:05:01.6255863Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6256087Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6256891Z stepcurrent: skipping 52 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6257017Z Running 1 items in this shard 2025-12-04T12:05:01.6257022Z 2025-12-04T12:05:01.6257708Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9180s] [100%] 2025-12-04T12:05:01.6258488Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2646s] [100%] 2025-12-04T12:05:01.6259082Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2654s] [100%] 2025-12-04T12:05:01.6259088Z 2025-12-04T12:05:01.6259234Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6259598Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6259766Z Traceback (most recent call last): 2025-12-04T12:05:01.6260275Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6260427Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6260979Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6261109Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6261297Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6261303Z 2025-12-04T12:05:01.6261525Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6262248Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6262254Z 2025-12-04T12:05:01.6262552Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6262778Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6262889Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6263011Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6263131Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6263352Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6263457Z graph_break [] 2025-12-04T12:05:01.6263672Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6264403Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6264511Z warnings.warn( 2025-12-04T12:05:01.6264877Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6265029Z Traceback (most recent call last): 2025-12-04T12:05:01.6265593Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6265748Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6266271Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6266395Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6266583Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6266597Z 2025-12-04T12:05:01.6266806Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6267524Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6267530Z 2025-12-04T12:05:01.6267803Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6268025Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6268140Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6268263Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6268382Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6268610Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6268708Z graph_break [] 2025-12-04T12:05:01.6268921Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6269662Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6269761Z warnings.warn( 2025-12-04T12:05:01.6269975Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6270091Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6270201Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6270471Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6270621Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6270719Z graph_break [] 2025-12-04T12:05:01.6270940Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6271696Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6271796Z warnings.warn( 2025-12-04T12:05:01.6271950Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6272314Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6272441Z Traceback (most recent call last): 2025-12-04T12:05:01.6272917Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6273108Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6273640Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6273764Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6273955Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6273962Z 2025-12-04T12:05:01.6274184Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6274907Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6274912Z 2025-12-04T12:05:01.6275182Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6275397Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6275509Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6275635Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6275758Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6275975Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6276081Z graph_break [] 2025-12-04T12:05:01.6276295Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6277035Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6277131Z warnings.warn( 2025-12-04T12:05:01.6277343Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6277458Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6277569Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6277781Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6277909Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6278004Z graph_break [] 2025-12-04T12:05:01.6278225Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6278959Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6279056Z warnings.warn( 2025-12-04T12:05:01.6279278Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6279387Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6279493Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6279721Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6279838Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6279941Z graph_break [] 2025-12-04T12:05:01.6280151Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6280906Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6281014Z warnings.warn( 2025-12-04T12:05:01.6281793Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b4b4a4cac36fd9fb.xml - 2025-12-04T12:05:01.6281990Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6282832Z FAILED [0.2654s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6282838Z 2025-12-04T12:05:01.6283046Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6283773Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6283837Z 2025-12-04T12:05:01.6284105Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6284288Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6284486Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.6284582Z Got exit code 1 2025-12-04T12:05:01.6284695Z Retrying single test... 2025-12-04T12:05:01.6285272Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5f059c7d0f50bda4.xml 2025-12-04T12:05:01.6285427Z ============================= test session starts ============================== 2025-12-04T12:05:01.6285781Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6285889Z cachedir: .pytest_cache 2025-12-04T12:05:01.6286418Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6286543Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6286647Z configfile: pytest.ini 2025-12-04T12:05:01.6287254Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6287484Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6288521Z stepcurrent: skipping 52 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6288651Z Running 1 items in this shard 2025-12-04T12:05:01.6288656Z 2025-12-04T12:05:01.6289341Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9079s] [100%] 2025-12-04T12:05:01.6290043Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2641s] [100%] 2025-12-04T12:05:01.6290649Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2602s] [100%] 2025-12-04T12:05:01.6290654Z 2025-12-04T12:05:01.6290808Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6291178Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6291302Z Traceback (most recent call last): 2025-12-04T12:05:01.6291786Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6291943Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6292466Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6292594Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6292866Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6292872Z 2025-12-04T12:05:01.6293143Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6293868Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6293910Z 2025-12-04T12:05:01.6294178Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6294406Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6294519Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6294641Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6294762Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6294982Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6295088Z graph_break [] 2025-12-04T12:05:01.6295348Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6296082Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6296190Z warnings.warn( 2025-12-04T12:05:01.6296556Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6296684Z Traceback (most recent call last): 2025-12-04T12:05:01.6297153Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6297305Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6297832Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6297953Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6298226Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6298246Z 2025-12-04T12:05:01.6298469Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6299185Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6299194Z 2025-12-04T12:05:01.6299469Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6299685Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6299793Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6299915Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6300036Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6300263Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6300358Z graph_break [] 2025-12-04T12:05:01.6300571Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6301324Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6301424Z warnings.warn( 2025-12-04T12:05:01.6301638Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6301758Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6301871Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6302097Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6302213Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6302308Z graph_break [] 2025-12-04T12:05:01.6302529Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6303257Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6303357Z warnings.warn( 2025-12-04T12:05:01.6303542Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6303940Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6304067Z Traceback (most recent call last): 2025-12-04T12:05:01.6304566Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6304715Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6305236Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6305352Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6305541Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6305547Z 2025-12-04T12:05:01.6305767Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6306518Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6306524Z 2025-12-04T12:05:01.6306794Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6307008Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6307118Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6307233Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6307350Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6307573Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6307672Z graph_break [] 2025-12-04T12:05:01.6307884Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6308621Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6308726Z warnings.warn( 2025-12-04T12:05:01.6308936Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6309053Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6309166Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6309381Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6309512Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6309608Z graph_break [] 2025-12-04T12:05:01.6309832Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6310561Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6310655Z warnings.warn( 2025-12-04T12:05:01.6310870Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6310978Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6311092Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6311320Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6311436Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6311537Z graph_break [] 2025-12-04T12:05:01.6311746Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6312470Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6312576Z warnings.warn( 2025-12-04T12:05:01.6313333Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5f059c7d0f50bda4.xml - 2025-12-04T12:05:01.6313499Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6314379Z FAILED [0.2602s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6314427Z 2025-12-04T12:05:01.6314650Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6315380Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6315416Z 2025-12-04T12:05:01.6315681Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6315866Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6316064Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.6316164Z Got exit code 1 2025-12-04T12:05:01.6316811Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6317254Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.6317841Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c9ff80f1a6820ed2.xml 2025-12-04T12:05:01.6318005Z ============================= test session starts ============================== 2025-12-04T12:05:01.6318358Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6318474Z cachedir: .pytest_cache 2025-12-04T12:05:01.6318996Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6319121Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6319238Z configfile: pytest.ini 2025-12-04T12:05:01.6319832Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6320071Z collecting ... collected 571 items / 53 deselected / 518 selected 2025-12-04T12:05:01.6320217Z stepcurrent: skipping 53 already run items. 2025-12-04T12:05:01.6320330Z Running 242 items in this shard 2025-12-04T12:05:01.6320335Z 2025-12-04T12:05:01.6320930Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_True_float16_cuda PASSED [3.4477s] [ 0%] 2025-12-04T12:05:01.6321622Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0935s] [ 0%] 2025-12-04T12:05:01.6322317Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2689s] [ 0%] 2025-12-04T12:05:01.6322916Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2592s] [ 0%] 2025-12-04T12:05:01.6322927Z 2025-12-04T12:05:01.6323070Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6323450Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6323572Z Traceback (most recent call last): 2025-12-04T12:05:01.6324061Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6324217Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6324737Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6324875Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6325062Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6325068Z 2025-12-04T12:05:01.6325282Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6326074Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6326081Z 2025-12-04T12:05:01.6326346Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6326623Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6326734Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6326848Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6327083Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6327206Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6327310Z graph_break [] 2025-12-04T12:05:01.6327527Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6328256Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6328399Z warnings.warn( 2025-12-04T12:05:01.6328763Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6328882Z Traceback (most recent call last): 2025-12-04T12:05:01.6329365Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6329521Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6330040Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6330162Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6330349Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6330354Z 2025-12-04T12:05:01.6330574Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6331300Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6331308Z 2025-12-04T12:05:01.6331580Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6331796Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6331911Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6332029Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6332248Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6332367Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6332470Z graph_break [] 2025-12-04T12:05:01.6332681Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6333423Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6333525Z warnings.warn( 2025-12-04T12:05:01.6333740Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6333860Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6333972Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6334187Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6334318Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6334414Z graph_break [] 2025-12-04T12:05:01.6334625Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6335363Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6335462Z warnings.warn( 2025-12-04T12:05:01.6335609Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6335974Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6336127Z Traceback (most recent call last): 2025-12-04T12:05:01.6336641Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6336797Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6337347Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6337469Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6337658Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6337663Z 2025-12-04T12:05:01.6337884Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6338697Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6338704Z 2025-12-04T12:05:01.6338980Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6339238Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6339349Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6339468Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6339685Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6339808Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6339912Z graph_break [] 2025-12-04T12:05:01.6340126Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6340853Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6340964Z warnings.warn( 2025-12-04T12:05:01.6341178Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6341298Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6341417Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6341639Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6341770Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6341866Z graph_break [] 2025-12-04T12:05:01.6342078Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6342924Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6343026Z warnings.warn( 2025-12-04T12:05:01.6343248Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6343376Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6343537Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6343770Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6343890Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6343985Z graph_break [] 2025-12-04T12:05:01.6344212Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6344939Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6345045Z warnings.warn( 2025-12-04T12:05:01.6345802Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c9ff80f1a6820ed2.xml - 2025-12-04T12:05:01.6345969Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6346809Z FAILED [0.2592s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6346816Z 2025-12-04T12:05:01.6347032Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6347845Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6347851Z 2025-12-04T12:05:01.6348116Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6348321Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6348537Z ============= 1 failed, 1 passed, 53 deselected, 2 rerun in 5.76s ============== 2025-12-04T12:05:01.6348635Z Got exit code 1 2025-12-04T12:05:01.6348739Z Retrying single test... 2025-12-04T12:05:01.6349330Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c2ef3587eca8dd3f.xml 2025-12-04T12:05:01.6349490Z ============================= test session starts ============================== 2025-12-04T12:05:01.6349853Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6349991Z cachedir: .pytest_cache 2025-12-04T12:05:01.6350512Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6350647Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6350756Z configfile: pytest.ini 2025-12-04T12:05:01.6351355Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6351581Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6352390Z stepcurrent: skipping 54 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6352511Z Running 1 items in this shard 2025-12-04T12:05:01.6352516Z 2025-12-04T12:05:01.6353205Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9147s] [100%] 2025-12-04T12:05:01.6353900Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2679s] [100%] 2025-12-04T12:05:01.6354496Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2675s] [100%] 2025-12-04T12:05:01.6354502Z 2025-12-04T12:05:01.6354641Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6355014Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6355135Z Traceback (most recent call last): 2025-12-04T12:05:01.6355619Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6355777Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6356303Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6356433Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6356624Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6356631Z 2025-12-04T12:05:01.6356851Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6357574Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6357579Z 2025-12-04T12:05:01.6357843Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6358066Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6358175Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6358291Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6358458Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6358708Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6358817Z graph_break [] 2025-12-04T12:05:01.6359031Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6359799Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6359908Z warnings.warn( 2025-12-04T12:05:01.6360270Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6360389Z Traceback (most recent call last): 2025-12-04T12:05:01.6360864Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6361015Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6361573Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6361699Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6361889Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6361895Z 2025-12-04T12:05:01.6362115Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6362840Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6362846Z 2025-12-04T12:05:01.6363117Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6363333Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6363443Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6363569Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6363689Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6363913Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6364018Z graph_break [] 2025-12-04T12:05:01.6364235Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6364975Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6365076Z warnings.warn( 2025-12-04T12:05:01.6365287Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6365402Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6365514Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6365732Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6365861Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6365958Z graph_break [] 2025-12-04T12:05:01.6366168Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6366911Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6367011Z warnings.warn( 2025-12-04T12:05:01.6367159Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6367529Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6367645Z Traceback (most recent call last): 2025-12-04T12:05:01.6368134Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6368287Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6368813Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6368937Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6369157Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6369163Z 2025-12-04T12:05:01.6369434Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6370161Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6370197Z 2025-12-04T12:05:01.6370469Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6370687Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6370799Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6370924Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6371045Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6371262Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6371369Z graph_break [] 2025-12-04T12:05:01.6371617Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6372360Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6372461Z warnings.warn( 2025-12-04T12:05:01.6372677Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6372796Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6372910Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6373131Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6373260Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6373356Z graph_break [] 2025-12-04T12:05:01.6373564Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6374306Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6374410Z warnings.warn( 2025-12-04T12:05:01.6374634Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6374743Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6374854Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6375085Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6375205Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6375297Z graph_break [] 2025-12-04T12:05:01.6375521Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6376251Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6376358Z warnings.warn( 2025-12-04T12:05:01.6377111Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c2ef3587eca8dd3f.xml - 2025-12-04T12:05:01.6377285Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6378219Z FAILED [0.2675s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6378229Z 2025-12-04T12:05:01.6378446Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6379181Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6379186Z 2025-12-04T12:05:01.6379449Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6379628Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6379835Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.6379977Z Got exit code 1 2025-12-04T12:05:01.6380094Z Retrying single test... 2025-12-04T12:05:01.6380702Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-f520a42aa1f82bba.xml 2025-12-04T12:05:01.6380864Z ============================= test session starts ============================== 2025-12-04T12:05:01.6381253Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6381359Z cachedir: .pytest_cache 2025-12-04T12:05:01.6381878Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6382005Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6382115Z configfile: pytest.ini 2025-12-04T12:05:01.6382713Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6382976Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6383788Z stepcurrent: skipping 54 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6383915Z Running 1 items in this shard 2025-12-04T12:05:01.6383920Z 2025-12-04T12:05:01.6384607Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9211s] [100%] 2025-12-04T12:05:01.6385301Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2687s] [100%] 2025-12-04T12:05:01.6385894Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2698s] [100%] 2025-12-04T12:05:01.6385901Z 2025-12-04T12:05:01.6386043Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6386419Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6386538Z Traceback (most recent call last): 2025-12-04T12:05:01.6387024Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6387179Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6387878Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6388023Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6388218Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6388223Z 2025-12-04T12:05:01.6388444Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6389172Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6389182Z 2025-12-04T12:05:01.6389448Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6389673Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6389787Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6389900Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6390028Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6390248Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6390356Z graph_break [] 2025-12-04T12:05:01.6390570Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6391301Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6391411Z warnings.warn( 2025-12-04T12:05:01.6391968Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6392093Z Traceback (most recent call last): 2025-12-04T12:05:01.6392576Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6392770Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6393293Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6393414Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6393605Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6393611Z 2025-12-04T12:05:01.6393831Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6394556Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6394601Z 2025-12-04T12:05:01.6394876Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6395091Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6395205Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6395329Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6395450Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6395670Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6395777Z graph_break [] 2025-12-04T12:05:01.6395993Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6396728Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6396829Z warnings.warn( 2025-12-04T12:05:01.6397050Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6397167Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6397282Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6397499Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6397628Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6397727Z graph_break [] 2025-12-04T12:05:01.6397947Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6398678Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6398777Z warnings.warn( 2025-12-04T12:05:01.6398926Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6399290Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6399411Z Traceback (most recent call last): 2025-12-04T12:05:01.6399901Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6400055Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6400577Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6400700Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6400890Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6400895Z 2025-12-04T12:05:01.6401114Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6401831Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6401837Z 2025-12-04T12:05:01.6402108Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6402360Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6402501Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6402625Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6402744Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6402958Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6403093Z graph_break [] 2025-12-04T12:05:01.6403306Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6404040Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6404146Z warnings.warn( 2025-12-04T12:05:01.6404359Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6404483Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6404595Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6404847Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6404978Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6405072Z graph_break [] 2025-12-04T12:05:01.6405284Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6406024Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6406124Z warnings.warn( 2025-12-04T12:05:01.6406345Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6406456Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6406566Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6406796Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6406916Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6407007Z graph_break [] 2025-12-04T12:05:01.6407233Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6407959Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6408066Z warnings.warn( 2025-12-04T12:05:01.6408823Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-f520a42aa1f82bba.xml - 2025-12-04T12:05:01.6408989Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6409836Z FAILED [0.2698s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6409843Z 2025-12-04T12:05:01.6410057Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6410796Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6410802Z 2025-12-04T12:05:01.6411067Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6411244Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6411454Z ================== 1 failed, 294 deselected, 2 rerun in 5.15s ================== 2025-12-04T12:05:01.6411552Z Got exit code 1 2025-12-04T12:05:01.6412200Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6412608Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.6413176Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-26d6067018821038.xml 2025-12-04T12:05:01.6413400Z ============================= test session starts ============================== 2025-12-04T12:05:01.6413781Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6413899Z cachedir: .pytest_cache 2025-12-04T12:05:01.6414449Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6414573Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6414689Z configfile: pytest.ini 2025-12-04T12:05:01.6415278Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6415500Z collecting ... collected 571 items / 55 deselected / 516 selected 2025-12-04T12:05:01.6415652Z stepcurrent: skipping 55 already run items. 2025-12-04T12:05:01.6415763Z Running 240 items in this shard 2025-12-04T12:05:01.6415802Z 2025-12-04T12:05:01.6416418Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_float16_cuda PASSED [3.4315s] [ 0%] 2025-12-04T12:05:01.6417013Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_True_float16_cuda PASSED [0.3175s] [ 0%] 2025-12-04T12:05:01.6417696Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0918s] [ 1%] 2025-12-04T12:05:01.6418471Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2722s] [ 1%] 2025-12-04T12:05:01.6419064Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2590s] [ 1%] 2025-12-04T12:05:01.6419069Z 2025-12-04T12:05:01.6419219Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6419589Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6419708Z Traceback (most recent call last): 2025-12-04T12:05:01.6420191Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6420348Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6420878Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6420999Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6421191Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6421196Z 2025-12-04T12:05:01.6421419Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6422138Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6422146Z 2025-12-04T12:05:01.6422421Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6422636Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6422750Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6422883Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6423105Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6423223Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6423333Z graph_break [] 2025-12-04T12:05:01.6423551Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6424300Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6424402Z warnings.warn( 2025-12-04T12:05:01.6424809Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6424973Z Traceback (most recent call last): 2025-12-04T12:05:01.6425449Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6425602Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6426164Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6426289Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6426491Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6426497Z 2025-12-04T12:05:01.6426714Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6427432Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6427468Z 2025-12-04T12:05:01.6427750Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6427975Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6428097Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6428212Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6428435Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6428574Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6428674Z graph_break [] 2025-12-04T12:05:01.6428890Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6429637Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6429739Z warnings.warn( 2025-12-04T12:05:01.6429966Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6430083Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6430196Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6430437Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6430556Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6430654Z graph_break [] 2025-12-04T12:05:01.6430883Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6431615Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6431727Z warnings.warn( 2025-12-04T12:05:01.6431868Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6432224Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6432355Z Traceback (most recent call last): 2025-12-04T12:05:01.6432832Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6432986Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6433514Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6433637Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6433838Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6433844Z 2025-12-04T12:05:01.6434057Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6434772Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6434778Z 2025-12-04T12:05:01.6435056Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6435271Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6435431Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6435574Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6435794Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6435923Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6436051Z graph_break [] 2025-12-04T12:05:01.6436262Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6437003Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6437107Z warnings.warn( 2025-12-04T12:05:01.6437326Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6437435Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6437546Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6437774Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6437927Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6438022Z graph_break [] 2025-12-04T12:05:01.6438247Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6438976Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6439075Z warnings.warn( 2025-12-04T12:05:01.6439296Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6439406Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6439528Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6439744Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6439864Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6439972Z graph_break [] 2025-12-04T12:05:01.6440181Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6440915Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6441027Z warnings.warn( 2025-12-04T12:05:01.6441775Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-26d6067018821038.xml - 2025-12-04T12:05:01.6441957Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6442784Z FAILED [0.2590s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6442790Z 2025-12-04T12:05:01.6443005Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6443732Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6443739Z 2025-12-04T12:05:01.6444004Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6444189Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6444401Z ============= 1 failed, 2 passed, 55 deselected, 2 rerun in 6.06s ============== 2025-12-04T12:05:01.6444496Z Got exit code 1 2025-12-04T12:05:01.6444611Z Retrying single test... 2025-12-04T12:05:01.6445184Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-323433d4d00bc615.xml 2025-12-04T12:05:01.6445353Z ============================= test session starts ============================== 2025-12-04T12:05:01.6445705Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6445813Z cachedir: .pytest_cache 2025-12-04T12:05:01.6446379Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6446536Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6446644Z configfile: pytest.ini 2025-12-04T12:05:01.6447246Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6447508Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6448325Z stepcurrent: skipping 57 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6448438Z Running 1 items in this shard 2025-12-04T12:05:01.6448443Z 2025-12-04T12:05:01.6449122Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9038s] [100%] 2025-12-04T12:05:01.6449863Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2644s] [100%] 2025-12-04T12:05:01.6450454Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2601s] [100%] 2025-12-04T12:05:01.6450462Z 2025-12-04T12:05:01.6450614Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6450973Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6451095Z Traceback (most recent call last): 2025-12-04T12:05:01.6451586Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6451739Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6452278Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6452405Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6452595Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6452601Z 2025-12-04T12:05:01.6452824Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6453542Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6453548Z 2025-12-04T12:05:01.6453822Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6454039Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6454149Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6454271Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6454391Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6454607Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6454718Z graph_break [] 2025-12-04T12:05:01.6454934Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6455680Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6455783Z warnings.warn( 2025-12-04T12:05:01.6456141Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6456272Z Traceback (most recent call last): 2025-12-04T12:05:01.6456746Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6456898Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6457427Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6457583Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6457787Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6457823Z 2025-12-04T12:05:01.6458040Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6458840Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6458898Z 2025-12-04T12:05:01.6459165Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6459382Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6459505Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6459619Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6459739Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6459969Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6460098Z graph_break [] 2025-12-04T12:05:01.6460311Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6461057Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6461159Z warnings.warn( 2025-12-04T12:05:01.6461382Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6461492Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6461603Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6461834Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6461954Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6462045Z graph_break [] 2025-12-04T12:05:01.6462266Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6462994Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6463110Z warnings.warn( 2025-12-04T12:05:01.6463253Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6463613Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6463748Z Traceback (most recent call last): 2025-12-04T12:05:01.6464223Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6464374Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6464904Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6465024Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6465226Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6465232Z 2025-12-04T12:05:01.6465451Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6466174Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6466179Z 2025-12-04T12:05:01.6466460Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6466677Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6466800Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6466914Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6467037Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6467266Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6467361Z graph_break [] 2025-12-04T12:05:01.6467572Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6468344Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6468480Z warnings.warn( 2025-12-04T12:05:01.6468704Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6468814Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6468957Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6469185Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6469305Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6469400Z graph_break [] 2025-12-04T12:05:01.6469624Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6470353Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6470460Z warnings.warn( 2025-12-04T12:05:01.6470670Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6470809Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6470934Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6471150Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6471265Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6471375Z graph_break [] 2025-12-04T12:05:01.6471588Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6472320Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6472430Z warnings.warn( 2025-12-04T12:05:01.6473182Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-323433d4d00bc615.xml - 2025-12-04T12:05:01.6473363Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6474204Z FAILED [0.2601s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6474210Z 2025-12-04T12:05:01.6474424Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6475158Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6475164Z 2025-12-04T12:05:01.6475526Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6475715Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6475914Z ================== 1 failed, 294 deselected, 2 rerun in 5.10s ================== 2025-12-04T12:05:01.6476012Z Got exit code 1 2025-12-04T12:05:01.6476159Z Retrying single test... 2025-12-04T12:05:01.6476784Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-171ced1c6f8950ca.xml 2025-12-04T12:05:01.6477014Z ============================= test session starts ============================== 2025-12-04T12:05:01.6477374Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6477484Z cachedir: .pytest_cache 2025-12-04T12:05:01.6478084Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6478209Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6478318Z configfile: pytest.ini 2025-12-04T12:05:01.6478926Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6479155Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6480122Z stepcurrent: skipping 57 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6480238Z Running 1 items in this shard 2025-12-04T12:05:01.6480243Z 2025-12-04T12:05:01.6480923Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9076s] [100%] 2025-12-04T12:05:01.6481723Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2685s] [100%] 2025-12-04T12:05:01.6482310Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2675s] [100%] 2025-12-04T12:05:01.6482317Z 2025-12-04T12:05:01.6482474Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6482881Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6483006Z Traceback (most recent call last): 2025-12-04T12:05:01.6483495Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6483650Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6484184Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6484310Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6484502Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6484508Z 2025-12-04T12:05:01.6484729Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6485521Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6485529Z 2025-12-04T12:05:01.6485810Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6486035Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6486148Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6486274Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6486396Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6486618Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6486722Z graph_break [] 2025-12-04T12:05:01.6486935Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6487683Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6487968Z warnings.warn( 2025-12-04T12:05:01.6488332Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6488472Z Traceback (most recent call last): 2025-12-04T12:05:01.6488949Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6489103Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6489633Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6489756Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6489960Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6489966Z 2025-12-04T12:05:01.6490180Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6490894Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6490912Z 2025-12-04T12:05:01.6491176Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6491515Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6491640Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6491754Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6491872Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6492160Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6492259Z graph_break [] 2025-12-04T12:05:01.6492474Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6493220Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6493320Z warnings.warn( 2025-12-04T12:05:01.6493545Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6493655Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6493812Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6494046Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6494167Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6494263Z graph_break [] 2025-12-04T12:05:01.6494487Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6495221Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6495331Z warnings.warn( 2025-12-04T12:05:01.6495473Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6495833Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6495965Z Traceback (most recent call last): 2025-12-04T12:05:01.6496442Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6496598Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6497130Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6497251Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6497454Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6497460Z 2025-12-04T12:05:01.6497670Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6498446Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6498453Z 2025-12-04T12:05:01.6498731Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6498947Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6499070Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6499186Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6499306Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6499537Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6499637Z graph_break [] 2025-12-04T12:05:01.6499851Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6500591Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6500691Z warnings.warn( 2025-12-04T12:05:01.6500918Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6501026Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6501140Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6501368Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6501487Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6501622Z graph_break [] 2025-12-04T12:05:01.6501879Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6502608Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6502749Z warnings.warn( 2025-12-04T12:05:01.6502960Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6503068Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6503191Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6503407Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6503528Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6503634Z graph_break [] 2025-12-04T12:05:01.6503846Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6504571Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6504710Z warnings.warn( 2025-12-04T12:05:01.6505469Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-171ced1c6f8950ca.xml - 2025-12-04T12:05:01.6505651Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6506480Z FAILED [0.2675s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6506486Z 2025-12-04T12:05:01.6506698Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6507422Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6507430Z 2025-12-04T12:05:01.6507697Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6507886Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6508087Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.6508186Z Got exit code 1 2025-12-04T12:05:01.6508832Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6509243Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.6509830Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-25dfec8b4de20cd4.xml 2025-12-04T12:05:01.6509988Z ============================= test session starts ============================== 2025-12-04T12:05:01.6510344Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6510465Z cachedir: .pytest_cache 2025-12-04T12:05:01.6510985Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6511119Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6511224Z configfile: pytest.ini 2025-12-04T12:05:01.6511813Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6512047Z collecting ... collected 571 items / 58 deselected / 513 selected 2025-12-04T12:05:01.6512188Z stepcurrent: skipping 58 already run items. 2025-12-04T12:05:01.6512300Z Running 237 items in this shard 2025-12-04T12:05:01.6512305Z 2025-12-04T12:05:01.6512998Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9167s] [ 0%] 2025-12-04T12:05:01.6513735Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2667s] [ 0%] 2025-12-04T12:05:01.6514328Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2566s] [ 0%] 2025-12-04T12:05:01.6514364Z 2025-12-04T12:05:01.6514505Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6514870Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6514993Z Traceback (most recent call last): 2025-12-04T12:05:01.6515468Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6515628Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6516153Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6516307Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6516507Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6516513Z 2025-12-04T12:05:01.6516727Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6517453Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6517458Z 2025-12-04T12:05:01.6517723Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6517941Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6518060Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6518174Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6518305Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6518535Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6518632Z graph_break [] 2025-12-04T12:05:01.6518861Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6519597Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6519698Z warnings.warn( 2025-12-04T12:05:01.6520062Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6520182Z Traceback (most recent call last): 2025-12-04T12:05:01.6520669Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6520821Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6521340Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6521477Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6521668Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6521674Z 2025-12-04T12:05:01.6521887Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6522607Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6522614Z 2025-12-04T12:05:01.6522878Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6523106Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6523217Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6523330Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6523462Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6523682Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6523784Z graph_break [] 2025-12-04T12:05:01.6524069Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6524801Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6524942Z warnings.warn( 2025-12-04T12:05:01.6525159Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6525268Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6525390Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6525609Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6525728Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6525832Z graph_break [] 2025-12-04T12:05:01.6526042Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6526784Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6526917Z warnings.warn( 2025-12-04T12:05:01.6527060Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6527424Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6527545Z Traceback (most recent call last): 2025-12-04T12:05:01.6528028Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6528194Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6528712Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6528845Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6529036Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6529044Z 2025-12-04T12:05:01.6529259Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6529990Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6529995Z 2025-12-04T12:05:01.6530263Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6530488Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6530599Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6530713Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6530844Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6531061Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6531154Z graph_break [] 2025-12-04T12:05:01.6531381Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6532119Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6532229Z warnings.warn( 2025-12-04T12:05:01.6532443Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6532557Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6532680Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6532899Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6533019Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6533127Z graph_break [] 2025-12-04T12:05:01.6533340Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6534073Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6534175Z warnings.warn( 2025-12-04T12:05:01.6534424Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6534544Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6534706Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6534924Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6535056Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6535188Z graph_break [] 2025-12-04T12:05:01.6535395Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6536139Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6536238Z warnings.warn( 2025-12-04T12:05:01.6537000Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-25dfec8b4de20cd4.xml - 2025-12-04T12:05:01.6537171Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6538028Z FAILED [0.2566s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6538046Z 2025-12-04T12:05:01.6538351Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6539067Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6539072Z 2025-12-04T12:05:01.6539344Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6539522Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6539719Z ================== 1 failed, 58 deselected, 2 rerun in 5.13s =================== 2025-12-04T12:05:01.6539830Z Got exit code 1 2025-12-04T12:05:01.6539937Z Retrying single test... 2025-12-04T12:05:01.6540518Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-301e5b291449627e.xml 2025-12-04T12:05:01.6540680Z ============================= test session starts ============================== 2025-12-04T12:05:01.6541032Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6541149Z cachedir: .pytest_cache 2025-12-04T12:05:01.6541670Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6541803Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6541909Z configfile: pytest.ini 2025-12-04T12:05:01.6542501Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6542737Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6543549Z stepcurrent: skipping 58 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6543663Z Running 1 items in this shard 2025-12-04T12:05:01.6543668Z 2025-12-04T12:05:01.6544357Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9118s] [100%] 2025-12-04T12:05:01.6545033Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2673s] [100%] 2025-12-04T12:05:01.6545626Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2588s] [100%] 2025-12-04T12:05:01.6545632Z 2025-12-04T12:05:01.6545771Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6546177Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6546326Z Traceback (most recent call last): 2025-12-04T12:05:01.6546806Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6547001Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6547524Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6547646Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6547841Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6547846Z 2025-12-04T12:05:01.6548062Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6548781Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6548817Z 2025-12-04T12:05:01.6549085Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6549306Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6549424Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6549538Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6549668Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6549887Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6549986Z graph_break [] 2025-12-04T12:05:01.6550208Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6550938Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6551038Z warnings.warn( 2025-12-04T12:05:01.6551399Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6551524Z Traceback (most recent call last): 2025-12-04T12:05:01.6552006Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6552160Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6552682Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6552813Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6553003Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6553009Z 2025-12-04T12:05:01.6553224Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6553943Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6553949Z 2025-12-04T12:05:01.6554219Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6554445Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6554554Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6554667Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6554796Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6555019Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6555116Z graph_break [] 2025-12-04T12:05:01.6555341Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6556071Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6556180Z warnings.warn( 2025-12-04T12:05:01.6556393Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6556503Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6556660Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6556909Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6557031Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6557139Z graph_break [] 2025-12-04T12:05:01.6557351Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6558117Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6558218Z warnings.warn( 2025-12-04T12:05:01.6558360Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6558730Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6558851Z Traceback (most recent call last): 2025-12-04T12:05:01.6559324Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6559524Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6560042Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6560175Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6560369Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6560374Z 2025-12-04T12:05:01.6560589Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6561313Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6561318Z 2025-12-04T12:05:01.6561580Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6561807Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6561920Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6562036Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6562170Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6562387Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6562484Z graph_break [] 2025-12-04T12:05:01.6562714Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6563449Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6563561Z warnings.warn( 2025-12-04T12:05:01.6563774Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6563884Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6564007Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6564225Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6564348Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6564457Z graph_break [] 2025-12-04T12:05:01.6564671Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6565411Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6565513Z warnings.warn( 2025-12-04T12:05:01.6565725Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6565845Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6565956Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6566175Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6566306Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6566401Z graph_break [] 2025-12-04T12:05:01.6566614Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6567489Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6567591Z warnings.warn( 2025-12-04T12:05:01.6568350Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-301e5b291449627e.xml - 2025-12-04T12:05:01.6568549Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6569377Z FAILED [0.2588s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6569396Z 2025-12-04T12:05:01.6569613Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6570325Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6570363Z 2025-12-04T12:05:01.6570641Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6570818Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6571027Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.6571130Z Got exit code 1 2025-12-04T12:05:01.6571238Z Retrying single test... 2025-12-04T12:05:01.6571827Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0b50a11bd83abca6.xml 2025-12-04T12:05:01.6571990Z ============================= test session starts ============================== 2025-12-04T12:05:01.6572341Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6572461Z cachedir: .pytest_cache 2025-12-04T12:05:01.6572983Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6573122Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6573232Z configfile: pytest.ini 2025-12-04T12:05:01.6573828Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6574071Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6574870Z stepcurrent: skipping 58 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6574983Z Running 1 items in this shard 2025-12-04T12:05:01.6575001Z 2025-12-04T12:05:01.6575677Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9176s] [100%] 2025-12-04T12:05:01.6576352Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2663s] [100%] 2025-12-04T12:05:01.6576960Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2599s] [100%] 2025-12-04T12:05:01.6576968Z 2025-12-04T12:05:01.6577109Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6577475Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6577597Z Traceback (most recent call last): 2025-12-04T12:05:01.6578071Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6582952Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6583562Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6583807Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6584015Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6584061Z 2025-12-04T12:05:01.6584284Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6585017Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6585061Z 2025-12-04T12:05:01.6585330Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6585557Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6585675Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6585787Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6585905Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6586136Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6586267Z graph_break [] 2025-12-04T12:05:01.6586493Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6587240Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6587338Z warnings.warn( 2025-12-04T12:05:01.6587930Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6588059Z Traceback (most recent call last): 2025-12-04T12:05:01.6588534Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6588701Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6589224Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6589353Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6589548Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6589554Z 2025-12-04T12:05:01.6589771Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6590493Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6590501Z 2025-12-04T12:05:01.6590770Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6590998Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6591113Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6591222Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6591353Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6591572Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6591667Z graph_break [] 2025-12-04T12:05:01.6591894Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6592637Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6592747Z warnings.warn( 2025-12-04T12:05:01.6592960Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6593069Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6593188Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6593404Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6593523Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6593628Z graph_break [] 2025-12-04T12:05:01.6593842Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6594577Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6594752Z warnings.warn( 2025-12-04T12:05:01.6594936Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6595304Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6595422Z Traceback (most recent call last): 2025-12-04T12:05:01.6595940Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6596100Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6596617Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6596750Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6596939Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6596945Z 2025-12-04T12:05:01.6597160Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6597934Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6597940Z 2025-12-04T12:05:01.6598205Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6598427Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6598534Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6598647Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6598775Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6598990Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6599086Z graph_break [] 2025-12-04T12:05:01.6599307Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6600038Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6600137Z warnings.warn( 2025-12-04T12:05:01.6600365Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6600472Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6600591Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6600809Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6600926Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6601026Z graph_break [] 2025-12-04T12:05:01.6601236Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6601957Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6602064Z warnings.warn( 2025-12-04T12:05:01.6602276Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6602394Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6602506Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6602728Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6602855Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6602949Z graph_break [] 2025-12-04T12:05:01.6603160Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6603895Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6603993Z warnings.warn( 2025-12-04T12:05:01.6604761Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0b50a11bd83abca6.xml - 2025-12-04T12:05:01.6604931Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6605820Z FAILED [0.2599s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6605829Z 2025-12-04T12:05:01.6606058Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6606770Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6606805Z 2025-12-04T12:05:01.6607081Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6607261Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6607462Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.6607574Z Got exit code 1 2025-12-04T12:05:01.6608207Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6608661Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.6609230Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-28422965aaa270fa.xml 2025-12-04T12:05:01.6609393Z ============================= test session starts ============================== 2025-12-04T12:05:01.6609754Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6609861Z cachedir: .pytest_cache 2025-12-04T12:05:01.6610381Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6610514Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6610621Z configfile: pytest.ini 2025-12-04T12:05:01.6611220Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6611448Z collecting ... collected 571 items / 59 deselected / 512 selected 2025-12-04T12:05:01.6611590Z stepcurrent: skipping 59 already run items. 2025-12-04T12:05:01.6611713Z Running 236 items in this shard 2025-12-04T12:05:01.6611718Z 2025-12-04T12:05:01.6612313Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_float16_cuda PASSED [3.4886s] [ 0%] 2025-12-04T12:05:01.6613002Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0937s] [ 0%] 2025-12-04T12:05:01.6613671Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2676s] [ 0%] 2025-12-04T12:05:01.6614255Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2658s] [ 0%] 2025-12-04T12:05:01.6614275Z 2025-12-04T12:05:01.6614418Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6614769Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6614908Z Traceback (most recent call last): 2025-12-04T12:05:01.6615383Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6615536Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6616069Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6616191Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6616387Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6616394Z 2025-12-04T12:05:01.6616611Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6617388Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6617393Z 2025-12-04T12:05:01.6617666Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6617916Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6618033Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6618217Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6618441Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6618571Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6618671Z graph_break [] 2025-12-04T12:05:01.6618888Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6619639Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6619793Z warnings.warn( 2025-12-04T12:05:01.6620151Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6620280Z Traceback (most recent call last): 2025-12-04T12:05:01.6620755Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6620920Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6621440Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6621559Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6621755Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6621761Z 2025-12-04T12:05:01.6621974Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6622696Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6622707Z 2025-12-04T12:05:01.6622970Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6623185Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6623309Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6623422Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6623642Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6623768Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6623861Z graph_break [] 2025-12-04T12:05:01.6624086Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6624821Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6624922Z warnings.warn( 2025-12-04T12:05:01.6625147Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6625259Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6625370Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6625595Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6625717Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6625822Z graph_break [] 2025-12-04T12:05:01.6626036Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6626762Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6626870Z warnings.warn( 2025-12-04T12:05:01.6627010Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6627368Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6627542Z Traceback (most recent call last): 2025-12-04T12:05:01.6628046Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6628209Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6628755Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6628874Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6629072Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6629078Z 2025-12-04T12:05:01.6629289Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6630012Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6630017Z 2025-12-04T12:05:01.6630317Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6630535Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6630656Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6630766Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6630982Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6631111Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6631205Z graph_break [] 2025-12-04T12:05:01.6631427Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6632159Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6632256Z warnings.warn( 2025-12-04T12:05:01.6632481Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6632591Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6632705Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6632937Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6633056Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6633152Z graph_break [] 2025-12-04T12:05:01.6633371Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6634101Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6634208Z warnings.warn( 2025-12-04T12:05:01.6634419Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6634528Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6634646Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6634865Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6634985Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6635090Z graph_break [] 2025-12-04T12:05:01.6635308Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6636043Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6636142Z warnings.warn( 2025-12-04T12:05:01.6636899Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-28422965aaa270fa.xml - 2025-12-04T12:05:01.6637076Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6637898Z FAILED [0.2658s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6637904Z 2025-12-04T12:05:01.6638129Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6638908Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6638914Z 2025-12-04T12:05:01.6639177Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6639401Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6639615Z ============= 1 failed, 1 passed, 59 deselected, 2 rerun in 5.81s ============== 2025-12-04T12:05:01.6639721Z Got exit code 1 2025-12-04T12:05:01.6639825Z Retrying single test... 2025-12-04T12:05:01.6640405Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cc1d7ca9ed71a647.xml 2025-12-04T12:05:01.6640574Z ============================= test session starts ============================== 2025-12-04T12:05:01.6640925Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6641067Z cachedir: .pytest_cache 2025-12-04T12:05:01.6641606Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6641730Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6641849Z configfile: pytest.ini 2025-12-04T12:05:01.6642442Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6642668Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6643475Z stepcurrent: skipping 60 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6643589Z Running 1 items in this shard 2025-12-04T12:05:01.6643595Z 2025-12-04T12:05:01.6644279Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9108s] [100%] 2025-12-04T12:05:01.6644955Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2646s] [100%] 2025-12-04T12:05:01.6645542Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2609s] [100%] 2025-12-04T12:05:01.6645557Z 2025-12-04T12:05:01.6645697Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6646053Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6646187Z Traceback (most recent call last): 2025-12-04T12:05:01.6646660Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6646816Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6647358Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6647479Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6647677Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6647686Z 2025-12-04T12:05:01.6647902Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6648617Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6648622Z 2025-12-04T12:05:01.6648892Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6649106Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6649223Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6649334Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6649492Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6649751Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6649850Z graph_break [] 2025-12-04T12:05:01.6650064Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6650843Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6650945Z warnings.warn( 2025-12-04T12:05:01.6651306Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6651424Z Traceback (most recent call last): 2025-12-04T12:05:01.6651897Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6652058Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6652610Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6652734Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6652931Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6652937Z 2025-12-04T12:05:01.6653149Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6653871Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6653877Z 2025-12-04T12:05:01.6654141Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6654359Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6654479Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6654592Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6654712Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6654947Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6655044Z graph_break [] 2025-12-04T12:05:01.6655270Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6656006Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6656108Z warnings.warn( 2025-12-04T12:05:01.6656329Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6656439Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6656552Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6656776Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6656895Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6657000Z graph_break [] 2025-12-04T12:05:01.6657216Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6657951Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6658057Z warnings.warn( 2025-12-04T12:05:01.6658292Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6658651Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6658782Z Traceback (most recent call last): 2025-12-04T12:05:01.6659259Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6659423Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6659945Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6660069Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6660318Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6660324Z 2025-12-04T12:05:01.6660570Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6661293Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6661347Z 2025-12-04T12:05:01.6661613Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6661828Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6661950Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6662063Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6662182Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6662411Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6662506Z graph_break [] 2025-12-04T12:05:01.6662731Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6663498Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6663603Z warnings.warn( 2025-12-04T12:05:01.6663823Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6663933Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6664044Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6664272Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6664391Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6664495Z graph_break [] 2025-12-04T12:05:01.6664706Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6665434Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6665546Z warnings.warn( 2025-12-04T12:05:01.6665759Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6665864Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6665990Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6666206Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6666325Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6666429Z graph_break [] 2025-12-04T12:05:01.6666638Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6667372Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6667471Z warnings.warn( 2025-12-04T12:05:01.6668221Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cc1d7ca9ed71a647.xml - 2025-12-04T12:05:01.6668403Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6669221Z FAILED [0.2609s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6669229Z 2025-12-04T12:05:01.6669452Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6670163Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6670168Z 2025-12-04T12:05:01.6670432Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6670618Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6670816Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.6670968Z Got exit code 1 2025-12-04T12:05:01.6671075Z Retrying single test... 2025-12-04T12:05:01.6671684Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-28fdda0d819ffd5b.xml 2025-12-04T12:05:01.6671860Z ============================= test session starts ============================== 2025-12-04T12:05:01.6672241Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6672344Z cachedir: .pytest_cache 2025-12-04T12:05:01.6672880Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6673001Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6673120Z configfile: pytest.ini 2025-12-04T12:05:01.6673713Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6673975Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6674786Z stepcurrent: skipping 60 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6674902Z Running 1 items in this shard 2025-12-04T12:05:01.6674907Z 2025-12-04T12:05:01.6675597Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9222s] [100%] 2025-12-04T12:05:01.6676270Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2635s] [100%] 2025-12-04T12:05:01.6676854Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2615s] [100%] 2025-12-04T12:05:01.6676871Z 2025-12-04T12:05:01.6677017Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6677373Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6677503Z Traceback (most recent call last): 2025-12-04T12:05:01.6677978Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6678131Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6678664Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6678788Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6678987Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6678993Z 2025-12-04T12:05:01.6679209Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6679929Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6679937Z 2025-12-04T12:05:01.6680216Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6680433Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6680555Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6680667Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6680785Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6681011Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6681108Z graph_break [] 2025-12-04T12:05:01.6681320Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6682065Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6682168Z warnings.warn( 2025-12-04T12:05:01.6682564Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6682712Z Traceback (most recent call last): 2025-12-04T12:05:01.6683186Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6683379Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6683893Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6684013Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6684212Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6684217Z 2025-12-04T12:05:01.6684431Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6685149Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6685187Z 2025-12-04T12:05:01.6685455Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6685670Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6685788Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6685904Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6686023Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6686251Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6686347Z graph_break [] 2025-12-04T12:05:01.6686572Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6687301Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6687401Z warnings.warn( 2025-12-04T12:05:01.6687623Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6687930Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6688060Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6688280Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6688398Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6688508Z graph_break [] 2025-12-04T12:05:01.6688721Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6689447Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6689559Z warnings.warn( 2025-12-04T12:05:01.6689699Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6690051Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6690181Z Traceback (most recent call last): 2025-12-04T12:05:01.6690659Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6690826Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6691343Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6691463Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6691664Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6691670Z 2025-12-04T12:05:01.6691881Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6692607Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6692613Z 2025-12-04T12:05:01.6692878Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6693167Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6693287Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6693441Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6693562Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6693793Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6693928Z graph_break [] 2025-12-04T12:05:01.6694148Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6694879Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6694976Z warnings.warn( 2025-12-04T12:05:01.6695199Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6695310Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6695420Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6695698Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6695815Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6695920Z graph_break [] 2025-12-04T12:05:01.6696136Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6696859Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6696971Z warnings.warn( 2025-12-04T12:05:01.6697183Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6697289Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6697406Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6697623Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6697754Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6697848Z graph_break [] 2025-12-04T12:05:01.6698056Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6698865Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6698964Z warnings.warn( 2025-12-04T12:05:01.6699722Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-28fdda0d819ffd5b.xml - 2025-12-04T12:05:01.6699907Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6700731Z FAILED [0.2615s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6700737Z 2025-12-04T12:05:01.6700960Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6701677Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6701687Z 2025-12-04T12:05:01.6701953Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6702125Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6702323Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.6702430Z Got exit code 1 2025-12-04T12:05:01.6703057Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6703466Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.6704051Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2efd0ac46f0ffe8c.xml 2025-12-04T12:05:01.6704248Z ============================= test session starts ============================== 2025-12-04T12:05:01.6704650Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6704757Z cachedir: .pytest_cache 2025-12-04T12:05:01.6705273Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6705436Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6705543Z configfile: pytest.ini 2025-12-04T12:05:01.6706130Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6706367Z collecting ... collected 571 items / 61 deselected / 510 selected 2025-12-04T12:05:01.6706508Z stepcurrent: skipping 61 already run items. 2025-12-04T12:05:01.6706623Z Running 234 items in this shard 2025-12-04T12:05:01.6706628Z 2025-12-04T12:05:01.6707342Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9212s] [ 0%] 2025-12-04T12:05:01.6708014Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2688s] [ 0%] 2025-12-04T12:05:01.6708604Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2636s] [ 0%] 2025-12-04T12:05:01.6708611Z 2025-12-04T12:05:01.6708747Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6709099Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6709217Z Traceback (most recent call last): 2025-12-04T12:05:01.6709690Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6709851Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6710379Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6710508Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6710695Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6710702Z 2025-12-04T12:05:01.6710912Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6711626Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6711632Z 2025-12-04T12:05:01.6711894Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6712115Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6712221Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6712337Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6712465Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6712688Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6712783Z graph_break [] 2025-12-04T12:05:01.6713003Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6713737Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6713841Z warnings.warn( 2025-12-04T12:05:01.6714195Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6714316Z Traceback (most recent call last): 2025-12-04T12:05:01.6714796Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6714948Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6715497Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6715654Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6715844Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6715849Z 2025-12-04T12:05:01.6716102Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6716813Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6716819Z 2025-12-04T12:05:01.6717081Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6717307Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6717417Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6717539Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6717657Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6717905Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6718008Z graph_break [] 2025-12-04T12:05:01.6718221Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6718951Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6719063Z warnings.warn( 2025-12-04T12:05:01.6719275Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6719390Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6719498Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6719717Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6719846Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6719939Z graph_break [] 2025-12-04T12:05:01.6720147Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6720892Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6720990Z warnings.warn( 2025-12-04T12:05:01.6721129Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6721492Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6721610Z Traceback (most recent call last): 2025-12-04T12:05:01.6722095Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6722252Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6722767Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6722897Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6723089Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6723095Z 2025-12-04T12:05:01.6723314Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6724023Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6724031Z 2025-12-04T12:05:01.6724294Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6724517Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6724625Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6724744Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6724863Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6725078Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6725184Z graph_break [] 2025-12-04T12:05:01.6725429Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6726190Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6726302Z warnings.warn( 2025-12-04T12:05:01.6726516Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6726653Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6726774Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6726989Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6727120Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6727215Z graph_break [] 2025-12-04T12:05:01.6727424Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6728158Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6728291Z warnings.warn( 2025-12-04T12:05:01.6728503Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6728619Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6728728Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6728953Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6729072Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6729162Z graph_break [] 2025-12-04T12:05:01.6729383Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6730109Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6730204Z warnings.warn( 2025-12-04T12:05:01.6730973Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2efd0ac46f0ffe8c.xml - 2025-12-04T12:05:01.6731142Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6731972Z FAILED [0.2636s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6731980Z 2025-12-04T12:05:01.6732194Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6732900Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6732914Z 2025-12-04T12:05:01.6733174Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6733348Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6733549Z ================== 1 failed, 61 deselected, 2 rerun in 5.13s =================== 2025-12-04T12:05:01.6733651Z Got exit code 1 2025-12-04T12:05:01.6733755Z Retrying single test... 2025-12-04T12:05:01.6734340Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a06616e9f3ca63a0.xml 2025-12-04T12:05:01.6734501Z ============================= test session starts ============================== 2025-12-04T12:05:01.6734858Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6734966Z cachedir: .pytest_cache 2025-12-04T12:05:01.6735484Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6735611Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6735715Z configfile: pytest.ini 2025-12-04T12:05:01.6736299Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6736566Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6737390Z stepcurrent: skipping 61 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6737535Z Running 1 items in this shard 2025-12-04T12:05:01.6737540Z 2025-12-04T12:05:01.6738285Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9067s] [100%] 2025-12-04T12:05:01.6738953Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2656s] [100%] 2025-12-04T12:05:01.6739541Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2636s] [100%] 2025-12-04T12:05:01.6739547Z 2025-12-04T12:05:01.6739797Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6740157Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6740277Z Traceback (most recent call last): 2025-12-04T12:05:01.6740756Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6740923Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6741446Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6741576Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6741765Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6741770Z 2025-12-04T12:05:01.6741984Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6742706Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6742714Z 2025-12-04T12:05:01.6742979Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6743202Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6743315Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6743425Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6743552Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6743773Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6743867Z graph_break [] 2025-12-04T12:05:01.6744093Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6744825Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6744934Z warnings.warn( 2025-12-04T12:05:01.6745288Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6745413Z Traceback (most recent call last): 2025-12-04T12:05:01.6745891Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6746046Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6746562Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6746692Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6746885Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6746890Z 2025-12-04T12:05:01.6747108Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6747816Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6747874Z 2025-12-04T12:05:01.6748166Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6748396Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6748505Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6748655Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6748775Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6748994Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6749100Z graph_break [] 2025-12-04T12:05:01.6749314Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6750041Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6750149Z warnings.warn( 2025-12-04T12:05:01.6750359Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6750518Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6750637Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6750853Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6750983Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6751079Z graph_break [] 2025-12-04T12:05:01.6751283Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6752023Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6752120Z warnings.warn( 2025-12-04T12:05:01.6752266Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6752617Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6752731Z Traceback (most recent call last): 2025-12-04T12:05:01.6753217Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6753373Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6753889Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6754021Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6754205Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6754213Z 2025-12-04T12:05:01.6754433Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6755136Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6755142Z 2025-12-04T12:05:01.6755403Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6755633Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6755743Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6755865Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6755983Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6756196Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6756305Z graph_break [] 2025-12-04T12:05:01.6756517Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6757240Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6757346Z warnings.warn( 2025-12-04T12:05:01.6757557Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6757662Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6757784Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6758034Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6758160Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6758284Z graph_break [] 2025-12-04T12:05:01.6758493Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6759227Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6759361Z warnings.warn( 2025-12-04T12:05:01.6759569Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6759684Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6759792Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6760016Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6760132Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6760224Z graph_break [] 2025-12-04T12:05:01.6760442Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6761198Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6761294Z warnings.warn( 2025-12-04T12:05:01.6762058Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a06616e9f3ca63a0.xml - 2025-12-04T12:05:01.6762223Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6763052Z FAILED [0.2636s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6763058Z 2025-12-04T12:05:01.6763276Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6763984Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6764002Z 2025-12-04T12:05:01.6764264Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6764437Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6764640Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.6764736Z Got exit code 1 2025-12-04T12:05:01.6764844Z Retrying single test... 2025-12-04T12:05:01.6765436Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ac835e731ca0e136.xml 2025-12-04T12:05:01.6765601Z ============================= test session starts ============================== 2025-12-04T12:05:01.6765961Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6766068Z cachedir: .pytest_cache 2025-12-04T12:05:01.6766594Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6766726Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6766832Z configfile: pytest.ini 2025-12-04T12:05:01.6767422Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6767657Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6768456Z stepcurrent: skipping 61 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6768575Z Running 1 items in this shard 2025-12-04T12:05:01.6768580Z 2025-12-04T12:05:01.6769250Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9164s] [100%] 2025-12-04T12:05:01.6769985Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2643s] [100%] 2025-12-04T12:05:01.6770575Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2614s] [100%] 2025-12-04T12:05:01.6770611Z 2025-12-04T12:05:01.6770751Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6771105Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6771225Z Traceback (most recent call last): 2025-12-04T12:05:01.6771697Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6771859Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6772379Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6772538Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6772729Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6772735Z 2025-12-04T12:05:01.6772948Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6773668Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6773673Z 2025-12-04T12:05:01.6773937Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6774158Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6774268Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6774380Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6774507Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6774730Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6774825Z graph_break [] 2025-12-04T12:05:01.6775048Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6775776Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6775884Z warnings.warn( 2025-12-04T12:05:01.6776233Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6776352Z Traceback (most recent call last): 2025-12-04T12:05:01.6776834Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6776988Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6777509Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6777647Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6777835Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6777841Z 2025-12-04T12:05:01.6778062Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6778855Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6778865Z 2025-12-04T12:05:01.6779131Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6779361Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6779470Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6779590Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6779710Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6779925Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6780036Z graph_break [] 2025-12-04T12:05:01.6780288Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6781048Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6781185Z warnings.warn( 2025-12-04T12:05:01.6781395Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6781512Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6781623Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6781838Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6781968Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6782061Z graph_break [] 2025-12-04T12:05:01.6782269Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6783012Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6783156Z warnings.warn( 2025-12-04T12:05:01.6783309Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6783657Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6783780Z Traceback (most recent call last): 2025-12-04T12:05:01.6784265Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6784417Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6784930Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6785059Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6785244Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6785249Z 2025-12-04T12:05:01.6785480Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6786188Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6786195Z 2025-12-04T12:05:01.6786457Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6786682Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6786794Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6786916Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6787035Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6787251Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6787354Z graph_break [] 2025-12-04T12:05:01.6787567Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6788511Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6788628Z warnings.warn( 2025-12-04T12:05:01.6788839Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6788945Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6789071Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6789287Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6789410Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6789504Z graph_break [] 2025-12-04T12:05:01.6789714Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6790450Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6790546Z warnings.warn( 2025-12-04T12:05:01.6790927Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6791062Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6791216Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6791449Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6791570Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6791708Z graph_break [] 2025-12-04T12:05:01.6791937Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6792661Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6792758Z warnings.warn( 2025-12-04T12:05:01.6793529Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ac835e731ca0e136.xml - 2025-12-04T12:05:01.6793697Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6794577Z FAILED [0.2614s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6794584Z 2025-12-04T12:05:01.6794801Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6795510Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6795528Z 2025-12-04T12:05:01.6795795Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6795972Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6796183Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.6796282Z Got exit code 1 2025-12-04T12:05:01.6796913Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6797344Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.6797997Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b97506f71fc95dd3.xml 2025-12-04T12:05:01.6798172Z ============================= test session starts ============================== 2025-12-04T12:05:01.6798527Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6798632Z cachedir: .pytest_cache 2025-12-04T12:05:01.6799167Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6799289Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6799399Z configfile: pytest.ini 2025-12-04T12:05:01.6800015Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6800242Z collecting ... collected 571 items / 62 deselected / 509 selected 2025-12-04T12:05:01.6800397Z stepcurrent: skipping 62 already run items. 2025-12-04T12:05:01.6800514Z Running 233 items in this shard 2025-12-04T12:05:01.6800520Z 2025-12-04T12:05:01.6801230Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9051s] [ 0%] 2025-12-04T12:05:01.6801949Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2637s] [ 0%] 2025-12-04T12:05:01.6802563Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2604s] [ 0%] 2025-12-04T12:05:01.6802571Z 2025-12-04T12:05:01.6802765Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6803181Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6803305Z Traceback (most recent call last): 2025-12-04T12:05:01.6803791Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6803974Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6804506Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6804636Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6804824Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6804830Z 2025-12-04T12:05:01.6805054Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6805798Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6805837Z 2025-12-04T12:05:01.6806113Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6806334Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6806444Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6806568Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6806688Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6806906Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6807016Z graph_break [] 2025-12-04T12:05:01.6807225Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6807972Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6808074Z warnings.warn( 2025-12-04T12:05:01.6808462Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6808597Z Traceback (most recent call last): 2025-12-04T12:05:01.6809070Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6809225Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6809757Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6809876Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6810074Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6810080Z 2025-12-04T12:05:01.6810294Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6811033Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6811050Z 2025-12-04T12:05:01.6811318Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6811536Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6811660Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6811775Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6811893Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6812124Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6812221Z graph_break [] 2025-12-04T12:05:01.6812436Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6813181Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6813280Z warnings.warn( 2025-12-04T12:05:01.6813537Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6813649Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6813795Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6814024Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6814142Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6814264Z graph_break [] 2025-12-04T12:05:01.6814487Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6815218Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6815325Z warnings.warn( 2025-12-04T12:05:01.6815468Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6815845Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6816009Z Traceback (most recent call last): 2025-12-04T12:05:01.6816493Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6816644Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6817172Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6817294Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6817492Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6817497Z 2025-12-04T12:05:01.6817708Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6818529Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6818536Z 2025-12-04T12:05:01.6818812Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6819036Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6819160Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6819274Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6819396Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6819628Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6819732Z graph_break [] 2025-12-04T12:05:01.6819945Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6820692Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6820790Z warnings.warn( 2025-12-04T12:05:01.6821012Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6821122Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6821234Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6821465Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6821585Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6821679Z graph_break [] 2025-12-04T12:05:01.6821901Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6822632Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6822742Z warnings.warn( 2025-12-04T12:05:01.6822954Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6823062Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6823184Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6823401Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6823516Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6823622Z graph_break [] 2025-12-04T12:05:01.6823872Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6824623Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6824733Z warnings.warn( 2025-12-04T12:05:01.6825537Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b97506f71fc95dd3.xml - 2025-12-04T12:05:01.6825716Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6826572Z FAILED [0.2604s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6826579Z 2025-12-04T12:05:01.6826791Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6827581Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6827587Z 2025-12-04T12:05:01.6827854Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6828043Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6828237Z ================== 1 failed, 62 deselected, 2 rerun in 5.11s =================== 2025-12-04T12:05:01.6828332Z Got exit code 1 2025-12-04T12:05:01.6828447Z Retrying single test... 2025-12-04T12:05:01.6829023Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e0e0cb20cc598f84.xml 2025-12-04T12:05:01.6829193Z ============================= test session starts ============================== 2025-12-04T12:05:01.6829542Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6829649Z cachedir: .pytest_cache 2025-12-04T12:05:01.6830188Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6830309Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6830418Z configfile: pytest.ini 2025-12-04T12:05:01.6831021Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6831247Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6832082Z stepcurrent: skipping 62 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6832193Z Running 1 items in this shard 2025-12-04T12:05:01.6832199Z 2025-12-04T12:05:01.6832905Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9073s] [100%] 2025-12-04T12:05:01.6833620Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2651s] [100%] 2025-12-04T12:05:01.6834235Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2598s] [100%] 2025-12-04T12:05:01.6834243Z 2025-12-04T12:05:01.6834397Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6834777Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6834904Z Traceback (most recent call last): 2025-12-04T12:05:01.6835378Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6835528Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6836128Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6836252Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6836442Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6836477Z 2025-12-04T12:05:01.6836703Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6837445Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6837451Z 2025-12-04T12:05:01.6837726Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6837948Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6838055Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6838178Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6838332Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6838564Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6838662Z graph_break [] 2025-12-04T12:05:01.6838875Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6839622Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6839720Z warnings.warn( 2025-12-04T12:05:01.6840098Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6840233Z Traceback (most recent call last): 2025-12-04T12:05:01.6840706Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6840871Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6841390Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6841516Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6841713Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6841718Z 2025-12-04T12:05:01.6841930Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6842673Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6842690Z 2025-12-04T12:05:01.6842955Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6843169Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6843289Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6843400Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6843519Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6843752Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6843889Z graph_break [] 2025-12-04T12:05:01.6844162Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6844909Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6845012Z warnings.warn( 2025-12-04T12:05:01.6845234Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6845343Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6845458Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6845688Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6845808Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6845903Z graph_break [] 2025-12-04T12:05:01.6846128Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6846936Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6847047Z warnings.warn( 2025-12-04T12:05:01.6847187Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6847602Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6847736Z Traceback (most recent call last): 2025-12-04T12:05:01.6848212Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6848367Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6848897Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6849019Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6849257Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6849262Z 2025-12-04T12:05:01.6849479Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6850219Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6850239Z 2025-12-04T12:05:01.6850505Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6850719Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6850843Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6850956Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6851074Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6851306Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6851401Z graph_break [] 2025-12-04T12:05:01.6851614Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6852366Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6852468Z warnings.warn( 2025-12-04T12:05:01.6852691Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6852805Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6852919Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6853151Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6853272Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6853366Z graph_break [] 2025-12-04T12:05:01.6853589Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6854319Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6854434Z warnings.warn( 2025-12-04T12:05:01.6854647Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6854759Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6854879Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6855099Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6855218Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6855321Z graph_break [] 2025-12-04T12:05:01.6855532Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6856257Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6856365Z warnings.warn( 2025-12-04T12:05:01.6857120Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e0e0cb20cc598f84.xml - 2025-12-04T12:05:01.6857389Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6858368Z FAILED [0.2598s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6858405Z 2025-12-04T12:05:01.6858635Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6859373Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6859379Z 2025-12-04T12:05:01.6859642Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6859834Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6860027Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.6860160Z Got exit code 1 2025-12-04T12:05:01.6860280Z Retrying single test... 2025-12-04T12:05:01.6860859Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7ccf4f5fc7381dbf.xml 2025-12-04T12:05:01.6861032Z ============================= test session starts ============================== 2025-12-04T12:05:01.6861383Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6861489Z cachedir: .pytest_cache 2025-12-04T12:05:01.6862022Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6862145Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6862248Z configfile: pytest.ini 2025-12-04T12:05:01.6862853Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6863086Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6863927Z stepcurrent: skipping 62 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6864041Z Running 1 items in this shard 2025-12-04T12:05:01.6864046Z 2025-12-04T12:05:01.6864761Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9299s] [100%] 2025-12-04T12:05:01.6865465Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2609s] [100%] 2025-12-04T12:05:01.6866078Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2588s] [100%] 2025-12-04T12:05:01.6866086Z 2025-12-04T12:05:01.6866239Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6866621Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6866753Z Traceback (most recent call last): 2025-12-04T12:05:01.6867230Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6867383Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6867911Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6868032Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6868220Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6868238Z 2025-12-04T12:05:01.6868452Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6869220Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6869273Z 2025-12-04T12:05:01.6869547Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6869762Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6869906Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6870029Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6870148Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6870374Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6870471Z graph_break [] 2025-12-04T12:05:01.6870681Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6871422Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6871555Z warnings.warn( 2025-12-04T12:05:01.6871940Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6872072Z Traceback (most recent call last): 2025-12-04T12:05:01.6872547Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6872713Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6873231Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6873352Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6873554Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6873559Z 2025-12-04T12:05:01.6873771Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6874520Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6874527Z 2025-12-04T12:05:01.6874793Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6875007Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6875128Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6875242Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6875362Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6875592Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6875688Z graph_break [] 2025-12-04T12:05:01.6875910Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6876640Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6876739Z warnings.warn( 2025-12-04T12:05:01.6876967Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6877078Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6877188Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6877416Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6877540Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6877631Z graph_break [] 2025-12-04T12:05:01.6877854Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6878579Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6878684Z warnings.warn( 2025-12-04T12:05:01.6878826Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6879207Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6879372Z Traceback (most recent call last): 2025-12-04T12:05:01.6879879Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6880043Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6880558Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6880707Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6880909Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6880915Z 2025-12-04T12:05:01.6881130Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6881868Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6881884Z 2025-12-04T12:05:01.6882148Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6882397Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6882518Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6882633Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6882749Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6882982Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6883080Z graph_break [] 2025-12-04T12:05:01.6883291Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6884040Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6884138Z warnings.warn( 2025-12-04T12:05:01.6884362Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6884469Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6884583Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6884814Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6884935Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6885029Z graph_break [] 2025-12-04T12:05:01.6885250Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6885977Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6886088Z warnings.warn( 2025-12-04T12:05:01.6886299Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6886406Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6886529Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6886745Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6886862Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6886970Z graph_break [] 2025-12-04T12:05:01.6887182Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6888159Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6888266Z warnings.warn( 2025-12-04T12:05:01.6889022Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7ccf4f5fc7381dbf.xml - 2025-12-04T12:05:01.6889207Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6890060Z FAILED [0.2588s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6890066Z 2025-12-04T12:05:01.6890294Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6891147Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6891154Z 2025-12-04T12:05:01.6891416Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6891646Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6891845Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.6891960Z Got exit code 1 2025-12-04T12:05:01.6892619Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6893027Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.6893609Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9c9073ed00c82380.xml 2025-12-04T12:05:01.6893815Z ============================= test session starts ============================== 2025-12-04T12:05:01.6894175Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6894284Z cachedir: .pytest_cache 2025-12-04T12:05:01.6894807Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6894939Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6895045Z configfile: pytest.ini 2025-12-04T12:05:01.6895637Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6895873Z collecting ... collected 571 items / 63 deselected / 508 selected 2025-12-04T12:05:01.6896013Z stepcurrent: skipping 63 already run items. 2025-12-04T12:05:01.6896131Z Running 232 items in this shard 2025-12-04T12:05:01.6896139Z 2025-12-04T12:05:01.6896762Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_float16_cuda PASSED [3.4340s] [ 0%] 2025-12-04T12:05:01.6897436Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0946s] [ 0%] 2025-12-04T12:05:01.6898186Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2664s] [ 0%] 2025-12-04T12:05:01.6898775Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2599s] [ 0%] 2025-12-04T12:05:01.6898781Z 2025-12-04T12:05:01.6898929Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6899288Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6899415Z Traceback (most recent call last): 2025-12-04T12:05:01.6899907Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6900071Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6900608Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6900733Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6900923Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6900929Z 2025-12-04T12:05:01.6901155Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6901876Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6901882Z 2025-12-04T12:05:01.6902200Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6902449Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6902560Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6902689Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6902910Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6903097Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6903206Z graph_break [] 2025-12-04T12:05:01.6903423Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6904170Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6904272Z warnings.warn( 2025-12-04T12:05:01.6904631Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6904763Z Traceback (most recent call last): 2025-12-04T12:05:01.6905278Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6905434Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6905968Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6906094Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6906301Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6906306Z 2025-12-04T12:05:01.6906521Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6907237Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6907243Z 2025-12-04T12:05:01.6907521Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6907744Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6907870Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6907989Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6908207Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6908340Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6908439Z graph_break [] 2025-12-04T12:05:01.6908657Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6909397Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6909499Z warnings.warn( 2025-12-04T12:05:01.6909715Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6909842Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6909953Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6910189Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6910311Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6910408Z graph_break [] 2025-12-04T12:05:01.6910630Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6911365Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6911465Z warnings.warn( 2025-12-04T12:05:01.6911616Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6911972Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6912106Z Traceback (most recent call last): 2025-12-04T12:05:01.6912582Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6912733Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6913352Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6913474Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6913664Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6913789Z 2025-12-04T12:05:01.6914003Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6914717Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6914723Z 2025-12-04T12:05:01.6914995Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6915210Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6915321Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6915451Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6915701Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6915843Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6915941Z graph_break [] 2025-12-04T12:05:01.6916154Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6916901Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6917003Z warnings.warn( 2025-12-04T12:05:01.6917212Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6917334Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6917445Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6917677Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6917798Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6917892Z graph_break [] 2025-12-04T12:05:01.6918125Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6918853Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6918953Z warnings.warn( 2025-12-04T12:05:01.6919180Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6919290Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6919415Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6919633Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6919752Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6919859Z graph_break [] 2025-12-04T12:05:01.6920073Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6920802Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6920913Z warnings.warn( 2025-12-04T12:05:01.6921668Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9c9073ed00c82380.xml - 2025-12-04T12:05:01.6921848Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6922676Z FAILED [0.2599s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6922682Z 2025-12-04T12:05:01.6922895Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6923622Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6923628Z 2025-12-04T12:05:01.6923925Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6924146Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6924360Z ============= 1 failed, 1 passed, 63 deselected, 2 rerun in 5.73s ============== 2025-12-04T12:05:01.6924458Z Got exit code 1 2025-12-04T12:05:01.6924607Z Retrying single test... 2025-12-04T12:05:01.6925182Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-24dd415c882be612.xml 2025-12-04T12:05:01.6925343Z ============================= test session starts ============================== 2025-12-04T12:05:01.6925705Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6925813Z cachedir: .pytest_cache 2025-12-04T12:05:01.6926347Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6926504Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6926616Z configfile: pytest.ini 2025-12-04T12:05:01.6927222Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6927450Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6928258Z stepcurrent: skipping 64 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6928373Z Running 1 items in this shard 2025-12-04T12:05:01.6928378Z 2025-12-04T12:05:01.6929050Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9385s] [100%] 2025-12-04T12:05:01.6929736Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2671s] [100%] 2025-12-04T12:05:01.6930326Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2560s] [100%] 2025-12-04T12:05:01.6930332Z 2025-12-04T12:05:01.6930481Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6930840Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6930959Z Traceback (most recent call last): 2025-12-04T12:05:01.6931444Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6931599Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6932131Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6932251Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6932444Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6932449Z 2025-12-04T12:05:01.6932676Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6933389Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6933397Z 2025-12-04T12:05:01.6933671Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6933892Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6934005Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6934129Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6934251Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6934470Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6934576Z graph_break [] 2025-12-04T12:05:01.6934787Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6935593Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6935693Z warnings.warn( 2025-12-04T12:05:01.6936052Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6936217Z Traceback (most recent call last): 2025-12-04T12:05:01.6936693Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6936847Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6937377Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6937500Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6937701Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6937741Z 2025-12-04T12:05:01.6937957Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6938761Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6938768Z 2025-12-04T12:05:01.6939046Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6939263Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6939390Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6939507Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6939627Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6939860Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6939958Z graph_break [] 2025-12-04T12:05:01.6940171Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6940924Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6941025Z warnings.warn( 2025-12-04T12:05:01.6941253Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6941367Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6941480Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6941709Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6941828Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6941925Z graph_break [] 2025-12-04T12:05:01.6942149Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6942881Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6942983Z warnings.warn( 2025-12-04T12:05:01.6943140Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6943499Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6943631Z Traceback (most recent call last): 2025-12-04T12:05:01.6944107Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6944263Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6944791Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6944912Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6945103Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6945121Z 2025-12-04T12:05:01.6945334Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6946094Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6946103Z 2025-12-04T12:05:01.6946412Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6946630Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6946774Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6946900Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6947021Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6947251Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6947348Z graph_break [] 2025-12-04T12:05:01.6947562Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6948314Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6948460Z warnings.warn( 2025-12-04T12:05:01.6948671Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6948792Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6948903Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6949132Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6949254Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6949352Z graph_break [] 2025-12-04T12:05:01.6949574Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6950298Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6950397Z warnings.warn( 2025-12-04T12:05:01.6950623Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6950732Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6950863Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6951089Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6951211Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6951325Z graph_break [] 2025-12-04T12:05:01.6951539Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6952267Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6952377Z warnings.warn( 2025-12-04T12:05:01.6953132Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-24dd415c882be612.xml - 2025-12-04T12:05:01.6953313Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6954142Z FAILED [0.2560s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6954149Z 2025-12-04T12:05:01.6954366Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6955092Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6955099Z 2025-12-04T12:05:01.6955364Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6955554Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6955751Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.6955848Z Got exit code 1 2025-12-04T12:05:01.6955971Z Retrying single test... 2025-12-04T12:05:01.6956543Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ebb637005a79032b.xml 2025-12-04T12:05:01.6956737Z ============================= test session starts ============================== 2025-12-04T12:05:01.6957131Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6957242Z cachedir: .pytest_cache 2025-12-04T12:05:01.6957780Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6957937Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6958045Z configfile: pytest.ini 2025-12-04T12:05:01.6958654Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6958882Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.6959698Z stepcurrent: skipping 64 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6959846Z Running 1 items in this shard 2025-12-04T12:05:01.6959852Z 2025-12-04T12:05:01.6960526Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9133s] [100%] 2025-12-04T12:05:01.6961214Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2682s] [100%] 2025-12-04T12:05:01.6961798Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2626s] [100%] 2025-12-04T12:05:01.6961804Z 2025-12-04T12:05:01.6961960Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6962317Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6962441Z Traceback (most recent call last): 2025-12-04T12:05:01.6962935Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6963097Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6963637Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6963765Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6963959Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6963965Z 2025-12-04T12:05:01.6964190Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6964908Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6964913Z 2025-12-04T12:05:01.6965191Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6965411Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6965522Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6965649Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6965768Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6965990Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6966101Z graph_break [] 2025-12-04T12:05:01.6966314Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6967063Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6967166Z warnings.warn( 2025-12-04T12:05:01.6967523Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6967652Z Traceback (most recent call last): 2025-12-04T12:05:01.6968160Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6968316Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6968872Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6968997Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6969222Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6969228Z 2025-12-04T12:05:01.6969443Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6970160Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6970165Z 2025-12-04T12:05:01.6970442Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6970658Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6970813Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6970932Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6971054Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6971285Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6971381Z graph_break [] 2025-12-04T12:05:01.6971599Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6972343Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6972443Z warnings.warn( 2025-12-04T12:05:01.6972668Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6972776Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6972887Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6973119Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6973242Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6973341Z graph_break [] 2025-12-04T12:05:01.6973566Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6974294Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6974394Z warnings.warn( 2025-12-04T12:05:01.6974546Z =================================== FAILURES =================================== 2025-12-04T12:05:01.6974903Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.6975031Z Traceback (most recent call last): 2025-12-04T12:05:01.6975501Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6975652Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6976183Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6976310Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6976511Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6976516Z 2025-12-04T12:05:01.6976730Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6977448Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6977453Z 2025-12-04T12:05:01.6977727Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6977941Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6978062Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6978242Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6978367Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6978645Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6978745Z graph_break [] 2025-12-04T12:05:01.6978990Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6979739Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6979867Z warnings.warn( 2025-12-04T12:05:01.6980083Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6980209Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6980322Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6980555Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6980676Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6980772Z graph_break [] 2025-12-04T12:05:01.6980995Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6981759Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6981856Z warnings.warn( 2025-12-04T12:05:01.6982082Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6982192Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6982317Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6982534Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6982651Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6982758Z graph_break [] 2025-12-04T12:05:01.6982969Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6983689Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6983803Z warnings.warn( 2025-12-04T12:05:01.6984555Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ebb637005a79032b.xml - 2025-12-04T12:05:01.6984737Z =========================== short test summary info ============================ 2025-12-04T12:05:01.6985560Z FAILED [0.2626s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6985567Z 2025-12-04T12:05:01.6985781Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6986509Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6986514Z 2025-12-04T12:05:01.6986777Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6986970Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.6987171Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.6987269Z Got exit code 1 2025-12-04T12:05:01.6988112Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.6988530Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.6989113Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-64323540e9247afe.xml 2025-12-04T12:05:01.6989275Z ============================= test session starts ============================== 2025-12-04T12:05:01.6989626Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.6989750Z cachedir: .pytest_cache 2025-12-04T12:05:01.6990338Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.6990502Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.6990624Z configfile: pytest.ini 2025-12-04T12:05:01.6991219Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.6991513Z collecting ... collected 571 items / 65 deselected / 506 selected 2025-12-04T12:05:01.6991654Z stepcurrent: skipping 65 already run items. 2025-12-04T12:05:01.6991766Z Running 230 items in this shard 2025-12-04T12:05:01.6991772Z 2025-12-04T12:05:01.6992386Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_False_float16_cuda PASSED [3.4277s] [ 0%] 2025-12-04T12:05:01.6993067Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0921s] [ 0%] 2025-12-04T12:05:01.6993817Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2671s] [ 0%] 2025-12-04T12:05:01.6994415Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2651s] [ 0%] 2025-12-04T12:05:01.6994424Z 2025-12-04T12:05:01.6994561Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.6994935Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.6995058Z Traceback (most recent call last): 2025-12-04T12:05:01.6995543Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.6995699Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.6996223Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.6996359Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.6996553Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.6996558Z 2025-12-04T12:05:01.6996790Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.6997513Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.6997520Z 2025-12-04T12:05:01.6997784Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.6998009Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.6998120Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.6998233Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.6998463Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.6998588Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.6998701Z graph_break [] 2025-12-04T12:05:01.6998921Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.6999656Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.6999770Z warnings.warn( 2025-12-04T12:05:01.7000128Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7000249Z Traceback (most recent call last): 2025-12-04T12:05:01.7000735Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7000890Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7001416Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7001573Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7001794Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7001800Z 2025-12-04T12:05:01.7002026Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7002743Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7002779Z 2025-12-04T12:05:01.7003055Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7003270Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7003380Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7003502Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7003723Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7003844Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7003978Z graph_break [] 2025-12-04T12:05:01.7004196Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7004944Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7005046Z warnings.warn( 2025-12-04T12:05:01.7005260Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7005379Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7005496Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7005713Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7005845Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7005939Z graph_break [] 2025-12-04T12:05:01.7006164Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7006891Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7006996Z warnings.warn( 2025-12-04T12:05:01.7007150Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7007513Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7007634Z Traceback (most recent call last): 2025-12-04T12:05:01.7008123Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7008278Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7008807Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7008931Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7009119Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7009126Z 2025-12-04T12:05:01.7009360Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7010078Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7010086Z 2025-12-04T12:05:01.7010361Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7010578Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7010697Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7010823Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7011044Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7011165Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7011276Z graph_break [] 2025-12-04T12:05:01.7011496Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7012300Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7012405Z warnings.warn( 2025-12-04T12:05:01.7012621Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7012777Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7012889Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7013108Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7013244Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7013339Z graph_break [] 2025-12-04T12:05:01.7013549Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7014287Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7014388Z warnings.warn( 2025-12-04T12:05:01.7014647Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7014759Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7014867Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7015098Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7015219Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7015313Z graph_break [] 2025-12-04T12:05:01.7015540Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7016263Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7016374Z warnings.warn( 2025-12-04T12:05:01.7017121Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-64323540e9247afe.xml - 2025-12-04T12:05:01.7017291Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7018208Z FAILED [0.2651s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7018214Z 2025-12-04T12:05:01.7018432Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7019157Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7019164Z 2025-12-04T12:05:01.7019427Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7019603Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7019824Z ============= 1 failed, 1 passed, 65 deselected, 2 rerun in 5.74s ============== 2025-12-04T12:05:01.7019921Z Got exit code 1 2025-12-04T12:05:01.7020040Z Retrying single test... 2025-12-04T12:05:01.7020616Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a52cc93387944e77.xml 2025-12-04T12:05:01.7020773Z ============================= test session starts ============================== 2025-12-04T12:05:01.7021138Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7021246Z cachedir: .pytest_cache 2025-12-04T12:05:01.7021764Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7021899Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7022004Z configfile: pytest.ini 2025-12-04T12:05:01.7022607Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7022836Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7023734Z stepcurrent: skipping 66 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7023863Z Running 1 items in this shard 2025-12-04T12:05:01.7023868Z 2025-12-04T12:05:01.7024586Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9034s] [100%] 2025-12-04T12:05:01.7025277Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2627s] [100%] 2025-12-04T12:05:01.7025873Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2610s] [100%] 2025-12-04T12:05:01.7025879Z 2025-12-04T12:05:01.7026016Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7026424Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7026547Z Traceback (most recent call last): 2025-12-04T12:05:01.7027034Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7027194Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7027713Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7027845Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7028035Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7028041Z 2025-12-04T12:05:01.7028264Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7028985Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7028995Z 2025-12-04T12:05:01.7029262Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7029489Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7029599Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7029721Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7029842Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7030060Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7030166Z graph_break [] 2025-12-04T12:05:01.7030381Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7031113Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7031221Z warnings.warn( 2025-12-04T12:05:01.7031585Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7031708Z Traceback (most recent call last): 2025-12-04T12:05:01.7032198Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7032349Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7032882Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7033005Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7033192Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7033198Z 2025-12-04T12:05:01.7033427Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7034146Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7034154Z 2025-12-04T12:05:01.7034460Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7035263Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7035381Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7035511Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7035668Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7035890Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7036001Z graph_break [] 2025-12-04T12:05:01.7036214Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7036966Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7037067Z warnings.warn( 2025-12-04T12:05:01.7037279Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7037431Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7037547Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7037766Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7037899Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7037995Z graph_break [] 2025-12-04T12:05:01.7038218Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7038947Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7039046Z warnings.warn( 2025-12-04T12:05:01.7039201Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7039562Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7039682Z Traceback (most recent call last): 2025-12-04T12:05:01.7040172Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7040332Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7040860Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7040983Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7041174Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7041179Z 2025-12-04T12:05:01.7041406Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7042127Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7042133Z 2025-12-04T12:05:01.7042408Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7042628Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7042744Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7042868Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7042992Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7043209Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7043317Z graph_break [] 2025-12-04T12:05:01.7043532Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7044275Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7044373Z warnings.warn( 2025-12-04T12:05:01.7044587Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7044705Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7044816Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7045034Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7045199Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7045297Z graph_break [] 2025-12-04T12:05:01.7045540Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7046283Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7046413Z warnings.warn( 2025-12-04T12:05:01.7046633Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7046740Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7046850Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7047079Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7047199Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7047296Z graph_break [] 2025-12-04T12:05:01.7047519Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7048282Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7048394Z warnings.warn( 2025-12-04T12:05:01.7049146Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a52cc93387944e77.xml - 2025-12-04T12:05:01.7049317Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7050156Z FAILED [0.2610s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7050162Z 2025-12-04T12:05:01.7050377Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7051105Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7051113Z 2025-12-04T12:05:01.7051377Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7051555Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7051766Z ================== 1 failed, 294 deselected, 2 rerun in 5.10s ================== 2025-12-04T12:05:01.7051867Z Got exit code 1 2025-12-04T12:05:01.7051985Z Retrying single test... 2025-12-04T12:05:01.7052563Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c3ebbd7913c9335f.xml 2025-12-04T12:05:01.7052722Z ============================= test session starts ============================== 2025-12-04T12:05:01.7053085Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7053193Z cachedir: .pytest_cache 2025-12-04T12:05:01.7053717Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7053858Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7053964Z configfile: pytest.ini 2025-12-04T12:05:01.7054568Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7054799Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7060544Z stepcurrent: skipping 66 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7060739Z Running 1 items in this shard 2025-12-04T12:05:01.7060750Z 2025-12-04T12:05:01.7061460Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9125s] [100%] 2025-12-04T12:05:01.7062293Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2641s] [100%] 2025-12-04T12:05:01.7062891Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2529s] [100%] 2025-12-04T12:05:01.7062936Z 2025-12-04T12:05:01.7063080Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7063455Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7063577Z Traceback (most recent call last): 2025-12-04T12:05:01.7064067Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7064225Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7064753Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7064929Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7065125Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7065131Z 2025-12-04T12:05:01.7065345Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7066082Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7066090Z 2025-12-04T12:05:01.7066357Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7066595Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7066707Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7066820Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7066955Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7067178Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7067292Z graph_break [] 2025-12-04T12:05:01.7067516Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7068255Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7068369Z warnings.warn( 2025-12-04T12:05:01.7068735Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7068851Z Traceback (most recent call last): 2025-12-04T12:05:01.7069337Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7069493Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7070023Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7070144Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7070340Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7070345Z 2025-12-04T12:05:01.7070572Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7071290Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7071298Z 2025-12-04T12:05:01.7071577Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7071796Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7071905Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7072031Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7072154Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7072374Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7072482Z graph_break [] 2025-12-04T12:05:01.7072755Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7073533Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7073634Z warnings.warn( 2025-12-04T12:05:01.7073880Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7073997Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7074107Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7074325Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7074457Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7074552Z graph_break [] 2025-12-04T12:05:01.7074762Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7075505Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7075635Z warnings.warn( 2025-12-04T12:05:01.7075789Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7076149Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7076270Z Traceback (most recent call last): 2025-12-04T12:05:01.7076756Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7076910Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7077439Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7077562Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7077755Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7077760Z 2025-12-04T12:05:01.7077981Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7078703Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7078709Z 2025-12-04T12:05:01.7078979Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7079209Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7079322Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7079447Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7079564Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7079781Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7079887Z graph_break [] 2025-12-04T12:05:01.7080100Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7080835Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7080949Z warnings.warn( 2025-12-04T12:05:01.7081164Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7081281Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7081395Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7081613Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7081744Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7081840Z graph_break [] 2025-12-04T12:05:01.7082052Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7082793Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7082894Z warnings.warn( 2025-12-04T12:05:01.7083112Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7083276Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7083390Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7083648Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7083770Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7083895Z graph_break [] 2025-12-04T12:05:01.7084121Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7084851Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7084966Z warnings.warn( 2025-12-04T12:05:01.7085724Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c3ebbd7913c9335f.xml - 2025-12-04T12:05:01.7085890Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7086767Z FAILED [0.2529s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7086773Z 2025-12-04T12:05:01.7086990Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7087942Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7087952Z 2025-12-04T12:05:01.7088261Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7088439Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7088651Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.7088751Z Got exit code 1 2025-12-04T12:05:01.7089405Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7089819Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.7090395Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4acaad8a10b8235e.xml 2025-12-04T12:05:01.7090574Z ============================= test session starts ============================== 2025-12-04T12:05:01.7090928Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7091038Z cachedir: .pytest_cache 2025-12-04T12:05:01.7091571Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7091693Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7091813Z configfile: pytest.ini 2025-12-04T12:05:01.7092408Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7092637Z collecting ... collected 571 items / 67 deselected / 504 selected 2025-12-04T12:05:01.7092790Z stepcurrent: skipping 67 already run items. 2025-12-04T12:05:01.7092902Z Running 228 items in this shard 2025-12-04T12:05:01.7092910Z 2025-12-04T12:05:01.7093613Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9404s] [ 0%] 2025-12-04T12:05:01.7094296Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2753s] [ 0%] 2025-12-04T12:05:01.7094894Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2746s] [ 0%] 2025-12-04T12:05:01.7094901Z 2025-12-04T12:05:01.7095126Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7095618Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7095754Z Traceback (most recent call last): 2025-12-04T12:05:01.7096232Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7096427Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7096962Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7097085Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7097277Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7097294Z 2025-12-04T12:05:01.7097510Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7098298Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7098356Z 2025-12-04T12:05:01.7098639Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7098858Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7098970Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7099098Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7099218Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7099453Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7099551Z graph_break [] 2025-12-04T12:05:01.7099768Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7100515Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7100617Z warnings.warn( 2025-12-04T12:05:01.7100986Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7101120Z Traceback (most recent call last): 2025-12-04T12:05:01.7101594Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7101767Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7102288Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7102408Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7102610Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7102615Z 2025-12-04T12:05:01.7102830Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7103563Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7103571Z 2025-12-04T12:05:01.7103838Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7104056Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7104179Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7104292Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7104415Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7104644Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7104742Z graph_break [] 2025-12-04T12:05:01.7104968Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7105698Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7105799Z warnings.warn( 2025-12-04T12:05:01.7106024Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7106169Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7106283Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7106542Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7106664Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7106790Z graph_break [] 2025-12-04T12:05:01.7107016Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7107750Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7107858Z warnings.warn( 2025-12-04T12:05:01.7107999Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7108365Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7108494Z Traceback (most recent call last): 2025-12-04T12:05:01.7108999Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7109166Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7109683Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7109804Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7110006Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7110012Z 2025-12-04T12:05:01.7110224Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7110947Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7110965Z 2025-12-04T12:05:01.7111230Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7113873Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7114018Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7114149Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7114272Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7114512Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7114614Z graph_break [] 2025-12-04T12:05:01.7114835Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7115584Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7115685Z warnings.warn( 2025-12-04T12:05:01.7115901Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7116021Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7116134Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7116369Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7116526Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7116624Z graph_break [] 2025-12-04T12:05:01.7116837Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7117574Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7117673Z warnings.warn( 2025-12-04T12:05:01.7117895Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7118007Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7118116Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7118347Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7118466Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7118561Z graph_break [] 2025-12-04T12:05:01.7118787Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7119576Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7119690Z warnings.warn( 2025-12-04T12:05:01.7120443Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4acaad8a10b8235e.xml - 2025-12-04T12:05:01.7120653Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7121498Z FAILED [0.2746s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7121505Z 2025-12-04T12:05:01.7121718Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7122457Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7122514Z 2025-12-04T12:05:01.7122784Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7122956Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7123168Z ================== 1 failed, 67 deselected, 2 rerun in 5.18s =================== 2025-12-04T12:05:01.7123265Z Got exit code 1 2025-12-04T12:05:01.7123384Z Retrying single test... 2025-12-04T12:05:01.7123956Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7532210fc6e9ae4a.xml 2025-12-04T12:05:01.7124118Z ============================= test session starts ============================== 2025-12-04T12:05:01.7124485Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7124593Z cachedir: .pytest_cache 2025-12-04T12:05:01.7125185Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7125324Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7125430Z configfile: pytest.ini 2025-12-04T12:05:01.7126038Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7126269Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7127074Z stepcurrent: skipping 67 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7127196Z Running 1 items in this shard 2025-12-04T12:05:01.7127202Z 2025-12-04T12:05:01.7127885Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9245s] [100%] 2025-12-04T12:05:01.7128588Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2718s] [100%] 2025-12-04T12:05:01.7129185Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2617s] [100%] 2025-12-04T12:05:01.7129193Z 2025-12-04T12:05:01.7129342Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7129711Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7129832Z Traceback (most recent call last): 2025-12-04T12:05:01.7130318Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7130474Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7130999Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7131165Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7131359Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7131364Z 2025-12-04T12:05:01.7131590Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7132346Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7132351Z 2025-12-04T12:05:01.7132616Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7132841Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7132954Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7133078Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7133198Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7133453Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7133558Z graph_break [] 2025-12-04T12:05:01.7133775Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7134507Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7134622Z warnings.warn( 2025-12-04T12:05:01.7134990Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7135118Z Traceback (most recent call last): 2025-12-04T12:05:01.7135591Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7135747Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7136274Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7136441Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7136636Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7136641Z 2025-12-04T12:05:01.7136867Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7137589Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7137596Z 2025-12-04T12:05:01.7137867Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7138084Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7138284Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7138409Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7138530Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7138750Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7138862Z graph_break [] 2025-12-04T12:05:01.7139080Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7139825Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7139930Z warnings.warn( 2025-12-04T12:05:01.7140141Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7140260Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7140375Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7140598Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7140730Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7140829Z graph_break [] 2025-12-04T12:05:01.7141048Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7141780Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7141923Z warnings.warn( 2025-12-04T12:05:01.7142079Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7142449Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7142600Z Traceback (most recent call last): 2025-12-04T12:05:01.7143090Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7143246Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7143776Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7143899Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7144089Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7144126Z 2025-12-04T12:05:01.7144356Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7145083Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7145089Z 2025-12-04T12:05:01.7145364Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7145580Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7145690Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7145812Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7145932Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7146149Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7146254Z graph_break [] 2025-12-04T12:05:01.7146466Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7147245Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7147345Z warnings.warn( 2025-12-04T12:05:01.7147559Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7147677Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7147788Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7148007Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7148138Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7148232Z graph_break [] 2025-12-04T12:05:01.7148445Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7149186Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7149283Z warnings.warn( 2025-12-04T12:05:01.7149507Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7149615Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7149732Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7149963Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7150081Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7150177Z graph_break [] 2025-12-04T12:05:01.7150399Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7151132Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7151239Z warnings.warn( 2025-12-04T12:05:01.7151985Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7532210fc6e9ae4a.xml - 2025-12-04T12:05:01.7152153Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7153039Z FAILED [0.2617s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7153046Z 2025-12-04T12:05:01.7153259Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7154029Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7154034Z 2025-12-04T12:05:01.7154297Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7154475Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7154679Z ================== 1 failed, 294 deselected, 2 rerun in 5.15s ================== 2025-12-04T12:05:01.7154776Z Got exit code 1 2025-12-04T12:05:01.7154927Z Retrying single test... 2025-12-04T12:05:01.7155506Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d5d3164b3ad67fad.xml 2025-12-04T12:05:01.7155667Z ============================= test session starts ============================== 2025-12-04T12:05:01.7156025Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7156140Z cachedir: .pytest_cache 2025-12-04T12:05:01.7156661Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7156796Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7156905Z configfile: pytest.ini 2025-12-04T12:05:01.7157508Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7157734Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7158587Z stepcurrent: skipping 67 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7158719Z Running 1 items in this shard 2025-12-04T12:05:01.7158724Z 2025-12-04T12:05:01.7159413Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9269s] [100%] 2025-12-04T12:05:01.7160111Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2807s] [100%] 2025-12-04T12:05:01.7160707Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2734s] [100%] 2025-12-04T12:05:01.7160713Z 2025-12-04T12:05:01.7160862Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7161233Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7161356Z Traceback (most recent call last): 2025-12-04T12:05:01.7161839Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7161998Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7162521Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7162656Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7162845Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7162850Z 2025-12-04T12:05:01.7163075Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7163799Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7163809Z 2025-12-04T12:05:01.7164107Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7164334Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7164443Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7164625Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7164743Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7164963Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7165070Z graph_break [] 2025-12-04T12:05:01.7165286Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7166017Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7166126Z warnings.warn( 2025-12-04T12:05:01.7166492Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7166652Z Traceback (most recent call last): 2025-12-04T12:05:01.7167128Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7167283Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7167814Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7167936Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7168127Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7168132Z 2025-12-04T12:05:01.7168356Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7169082Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7169087Z 2025-12-04T12:05:01.7169395Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7169623Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7169734Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7169852Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7169975Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7170202Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7170297Z graph_break [] 2025-12-04T12:05:01.7170512Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7171251Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7171349Z warnings.warn( 2025-12-04T12:05:01.7171562Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7171678Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7171791Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7172021Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7172139Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7172236Z graph_break [] 2025-12-04T12:05:01.7172455Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7173181Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7173281Z warnings.warn( 2025-12-04T12:05:01.7173431Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7173796Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7173927Z Traceback (most recent call last): 2025-12-04T12:05:01.7174404Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7174563Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7175119Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7175242Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7175459Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7175464Z 2025-12-04T12:05:01.7175689Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7176414Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7176419Z 2025-12-04T12:05:01.7176693Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7176911Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7177056Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7177182Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7177304Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7177520Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7177625Z graph_break [] 2025-12-04T12:05:01.7177843Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7178670Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7178771Z warnings.warn( 2025-12-04T12:05:01.7178984Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7179102Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7179212Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7179432Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7179613Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7179709Z graph_break [] 2025-12-04T12:05:01.7179931Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7180663Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7180763Z warnings.warn( 2025-12-04T12:05:01.7180986Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7181094Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7181203Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7181436Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7181555Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7181656Z graph_break [] 2025-12-04T12:05:01.7181866Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7182595Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7182698Z warnings.warn( 2025-12-04T12:05:01.7183450Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d5d3164b3ad67fad.xml - 2025-12-04T12:05:01.7183616Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7184455Z FAILED [0.2734s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7184462Z 2025-12-04T12:05:01.7184677Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7185406Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7185414Z 2025-12-04T12:05:01.7185709Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7185894Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7186090Z ================== 1 failed, 294 deselected, 2 rerun in 5.18s ================== 2025-12-04T12:05:01.7186220Z Got exit code 1 2025-12-04T12:05:01.7186871Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7187279Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.7188024Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4e5ad1e60a526460.xml 2025-12-04T12:05:01.7188196Z ============================= test session starts ============================== 2025-12-04T12:05:01.7188612Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7188730Z cachedir: .pytest_cache 2025-12-04T12:05:01.7189253Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7189375Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7189491Z configfile: pytest.ini 2025-12-04T12:05:01.7190078Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7190312Z collecting ... collected 571 items / 68 deselected / 503 selected 2025-12-04T12:05:01.7190450Z stepcurrent: skipping 68 already run items. 2025-12-04T12:05:01.7190560Z Running 227 items in this shard 2025-12-04T12:05:01.7190566Z 2025-12-04T12:05:01.7191309Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9265s] [ 0%] 2025-12-04T12:05:01.7191995Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2662s] [ 0%] 2025-12-04T12:05:01.7192599Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2625s] [ 0%] 2025-12-04T12:05:01.7192606Z 2025-12-04T12:05:01.7192743Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7193102Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7193227Z Traceback (most recent call last): 2025-12-04T12:05:01.7193702Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7193855Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7194388Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7194509Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7194705Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7194710Z 2025-12-04T12:05:01.7194926Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7195644Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7195661Z 2025-12-04T12:05:01.7195923Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7196141Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7196258Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7196370Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7196493Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7196722Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7196863Z graph_break [] 2025-12-04T12:05:01.7197078Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7197821Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7197961Z warnings.warn( 2025-12-04T12:05:01.7198332Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7198451Z Traceback (most recent call last): 2025-12-04T12:05:01.7198921Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7199085Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7199603Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7199759Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7199959Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7199965Z 2025-12-04T12:05:01.7200178Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7200906Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7200913Z 2025-12-04T12:05:01.7201176Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7201391Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7201506Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7201619Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7201747Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7202005Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7202101Z graph_break [] 2025-12-04T12:05:01.7202326Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7203057Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7203158Z warnings.warn( 2025-12-04T12:05:01.7203382Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7203491Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7203610Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7203825Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7203945Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7204044Z graph_break [] 2025-12-04T12:05:01.7204257Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7204989Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7205094Z warnings.warn( 2025-12-04T12:05:01.7205233Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7205605Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7205724Z Traceback (most recent call last): 2025-12-04T12:05:01.7206201Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7206368Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7206883Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7207000Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7207198Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7207208Z 2025-12-04T12:05:01.7207473Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7208196Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7208232Z 2025-12-04T12:05:01.7208499Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7208712Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7208828Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7208941Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7209074Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7209293Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7209387Z graph_break [] 2025-12-04T12:05:01.7209606Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7210371Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7210469Z warnings.warn( 2025-12-04T12:05:01.7210689Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7210800Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7210912Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7211138Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7211257Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7211359Z graph_break [] 2025-12-04T12:05:01.7211570Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7212302Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7212410Z warnings.warn( 2025-12-04T12:05:01.7212656Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7212770Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7212887Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7213106Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7213234Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7213330Z graph_break [] 2025-12-04T12:05:01.7213540Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7214272Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7214370Z warnings.warn( 2025-12-04T12:05:01.7215118Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4e5ad1e60a526460.xml - 2025-12-04T12:05:01.7215300Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7216132Z FAILED [0.2625s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7216141Z 2025-12-04T12:05:01.7216362Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7217079Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7217085Z 2025-12-04T12:05:01.7217356Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7217532Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7217724Z ================== 1 failed, 68 deselected, 2 rerun in 5.15s =================== 2025-12-04T12:05:01.7217830Z Got exit code 1 2025-12-04T12:05:01.7217937Z Retrying single test... 2025-12-04T12:05:01.7218636Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4c3c91d6f895ec60.xml 2025-12-04T12:05:01.7218806Z ============================= test session starts ============================== 2025-12-04T12:05:01.7219190Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7219306Z cachedir: .pytest_cache 2025-12-04T12:05:01.7219825Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7219948Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7220065Z configfile: pytest.ini 2025-12-04T12:05:01.7220657Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7220918Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7221732Z stepcurrent: skipping 68 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7221845Z Running 1 items in this shard 2025-12-04T12:05:01.7221852Z 2025-12-04T12:05:01.7222541Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9113s] [100%] 2025-12-04T12:05:01.7223223Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2666s] [100%] 2025-12-04T12:05:01.7223829Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2627s] [100%] 2025-12-04T12:05:01.7223835Z 2025-12-04T12:05:01.7224008Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7224375Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7224503Z Traceback (most recent call last): 2025-12-04T12:05:01.7224977Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7225133Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7225662Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7225782Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7225979Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7225984Z 2025-12-04T12:05:01.7226197Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7226915Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7226933Z 2025-12-04T12:05:01.7227200Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7227416Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7227538Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7227651Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7227770Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7227997Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7228094Z graph_break [] 2025-12-04T12:05:01.7228307Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7229048Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7229146Z warnings.warn( 2025-12-04T12:05:01.7229520Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7229675Z Traceback (most recent call last): 2025-12-04T12:05:01.7230151Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7230340Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7230857Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7230987Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7231181Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7231186Z 2025-12-04T12:05:01.7231400Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7232133Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7232172Z 2025-12-04T12:05:01.7232440Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7232667Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7232776Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7232890Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7233017Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7233236Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7233332Z graph_break [] 2025-12-04T12:05:01.7233553Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7234287Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7234385Z warnings.warn( 2025-12-04T12:05:01.7234606Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7234751Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7234900Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7235228Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7235347Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7235450Z graph_break [] 2025-12-04T12:05:01.7235666Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7236394Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7236500Z warnings.warn( 2025-12-04T12:05:01.7236640Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7237001Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7237124Z Traceback (most recent call last): 2025-12-04T12:05:01.7237601Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7237765Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7238282Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7238403Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7238601Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7238607Z 2025-12-04T12:05:01.7238821Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7239545Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7239551Z 2025-12-04T12:05:01.7239813Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7240028Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7240149Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7240303Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7240430Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7240646Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7240773Z graph_break [] 2025-12-04T12:05:01.7240994Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7241725Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7241825Z warnings.warn( 2025-12-04T12:05:01.7242042Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7242149Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7242270Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7242487Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7242640Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7242743Z graph_break [] 2025-12-04T12:05:01.7242958Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7243684Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7243791Z warnings.warn( 2025-12-04T12:05:01.7244001Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7244107Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7244225Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7244441Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7244567Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7244661Z graph_break [] 2025-12-04T12:05:01.7244871Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7245659Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7245758Z warnings.warn( 2025-12-04T12:05:01.7246507Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4c3c91d6f895ec60.xml - 2025-12-04T12:05:01.7246688Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7247525Z FAILED [0.2627s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7247531Z 2025-12-04T12:05:01.7247750Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7248466Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7248474Z 2025-12-04T12:05:01.7248749Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7248926Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7249128Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.7249236Z Got exit code 1 2025-12-04T12:05:01.7249341Z Retrying single test... 2025-12-04T12:05:01.7249915Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-95a17199eb5946d1.xml 2025-12-04T12:05:01.7250081Z ============================= test session starts ============================== 2025-12-04T12:05:01.7250433Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7250545Z cachedir: .pytest_cache 2025-12-04T12:05:01.7251071Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7251225Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7251339Z configfile: pytest.ini 2025-12-04T12:05:01.7251930Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7252191Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7253003Z stepcurrent: skipping 68 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7253113Z Running 1 items in this shard 2025-12-04T12:05:01.7253118Z 2025-12-04T12:05:01.7253809Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9078s] [100%] 2025-12-04T12:05:01.7254518Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2639s] [100%] 2025-12-04T12:05:01.7255119Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2607s] [100%] 2025-12-04T12:05:01.7255127Z 2025-12-04T12:05:01.7255263Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7255623Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7255751Z Traceback (most recent call last): 2025-12-04T12:05:01.7256226Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7256388Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7256941Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7257065Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7257266Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7257273Z 2025-12-04T12:05:01.7257487Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7258318Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7258334Z 2025-12-04T12:05:01.7258598Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7258815Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7258936Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7259047Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7259167Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7259400Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7259495Z graph_break [] 2025-12-04T12:05:01.7259715Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7260453Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7260554Z warnings.warn( 2025-12-04T12:05:01.7260920Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7261042Z Traceback (most recent call last): 2025-12-04T12:05:01.7261629Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7261791Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7262308Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7262443Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7262630Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7262682Z 2025-12-04T12:05:01.7262897Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7263622Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7263660Z 2025-12-04T12:05:01.7263924Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7264146Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7264254Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7264366Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7264493Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7264710Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7264839Z graph_break [] 2025-12-04T12:05:01.7265064Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7265801Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7265901Z warnings.warn( 2025-12-04T12:05:01.7266128Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7266238Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7266356Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7266572Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7266691Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7266793Z graph_break [] 2025-12-04T12:05:01.7267006Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7267841Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7267953Z warnings.warn( 2025-12-04T12:05:01.7268095Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7268462Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7268587Z Traceback (most recent call last): 2025-12-04T12:05:01.7269062Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7269223Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7269740Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7269860Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7270059Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7270065Z 2025-12-04T12:05:01.7270282Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7271008Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7271014Z 2025-12-04T12:05:01.7271281Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7271494Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7271613Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7271724Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7271849Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7272067Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7272163Z graph_break [] 2025-12-04T12:05:01.7272382Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7273113Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7273246Z warnings.warn( 2025-12-04T12:05:01.7273476Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7273586Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7273742Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7273960Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7274082Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7274193Z graph_break [] 2025-12-04T12:05:01.7274406Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7275138Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7275250Z warnings.warn( 2025-12-04T12:05:01.7275466Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7275615Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7275729Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7275949Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7276079Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7276181Z graph_break [] 2025-12-04T12:05:01.7276391Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7277136Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7277237Z warnings.warn( 2025-12-04T12:05:01.7277990Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-95a17199eb5946d1.xml - 2025-12-04T12:05:01.7278174Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7279043Z FAILED [0.2607s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7279049Z 2025-12-04T12:05:01.7279278Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7279997Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7280002Z 2025-12-04T12:05:01.7280280Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7280459Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7280658Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.7280773Z Got exit code 1 2025-12-04T12:05:01.7281415Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7281842Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.7282416Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-81b9e43a20c07e3d.xml 2025-12-04T12:05:01.7282579Z ============================= test session starts ============================== 2025-12-04T12:05:01.7282941Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7283047Z cachedir: .pytest_cache 2025-12-04T12:05:01.7283568Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7283702Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7283810Z configfile: pytest.ini 2025-12-04T12:05:01.7284416Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7284676Z collecting ... collected 571 items / 69 deselected / 502 selected 2025-12-04T12:05:01.7284818Z stepcurrent: skipping 69 already run items. 2025-12-04T12:05:01.7284976Z Running 226 items in this shard 2025-12-04T12:05:01.7284981Z 2025-12-04T12:05:01.7285598Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_False_float16_cuda PASSED [3.3469s] [ 0%] 2025-12-04T12:05:01.7286305Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0914s] [ 0%] 2025-12-04T12:05:01.7286994Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2680s] [ 0%] 2025-12-04T12:05:01.7287647Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2608s] [ 0%] 2025-12-04T12:05:01.7287656Z 2025-12-04T12:05:01.7287963Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7288334Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7288469Z Traceback (most recent call last): 2025-12-04T12:05:01.7288948Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7289102Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7289635Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7289757Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7289948Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7289968Z 2025-12-04T12:05:01.7290256Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7290991Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7290998Z 2025-12-04T12:05:01.7291275Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7291493Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7291604Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7291730Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7291948Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7292086Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7292182Z graph_break [] 2025-12-04T12:05:01.7292397Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7293149Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7293247Z warnings.warn( 2025-12-04T12:05:01.7293623Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7293755Z Traceback (most recent call last): 2025-12-04T12:05:01.7294235Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7294399Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7294919Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7295041Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7295242Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7295247Z 2025-12-04T12:05:01.7295463Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7296251Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7296258Z 2025-12-04T12:05:01.7296525Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7296783Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7296903Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7297016Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7297232Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7297362Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7297458Z graph_break [] 2025-12-04T12:05:01.7297684Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7298482Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7298633Z warnings.warn( 2025-12-04T12:05:01.7298862Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7298972Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7299087Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7299317Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7299436Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7299540Z graph_break [] 2025-12-04T12:05:01.7299755Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7300480Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7300591Z warnings.warn( 2025-12-04T12:05:01.7300735Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7301167Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7301305Z Traceback (most recent call last): 2025-12-04T12:05:01.7301783Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7301955Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7302474Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7302595Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7302799Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7302804Z 2025-12-04T12:05:01.7303016Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7303744Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7303765Z 2025-12-04T12:05:01.7304032Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7304248Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7304368Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7304484Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7304702Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7304833Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7304929Z graph_break [] 2025-12-04T12:05:01.7305143Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7305888Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7305988Z warnings.warn( 2025-12-04T12:05:01.7306213Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7306324Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7306473Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7306706Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7306824Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7306952Z graph_break [] 2025-12-04T12:05:01.7307175Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7307905Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7308017Z warnings.warn( 2025-12-04T12:05:01.7308230Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7308338Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7308466Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7308684Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7308837Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7308948Z graph_break [] 2025-12-04T12:05:01.7309157Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7309898Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7309999Z warnings.warn( 2025-12-04T12:05:01.7310751Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-81b9e43a20c07e3d.xml - 2025-12-04T12:05:01.7310933Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7311776Z FAILED [0.2608s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7311785Z 2025-12-04T12:05:01.7312040Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7312765Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7312773Z 2025-12-04T12:05:01.7313036Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7313227Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7313438Z ============= 1 failed, 1 passed, 69 deselected, 2 rerun in 5.65s ============== 2025-12-04T12:05:01.7313549Z Got exit code 1 2025-12-04T12:05:01.7313653Z Retrying single test... 2025-12-04T12:05:01.7314230Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1de8202a5dd1888a.xml 2025-12-04T12:05:01.7314404Z ============================= test session starts ============================== 2025-12-04T12:05:01.7314761Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7314869Z cachedir: .pytest_cache 2025-12-04T12:05:01.7315405Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7315528Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7315649Z configfile: pytest.ini 2025-12-04T12:05:01.7316241Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7316467Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7317299Z stepcurrent: skipping 70 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7317412Z Running 1 items in this shard 2025-12-04T12:05:01.7317419Z 2025-12-04T12:05:01.7318160Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9093s] [100%] 2025-12-04T12:05:01.7318851Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2627s] [100%] 2025-12-04T12:05:01.7319489Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2759s] [100%] 2025-12-04T12:05:01.7319494Z 2025-12-04T12:05:01.7319646Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7320020Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7320150Z Traceback (most recent call last): 2025-12-04T12:05:01.7320626Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7320818Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7321353Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7321477Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7321679Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7321684Z 2025-12-04T12:05:01.7321898Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7322625Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7322631Z 2025-12-04T12:05:01.7322906Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7323124Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7323279Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7323396Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7323519Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7323754Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7323853Z graph_break [] 2025-12-04T12:05:01.7324064Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7324811Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7324912Z warnings.warn( 2025-12-04T12:05:01.7325284Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7325418Z Traceback (most recent call last): 2025-12-04T12:05:01.7325888Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7326057Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7326576Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7326696Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7326900Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7326906Z 2025-12-04T12:05:01.7327121Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7327859Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7327864Z 2025-12-04T12:05:01.7328132Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7328351Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7328473Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7328591Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7328713Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7328975Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7329075Z graph_break [] 2025-12-04T12:05:01.7329298Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7330082Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7330183Z warnings.warn( 2025-12-04T12:05:01.7330411Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7330524Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7330635Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7330867Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7330990Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7331133Z graph_break [] 2025-12-04T12:05:01.7331354Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7332088Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7332205Z warnings.warn( 2025-12-04T12:05:01.7332346Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7332720Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7332856Z Traceback (most recent call last): 2025-12-04T12:05:01.7333335Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7333506Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7334056Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7334182Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7334388Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7334394Z 2025-12-04T12:05:01.7334609Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7335357Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7335362Z 2025-12-04T12:05:01.7335631Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7335848Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7335973Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7336087Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7336208Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7336439Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7336543Z graph_break [] 2025-12-04T12:05:01.7336772Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7337500Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7337607Z warnings.warn( 2025-12-04T12:05:01.7337831Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7337940Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7338055Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7338362Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7338485Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7338580Z graph_break [] 2025-12-04T12:05:01.7338807Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7339582Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7339694Z warnings.warn( 2025-12-04T12:05:01.7339907Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7340046Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7340175Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7340390Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7340508Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7340616Z graph_break [] 2025-12-04T12:05:01.7340828Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7341567Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7341665Z warnings.warn( 2025-12-04T12:05:01.7342452Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1de8202a5dd1888a.xml - 2025-12-04T12:05:01.7342633Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7343477Z FAILED [0.2759s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7343485Z 2025-12-04T12:05:01.7343715Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7344443Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7344448Z 2025-12-04T12:05:01.7344711Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7344933Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7345136Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.7345246Z Got exit code 1 2025-12-04T12:05:01.7345352Z Retrying single test... 2025-12-04T12:05:01.7345929Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c241afcfaa8e2579.xml 2025-12-04T12:05:01.7346102Z ============================= test session starts ============================== 2025-12-04T12:05:01.7346451Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7346558Z cachedir: .pytest_cache 2025-12-04T12:05:01.7347092Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7347213Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7347333Z configfile: pytest.ini 2025-12-04T12:05:01.7347928Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7348156Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7349103Z stepcurrent: skipping 70 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7349222Z Running 1 items in this shard 2025-12-04T12:05:01.7349228Z 2025-12-04T12:05:01.7349931Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9022s] [100%] 2025-12-04T12:05:01.7350616Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2653s] [100%] 2025-12-04T12:05:01.7351222Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2633s] [100%] 2025-12-04T12:05:01.7351241Z 2025-12-04T12:05:01.7351421Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7351793Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7351960Z Traceback (most recent call last): 2025-12-04T12:05:01.7352435Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7352591Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7353123Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7353244Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7353446Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7353452Z 2025-12-04T12:05:01.7353702Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7354437Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7354443Z 2025-12-04T12:05:01.7354723Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7354946Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7355073Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7355187Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7355309Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7355541Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7355638Z graph_break [] 2025-12-04T12:05:01.7355852Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7356628Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7356736Z warnings.warn( 2025-12-04T12:05:01.7357123Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7357245Z Traceback (most recent call last): 2025-12-04T12:05:01.7357719Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7357887Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7358407Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7358528Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7358729Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7358734Z 2025-12-04T12:05:01.7358948Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7359693Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7359699Z 2025-12-04T12:05:01.7359965Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7360182Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7360304Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7360417Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7360547Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7360765Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7360861Z graph_break [] 2025-12-04T12:05:01.7361087Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7361818Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7361917Z warnings.warn( 2025-12-04T12:05:01.7362173Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7362288Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7362399Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7362658Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7362779Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7362889Z graph_break [] 2025-12-04T12:05:01.7363100Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7363827Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7363938Z warnings.warn( 2025-12-04T12:05:01.7364081Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7364488Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7364622Z Traceback (most recent call last): 2025-12-04T12:05:01.7365096Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7365264Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7365779Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7365898Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7366098Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7366104Z 2025-12-04T12:05:01.7366320Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7367060Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7367068Z 2025-12-04T12:05:01.7367363Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7367583Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7367699Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7367816Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7367934Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7368163Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7368259Z graph_break [] 2025-12-04T12:05:01.7368482Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7369216Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7369315Z warnings.warn( 2025-12-04T12:05:01.7369542Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7369657Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7369770Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7370005Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7370125Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7370234Z graph_break [] 2025-12-04T12:05:01.7370447Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7371182Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7371290Z warnings.warn( 2025-12-04T12:05:01.7371503Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7371612Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7371737Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7371956Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7372084Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7372196Z graph_break [] 2025-12-04T12:05:01.7372465Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7373202Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7373334Z warnings.warn( 2025-12-04T12:05:01.7374090Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c241afcfaa8e2579.xml - 2025-12-04T12:05:01.7374270Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7375109Z FAILED [0.2633s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7375152Z 2025-12-04T12:05:01.7375381Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7376115Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7376123Z 2025-12-04T12:05:01.7376388Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7376578Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7376772Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.7376880Z Got exit code 1 2025-12-04T12:05:01.7377527Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7377936Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.7378641Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-dbee4fdc7ea3ecd0.xml 2025-12-04T12:05:01.7378806Z ============================= test session starts ============================== 2025-12-04T12:05:01.7379169Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7379280Z cachedir: .pytest_cache 2025-12-04T12:05:01.7379802Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7379937Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7380047Z configfile: pytest.ini 2025-12-04T12:05:01.7380638Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7380884Z collecting ... collected 571 items / 71 deselected / 500 selected 2025-12-04T12:05:01.7381031Z stepcurrent: skipping 71 already run items. 2025-12-04T12:05:01.7381158Z Running 224 items in this shard 2025-12-04T12:05:01.7381166Z 2025-12-04T12:05:01.7381778Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_float16_cuda PASSED [3.3563s] [ 0%] 2025-12-04T12:05:01.7382461Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0924s] [ 0%] 2025-12-04T12:05:01.7383144Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2735s] [ 0%] 2025-12-04T12:05:01.7383729Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2649s] [ 0%] 2025-12-04T12:05:01.7383735Z 2025-12-04T12:05:01.7383892Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7384256Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7384411Z Traceback (most recent call last): 2025-12-04T12:05:01.7384905Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7385093Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7385631Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7385756Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7385947Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7385952Z 2025-12-04T12:05:01.7386188Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7386908Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7386949Z 2025-12-04T12:05:01.7387237Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7387458Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7387571Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7387909Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7388136Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7388260Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7388375Z graph_break [] 2025-12-04T12:05:01.7388592Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7389346Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7389447Z warnings.warn( 2025-12-04T12:05:01.7389876Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7390015Z Traceback (most recent call last): 2025-12-04T12:05:01.7390492Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7390645Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7391182Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7391305Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7391509Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7391515Z 2025-12-04T12:05:01.7391728Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7392445Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7392465Z 2025-12-04T12:05:01.7392739Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7392958Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7393085Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7393202Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7393424Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7393556Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7393650Z graph_break [] 2025-12-04T12:05:01.7393868Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7394610Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7394711Z warnings.warn( 2025-12-04T12:05:01.7394935Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7395047Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7395166Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7395449Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7395571Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7395667Z graph_break [] 2025-12-04T12:05:01.7395899Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7396672Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7396785Z warnings.warn( 2025-12-04T12:05:01.7396928Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7397285Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7397420Z Traceback (most recent call last): 2025-12-04T12:05:01.7397898Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7398100Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7398635Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7398756Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7398961Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7398968Z 2025-12-04T12:05:01.7399181Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7399895Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7399900Z 2025-12-04T12:05:01.7400177Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7400473Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7400661Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7400777Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7401000Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7401134Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7401230Z graph_break [] 2025-12-04T12:05:01.7401449Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7402198Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7402297Z warnings.warn( 2025-12-04T12:05:01.7402521Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7402631Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7402744Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7402973Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7403097Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7403195Z graph_break [] 2025-12-04T12:05:01.7403420Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7404149Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7404260Z warnings.warn( 2025-12-04T12:05:01.7404475Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7404584Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7404707Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7404926Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7405045Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7405152Z graph_break [] 2025-12-04T12:05:01.7405364Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7406130Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7406243Z warnings.warn( 2025-12-04T12:05:01.7407006Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-dbee4fdc7ea3ecd0.xml - 2025-12-04T12:05:01.7407221Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7408046Z FAILED [0.2649s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7408052Z 2025-12-04T12:05:01.7408268Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7408993Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7409030Z 2025-12-04T12:05:01.7409298Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7409487Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7409699Z ============= 1 failed, 1 passed, 71 deselected, 2 rerun in 5.67s ============== 2025-12-04T12:05:01.7409799Z Got exit code 1 2025-12-04T12:05:01.7409918Z Retrying single test... 2025-12-04T12:05:01.7410493Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-163cc6583cdf0c02.xml 2025-12-04T12:05:01.7410665Z ============================= test session starts ============================== 2025-12-04T12:05:01.7411018Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7411126Z cachedir: .pytest_cache 2025-12-04T12:05:01.7411708Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7411837Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7411945Z configfile: pytest.ini 2025-12-04T12:05:01.7412553Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7412782Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7413595Z stepcurrent: skipping 72 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7413708Z Running 1 items in this shard 2025-12-04T12:05:01.7413713Z 2025-12-04T12:05:01.7414390Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9074s] [100%] 2025-12-04T12:05:01.7415080Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2656s] [100%] 2025-12-04T12:05:01.7415664Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2620s] [100%] 2025-12-04T12:05:01.7415673Z 2025-12-04T12:05:01.7415823Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7416181Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7416314Z Traceback (most recent call last): 2025-12-04T12:05:01.7416790Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7416945Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7417482Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7417610Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7417834Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7417840Z 2025-12-04T12:05:01.7418067Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7418859Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7418901Z 2025-12-04T12:05:01.7419182Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7419401Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7419512Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7419638Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7419758Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7419978Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7420120Z graph_break [] 2025-12-04T12:05:01.7420341Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7421092Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7421194Z warnings.warn( 2025-12-04T12:05:01.7421551Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7421682Z Traceback (most recent call last): 2025-12-04T12:05:01.7422156Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7422322Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7422840Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7422961Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7423196Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7423202Z 2025-12-04T12:05:01.7423419Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7424133Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7424152Z 2025-12-04T12:05:01.7424417Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7424635Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7424757Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7424872Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7424991Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7425223Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7425324Z graph_break [] 2025-12-04T12:05:01.7425543Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7426289Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7426392Z warnings.warn( 2025-12-04T12:05:01.7426621Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7426731Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7426849Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7427086Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7427207Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7427304Z graph_break [] 2025-12-04T12:05:01.7427529Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7428257Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7428373Z warnings.warn( 2025-12-04T12:05:01.7428553Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7428910Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7429076Z Traceback (most recent call last): 2025-12-04T12:05:01.7429553Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7429709Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7430240Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7430363Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7430567Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7430573Z 2025-12-04T12:05:01.7430785Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7431529Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7431534Z 2025-12-04T12:05:01.7431813Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7432034Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7432157Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7432269Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7432391Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7432622Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7432720Z graph_break [] 2025-12-04T12:05:01.7432933Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7433710Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7433814Z warnings.warn( 2025-12-04T12:05:01.7434043Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7434152Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7434265Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7434499Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7434622Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7434721Z graph_break [] 2025-12-04T12:05:01.7434950Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7435679Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7435793Z warnings.warn( 2025-12-04T12:05:01.7436006Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7436124Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7436252Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7436473Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7436596Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7436709Z graph_break [] 2025-12-04T12:05:01.7436927Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7437657Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7437773Z warnings.warn( 2025-12-04T12:05:01.7438523Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-163cc6583cdf0c02.xml - 2025-12-04T12:05:01.7438708Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7439571Z FAILED [0.2620s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7439577Z 2025-12-04T12:05:01.7439806Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7440552Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7440558Z 2025-12-04T12:05:01.7440819Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7441016Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7441217Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.7441317Z Got exit code 1 2025-12-04T12:05:01.7441441Z Retrying single test... 2025-12-04T12:05:01.7442024Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a5ca9a61ebc16b36.xml 2025-12-04T12:05:01.7442302Z ============================= test session starts ============================== 2025-12-04T12:05:01.7442652Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7442764Z cachedir: .pytest_cache 2025-12-04T12:05:01.7443300Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7443423Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7443533Z configfile: pytest.ini 2025-12-04T12:05:01.7444140Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7444364Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7445216Z stepcurrent: skipping 72 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7445337Z Running 1 items in this shard 2025-12-04T12:05:01.7445342Z 2025-12-04T12:05:01.7446018Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9146s] [100%] 2025-12-04T12:05:01.7446705Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2664s] [100%] 2025-12-04T12:05:01.7447290Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2533s] [100%] 2025-12-04T12:05:01.7447296Z 2025-12-04T12:05:01.7447448Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7447799Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7447935Z Traceback (most recent call last): 2025-12-04T12:05:01.7448411Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7448568Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7449101Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7449222Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7449412Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7449417Z 2025-12-04T12:05:01.7449640Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7450352Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7450357Z 2025-12-04T12:05:01.7450635Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7450888Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7451006Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7451129Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7451251Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7451514Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7451614Z graph_break [] 2025-12-04T12:05:01.7451830Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7452577Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7452681Z warnings.warn( 2025-12-04T12:05:01.7453036Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7453226Z Traceback (most recent call last): 2025-12-04T12:05:01.7453709Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7453875Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7454395Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7454523Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7454725Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7454730Z 2025-12-04T12:05:01.7454946Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7455659Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7455677Z 2025-12-04T12:05:01.7455942Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7456194Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7456324Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7456440Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7456563Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7456796Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7456896Z graph_break [] 2025-12-04T12:05:01.7457108Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7457850Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7457951Z warnings.warn( 2025-12-04T12:05:01.7458245Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7458357Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7458470Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7458708Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7458831Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7458941Z graph_break [] 2025-12-04T12:05:01.7459166Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7459898Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7460012Z warnings.warn( 2025-12-04T12:05:01.7460160Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7460520Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7460654Z Traceback (most recent call last): 2025-12-04T12:05:01.7461132Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7461289Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7461866Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7461991Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7462194Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7462230Z 2025-12-04T12:05:01.7462446Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7463161Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7463177Z 2025-12-04T12:05:01.7463442Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7463658Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7463781Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7463923Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7464045Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7464279Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7464376Z graph_break [] 2025-12-04T12:05:01.7464589Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7465335Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7465433Z warnings.warn( 2025-12-04T12:05:01.7465658Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7465767Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7465880Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7466110Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7466230Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7466328Z graph_break [] 2025-12-04T12:05:01.7466590Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7467319Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7467433Z warnings.warn( 2025-12-04T12:05:01.7467646Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7467755Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7467880Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7468099Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7468218Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7468326Z graph_break [] 2025-12-04T12:05:01.7468538Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7469269Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7469389Z warnings.warn( 2025-12-04T12:05:01.7470144Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a5ca9a61ebc16b36.xml - 2025-12-04T12:05:01.7470330Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7471157Z FAILED [0.2533s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7471164Z 2025-12-04T12:05:01.7471389Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7472101Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7472106Z 2025-12-04T12:05:01.7472372Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7472602Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7472800Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.7472932Z Got exit code 1 2025-12-04T12:05:01.7473573Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7473984Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.7474567Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c4c3d903e4063d32.xml 2025-12-04T12:05:01.7474730Z ============================= test session starts ============================== 2025-12-04T12:05:01.7475083Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7475247Z cachedir: .pytest_cache 2025-12-04T12:05:01.7475847Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7475983Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7476092Z configfile: pytest.ini 2025-12-04T12:05:01.7476683Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7476917Z collecting ... collected 571 items / 73 deselected / 498 selected 2025-12-04T12:05:01.7477059Z stepcurrent: skipping 73 already run items. 2025-12-04T12:05:01.7477173Z Running 222 items in this shard 2025-12-04T12:05:01.7477178Z 2025-12-04T12:05:01.7477785Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_float16_cuda PASSED [3.4744s] [ 0%] 2025-12-04T12:05:01.7478513Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0925s] [ 0%] 2025-12-04T12:05:01.7479188Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2687s] [ 0%] 2025-12-04T12:05:01.7479770Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2658s] [ 0%] 2025-12-04T12:05:01.7479776Z 2025-12-04T12:05:01.7479934Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7480286Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7480409Z Traceback (most recent call last): 2025-12-04T12:05:01.7480898Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7481054Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7481583Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7481720Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7481912Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7481919Z 2025-12-04T12:05:01.7482146Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7482855Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7482861Z 2025-12-04T12:05:01.7483126Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7483357Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7483470Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7483599Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7483823Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7483977Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7484086Z graph_break [] 2025-12-04T12:05:01.7484303Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7485070Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7485184Z warnings.warn( 2025-12-04T12:05:01.7485539Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7485673Z Traceback (most recent call last): 2025-12-04T12:05:01.7486149Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7486303Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7486868Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7486993Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7487183Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7487201Z 2025-12-04T12:05:01.7487419Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7488575Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7488595Z 2025-12-04T12:05:01.7488915Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7489141Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7489252Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7489381Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7489609Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7490370Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7490480Z graph_break [] 2025-12-04T12:05:01.7490702Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7491455Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7491561Z warnings.warn( 2025-12-04T12:05:01.7491777Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7491901Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7492018Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7492256Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7492378Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7492478Z graph_break [] 2025-12-04T12:05:01.7492705Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7493444Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7493543Z warnings.warn( 2025-12-04T12:05:01.7493700Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7494056Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7494192Z Traceback (most recent call last): 2025-12-04T12:05:01.7494672Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7494830Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7495363Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7495487Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7495684Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7495690Z 2025-12-04T12:05:01.7495976Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7496689Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7496740Z 2025-12-04T12:05:01.7497021Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7497238Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7497353Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7497484Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7497706Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7497840Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7497940Z graph_break [] 2025-12-04T12:05:01.7498284Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7499038Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7499140Z warnings.warn( 2025-12-04T12:05:01.7499356Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7499479Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7499593Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7499812Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7499946Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7500042Z graph_break [] 2025-12-04T12:05:01.7500264Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7500995Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7501134Z warnings.warn( 2025-12-04T12:05:01.7501358Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7501465Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7501579Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7501812Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7501936Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7502390Z graph_break [] 2025-12-04T12:05:01.7502768Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7503858Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7504836Z warnings.warn( 2025-12-04T12:05:01.7505759Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c4c3d903e4063d32.xml - 2025-12-04T12:05:01.7506830Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7507974Z FAILED [0.2658s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7508936Z 2025-12-04T12:05:01.7509157Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7510226Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7511114Z 2025-12-04T12:05:01.7511393Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7511973Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7512514Z ============= 1 failed, 1 passed, 73 deselected, 2 rerun in 5.80s ============== 2025-12-04T12:05:01.7512978Z Got exit code 1 2025-12-04T12:05:01.7513233Z Retrying single test... 2025-12-04T12:05:01.7514039Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-967683d4f4b3f6ea.xml 2025-12-04T12:05:01.7514925Z ============================= test session starts ============================== 2025-12-04T12:05:01.7515625Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7516226Z cachedir: .pytest_cache 2025-12-04T12:05:01.7516946Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7517734Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7518086Z configfile: pytest.ini 2025-12-04T12:05:01.7518856Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7519852Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7521013Z stepcurrent: skipping 74 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7522054Z Running 1 items in this shard 2025-12-04T12:05:01.7522277Z 2025-12-04T12:05:01.7522948Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9110s] [100%] 2025-12-04T12:05:01.7524428Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2697s] [100%] 2025-12-04T12:05:01.7525821Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2591s] [100%] 2025-12-04T12:05:01.7526541Z 2025-12-04T12:05:01.7526727Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7527365Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7527987Z Traceback (most recent call last): 2025-12-04T12:05:01.7528698Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7529460Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7530281Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7531067Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7531509Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7531827Z 2025-12-04T12:05:01.7532044Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7533108Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7533967Z 2025-12-04T12:05:01.7534237Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7534871Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7535338Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7535674Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7536011Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7536460Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7543063Z graph_break [] 2025-12-04T12:05:01.7543485Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7544610Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7545608Z warnings.warn( 2025-12-04T12:05:01.7546127Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7546849Z Traceback (most recent call last): 2025-12-04T12:05:01.7547576Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7548378Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7549197Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7549988Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7550444Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7550764Z 2025-12-04T12:05:01.7550981Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7552046Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7552941Z 2025-12-04T12:05:01.7553219Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7553855Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7554321Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7554663Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7555008Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7555461Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7555926Z graph_break [] 2025-12-04T12:05:01.7556302Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7557391Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7558371Z warnings.warn( 2025-12-04T12:05:01.7558749Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7559290Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7559623Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7560061Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7560542Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7560870Z graph_break [] 2025-12-04T12:05:01.7561243Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7562338Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7563307Z warnings.warn( 2025-12-04T12:05:01.7563608Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7564261Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7564881Z Traceback (most recent call last): 2025-12-04T12:05:01.7565584Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7566365Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7567177Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7567965Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7568398Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7568728Z 2025-12-04T12:05:01.7568945Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7570012Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7570855Z 2025-12-04T12:05:01.7571131Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7571754Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7572231Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7572599Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7572926Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7573390Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7573889Z graph_break [] 2025-12-04T12:05:01.7574256Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7575357Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7576481Z warnings.warn( 2025-12-04T12:05:01.7576871Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7577334Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7577672Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7578220Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7578698Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7579044Z graph_break [] 2025-12-04T12:05:01.7579445Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7580541Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7581509Z warnings.warn( 2025-12-04T12:05:01.7581887Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7582359Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7582680Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7583115Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7583598Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7583925Z graph_break [] 2025-12-04T12:05:01.7584296Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7585443Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7586412Z warnings.warn( 2025-12-04T12:05:01.7587320Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-967683d4f4b3f6ea.xml - 2025-12-04T12:05:01.7588611Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7589757Z FAILED [0.2591s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7590707Z 2025-12-04T12:05:01.7590936Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7591998Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7592858Z 2025-12-04T12:05:01.7593125Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7593716Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7594242Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.7594676Z Got exit code 1 2025-12-04T12:05:01.7594945Z Retrying single test... 2025-12-04T12:05:01.7595717Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-dd59b77eb3bfc158.xml 2025-12-04T12:05:01.7596592Z ============================= test session starts ============================== 2025-12-04T12:05:01.7597258Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7597860Z cachedir: .pytest_cache 2025-12-04T12:05:01.7598668Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7599448Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7599802Z configfile: pytest.ini 2025-12-04T12:05:01.7600583Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7601581Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7602746Z stepcurrent: skipping 74 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7603790Z Running 1 items in this shard 2025-12-04T12:05:01.7604004Z 2025-12-04T12:05:01.7604699Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9166s] [100%] 2025-12-04T12:05:01.7606232Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2625s] [100%] 2025-12-04T12:05:01.7607605Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2583s] [100%] 2025-12-04T12:05:01.7608342Z 2025-12-04T12:05:01.7608486Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7609137Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7609754Z Traceback (most recent call last): 2025-12-04T12:05:01.7610464Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7611237Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7612107Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7612890Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7613338Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7613670Z 2025-12-04T12:05:01.7613885Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7614952Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7615801Z 2025-12-04T12:05:01.7616065Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7616700Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7617178Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7617514Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7617843Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7618398Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7618868Z graph_break [] 2025-12-04T12:05:01.7619231Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7620340Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7621321Z warnings.warn( 2025-12-04T12:05:01.7621838Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7622444Z Traceback (most recent call last): 2025-12-04T12:05:01.7623154Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7623922Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7624727Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7625519Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7626011Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7626328Z 2025-12-04T12:05:01.7626556Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7627611Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7628511Z 2025-12-04T12:05:01.7628785Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7629418Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7629900Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7630229Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7630568Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7631032Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7631515Z graph_break [] 2025-12-04T12:05:01.7631889Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7632991Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7633962Z warnings.warn( 2025-12-04T12:05:01.7634346Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7634814Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7635146Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7635572Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7636052Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7636393Z graph_break [] 2025-12-04T12:05:01.7636754Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7637888Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7638862Z warnings.warn( 2025-12-04T12:05:01.7639173Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7639813Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7640436Z Traceback (most recent call last): 2025-12-04T12:05:01.7641161Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7641928Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7642742Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7643523Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7643967Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7644288Z 2025-12-04T12:05:01.7644506Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7645573Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7646419Z 2025-12-04T12:05:01.7646696Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7647326Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7647789Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7648128Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7648464Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7648913Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7649374Z graph_break [] 2025-12-04T12:05:01.7649746Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7650835Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7651847Z warnings.warn( 2025-12-04T12:05:01.7652228Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7652695Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7653048Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7653483Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7653965Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7654290Z graph_break [] 2025-12-04T12:05:01.7654661Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7655754Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7656726Z warnings.warn( 2025-12-04T12:05:01.7657148Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7657617Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7657953Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7658453Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7658992Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7659337Z graph_break [] 2025-12-04T12:05:01.7659694Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7660792Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7661761Z warnings.warn( 2025-12-04T12:05:01.7662683Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-dd59b77eb3bfc158.xml - 2025-12-04T12:05:01.7663735Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7664915Z FAILED [0.2583s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7665875Z 2025-12-04T12:05:01.7666098Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7667163Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7668006Z 2025-12-04T12:05:01.7668272Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7668870Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7669403Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.7669848Z Got exit code 1 2025-12-04T12:05:01.7670637Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7671828Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.7672956Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-16ca540d5e1f852f.xml 2025-12-04T12:05:01.7673842Z ============================= test session starts ============================== 2025-12-04T12:05:01.7674492Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7675096Z cachedir: .pytest_cache 2025-12-04T12:05:01.7675813Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7676600Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7676938Z configfile: pytest.ini 2025-12-04T12:05:01.7677725Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7678718Z collecting ... collected 571 items / 75 deselected / 496 selected 2025-12-04T12:05:01.7679211Z stepcurrent: skipping 75 already run items. 2025-12-04T12:05:01.7679630Z Running 220 items in this shard 2025-12-04T12:05:01.7679842Z 2025-12-04T12:05:01.7680537Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9199s] [ 0%] 2025-12-04T12:05:01.7682051Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2695s] [ 0%] 2025-12-04T12:05:01.7683455Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2614s] [ 0%] 2025-12-04T12:05:01.7684222Z 2025-12-04T12:05:01.7684366Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7685024Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7685654Z Traceback (most recent call last): 2025-12-04T12:05:01.7686361Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7687134Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7688140Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7688930Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7689361Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7689690Z 2025-12-04T12:05:01.7689907Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7691051Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7691908Z 2025-12-04T12:05:01.7692180Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7692819Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7693295Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7693630Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7693953Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7694411Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7694867Z graph_break [] 2025-12-04T12:05:01.7695230Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7696329Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7697301Z warnings.warn( 2025-12-04T12:05:01.7697830Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7698518Z Traceback (most recent call last): 2025-12-04T12:05:01.7699228Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7699998Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7700794Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7701583Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7702024Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7702341Z 2025-12-04T12:05:01.7702564Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7703622Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7704491Z 2025-12-04T12:05:01.7704827Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7705457Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7705933Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7706304Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7706638Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7707107Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7707557Z graph_break [] 2025-12-04T12:05:01.7707934Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7709038Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7710010Z warnings.warn( 2025-12-04T12:05:01.7710374Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7710892Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7711226Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7711647Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7712129Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7712470Z graph_break [] 2025-12-04T12:05:01.7712827Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7713925Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7714898Z warnings.warn( 2025-12-04T12:05:01.7715207Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7715845Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7716481Z Traceback (most recent call last): 2025-12-04T12:05:01.7717229Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7717991Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7718801Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7719588Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7720026Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7720340Z 2025-12-04T12:05:01.7720559Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7721625Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7722499Z 2025-12-04T12:05:01.7722769Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7723404Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7723865Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7724201Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7724540Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7724991Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7725446Z graph_break [] 2025-12-04T12:05:01.7725815Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7726913Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7727876Z warnings.warn( 2025-12-04T12:05:01.7728256Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7728724Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7729043Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7729478Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7729956Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7730321Z graph_break [] 2025-12-04T12:05:01.7730692Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7731783Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7732787Z warnings.warn( 2025-12-04T12:05:01.7733152Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7733622Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7733959Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7734381Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7734869Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7735211Z graph_break [] 2025-12-04T12:05:01.7735608Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7736701Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7737669Z warnings.warn( 2025-12-04T12:05:01.7738663Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-16ca540d5e1f852f.xml - 2025-12-04T12:05:01.7739716Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7740874Z FAILED [0.2614s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7741846Z 2025-12-04T12:05:01.7742064Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7743179Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7744030Z 2025-12-04T12:05:01.7744311Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7744888Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7745413Z ================== 1 failed, 75 deselected, 2 rerun in 5.14s =================== 2025-12-04T12:05:01.7745854Z Got exit code 1 2025-12-04T12:05:01.7746107Z Retrying single test... 2025-12-04T12:05:01.7746873Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3fca4d8c1020266e.xml 2025-12-04T12:05:01.7747752Z ============================= test session starts ============================== 2025-12-04T12:05:01.7748412Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7749000Z cachedir: .pytest_cache 2025-12-04T12:05:01.7749719Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7750498Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7750833Z configfile: pytest.ini 2025-12-04T12:05:01.7751606Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7752561Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7753734Z stepcurrent: skipping 75 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7754769Z Running 1 items in this shard 2025-12-04T12:05:01.7754984Z 2025-12-04T12:05:01.7755668Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9223s] [100%] 2025-12-04T12:05:01.7757225Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2653s] [100%] 2025-12-04T12:05:01.7758643Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2589s] [100%] 2025-12-04T12:05:01.7759433Z 2025-12-04T12:05:01.7759589Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7760230Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7760851Z Traceback (most recent call last): 2025-12-04T12:05:01.7761566Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7762327Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7763142Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7763963Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7764401Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7764720Z 2025-12-04T12:05:01.7764934Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7766009Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7766863Z 2025-12-04T12:05:01.7767138Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7767765Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7768231Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7768569Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7768900Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7769389Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7769848Z graph_break [] 2025-12-04T12:05:01.7770221Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7771308Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7772280Z warnings.warn( 2025-12-04T12:05:01.7772802Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7773439Z Traceback (most recent call last): 2025-12-04T12:05:01.7774139Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7774911Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7775719Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7776505Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7776945Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7777280Z 2025-12-04T12:05:01.7777493Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7778650Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7779504Z 2025-12-04T12:05:01.7779772Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7780400Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7780878Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7781209Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7781539Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7781992Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7782456Z graph_break [] 2025-12-04T12:05:01.7782858Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7783956Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7784981Z warnings.warn( 2025-12-04T12:05:01.7785363Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7785817Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7786152Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7786581Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7787049Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7787389Z graph_break [] 2025-12-04T12:05:01.7787962Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7789054Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7790099Z warnings.warn( 2025-12-04T12:05:01.7790403Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7791060Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7791680Z Traceback (most recent call last): 2025-12-04T12:05:01.7792395Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7793166Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7793975Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7794743Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7795185Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7795505Z 2025-12-04T12:05:01.7795784Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7796850Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7797712Z 2025-12-04T12:05:01.7797979Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7798621Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7799101Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7799429Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7799772Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7800239Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7800688Z graph_break [] 2025-12-04T12:05:01.7801068Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7802179Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7803160Z warnings.warn( 2025-12-04T12:05:01.7803529Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7804004Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7804347Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7804773Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7805262Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7805604Z graph_break [] 2025-12-04T12:05:01.7805964Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7807065Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7808041Z warnings.warn( 2025-12-04T12:05:01.7808429Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7808937Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7809270Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7809706Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7810217Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7810552Z graph_break [] 2025-12-04T12:05:01.7810921Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7812011Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7812969Z warnings.warn( 2025-12-04T12:05:01.7813890Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3fca4d8c1020266e.xml - 2025-12-04T12:05:01.7814959Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7816151Z FAILED [0.2589s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7817115Z 2025-12-04T12:05:01.7817338Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7818487Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7819348Z 2025-12-04T12:05:01.7819616Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7820205Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7820720Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.7821170Z Got exit code 1 2025-12-04T12:05:01.7821440Z Retrying single test... 2025-12-04T12:05:01.7822233Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d72e902ee6d4abee.xml 2025-12-04T12:05:01.7823125Z ============================= test session starts ============================== 2025-12-04T12:05:01.7823789Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7824390Z cachedir: .pytest_cache 2025-12-04T12:05:01.7825090Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7825881Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7826232Z configfile: pytest.ini 2025-12-04T12:05:01.7826997Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7827955Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7829128Z stepcurrent: skipping 75 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7830181Z Running 1 items in this shard 2025-12-04T12:05:01.7830391Z 2025-12-04T12:05:01.7831074Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9453s] [100%] 2025-12-04T12:05:01.7832577Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2665s] [100%] 2025-12-04T12:05:01.7833991Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2686s] [100%] 2025-12-04T12:05:01.7834719Z 2025-12-04T12:05:01.7834873Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7835531Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7836185Z Traceback (most recent call last): 2025-12-04T12:05:01.7836903Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7837709Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7838509Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7839290Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7839736Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7840054Z 2025-12-04T12:05:01.7840281Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7841337Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7842237Z 2025-12-04T12:05:01.7842508Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7843137Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7843611Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7843939Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7844276Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7844746Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7845198Z graph_break [] 2025-12-04T12:05:01.7845569Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7846674Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7847654Z warnings.warn( 2025-12-04T12:05:01.7848213Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7848849Z Traceback (most recent call last): 2025-12-04T12:05:01.7849566Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7850326Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7851143Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7851928Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7852375Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7852693Z 2025-12-04T12:05:01.7852912Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7853981Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7854836Z 2025-12-04T12:05:01.7855115Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7855746Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7856209Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7856334Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7856459Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7856680Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7856788Z graph_break [] 2025-12-04T12:05:01.7857003Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7857741Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7857853Z warnings.warn( 2025-12-04T12:05:01.7858066Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7858291Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7858413Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7858710Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7858845Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7858941Z graph_break [] 2025-12-04T12:05:01.7859192Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7859936Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7860034Z warnings.warn( 2025-12-04T12:05:01.7860190Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7860550Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7860672Z Traceback (most recent call last): 2025-12-04T12:05:01.7861160Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7861352Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7861873Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7862011Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7862203Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7862208Z 2025-12-04T12:05:01.7862434Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7863156Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7863162Z 2025-12-04T12:05:01.7863429Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7863660Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7863821Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7863950Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7864072Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7864292Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7864404Z graph_break [] 2025-12-04T12:05:01.7864620Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7865355Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7865471Z warnings.warn( 2025-12-04T12:05:01.7865687Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7865813Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7865927Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7866148Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7866289Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7866385Z graph_break [] 2025-12-04T12:05:01.7866600Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7867342Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7867446Z warnings.warn( 2025-12-04T12:05:01.7867658Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7867784Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7867898Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7868131Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7868252Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7868348Z graph_break [] 2025-12-04T12:05:01.7868578Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7869336Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7869439Z warnings.warn( 2025-12-04T12:05:01.7870211Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d72e902ee6d4abee.xml - 2025-12-04T12:05:01.7870416Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7871265Z FAILED [0.2686s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7871272Z 2025-12-04T12:05:01.7871490Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7872225Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7872270Z 2025-12-04T12:05:01.7872535Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7872714Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7872927Z ================== 1 failed, 294 deselected, 2 rerun in 5.17s ================== 2025-12-04T12:05:01.7873023Z Got exit code 1 2025-12-04T12:05:01.7873662Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7874084Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.7874661Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ec9f5a8da88c6473.xml 2025-12-04T12:05:01.7874834Z ============================= test session starts ============================== 2025-12-04T12:05:01.7875222Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7875333Z cachedir: .pytest_cache 2025-12-04T12:05:01.7875868Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7875993Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7876114Z configfile: pytest.ini 2025-12-04T12:05:01.7876707Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7876929Z collecting ... collected 571 items / 76 deselected / 495 selected 2025-12-04T12:05:01.7877084Z stepcurrent: skipping 76 already run items. 2025-12-04T12:05:01.7877196Z Running 219 items in this shard 2025-12-04T12:05:01.7877201Z 2025-12-04T12:05:01.7877798Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_float16_cuda PASSED [3.4395s] [ 0%] 2025-12-04T12:05:01.7878492Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0939s] [ 0%] 2025-12-04T12:05:01.7879166Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2663s] [ 0%] 2025-12-04T12:05:01.7879768Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2615s] [ 0%] 2025-12-04T12:05:01.7879774Z 2025-12-04T12:05:01.7879913Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7880284Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7880406Z Traceback (most recent call last): 2025-12-04T12:05:01.7880886Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7881085Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7881607Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7881757Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7881962Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7881967Z 2025-12-04T12:05:01.7882184Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7882908Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7882913Z 2025-12-04T12:05:01.7883183Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7883404Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7883562Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7883677Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7883913Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7884034Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7884132Z graph_break [] 2025-12-04T12:05:01.7884363Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7885097Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7885195Z warnings.warn( 2025-12-04T12:05:01.7885567Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7885686Z Traceback (most recent call last): 2025-12-04T12:05:01.7886174Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7886365Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7886890Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7887022Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7887214Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7887219Z 2025-12-04T12:05:01.7887432Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7888358Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7888365Z 2025-12-04T12:05:01.7888635Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7888866Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7888975Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7889097Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7889335Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7889456Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7889570Z graph_break [] 2025-12-04T12:05:01.7889787Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7890525Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7890643Z warnings.warn( 2025-12-04T12:05:01.7890856Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7890966Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7891092Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7891312Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7891433Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7891544Z graph_break [] 2025-12-04T12:05:01.7891826Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7892575Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7892722Z warnings.warn( 2025-12-04T12:05:01.7892868Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7893241Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7893363Z Traceback (most recent call last): 2025-12-04T12:05:01.7893841Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7894012Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7894530Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7894711Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7894905Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7894910Z 2025-12-04T12:05:01.7895124Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7895849Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7895855Z 2025-12-04T12:05:01.7896120Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7896347Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7896457Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7896569Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7896800Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7896920Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7897082Z graph_break [] 2025-12-04T12:05:01.7897315Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7898047Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7898226Z warnings.warn( 2025-12-04T12:05:01.7898441Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7898551Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7898678Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7898899Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7899019Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7899132Z graph_break [] 2025-12-04T12:05:01.7899344Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7900092Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7900192Z warnings.warn( 2025-12-04T12:05:01.7900405Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7900527Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7900640Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7900857Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7900990Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7901085Z graph_break [] 2025-12-04T12:05:01.7901296Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7902032Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7902131Z warnings.warn( 2025-12-04T12:05:01.7902935Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ec9f5a8da88c6473.xml - 2025-12-04T12:05:01.7903106Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7903928Z FAILED [0.2615s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7903975Z 2025-12-04T12:05:01.7904192Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7904909Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7904914Z 2025-12-04T12:05:01.7905191Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7905366Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7905625Z ============= 1 failed, 1 passed, 76 deselected, 2 rerun in 5.74s ============== 2025-12-04T12:05:01.7905727Z Got exit code 1 2025-12-04T12:05:01.7905836Z Retrying single test... 2025-12-04T12:05:01.7906427Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cd768c9b8fb615f9.xml 2025-12-04T12:05:01.7906591Z ============================= test session starts ============================== 2025-12-04T12:05:01.7906940Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7907060Z cachedir: .pytest_cache 2025-12-04T12:05:01.7907584Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7907718Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7907826Z configfile: pytest.ini 2025-12-04T12:05:01.7908449Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7908696Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7909497Z stepcurrent: skipping 77 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7909613Z Running 1 items in this shard 2025-12-04T12:05:01.7909631Z 2025-12-04T12:05:01.7910311Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9101s] [100%] 2025-12-04T12:05:01.7910988Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2639s] [100%] 2025-12-04T12:05:01.7911592Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2607s] [100%] 2025-12-04T12:05:01.7911599Z 2025-12-04T12:05:01.7911744Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7912113Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7912237Z Traceback (most recent call last): 2025-12-04T12:05:01.7912713Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7912881Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7913406Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7913526Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7913730Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7913736Z 2025-12-04T12:05:01.7913952Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7914717Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7914723Z 2025-12-04T12:05:01.7914989Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7915238Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7915363Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7915477Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7915611Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7915830Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7915935Z graph_break [] 2025-12-04T12:05:01.7916163Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7916897Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7917042Z warnings.warn( 2025-12-04T12:05:01.7917417Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7917541Z Traceback (most recent call last): 2025-12-04T12:05:01.7918033Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7918190Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7918710Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7918848Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7919041Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7919046Z 2025-12-04T12:05:01.7919261Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7920023Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7920030Z 2025-12-04T12:05:01.7920295Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7920529Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7920644Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7920761Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7920897Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7921118Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7921232Z graph_break [] 2025-12-04T12:05:01.7921450Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7922187Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7922308Z warnings.warn( 2025-12-04T12:05:01.7922526Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7922638Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7922767Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7922989Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7923117Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7923230Z graph_break [] 2025-12-04T12:05:01.7923442Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7924183Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7924286Z warnings.warn( 2025-12-04T12:05:01.7924432Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7924806Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7924931Z Traceback (most recent call last): 2025-12-04T12:05:01.7925451Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7925608Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7926154Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7926295Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7926486Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7926491Z 2025-12-04T12:05:01.7926707Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7927436Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7927471Z 2025-12-04T12:05:01.7927740Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7927969Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7928080Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7928195Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7928329Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7928550Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7928647Z graph_break [] 2025-12-04T12:05:01.7928875Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7929607Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7929720Z warnings.warn( 2025-12-04T12:05:01.7929932Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7930045Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7930200Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7930423Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7930542Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7930653Z graph_break [] 2025-12-04T12:05:01.7930867Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7931614Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7931717Z warnings.warn( 2025-12-04T12:05:01.7931928Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7932049Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7932167Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7932386Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7932519Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7932620Z graph_break [] 2025-12-04T12:05:01.7932851Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7933582Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7933685Z warnings.warn( 2025-12-04T12:05:01.7934454Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cd768c9b8fb615f9.xml - 2025-12-04T12:05:01.7934627Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7935453Z FAILED [0.2607s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7935470Z 2025-12-04T12:05:01.7935690Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7936438Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7936444Z 2025-12-04T12:05:01.7936723Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7936931Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7937140Z ================== 1 failed, 294 deselected, 2 rerun in 5.11s ================== 2025-12-04T12:05:01.7937242Z Got exit code 1 2025-12-04T12:05:01.7937348Z Retrying single test... 2025-12-04T12:05:01.7937941Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-eed072ccebef54d5.xml 2025-12-04T12:05:01.7938101Z ============================= test session starts ============================== 2025-12-04T12:05:01.7938580Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7938703Z cachedir: .pytest_cache 2025-12-04T12:05:01.7939228Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7939365Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7939475Z configfile: pytest.ini 2025-12-04T12:05:01.7940072Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7940313Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.7941113Z stepcurrent: skipping 77 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7941229Z Running 1 items in this shard 2025-12-04T12:05:01.7941249Z 2025-12-04T12:05:01.7941964Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9096s] [100%] 2025-12-04T12:05:01.7942641Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2637s] [100%] 2025-12-04T12:05:01.7943247Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2609s] [100%] 2025-12-04T12:05:01.7943253Z 2025-12-04T12:05:01.7943395Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7943764Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7943887Z Traceback (most recent call last): 2025-12-04T12:05:01.7944367Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7944540Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7945067Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7945189Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7945392Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7945399Z 2025-12-04T12:05:01.7945614Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7946343Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7946348Z 2025-12-04T12:05:01.7946614Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7946832Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7946957Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7947077Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7947211Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7947471Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7947572Z graph_break [] 2025-12-04T12:05:01.7947801Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7948566Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7948666Z warnings.warn( 2025-12-04T12:05:01.7949035Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7949156Z Traceback (most recent call last): 2025-12-04T12:05:01.7949643Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7949799Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7950351Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7950486Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7950676Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7950682Z 2025-12-04T12:05:01.7950898Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7951622Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7951627Z 2025-12-04T12:05:01.7951893Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7952126Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7952239Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7952354Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7952490Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7952739Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7952854Z graph_break [] 2025-12-04T12:05:01.7953068Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7953804Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7953922Z warnings.warn( 2025-12-04T12:05:01.7954135Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7954246Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7954371Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7954589Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7954721Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7954818Z graph_break [] 2025-12-04T12:05:01.7955031Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7955775Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7955877Z warnings.warn( 2025-12-04T12:05:01.7956020Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7956390Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.7956509Z Traceback (most recent call last): 2025-12-04T12:05:01.7956998Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7957151Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7957672Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7957805Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7958001Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7958006Z 2025-12-04T12:05:01.7958252Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7958978Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7959014Z 2025-12-04T12:05:01.7959279Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7959511Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7959623Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7959737Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7959872Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7960092Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7960190Z graph_break [] 2025-12-04T12:05:01.7960448Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7961187Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7961302Z warnings.warn( 2025-12-04T12:05:01.7961517Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7961626Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7961752Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7961969Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7962089Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7962196Z graph_break [] 2025-12-04T12:05:01.7962411Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7963151Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7963284Z warnings.warn( 2025-12-04T12:05:01.7963504Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7963624Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7963737Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7963957Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7964088Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7964187Z graph_break [] 2025-12-04T12:05:01.7964414Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7965138Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7965238Z warnings.warn( 2025-12-04T12:05:01.7966013Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-eed072ccebef54d5.xml - 2025-12-04T12:05:01.7966186Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7967008Z FAILED [0.2609s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7967028Z 2025-12-04T12:05:01.7967243Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7967958Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7967963Z 2025-12-04T12:05:01.7968239Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7968418Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.7968628Z ================== 1 failed, 294 deselected, 2 rerun in 5.12s ================== 2025-12-04T12:05:01.7968734Z Got exit code 1 2025-12-04T12:05:01.7969398Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.7969823Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.7970429Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e17832bbca358cbf.xml 2025-12-04T12:05:01.7970591Z ============================= test session starts ============================== 2025-12-04T12:05:01.7970957Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.7971067Z cachedir: .pytest_cache 2025-12-04T12:05:01.7971601Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.7971758Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.7971865Z configfile: pytest.ini 2025-12-04T12:05:01.7972471Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.7972698Z collecting ... collected 571 items / 78 deselected / 493 selected 2025-12-04T12:05:01.7972859Z stepcurrent: skipping 78 already run items. 2025-12-04T12:05:01.7972976Z Running 217 items in this shard 2025-12-04T12:05:01.7972981Z 2025-12-04T12:05:01.7973579Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sqrt_load_upcast_to_fp32_False_float16_cuda PASSED [3.4320s] [ 0%] 2025-12-04T12:05:01.7974269Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0935s] [ 0%] 2025-12-04T12:05:01.7974976Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2653s] [ 0%] 2025-12-04T12:05:01.7975579Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2555s] [ 0%] 2025-12-04T12:05:01.7975585Z 2025-12-04T12:05:01.7975729Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.7976086Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7976223Z Traceback (most recent call last): 2025-12-04T12:05:01.7976705Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7976875Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7977399Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7977526Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7977734Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7977740Z 2025-12-04T12:05:01.7977955Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7978754Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7978777Z 2025-12-04T12:05:01.7979047Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7979267Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7979397Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7979512Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7979735Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7979871Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7979968Z graph_break [] 2025-12-04T12:05:01.7980189Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7980977Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7981082Z warnings.warn( 2025-12-04T12:05:01.7981494Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7981617Z Traceback (most recent call last): 2025-12-04T12:05:01.7982094Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7982262Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7982780Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7982916Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7983105Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7983143Z 2025-12-04T12:05:01.7983359Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7984086Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7984093Z 2025-12-04T12:05:01.7984359Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7984587Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7984700Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7984813Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7985048Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7985169Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7985265Z graph_break [] 2025-12-04T12:05:01.7985491Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7986257Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7986361Z warnings.warn( 2025-12-04T12:05:01.7986586Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7986699Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7986823Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7987043Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7987163Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7987270Z graph_break [] 2025-12-04T12:05:01.7987482Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7988431Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7988555Z warnings.warn( 2025-12-04T12:05:01.7988704Z =================================== FAILURES =================================== 2025-12-04T12:05:01.7989074Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.7989196Z Traceback (most recent call last): 2025-12-04T12:05:01.7989677Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.7989846Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.7990362Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.7990489Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.7990692Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7990698Z 2025-12-04T12:05:01.7990915Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.7991711Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.7991718Z 2025-12-04T12:05:01.7991983Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.7992199Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7992368Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7992483Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7992716Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7992837Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7992934Z graph_break [] 2025-12-04T12:05:01.7993161Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7993892Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7994036Z warnings.warn( 2025-12-04T12:05:01.7994265Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7994376Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7994502Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7994721Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7994845Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7994957Z graph_break [] 2025-12-04T12:05:01.7995170Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7995904Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7996018Z warnings.warn( 2025-12-04T12:05:01.7996232Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.7996357Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.7996517Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.7996740Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.7996875Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.7996972Z graph_break [] 2025-12-04T12:05:01.7997186Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.7997928Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.7998028Z warnings.warn( 2025-12-04T12:05:01.7998789Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e17832bbca358cbf.xml - 2025-12-04T12:05:01.7998973Z =========================== short test summary info ============================ 2025-12-04T12:05:01.7999807Z FAILED [0.2555s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.7999817Z 2025-12-04T12:05:01.8000046Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8000758Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8000766Z 2025-12-04T12:05:01.8001044Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8001224Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8001436Z ============= 1 failed, 1 passed, 78 deselected, 2 rerun in 5.72s ============== 2025-12-04T12:05:01.8001548Z Got exit code 1 2025-12-04T12:05:01.8001655Z Retrying single test... 2025-12-04T12:05:01.8002228Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2596f02ad0864233.xml 2025-12-04T12:05:01.8002441Z ============================= test session starts ============================== 2025-12-04T12:05:01.8002795Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8002920Z cachedir: .pytest_cache 2025-12-04T12:05:01.8003471Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8003599Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8003720Z configfile: pytest.ini 2025-12-04T12:05:01.8004315Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8004556Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.8005363Z stepcurrent: skipping 79 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8005508Z Running 1 items in this shard 2025-12-04T12:05:01.8005516Z 2025-12-04T12:05:01.8006205Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9171s] [100%] 2025-12-04T12:05:01.8006882Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2687s] [100%] 2025-12-04T12:05:01.8007480Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2615s] [100%] 2025-12-04T12:05:01.8007486Z 2025-12-04T12:05:01.8007632Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.8007987Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8008124Z Traceback (most recent call last): 2025-12-04T12:05:01.8008716Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8008890Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8009412Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8009539Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8009743Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8009749Z 2025-12-04T12:05:01.8009964Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8010682Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8010700Z 2025-12-04T12:05:01.8010964Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8011188Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8011316Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8011429Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8011550Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8011782Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8011885Z graph_break [] 2025-12-04T12:05:01.8012101Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8012850Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8012953Z warnings.warn( 2025-12-04T12:05:01.8013324Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8013446Z Traceback (most recent call last): 2025-12-04T12:05:01.8013925Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8014125Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8014646Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8014812Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8015005Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8015011Z 2025-12-04T12:05:01.8015227Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8015957Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8015962Z 2025-12-04T12:05:01.8016231Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8016465Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8016610Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8016726Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8016859Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8017082Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8017183Z graph_break [] 2025-12-04T12:05:01.8017413Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8018221Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8018343Z warnings.warn( 2025-12-04T12:05:01.8018560Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8018670Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8018797Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8019016Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8019174Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8019285Z graph_break [] 2025-12-04T12:05:01.8019501Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8020233Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8020348Z warnings.warn( 2025-12-04T12:05:01.8020490Z =================================== FAILURES =================================== 2025-12-04T12:05:01.8020861Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8020982Z Traceback (most recent call last): 2025-12-04T12:05:01.8021460Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8021626Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8022162Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8022288Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8022492Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8022498Z 2025-12-04T12:05:01.8022717Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8023450Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8023455Z 2025-12-04T12:05:01.8023720Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8023938Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8024067Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8024183Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8024324Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8024548Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8024697Z graph_break [] 2025-12-04T12:05:01.8024927Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8025660Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8025791Z warnings.warn( 2025-12-04T12:05:01.8026022Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8026132Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8026258Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8026479Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8026601Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8026713Z graph_break [] 2025-12-04T12:05:01.8026926Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8027691Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8027807Z warnings.warn( 2025-12-04T12:05:01.8028021Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8028149Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8028263Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8028484Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8028621Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8028720Z graph_break [] 2025-12-04T12:05:01.8028935Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8029680Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8029811Z warnings.warn( 2025-12-04T12:05:01.8030579Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2596f02ad0864233.xml - 2025-12-04T12:05:01.8030751Z =========================== short test summary info ============================ 2025-12-04T12:05:01.8031574Z FAILED [0.2615s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8031580Z 2025-12-04T12:05:01.8031811Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8032529Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8032534Z 2025-12-04T12:05:01.8032814Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8032998Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8033198Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.8033311Z Got exit code 1 2025-12-04T12:05:01.8033419Z Retrying single test... 2025-12-04T12:05:01.8033995Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-35234e0a942debc1.xml 2025-12-04T12:05:01.8034171Z ============================= test session starts ============================== 2025-12-04T12:05:01.8034523Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8034647Z cachedir: .pytest_cache 2025-12-04T12:05:01.8035173Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8035296Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8035416Z configfile: pytest.ini 2025-12-04T12:05:01.8036046Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8036290Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.8037122Z stepcurrent: skipping 79 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8037236Z Running 1 items in this shard 2025-12-04T12:05:01.8037242Z 2025-12-04T12:05:01.8037930Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9276s] [100%] 2025-12-04T12:05:01.8038599Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2668s] [100%] 2025-12-04T12:05:01.8039232Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2567s] [100%] 2025-12-04T12:05:01.8039238Z 2025-12-04T12:05:01.8039380Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.8039738Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8039873Z Traceback (most recent call last): 2025-12-04T12:05:01.8040346Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8040520Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8041044Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8041168Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8041370Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8041378Z 2025-12-04T12:05:01.8041642Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8042365Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8042383Z 2025-12-04T12:05:01.8042648Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8042864Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8042988Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8043101Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8043221Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8043454Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8043549Z graph_break [] 2025-12-04T12:05:01.8043769Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8044523Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8044624Z warnings.warn( 2025-12-04T12:05:01.8044986Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8045110Z Traceback (most recent call last): 2025-12-04T12:05:01.8045585Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8045750Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8046267Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8046403Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8046595Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8046601Z 2025-12-04T12:05:01.8046816Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8047575Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8047580Z 2025-12-04T12:05:01.8047843Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8048101Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8048211Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8048325Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8048458Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8048677Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8048773Z graph_break [] 2025-12-04T12:05:01.8049000Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8049737Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8049885Z warnings.warn( 2025-12-04T12:05:01.8050102Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8050212Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8050340Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8050558Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8050676Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8050784Z graph_break [] 2025-12-04T12:05:01.8050997Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8051724Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8051837Z warnings.warn( 2025-12-04T12:05:01.8051980Z =================================== FAILURES =================================== 2025-12-04T12:05:01.8052388Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8052512Z Traceback (most recent call last): 2025-12-04T12:05:01.8052988Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8053157Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8053675Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8053809Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8054001Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8054007Z 2025-12-04T12:05:01.8054221Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8054948Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8054956Z 2025-12-04T12:05:01.8055225Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8055453Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8055563Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8055678Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8055810Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8056028Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8056128Z graph_break [] 2025-12-04T12:05:01.8056355Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8057087Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8057186Z warnings.warn( 2025-12-04T12:05:01.8057416Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8057526Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8057682Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8057902Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8058023Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8058232Z graph_break [] 2025-12-04T12:05:01.8058451Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8059181Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8059295Z warnings.warn( 2025-12-04T12:05:01.8059509Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8059635Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8059749Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8060009Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8060143Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8060241Z graph_break [] 2025-12-04T12:05:01.8060454Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8061194Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8061296Z warnings.warn( 2025-12-04T12:05:01.8062063Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-35234e0a942debc1.xml - 2025-12-04T12:05:01.8062233Z =========================== short test summary info ============================ 2025-12-04T12:05:01.8063056Z FAILED [0.2567s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8063111Z 2025-12-04T12:05:01.8063342Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8064059Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8064066Z 2025-12-04T12:05:01.8064342Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8064520Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8064720Z ================== 1 failed, 294 deselected, 2 rerun in 5.14s ================== 2025-12-04T12:05:01.8064830Z Got exit code 1 2025-12-04T12:05:01.8065464Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8065888Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.8066462Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-11b937d6d736747a.xml 2025-12-04T12:05:01.8066624Z ============================= test session starts ============================== 2025-12-04T12:05:01.8066990Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8067098Z cachedir: .pytest_cache 2025-12-04T12:05:01.8067622Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8067756Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8067863Z configfile: pytest.ini 2025-12-04T12:05:01.8068471Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8068697Z collecting ... collected 571 items / 80 deselected / 491 selected 2025-12-04T12:05:01.8068843Z stepcurrent: skipping 80 already run items. 2025-12-04T12:05:01.8069000Z Running 215 items in this shard 2025-12-04T12:05:01.8069006Z 2025-12-04T12:05:01.8069592Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_True_float16_cuda PASSED [3.4719s] [ 0%] 2025-12-04T12:05:01.8070319Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0932s] [ 0%] 2025-12-04T12:05:01.8070997Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2636s] [ 0%] 2025-12-04T12:05:01.8071587Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2583s] [ 0%] 2025-12-04T12:05:01.8071593Z 2025-12-04T12:05:01.8071748Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.8072150Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8072285Z Traceback (most recent call last): 2025-12-04T12:05:01.8072763Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8072923Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8079973Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8080149Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8080353Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8080361Z 2025-12-04T12:05:01.8080593Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8081417Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8081431Z 2025-12-04T12:05:01.8081720Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8081948Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8082066Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8082194Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8082420Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8082544Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8082654Z graph_break [] 2025-12-04T12:05:01.8082874Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8083632Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8083733Z warnings.warn( 2025-12-04T12:05:01.8084101Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8084240Z Traceback (most recent call last): 2025-12-04T12:05:01.8084719Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8084876Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8085414Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8085539Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8085743Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8085749Z 2025-12-04T12:05:01.8085970Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8086687Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8086707Z 2025-12-04T12:05:01.8086977Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8087236Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8087361Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8087478Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8088001Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8088144Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8088240Z graph_break [] 2025-12-04T12:05:01.8088460Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8089210Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8089311Z warnings.warn( 2025-12-04T12:05:01.8089543Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8089732Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8089846Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8090086Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8090208Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8090307Z graph_break [] 2025-12-04T12:05:01.8090536Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8091264Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8091381Z warnings.warn( 2025-12-04T12:05:01.8091523Z =================================== FAILURES =================================== 2025-12-04T12:05:01.8091885Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8092016Z Traceback (most recent call last): 2025-12-04T12:05:01.8092543Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8092704Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8093242Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8093367Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8093572Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8093578Z 2025-12-04T12:05:01.8093792Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8094508Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8094514Z 2025-12-04T12:05:01.8094791Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8095008Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8095138Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8095253Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8095472Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8095604Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8095704Z graph_break [] 2025-12-04T12:05:01.8095916Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8096655Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8096755Z warnings.warn( 2025-12-04T12:05:01.8096985Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8097096Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8097209Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8097438Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8097563Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8097659Z graph_break [] 2025-12-04T12:05:01.8097926Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8098722Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8098891Z warnings.warn( 2025-12-04T12:05:01.8099104Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8099213Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8099336Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8099557Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8099675Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8099782Z graph_break [] 2025-12-04T12:05:01.8099993Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8100759Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8100873Z warnings.warn( 2025-12-04T12:05:01.8101628Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-11b937d6d736747a.xml - 2025-12-04T12:05:01.8101811Z =========================== short test summary info ============================ 2025-12-04T12:05:01.8102645Z FAILED [0.2583s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8102652Z 2025-12-04T12:05:01.8102868Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8103624Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8103633Z 2025-12-04T12:05:01.8103902Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8104092Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8104308Z ============= 1 failed, 1 passed, 80 deselected, 2 rerun in 5.77s ============== 2025-12-04T12:05:01.8104408Z Got exit code 1 2025-12-04T12:05:01.8104526Z Retrying single test... 2025-12-04T12:05:01.8105100Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5cef76ce1b71d0a4.xml 2025-12-04T12:05:01.8105275Z ============================= test session starts ============================== 2025-12-04T12:05:01.8105624Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8105734Z cachedir: .pytest_cache 2025-12-04T12:05:01.8106271Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8106396Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8106504Z configfile: pytest.ini 2025-12-04T12:05:01.8107109Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8107339Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.8108157Z stepcurrent: skipping 81 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8108271Z Running 1 items in this shard 2025-12-04T12:05:01.8108276Z 2025-12-04T12:05:01.8108958Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9209s] [100%] 2025-12-04T12:05:01.8109686Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2731s] [100%] 2025-12-04T12:05:01.8110282Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2659s] [100%] 2025-12-04T12:05:01.8110319Z 2025-12-04T12:05:01.8110475Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.8110834Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8110966Z Traceback (most recent call last): 2025-12-04T12:05:01.8111442Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8111598Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8112136Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8112309Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8112501Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8112507Z 2025-12-04T12:05:01.8112735Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8113457Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8113463Z 2025-12-04T12:05:01.8113742Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8113962Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8114074Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8114205Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8114325Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8114546Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8114689Z graph_break [] 2025-12-04T12:05:01.8114910Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8115659Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8115762Z warnings.warn( 2025-12-04T12:05:01.8116123Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8116258Z Traceback (most recent call last): 2025-12-04T12:05:01.8116732Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8116889Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8117421Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8117549Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8117750Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8117757Z 2025-12-04T12:05:01.8117971Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8118688Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8118705Z 2025-12-04T12:05:01.8118968Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8119185Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8119306Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8119419Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8119539Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8119773Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8119873Z graph_break [] 2025-12-04T12:05:01.8120088Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8120864Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8120996Z warnings.warn( 2025-12-04T12:05:01.8121223Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8121333Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8121447Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8121675Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8121794Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8121889Z graph_break [] 2025-12-04T12:05:01.8122114Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8122845Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8122991Z warnings.warn( 2025-12-04T12:05:01.8123134Z =================================== FAILURES =================================== 2025-12-04T12:05:01.8123497Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8123632Z Traceback (most recent call last): 2025-12-04T12:05:01.8124109Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8124266Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8124796Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8124918Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8125119Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8125124Z 2025-12-04T12:05:01.8125370Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8126095Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8126100Z 2025-12-04T12:05:01.8126376Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8126595Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8126721Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8126833Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8126955Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8127186Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8127282Z graph_break [] 2025-12-04T12:05:01.8127494Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8128243Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8128346Z warnings.warn( 2025-12-04T12:05:01.8128570Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8128679Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8128794Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8129026Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8129145Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8129242Z graph_break [] 2025-12-04T12:05:01.8129467Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8130197Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8130310Z warnings.warn( 2025-12-04T12:05:01.8130518Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8130633Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8130791Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8131011Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8131131Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8131268Z graph_break [] 2025-12-04T12:05:01.8131481Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8132208Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8132320Z warnings.warn( 2025-12-04T12:05:01.8133081Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5cef76ce1b71d0a4.xml - 2025-12-04T12:05:01.8133262Z =========================== short test summary info ============================ 2025-12-04T12:05:01.8134129Z FAILED [0.2659s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8134135Z 2025-12-04T12:05:01.8134350Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8135086Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8135092Z 2025-12-04T12:05:01.8135357Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8135548Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8135745Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.8135843Z Got exit code 1 2025-12-04T12:05:01.8135962Z Retrying single test... 2025-12-04T12:05:01.8136577Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-f5efe4ba459d5b44.xml 2025-12-04T12:05:01.8136753Z ============================= test session starts ============================== 2025-12-04T12:05:01.8137106Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8137215Z cachedir: .pytest_cache 2025-12-04T12:05:01.8137752Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8137876Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8137984Z configfile: pytest.ini 2025-12-04T12:05:01.8138670Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8138902Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.8139728Z stepcurrent: skipping 81 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8139842Z Running 1 items in this shard 2025-12-04T12:05:01.8139847Z 2025-12-04T12:05:01.8140529Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9203s] [100%] 2025-12-04T12:05:01.8141223Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2705s] [100%] 2025-12-04T12:05:01.8141815Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda FAILED [0.2607s] [100%] 2025-12-04T12:05:01.8141821Z 2025-12-04T12:05:01.8141973Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.8142342Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8142519Z Traceback (most recent call last): 2025-12-04T12:05:01.8142999Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8143155Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8143723Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8143843Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8144036Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8144041Z 2025-12-04T12:05:01.8144268Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8144985Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8145024Z 2025-12-04T12:05:01.8145306Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8145525Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8145640Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8145766Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8145887Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8146108Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8146218Z graph_break [] 2025-12-04T12:05:01.8146432Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8147178Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8147279Z warnings.warn( 2025-12-04T12:05:01.8147641Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8147805Z Traceback (most recent call last): 2025-12-04T12:05:01.8148286Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8148449Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8148968Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8149090Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8149291Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8149296Z 2025-12-04T12:05:01.8149509Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8150226Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8150242Z 2025-12-04T12:05:01.8150506Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8150728Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8150849Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8150960Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8151084Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8151318Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8151416Z graph_break [] 2025-12-04T12:05:01.8151629Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8152376Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8152478Z warnings.warn( 2025-12-04T12:05:01.8152699Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8152809Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8152925Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8153188Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8153311Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8153407Z graph_break [] 2025-12-04T12:05:01.8153632Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8154413Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8154526Z warnings.warn( 2025-12-04T12:05:01.8154670Z =================================== FAILURES =================================== 2025-12-04T12:05:01.8155030Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda _ 2025-12-04T12:05:01.8155159Z Traceback (most recent call last): 2025-12-04T12:05:01.8155638Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8155828Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8156359Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8156482Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8156685Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8156691Z 2025-12-04T12:05:01.8156906Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8157625Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8157630Z 2025-12-04T12:05:01.8157909Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8158125Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8158254Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8158398Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8158521Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8158755Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8158853Z graph_break [] 2025-12-04T12:05:01.8159067Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8159812Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8159912Z warnings.warn( 2025-12-04T12:05:01.8160137Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8160246Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8160358Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8160588Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8160712Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8160811Z graph_break [] 2025-12-04T12:05:01.8161036Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8161764Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8161875Z warnings.warn( 2025-12-04T12:05:01.8162086Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8162194Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8162321Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8162539Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8162658Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8162766Z graph_break [] 2025-12-04T12:05:01.8162977Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8163706Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8163857Z warnings.warn( 2025-12-04T12:05:01.8164617Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-f5efe4ba459d5b44.xml - 2025-12-04T12:05:01.8164831Z =========================== short test summary info ============================ 2025-12-04T12:05:01.8165664Z FAILED [0.2607s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8165670Z 2025-12-04T12:05:01.8165897Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8166616Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8166653Z 2025-12-04T12:05:01.8166921Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8167116Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8167314Z ================== 1 failed, 294 deselected, 2 rerun in 5.13s ================== 2025-12-04T12:05:01.8167415Z Got exit code 1 2025-12-04T12:05:01.8168066Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda 2025-12-04T12:05:01.8168477Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.8169064Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-92f9442ed796f73f.xml 2025-12-04T12:05:01.8169226Z ============================= test session starts ============================== 2025-12-04T12:05:01.8169609Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8169731Z cachedir: .pytest_cache 2025-12-04T12:05:01.8170252Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8170386Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8170492Z configfile: pytest.ini 2025-12-04T12:05:01.8171083Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8171319Z collecting ... collected 571 items / 82 deselected / 489 selected 2025-12-04T12:05:01.8171459Z stepcurrent: skipping 82 already run items. 2025-12-04T12:05:01.8171571Z Running 213 items in this shard 2025-12-04T12:05:01.8171577Z 2025-12-04T12:05:01.8172178Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_float16_cuda PASSED [3.4368s] [ 0%] 2025-12-04T12:05:01.8172861Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.0922s] [ 0%] 2025-12-04T12:05:01.8173543Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2728s] [ 0%] 2025-12-04T12:05:01.8174131Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2713s] [ 0%] 2025-12-04T12:05:01.8174137Z 2025-12-04T12:05:01.8174290Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.8174646Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.8174768Z Traceback (most recent call last): 2025-12-04T12:05:01.8175258Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8175420Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8175970Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8176109Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8176332Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8176338Z 2025-12-04T12:05:01.8176565Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8177280Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8177286Z 2025-12-04T12:05:01.8177548Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8177778Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8177920Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8178047Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8178348Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8178473Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8178582Z graph_break [] 2025-12-04T12:05:01.8178801Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8179532Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8179646Z warnings.warn( 2025-12-04T12:05:01.8180003Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.8180137Z Traceback (most recent call last): 2025-12-04T12:05:01.8180615Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8180773Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8181343Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8181466Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8181656Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8181675Z 2025-12-04T12:05:01.8181891Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8182607Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8182613Z 2025-12-04T12:05:01.8182886Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8183103Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8183212Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8183337Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8183561Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8183695Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8183791Z graph_break [] 2025-12-04T12:05:01.8184007Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8184754Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8184854Z warnings.warn( 2025-12-04T12:05:01.8185065Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8185185Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8185302Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8185525Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8185640Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8185737Z graph_break [] 2025-12-04T12:05:01.8185955Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8186713Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8186840Z warnings.warn( 2025-12-04T12:05:01.8186987Z =================================== FAILURES =================================== 2025-12-04T12:05:01.8187342Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.8187467Z Traceback (most recent call last): 2025-12-04T12:05:01.8188132Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8188287Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8188811Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8189072Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8189264Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8189270Z 2025-12-04T12:05:01.8189493Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8190207Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8190215Z 2025-12-04T12:05:01.8190489Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8190707Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8190817Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8190937Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8191156Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8191287Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8191383Z graph_break [] 2025-12-04T12:05:01.8191639Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8192388Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8192492Z warnings.warn( 2025-12-04T12:05:01.8192705Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8192819Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8192929Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8193158Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8193279Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8193374Z graph_break [] 2025-12-04T12:05:01.8193593Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8194325Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8194429Z warnings.warn( 2025-12-04T12:05:01.8194643Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8194752Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8194867Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8195089Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8195208Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8195310Z graph_break [] 2025-12-04T12:05:01.8195522Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8196247Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8196354Z warnings.warn( 2025-12-04T12:05:01.8197112Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-92f9442ed796f73f.xml - 2025-12-04T12:05:01.8197347Z =========================== short test summary info ============================ 2025-12-04T12:05:01.8198184Z FAILED [0.2713s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8198228Z 2025-12-04T12:05:01.8198443Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8199166Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8199172Z 2025-12-04T12:05:01.8199436Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8199621Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8199863Z ============= 1 failed, 1 passed, 82 deselected, 2 rerun in 5.76s ============== 2025-12-04T12:05:01.8199958Z Got exit code 1 2025-12-04T12:05:01.8200074Z Retrying single test... 2025-12-04T12:05:01.8200651Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-bcaaee0a01e62e43.xml 2025-12-04T12:05:01.8200812Z ============================= test session starts ============================== 2025-12-04T12:05:01.8201168Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8201277Z cachedir: .pytest_cache 2025-12-04T12:05:01.8201808Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8201927Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8202031Z configfile: pytest.ini 2025-12-04T12:05:01.8202660Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8202891Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.8203693Z stepcurrent: skipping 83 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8203805Z Running 1 items in this shard 2025-12-04T12:05:01.8203810Z 2025-12-04T12:05:01.8204483Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9356s] [100%] 2025-12-04T12:05:01.8205162Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2711s] [100%] 2025-12-04T12:05:01.8205744Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2744s] [100%] 2025-12-04T12:05:01.8205755Z 2025-12-04T12:05:01.8205905Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.8206258Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.8206376Z Traceback (most recent call last): 2025-12-04T12:05:01.8206861Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8207016Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8207543Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8207660Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8207847Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8207853Z 2025-12-04T12:05:01.8208073Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8208821Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8208828Z 2025-12-04T12:05:01.8209109Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8209354Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8209463Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8209582Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8209697Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8209914Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8210016Z graph_break [] 2025-12-04T12:05:01.8210228Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8210962Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8211101Z warnings.warn( 2025-12-04T12:05:01.8211460Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.8211584Z Traceback (most recent call last): 2025-12-04T12:05:01.8212057Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8212218Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8212740Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8212862Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8213060Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8213066Z 2025-12-04T12:05:01.8213276Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8214018Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8214027Z 2025-12-04T12:05:01.8214304Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8214520Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8214643Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8214753Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8214871Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8215102Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8215199Z graph_break [] 2025-12-04T12:05:01.8215413Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8216152Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8216253Z warnings.warn( 2025-12-04T12:05:01.8216469Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8216588Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8216700Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8216922Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8217044Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8217137Z graph_break [] 2025-12-04T12:05:01.8217354Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8218083Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8218250Z warnings.warn( 2025-12-04T12:05:01.8218402Z =================================== FAILURES =================================== 2025-12-04T12:05:01.8218758Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.8218896Z Traceback (most recent call last): 2025-12-04T12:05:01.8219413Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8219571Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8220131Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8220251Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8220441Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8220456Z 2025-12-04T12:05:01.8220666Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8221378Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8221383Z 2025-12-04T12:05:01.8221659Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8221912Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8222023Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8222140Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8222260Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8222490Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8222588Z graph_break [] 2025-12-04T12:05:01.8222800Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8223539Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8223638Z warnings.warn( 2025-12-04T12:05:01.8223848Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8223963Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8224077Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8224344Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8224464Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8224557Z graph_break [] 2025-12-04T12:05:01.8224779Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8225505Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8225600Z warnings.warn( 2025-12-04T12:05:01.8225815Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8225922Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8226034Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8226260Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8226377Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8226480Z graph_break [] 2025-12-04T12:05:01.8226694Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8227419Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8227528Z warnings.warn( 2025-12-04T12:05:01.8228287Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-bcaaee0a01e62e43.xml - 2025-12-04T12:05:01.8228460Z =========================== short test summary info ============================ 2025-12-04T12:05:01.8229276Z FAILED [0.2744s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8229281Z 2025-12-04T12:05:01.8229493Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8230247Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8230253Z 2025-12-04T12:05:01.8230519Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8230733Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8230927Z ================== 1 failed, 294 deselected, 2 rerun in 5.17s ================== 2025-12-04T12:05:01.8231023Z Got exit code 1 2025-12-04T12:05:01.8231135Z Retrying single test... 2025-12-04T12:05:01.8231710Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-76ac61a88acbe22b.xml 2025-12-04T12:05:01.8231869Z ============================= test session starts ============================== 2025-12-04T12:05:01.8232226Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8232372Z cachedir: .pytest_cache 2025-12-04T12:05:01.8232904Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8233021Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8233131Z configfile: pytest.ini 2025-12-04T12:05:01.8233730Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8233953Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.8234760Z stepcurrent: skipping 83 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8234869Z Running 1 items in this shard 2025-12-04T12:05:01.8234875Z 2025-12-04T12:05:01.8235593Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [2.9213s] [100%] 2025-12-04T12:05:01.8236276Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda ('RERUN', {'yellow': True}) [0.2646s] [100%] 2025-12-04T12:05:01.8236862Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda FAILED [0.2675s] [100%] 2025-12-04T12:05:01.8236868Z 2025-12-04T12:05:01.8237014Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.8237365Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.8237487Z Traceback (most recent call last): 2025-12-04T12:05:01.8237971Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8238127Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8238661Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8238780Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8238969Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8238977Z 2025-12-04T12:05:01.8239198Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8239914Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8239919Z 2025-12-04T12:05:01.8240190Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8240406Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8240517Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8240639Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8240766Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8241023Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8241129Z graph_break [] 2025-12-04T12:05:01.8241344Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8242109Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8242219Z warnings.warn( 2025-12-04T12:05:01.8242573Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.8242699Z Traceback (most recent call last): 2025-12-04T12:05:01.8243178Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8243332Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8243861Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8244018Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8244216Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8244222Z 2025-12-04T12:05:01.8244438Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8245150Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8245156Z 2025-12-04T12:05:01.8245434Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8245652Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8245768Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8245882Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8246001Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8246262Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8246360Z graph_break [] 2025-12-04T12:05:01.8246571Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8247312Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8247414Z warnings.warn( 2025-12-04T12:05:01.8247623Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8247744Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8247857Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8248083Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8248205Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8248298Z graph_break [] 2025-12-04T12:05:01.8248516Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8249250Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8249344Z warnings.warn( 2025-12-04T12:05:01.8249495Z =================================== FAILURES =================================== 2025-12-04T12:05:01.8249854Z _ TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda _ 2025-12-04T12:05:01.8249980Z Traceback (most recent call last): 2025-12-04T12:05:01.8250458Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 224, in test_dtype_aware_codegen 2025-12-04T12:05:01.8250608Z code = run_and_get_triton_code(compiled, *inps) 2025-12-04T12:05:01.8251135Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/utils.py", line 2492, in run_and_get_triton_code 2025-12-04T12:05:01.8251256Z assert 1 <= len(source_codes) <= 2, ( 2025-12-04T12:05:01.8251444Z AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8251462Z 2025-12-04T12:05:01.8251703Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8252413Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8252449Z 2025-12-04T12:05:01.8252721Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8252937Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8253044Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8253169Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8253286Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8253511Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8253606Z graph_break [] 2025-12-04T12:05:01.8253813Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8254590Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8254689Z warnings.warn( 2025-12-04T12:05:01.8254896Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8255019Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8255133Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8255354Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8255474Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8255564Z graph_break [] 2025-12-04T12:05:01.8255782Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8256507Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8256603Z warnings.warn( 2025-12-04T12:05:01.8256853Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8256960Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8257071Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8257295Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8257410Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8257514Z graph_break [] 2025-12-04T12:05:01.8257726Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8258524Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8258631Z warnings.warn( 2025-12-04T12:05:01.8259385Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-76ac61a88acbe22b.xml - 2025-12-04T12:05:01.8259564Z =========================== short test summary info ============================ 2025-12-04T12:05:01.8260393Z FAILED [0.2675s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda - AssertionError: expected one or two code outputs got 0 2025-12-04T12:05:01.8260401Z 2025-12-04T12:05:01.8260613Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8261332Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8261337Z 2025-12-04T12:05:01.8261601Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8261784Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8261981Z ================== 1 failed, 294 deselected, 2 rerun in 5.15s ================== 2025-12-04T12:05:01.8262078Z Got exit code 1 2025-12-04T12:05:01.8262754Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda 2025-12-04T12:05:01.8263169Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.8263777Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-8647c42278291c82.xml 2025-12-04T12:05:01.8263940Z ============================= test session starts ============================== 2025-12-04T12:05:01.8264289Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8264402Z cachedir: .pytest_cache 2025-12-04T12:05:01.8264925Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8265047Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8265196Z configfile: pytest.ini 2025-12-04T12:05:01.8265790Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8266022Z collecting ... collected 571 items / 84 deselected / 487 selected 2025-12-04T12:05:01.8266165Z stepcurrent: skipping 84 already run items. 2025-12-04T12:05:01.8266279Z Running 211 items in this shard 2025-12-04T12:05:01.8266284Z 2025-12-04T12:05:01.8266889Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_trunc_load_upcast_to_fp32_False_float16_cuda PASSED [3.4255s] [ 0%] 2025-12-04T12:05:01.8267321Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_bool PASSED [0.2357s] [ 0%] 2025-12-04T12:05:01.8267770Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_float32 PASSED [0.2189s] [ 1%] 2025-12-04T12:05:01.8268243Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_float64 PASSED [0.2205s] [ 1%] 2025-12-04T12:05:01.8268679Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_abs_cuda_int64 PASSED [0.2215s] [ 2%] 2025-12-04T12:05:01.8269113Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acos_cuda_bool PASSED [0.5042s] [ 2%] 2025-12-04T12:05:01.8269558Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acos_cuda_float32 PASSED [0.4959s] [ 3%] 2025-12-04T12:05:01.8270001Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acos_cuda_int64 PASSED [0.5026s] [ 3%] 2025-12-04T12:05:01.8270435Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acosh_cuda_bool PASSED [0.5161s] [ 4%] 2025-12-04T12:05:01.8270885Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acosh_cuda_float32 PASSED [0.8887s] [ 4%] 2025-12-04T12:05:01.8271331Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_acosh_cuda_int64 PASSED [0.8956s] [ 5%] 2025-12-04T12:05:01.8271763Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_add_cuda_bool PASSED [2.9501s] [ 5%] 2025-12-04T12:05:01.8272200Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_add_cuda_int32 PASSED [2.2693s] [ 6%] 2025-12-04T12:05:01.8272630Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_add_cuda_int64 PASSED [1.9173s] [ 6%] 2025-12-04T12:05:01.8273078Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_angle_cuda_float32 PASSED [0.3402s] [ 7%] 2025-12-04T12:05:01.8273521Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_angle_cuda_int64 PASSED [0.2482s] [ 7%] 2025-12-04T12:05:01.8273963Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asin_cuda_float32 PASSED [0.3112s] [ 8%] 2025-12-04T12:05:01.8274409Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asin_cuda_float64 PASSED [0.3200s] [ 8%] 2025-12-04T12:05:01.8274885Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asin_cuda_int32 PASSED [0.3152s] [ 9%] 2025-12-04T12:05:01.8275326Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_asinh_cuda_int64 PASSED [0.3384s] [ 9%] 2025-12-04T12:05:01.8275768Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_bool PASSED [2.2145s] [ 9%] 2025-12-04T12:05:01.8276248Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_float32 PASSED [2.1447s] [ 10%] 2025-12-04T12:05:01.8276695Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_float64 PASSED [2.1225s] [ 10%] 2025-12-04T12:05:01.8277136Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_int32 PASSED [2.4301s] [ 11%] 2025-12-04T12:05:01.8277571Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan2_cuda_int64 PASSED [2.1858s] [ 11%] 2025-12-04T12:05:01.8278056Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan_cuda_bool PASSED [0.3211s] [ 12%] 2025-12-04T12:05:01.8278499Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atan_cuda_float32 PASSED [0.3179s] [ 12%] 2025-12-04T12:05:01.8278933Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atanh_cuda_bool PASSED [0.3203s] [ 13%] 2025-12-04T12:05:01.8279378Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_atanh_cuda_int32 PASSED [0.3174s] [ 13%] 2025-12-04T12:05:01.8279848Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_and_cuda_bool PASSED [1.5736s] [ 14%] 2025-12-04T12:05:01.8280326Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_and_cuda_int32 PASSED [1.5413s] [ 14%] 2025-12-04T12:05:01.8280792Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_and_cuda_int64 PASSED [1.5438s] [ 15%] 2025-12-04T12:05:01.8281289Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_not_cuda_bool PASSED [0.4039s] [ 15%] 2025-12-04T12:05:01.8281769Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_not_cuda_int64 PASSED [0.3946s] [ 16%] 2025-12-04T12:05:01.8282231Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_or_cuda_int64 PASSED [1.5359s] [ 16%] 2025-12-04T12:05:01.8282755Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_right_shift_cuda_int32 PASSED [1.5437s] [ 17%] 2025-12-04T12:05:01.8283266Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_right_shift_cuda_int64 PASSED [1.5545s] [ 17%] 2025-12-04T12:05:01.8283734Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_bitwise_xor_cuda_int64 PASSED [1.5680s] [ 18%] 2025-12-04T12:05:01.8284183Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ceil_cuda_float32 PASSED [0.2988s] [ 18%] 2025-12-04T12:05:01.8284621Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ceil_cuda_int64 PASSED [0.2115s] [ 18%] 2025-12-04T12:05:01.8285085Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_max_cuda_bool PASSED [1.6603s] [ 19%] 2025-12-04T12:05:01.8285550Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_max_cuda_float64 PASSED [1.9055s] [ 19%] 2025-12-04T12:05:01.8286009Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_max_cuda_int64 PASSED [1.6372s] [ 20%] 2025-12-04T12:05:01.8286481Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_min_cuda_float64 PASSED [1.6302s] [ 20%] 2025-12-04T12:05:01.8286936Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clamp_min_cuda_int64 PASSED [1.6206s] [ 21%] 2025-12-04T12:05:01.8287392Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clone_cuda_float32 PASSED [0.3547s] [ 21%] 2025-12-04T12:05:01.8288007Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_clone_cuda_int32 PASSED [0.3523s] [ 22%] 2025-12-04T12:05:01.8288537Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_bool PASSED [2.0271s] [ 22%] 2025-12-04T12:05:01.8289015Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_float32 PASSED [2.1628s] [ 23%] 2025-12-04T12:05:01.8289509Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_int32 PASSED [2.0106s] [ 23%] 2025-12-04T12:05:01.8289975Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_copysign_cuda_int64 PASSED [2.0212s] [ 24%] 2025-12-04T12:05:01.8290409Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cos_cuda_float32 PASSED [0.5140s] [ 24%] 2025-12-04T12:05:01.8290834Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cos_cuda_int64 PASSED [0.5161s] [ 25%] 2025-12-04T12:05:01.8291316Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cosh_cuda_bool PASSED [0.4891s] [ 25%] 2025-12-04T12:05:01.8291747Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cosh_cuda_int32 PASSED [0.4905s] [ 26%] 2025-12-04T12:05:01.8292188Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_cosh_cuda_int64 PASSED [0.4877s] [ 26%] 2025-12-04T12:05:01.8292633Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_digamma_cuda_bool PASSED [0.8599s] [ 27%] 2025-12-04T12:05:01.8293080Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_digamma_cuda_int32 PASSED [0.4036s] [ 27%] 2025-12-04T12:05:01.8293600Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_float32 PASSED [2.0381s] [ 27%] 2025-12-04T12:05:01.8294108Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_float64 PASSED [2.0976s] [ 28%] 2025-12-04T12:05:01.8294660Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_int32 PASSED [1.9211s] [ 28%] 2025-12-04T12:05:01.8295163Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_floor_rounding_cuda_int64 PASSED [1.7329s] [ 29%] 2025-12-04T12:05:01.8295679Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_no_rounding_mode_cuda_float32 PASSED [1.5721s] [ 29%] 2025-12-04T12:05:01.8296208Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_no_rounding_mode_cuda_float64 PASSED [1.6320s] [ 30%] 2025-12-04T12:05:01.8296720Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_trunc_rounding_cuda_float64 PASSED [2.0625s] [ 30%] 2025-12-04T12:05:01.8297228Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_trunc_rounding_cuda_int32 PASSED [1.5660s] [ 31%] 2025-12-04T12:05:01.8297723Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_div_trunc_rounding_cuda_int64 PASSED [1.6307s] [ 31%] 2025-12-04T12:05:01.8298204Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_eq_cuda_bool PASSED [1.7344s] [ 32%] 2025-12-04T12:05:01.8298641Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erf_cuda_int64 PASSED [0.3148s] [ 32%] 2025-12-04T12:05:01.8299069Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfc_cuda_int32 PASSED [0.5027s] [ 33%] 2025-12-04T12:05:01.8299509Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfc_cuda_int64 PASSED [0.4956s] [ 33%] 2025-12-04T12:05:01.8299947Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfinv_cuda_bool PASSED [0.5581s] [ 34%] 2025-12-04T12:05:01.8300397Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfinv_cuda_float64 PASSED [0.5563s] [ 34%] 2025-12-04T12:05:01.8300849Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_erfinv_cuda_int32 PASSED [0.3269s] [ 35%] 2025-12-04T12:05:01.8301295Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp2_cuda_float64 PASSED [0.5018s] [ 35%] 2025-12-04T12:05:01.8301772Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp2_cuda_int64 PASSED [0.4847s] [ 36%] 2025-12-04T12:05:01.8302201Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp_cuda_bool PASSED [0.4887s] [ 36%] 2025-12-04T12:05:01.8302670Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp_cuda_float32 PASSED [0.4806s] [ 36%] 2025-12-04T12:05:01.8303107Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_exp_cuda_int64 PASSED [0.4859s] [ 37%] 2025-12-04T12:05:01.8303552Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_float32 PASSED [0.2957s] [ 37%] 2025-12-04T12:05:01.8304012Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_float64 PASSED [0.3018s] [ 38%] 2025-12-04T12:05:01.8304485Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_int32 PASSED [0.2118s] [ 38%] 2025-12-04T12:05:01.8304926Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_floor_cuda_int64 PASSED [0.2113s] [ 39%] 2025-12-04T12:05:01.8305377Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_float32 PASSED [2.1435s] [ 39%] 2025-12-04T12:05:01.8305825Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_float64 PASSED [2.5113s] [ 40%] 2025-12-04T12:05:01.8306271Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_int32 PASSED [1.5647s] [ 40%] 2025-12-04T12:05:01.8306703Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_fmod_cuda_int64 PASSED [1.6362s] [ 41%] 2025-12-04T12:05:01.8307156Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_frexp_cuda_float32 PASSED [0.5325s] [ 41%] 2025-12-04T12:05:01.8307654Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_frexp_cuda_float64 PASSED [0.5469s] [ 42%] 2025-12-04T12:05:01.8308087Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gcd_cuda_int32 PASSED [1.7004s] [ 42%] 2025-12-04T12:05:01.8308522Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ge_cuda_bool PASSED [1.5848s] [ 43%] 2025-12-04T12:05:01.8308953Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ge_cuda_float64 PASSED [1.5598s] [ 43%] 2025-12-04T12:05:01.8309374Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ge_cuda_int64 PASSED [1.5560s] [ 44%] 2025-12-04T12:05:01.8309821Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gt_cuda_float32 PASSED [1.5528s] [ 44%] 2025-12-04T12:05:01.8310243Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gt_cuda_int32 PASSED [1.5697s] [ 45%] 2025-12-04T12:05:01.8310679Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_gt_cuda_int64 PASSED [1.5591s] [ 45%] 2025-12-04T12:05:01.8311138Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_hypot_cuda_float64 PASSED [2.0013s] [ 45%] 2025-12-04T12:05:01.8311562Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_i0_cuda_int32 PASSED [0.7384s] [ 46%] 2025-12-04T12:05:01.8312001Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_i0_cuda_int64 PASSED [0.4028s] [ 46%] 2025-12-04T12:05:01.8312455Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_igamma_cuda_float32 PASSED [0.9838s] [ 47%] 2025-12-04T12:05:01.8312900Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isinf_cuda_bool PASSED [0.2085s] [ 47%] 2025-12-04T12:05:01.8313352Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isinf_cuda_float64 PASSED [0.3041s] [ 48%] 2025-12-04T12:05:01.8313792Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isinf_cuda_int64 PASSED [0.2083s] [ 48%] 2025-12-04T12:05:01.8314291Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isnan_cuda_float64 PASSED [0.3005s] [ 49%] 2025-12-04T12:05:01.8314733Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isnan_cuda_int32 PASSED [0.2083s] [ 49%] 2025-12-04T12:05:01.8315182Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_isnan_cuda_int64 PASSED [0.2102s] [ 50%] 2025-12-04T12:05:01.8315646Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_le_cuda_float64 PASSED [1.5528s] [ 50%] 2025-12-04T12:05:01.8316073Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_le_cuda_int32 PASSED [1.8981s] [ 51%] 2025-12-04T12:05:01.8316539Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lgamma_cuda_float32 PASSED [0.5459s] [ 51%] 2025-12-04T12:05:01.8316996Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lgamma_cuda_float64 PASSED [0.5724s] [ 52%] 2025-12-04T12:05:01.8317485Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lgamma_cuda_int64 PASSED [0.5472s] [ 52%] 2025-12-04T12:05:01.8317921Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_bool PASSED [0.4988s] [ 53%] 2025-12-04T12:05:01.8318370Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_float32 PASSED [0.4920s] [ 53%] 2025-12-04T12:05:01.8318832Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_float64 PASSED [0.5059s] [ 54%] 2025-12-04T12:05:01.8319270Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_int32 PASSED [0.4958s] [ 54%] 2025-12-04T12:05:01.8319706Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log10_cuda_int64 PASSED [0.4976s] [ 54%] 2025-12-04T12:05:01.8320165Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log1p_cuda_float32 PASSED [0.3129s] [ 55%] 2025-12-04T12:05:01.8320646Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log1p_cuda_float64 PASSED [0.3418s] [ 55%] 2025-12-04T12:05:01.8321099Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log1p_cuda_int64 PASSED [0.3166s] [ 56%] 2025-12-04T12:05:01.8321530Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log2_cuda_bool PASSED [0.5025s] [ 56%] 2025-12-04T12:05:01.8321978Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log2_cuda_float32 PASSED [0.4915s] [ 57%] 2025-12-04T12:05:01.8322436Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log2_cuda_float64 PASSED [0.4971s] [ 57%] 2025-12-04T12:05:01.8322874Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log_cuda_float32 PASSED [0.4947s] [ 58%] 2025-12-04T12:05:01.8323326Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_log_cuda_float64 PASSED [0.4966s] [ 58%] 2025-12-04T12:05:01.8323791Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_and_cuda_bool PASSED [1.5920s] [ 59%] 2025-12-04T12:05:01.8324278Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_and_cuda_float64 PASSED [1.6139s] [ 59%] 2025-12-04T12:05:01.8324762Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_and_cuda_int32 PASSED [1.6173s] [ 60%] 2025-12-04T12:05:01.8325228Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_not_cuda_bool PASSED [0.4017s] [ 60%] 2025-12-04T12:05:01.8325718Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_not_cuda_float64 PASSED [0.4055s] [ 61%] 2025-12-04T12:05:01.8326186Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_not_cuda_int32 PASSED [0.4098s] [ 61%] 2025-12-04T12:05:01.8326660Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_or_cuda_float32 PASSED [1.5908s] [ 62%] 2025-12-04T12:05:01.8327150Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_or_cuda_float64 PASSED [1.6044s] [ 62%] 2025-12-04T12:05:01.8327647Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_or_cuda_int64 PASSED [1.6016s] [ 63%] 2025-12-04T12:05:01.8328147Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_logical_xor_cuda_float32 PASSED [1.6122s] [ 63%] 2025-12-04T12:05:01.8328619Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_lt_cuda_int32 PASSED [1.5611s] [ 63%] 2025-12-04T12:05:01.8329080Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_max_binary_cuda_int32 PASSED [1.9703s] [ 64%] 2025-12-04T12:05:01.8329536Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_maximum_cuda_int32 PASSED [1.6154s] [ 64%] 2025-12-04T12:05:01.8329986Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_maximum_cuda_int64 PASSED [1.6150s] [ 65%] 2025-12-04T12:05:01.8330459Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_min_binary_cuda_bool PASSED [1.6541s] [ 65%] 2025-12-04T12:05:01.8330954Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_min_binary_cuda_int32 PASSED [1.6062s] [ 66%] 2025-12-04T12:05:01.8331414Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_min_binary_cuda_int64 PASSED [1.6147s] [ 66%] 2025-12-04T12:05:01.8331870Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_minimum_cuda_bool PASSED [1.6261s] [ 67%] 2025-12-04T12:05:01.8332320Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_minimum_cuda_int32 PASSED [1.6018s] [ 67%] 2025-12-04T12:05:01.8332781Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_minimum_cuda_int64 PASSED [1.5949s] [ 68%] 2025-12-04T12:05:01.8333210Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_mul_cuda_int32 PASSED [1.5463s] [ 68%] 2025-12-04T12:05:01.8333668Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_mul_cuda_int64 PASSED [1.5629s] [ 69%] 2025-12-04T12:05:01.8334105Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_bool PASSED [1.5906s] [ 69%] 2025-12-04T12:05:01.8334538Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_float32 PASSED [1.5722s] [ 70%] 2025-12-04T12:05:01.8334984Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_float64 PASSED [1.5603s] [ 70%] 2025-12-04T12:05:01.8335408Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_int32 PASSED [1.5602s] [ 71%] 2025-12-04T12:05:01.8335834Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_ne_cuda_int64 PASSED [1.9390s] [ 71%] 2025-12-04T12:05:01.8336269Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_neg_cuda_int32 PASSED [0.2174s] [ 72%] 2025-12-04T12:05:01.8336695Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_neg_cuda_int64 PASSED [0.2202s] [ 72%] 2025-12-04T12:05:01.8337182Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_nextafter_cuda_float64 PASSED [2.0379s] [ 72%] 2025-12-04T12:05:01.8337729Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_0_cuda_float32 PASSED [1.0570s] [ 73%] 2025-12-04T12:05:01.8338337Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_0_cuda_int32 PASSED [1.2439s] [ 73%] 2025-12-04T12:05:01.8338881Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_0_cuda_int64 PASSED [1.2515s] [ 74%] 2025-12-04T12:05:01.8339414Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_bool PASSED [1.2424s] [ 74%] 2025-12-04T12:05:01.8339972Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_float32 PASSED [1.0375s] [ 75%] 2025-12-04T12:05:01.8340520Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_float64 PASSED [1.0516s] [ 75%] 2025-12-04T12:05:01.8341090Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_1_cuda_int64 PASSED [1.2498s] [ 76%] 2025-12-04T12:05:01.8341645Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_2_cuda_float32 PASSED [1.0352s] [ 76%] 2025-12-04T12:05:01.8342209Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_2_cuda_int32 PASSED [1.2412s] [ 77%] 2025-12-04T12:05:01.8342751Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_bool PASSED [1.2380s] [ 77%] 2025-12-04T12:05:01.8343292Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_float32 PASSED [1.0377s] [ 78%] 2025-12-04T12:05:01.8343838Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_float64 PASSED [1.0350s] [ 78%] 2025-12-04T12:05:01.8344421Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_3_cuda_int32 PASSED [1.2292s] [ 79%] 2025-12-04T12:05:01.8344948Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_polygamma_polygamma_n_4_cuda_bool PASSED [1.2378s] [ 79%] 2025-12-04T12:05:01.8345401Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_pow_cuda_float64 PASSED [2.6947s] [ 80%] 2025-12-04T12:05:01.8345835Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_pow_cuda_int32 PASSED [0.9954s] [ 80%] 2025-12-04T12:05:01.8346258Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_pow_cuda_int64 PASSED [1.0026s] [ 81%] 2025-12-04T12:05:01.8346751Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_reciprocal_cuda_float32 PASSED [0.4157s] [ 81%] 2025-12-04T12:05:01.8347261Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_reciprocal_cuda_float64 PASSED [0.4290s] [ 81%] 2025-12-04T12:05:01.8347746Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_reciprocal_cuda_int32 PASSED [0.4176s] [ 82%] 2025-12-04T12:05:01.8348186Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_cuda_int32 PASSED [0.2131s] [ 82%] 2025-12-04T12:05:01.8348692Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_0_cuda_float32 PASSED [0.5159s] [ 83%] 2025-12-04T12:05:01.8349214Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_3_cuda_float64 PASSED [0.5239s] [ 83%] 2025-12-04T12:05:01.8349739Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_round_decimals_neg_3_cuda_float32 PASSED [0.5185s] [ 84%] 2025-12-04T12:05:01.8350188Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_rsqrt_cuda_int32 PASSED [0.4874s] [ 84%] 2025-12-04T12:05:01.8350626Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_rsqrt_cuda_int64 PASSED [0.4866s] [ 85%] 2025-12-04T12:05:01.8351087Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sigmoid_cuda_float64 PASSED [0.5221s] [ 85%] 2025-12-04T12:05:01.8351551Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sigmoid_cuda_int32 PASSED [0.4147s] [ 86%] 2025-12-04T12:05:01.8352005Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sigmoid_cuda_int64 PASSED [0.4206s] [ 86%] 2025-12-04T12:05:01.8352457Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sign_cuda_float64 PASSED [0.2313s] [ 87%] 2025-12-04T12:05:01.8352905Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_signbit_cuda_int32 PASSED [0.2186s] [ 87%] 2025-12-04T12:05:01.8353341Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sin_cuda_float32 PASSED [0.3444s] [ 88%] 2025-12-04T12:05:01.8353781Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sin_cuda_int64 PASSED [0.3464s] [ 88%] 2025-12-04T12:05:01.8354248Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sinh_cuda_int64 PASSED [0.3182s] [ 89%] 2025-12-04T12:05:01.8354691Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_bool PASSED [0.3130s] [ 89%] 2025-12-04T12:05:01.8355164Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_float32 PASSED [0.3040s] [ 90%] 2025-12-04T12:05:01.8355608Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_float64 PASSED [0.3065s] [ 90%] 2025-12-04T12:05:01.8356047Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sqrt_cuda_int32 PASSED [0.3060s] [ 90%] 2025-12-04T12:05:01.8356487Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_square_cuda_bool PASSED [0.4054s] [ 91%] 2025-12-04T12:05:01.8356953Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_square_cuda_float64 PASSED [0.4006s] [ 91%] 2025-12-04T12:05:01.8357434Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_square_cuda_int32 PASSED [0.4004s] [ 92%] 2025-12-04T12:05:01.8357874Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_sub_cuda_float64 PASSED [1.8832s] [ 92%] 2025-12-04T12:05:01.8358312Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tan_cuda_bool PASSED [0.3491s] [ 93%] 2025-12-04T12:05:01.8358740Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tan_cuda_int32 PASSED [0.3467s] [ 93%] 2025-12-04T12:05:01.8359192Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tanh_cuda_float64 PASSED [0.3147s] [ 94%] 2025-12-04T12:05:01.8359623Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tanh_cuda_int32 PASSED [0.3096s] [ 94%] 2025-12-04T12:05:01.8360059Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_tanh_cuda_int64 PASSED [0.3123s] [ 95%] 2025-12-04T12:05:01.8360574Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_bool PASSED [1.6271s] [ 95%] 2025-12-04T12:05:01.8361054Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_float32 PASSED [1.5737s] [ 96%] 2025-12-04T12:05:01.8361548Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_float64 PASSED [1.6315s] [ 96%] 2025-12-04T12:05:01.8362019Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_int32 PASSED [1.6009s] [ 97%] 2025-12-04T12:05:01.8362486Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_true_divide_cuda_int64 PASSED [1.6041s] [ 97%] 2025-12-04T12:05:01.8362948Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_trunc_cuda_float32 PASSED [0.2971s] [ 98%] 2025-12-04T12:05:01.8363384Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_trunc_cuda_int32 PASSED [0.2109s] [ 98%] 2025-12-04T12:05:01.8363838Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_propagation_trunc_cuda_int64 PASSED [0.2117s] [ 99%] 2025-12-04T12:05:01.8364210Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_op_dtype_support_cuda PASSED [0.0037s] [ 99%] 2025-12-04T12:05:01.8364838Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.1010s] [100%] 2025-12-04T12:05:01.8365471Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.5341s] [100%] 2025-12-04T12:05:01.8366006Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 FAILED [0.4938s] [100%] 2025-12-04T12:05:01.8366012Z 2025-12-04T12:05:01.8366166Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.8366472Z _ TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 _ 2025-12-04T12:05:01.8366599Z Traceback (most recent call last): 2025-12-04T12:05:01.8367117Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 326, in test_upcast_rank_0_cpu 2025-12-04T12:05:01.8367287Z result, (code,) = run_and_get_code(compiled, *inps) 2025-12-04T12:05:01.8367537Z ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8367546Z 2025-12-04T12:05:01.8367761Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8368421Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8368427Z 2025-12-04T12:05:01.8368698Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8368918Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8369041Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8369188Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8369411Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8369541Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8369639Z graph_break [] 2025-12-04T12:05:01.8369856Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8370614Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8370721Z warnings.warn( 2025-12-04T12:05:01.8371037Z _ TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 _ 2025-12-04T12:05:01.8371159Z Traceback (most recent call last): 2025-12-04T12:05:01.8371630Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 326, in test_upcast_rank_0_cpu 2025-12-04T12:05:01.8371804Z result, (code,) = run_and_get_code(compiled, *inps) 2025-12-04T12:05:01.8372041Z ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8372047Z 2025-12-04T12:05:01.8372261Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8372933Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8372940Z 2025-12-04T12:05:01.8373210Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8373438Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8373550Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8373663Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8373890Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8374010Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8374107Z graph_break [] 2025-12-04T12:05:01.8374337Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8375073Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8375183Z warnings.warn( 2025-12-04T12:05:01.8375397Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8375506Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8375625Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8375841Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8375959Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8376063Z graph_break [] 2025-12-04T12:05:01.8376273Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8377013Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8377116Z warnings.warn( 2025-12-04T12:05:01.8377301Z =================================== FAILURES =================================== 2025-12-04T12:05:01.8377613Z _ TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 _ 2025-12-04T12:05:01.8377838Z Traceback (most recent call last): 2025-12-04T12:05:01.8378376Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 326, in test_upcast_rank_0_cpu 2025-12-04T12:05:01.8378558Z result, (code,) = run_and_get_code(compiled, *inps) 2025-12-04T12:05:01.8378766Z ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8378772Z 2025-12-04T12:05:01.8378998Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8379658Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8379701Z 2025-12-04T12:05:01.8379969Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8380203Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8380316Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8380439Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8380661Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8380782Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8380890Z graph_break [] 2025-12-04T12:05:01.8381102Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8381833Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8381944Z warnings.warn( 2025-12-04T12:05:01.8382157Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8382281Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8382428Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8382652Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8382782Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8382878Z graph_break [] 2025-12-04T12:05:01.8383093Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8383834Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8383932Z warnings.warn( 2025-12-04T12:05:01.8384145Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8384267Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8384379Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8384606Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8384727Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8384828Z graph_break [] 2025-12-04T12:05:01.8385055Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8385787Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8385890Z warnings.warn( 2025-12-04T12:05:01.8386655Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-8647c42278291c82.xml - 2025-12-04T12:05:01.8386824Z =========================== short test summary info ============================ 2025-12-04T12:05:01.8387631Z FAILED [0.4938s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 - ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8387637Z 2025-12-04T12:05:01.8388063Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8388794Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8388814Z 2025-12-04T12:05:01.8389078Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8389296Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8389542Z ====== 1 failed, 210 passed, 84 deselected, 2 rerun in 211.95s (0:03:31) ======= 2025-12-04T12:05:01.8389640Z Got exit code 1 2025-12-04T12:05:01.8389746Z Retrying single test... 2025-12-04T12:05:01.8390341Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a0ddea104bebcf1c.xml 2025-12-04T12:05:01.8390498Z ============================= test session starts ============================== 2025-12-04T12:05:01.8390863Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8391009Z cachedir: .pytest_cache 2025-12-04T12:05:01.8391533Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8391668Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8391778Z configfile: pytest.ini 2025-12-04T12:05:01.8392373Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8392609Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.8393364Z stepcurrent: skipping 294 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8393488Z Running 1 items in this shard 2025-12-04T12:05:01.8393493Z 2025-12-04T12:05:01.8394161Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 ('RERUN', {'yellow': True}) [2.9067s] [100%] 2025-12-04T12:05:01.8394786Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.2795s] [100%] 2025-12-04T12:05:01.8395332Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 FAILED [0.2838s] [100%] 2025-12-04T12:05:01.8395337Z 2025-12-04T12:05:01.8395476Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.8395789Z _ TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 _ 2025-12-04T12:05:01.8395908Z Traceback (most recent call last): 2025-12-04T12:05:01.8396381Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 326, in test_upcast_rank_0_cpu 2025-12-04T12:05:01.8396558Z result, (code,) = run_and_get_code(compiled, *inps) 2025-12-04T12:05:01.8396769Z ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8396775Z 2025-12-04T12:05:01.8396996Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8397656Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8397664Z 2025-12-04T12:05:01.8397931Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8398159Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8398270Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8398391Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8398512Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8398732Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8398839Z graph_break [] 2025-12-04T12:05:01.8399056Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8399821Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8399928Z warnings.warn( 2025-12-04T12:05:01.8400269Z _ TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 _ 2025-12-04T12:05:01.8400398Z Traceback (most recent call last): 2025-12-04T12:05:01.8400866Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 326, in test_upcast_rank_0_cpu 2025-12-04T12:05:01.8401031Z result, (code,) = run_and_get_code(compiled, *inps) 2025-12-04T12:05:01.8401253Z ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8401258Z 2025-12-04T12:05:01.8401468Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8402128Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8402170Z 2025-12-04T12:05:01.8402438Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8402653Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8402773Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8402885Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8403005Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8403234Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8403330Z graph_break [] 2025-12-04T12:05:01.8403545Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8404283Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8404383Z warnings.warn( 2025-12-04T12:05:01.8404637Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8404753Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8404866Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8405092Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8405215Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8405310Z graph_break [] 2025-12-04T12:05:01.8405530Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8406261Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8406370Z warnings.warn( 2025-12-04T12:05:01.8406516Z =================================== FAILURES =================================== 2025-12-04T12:05:01.8406821Z _ TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 _ 2025-12-04T12:05:01.8406954Z Traceback (most recent call last): 2025-12-04T12:05:01.8407429Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 326, in test_upcast_rank_0_cpu 2025-12-04T12:05:01.8407595Z result, (code,) = run_and_get_code(compiled, *inps) 2025-12-04T12:05:01.8407813Z ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8407820Z 2025-12-04T12:05:01.8408031Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8408699Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8408705Z 2025-12-04T12:05:01.8408966Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8409179Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8409300Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8409414Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8409547Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8409794Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8409890Z graph_break [] 2025-12-04T12:05:01.8410113Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8410876Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8410975Z warnings.warn( 2025-12-04T12:05:01.8411195Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8411304Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8411425Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8411644Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8411764Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8411901Z graph_break [] 2025-12-04T12:05:01.8412115Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8412845Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8412955Z warnings.warn( 2025-12-04T12:05:01.8413166Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8413283Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8413394Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8413613Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8413739Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8413833Z graph_break [] 2025-12-04T12:05:01.8414046Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8414834Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8414936Z warnings.warn( 2025-12-04T12:05:01.8415707Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a0ddea104bebcf1c.xml - 2025-12-04T12:05:01.8415876Z =========================== short test summary info ============================ 2025-12-04T12:05:01.8416669Z FAILED [0.2838s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 - ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8416676Z 2025-12-04T12:05:01.8416897Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8417554Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8417560Z 2025-12-04T12:05:01.8417838Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8418012Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8418283Z ================== 1 failed, 294 deselected, 2 rerun in 5.15s ================== 2025-12-04T12:05:01.8418392Z Got exit code 1 2025-12-04T12:05:01.8418498Z Retrying single test... 2025-12-04T12:05:01.8419073Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c80d90eb9dbd0365.xml 2025-12-04T12:05:01.8419243Z ============================= test session starts ============================== 2025-12-04T12:05:01.8419595Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8419711Z cachedir: .pytest_cache 2025-12-04T12:05:01.8420235Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8420359Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8420479Z configfile: pytest.ini 2025-12-04T12:05:01.8421109Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8421336Z collecting ... collected 571 items / 294 deselected / 277 selected 2025-12-04T12:05:01.8422126Z stepcurrent: skipping 294 already run items. Running only test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8422239Z Running 1 items in this shard 2025-12-04T12:05:01.8422244Z 2025-12-04T12:05:01.8422869Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 ('RERUN', {'yellow': True}) [2.8998s] [100%] 2025-12-04T12:05:01.8423488Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 ('RERUN', {'yellow': True}) [0.2812s] [100%] 2025-12-04T12:05:01.8424074Z inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 FAILED [0.2780s] [100%] 2025-12-04T12:05:01.8424079Z 2025-12-04T12:05:01.8424220Z ==================================== RERUNS ==================================== 2025-12-04T12:05:01.8424523Z _ TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 _ 2025-12-04T12:05:01.8424653Z Traceback (most recent call last): 2025-12-04T12:05:01.8425128Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 326, in test_upcast_rank_0_cpu 2025-12-04T12:05:01.8425295Z result, (code,) = run_and_get_code(compiled, *inps) 2025-12-04T12:05:01.8425514Z ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8425520Z 2025-12-04T12:05:01.8425730Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8426430Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8426438Z 2025-12-04T12:05:01.8426707Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8426926Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8427050Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8427162Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8427291Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8427511Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8427607Z graph_break [] 2025-12-04T12:05:01.8427833Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8428570Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8428671Z warnings.warn( 2025-12-04T12:05:01.8428983Z _ TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 _ 2025-12-04T12:05:01.8429104Z Traceback (most recent call last): 2025-12-04T12:05:01.8429581Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 326, in test_upcast_rank_0_cpu 2025-12-04T12:05:01.8429748Z result, (code,) = run_and_get_code(compiled, *inps) 2025-12-04T12:05:01.8429955Z ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8429961Z 2025-12-04T12:05:01.8430182Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8430840Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8430845Z 2025-12-04T12:05:01.8431116Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8431328Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8431453Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8431579Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8431733Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8431960Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8432073Z graph_break [] 2025-12-04T12:05:01.8432320Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8433062Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8433164Z warnings.warn( 2025-12-04T12:05:01.8433379Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8433505Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8433620Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8433840Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8434019Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8434119Z graph_break [] 2025-12-04T12:05:01.8434335Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8435084Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8435186Z warnings.warn( 2025-12-04T12:05:01.8435343Z =================================== FAILURES =================================== 2025-12-04T12:05:01.8435648Z _ TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 _ 2025-12-04T12:05:01.8435772Z Traceback (most recent call last): 2025-12-04T12:05:01.8436259Z File "/var/lib/jenkins/workspace/test/inductor/test_op_dtype_prop.py", line 326, in test_upcast_rank_0_cpu 2025-12-04T12:05:01.8436426Z result, (code,) = run_and_get_code(compiled, *inps) 2025-12-04T12:05:01.8436651Z ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8436691Z 2025-12-04T12:05:01.8436910Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8437568Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8437575Z 2025-12-04T12:05:01.8437853Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8438069Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8438194Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8438310Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8438433Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8438667Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8438767Z graph_break [] 2025-12-04T12:05:01.8438979Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8439737Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8439839Z warnings.warn( 2025-12-04T12:05:01.8440051Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8440174Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8440289Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8440518Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8440639Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8440737Z graph_break [] 2025-12-04T12:05:01.8440963Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8441696Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8441797Z warnings.warn( 2025-12-04T12:05:01.8442019Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:05:01.8442163Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:05:01.8442294Z stats [('calls_captured', 1)] 2025-12-04T12:05:01.8442514Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('not_ok', 1)] 2025-12-04T12:05:01.8442668Z inductor [('fxgraph_cache_miss', 1)] 2025-12-04T12:05:01.8442776Z graph_break [] 2025-12-04T12:05:01.8442986Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:05:01.8443711Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:05:01.8443822Z warnings.warn( 2025-12-04T12:05:01.8444578Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c80d90eb9dbd0365.xml - 2025-12-04T12:05:01.8444797Z =========================== short test summary info ============================ 2025-12-04T12:05:01.8445595Z FAILED [0.2780s] inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 - ValueError: not enough values to unpack (expected 1, got 0) 2025-12-04T12:05:01.8445604Z 2025-12-04T12:05:01.8445823Z To execute this test, run the following from the base repo dir: 2025-12-04T12:05:01.8446495Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_op_dtype_prop.py TestCaseCUDA.test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8446500Z 2025-12-04T12:05:01.8446762Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:05:01.8446955Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:05:01.8447156Z ================== 1 failed, 294 deselected, 2 rerun in 5.15s ================== 2025-12-04T12:05:01.8447257Z Got exit code 1 2025-12-04T12:05:01.8447884Z FAILED CONSISTENTLY: test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16 2025-12-04T12:05:01.8448296Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:05:01.8448880Z Test results will be stored in test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-98804e2c83f300d6.xml 2025-12-04T12:05:01.8449041Z ============================= test session starts ============================== 2025-12-04T12:05:01.8449393Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:05:01.8449513Z cachedir: .pytest_cache 2025-12-04T12:05:01.8450032Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:05:01.8450154Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:05:01.8450276Z configfile: pytest.ini 2025-12-04T12:05:01.8450873Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:05:01.8451111Z collecting ... collected 571 items / 295 deselected / 276 selected 2025-12-04T12:05:01.8451262Z stepcurrent: skipping 295 already run items. 2025-12-04T12:05:01.8451376Z Running 0 items in this shard 2025-12-04T12:05:01.8451381Z 2025-12-04T12:05:01.8452145Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-98804e2c83f300d6.xml - 2025-12-04T12:05:01.8452313Z =========================== 295 deselected in 0.05s ============================ 2025-12-04T12:05:01.8479471Z The following tests failed consistently: ['test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_abs_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acos_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_acosh_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_asinh_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atan_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_atanh_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_ceil_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_copysign_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_cosh_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfc_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_erfinv_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_exp_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_expm1_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_floor_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_fmod_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isinf_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_isnan_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_lgamma_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log10_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log1p_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log2_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_log_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_nextafter_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_pow_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_round_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_rsqrt_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sigmoid_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sin_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_sinh_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tan_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_False_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_dtype_aware_codegen_op_name_tanh_load_upcast_to_fp32_True_bfloat16_cuda', 'test/inductor/test_op_dtype_prop.py::TestCaseCUDA::test_upcast_rank_0_cpu_upcast_to_fp32_True_bfloat16_cuda_bfloat16'] 2025-12-04T12:05:01.8479633Z 2025-12-04T12:05:01.8480197Z FINISHED PRINTING LOG FILE of inductor/test_op_dtype_prop 1/2 (test/test-reports/inductor.test_op_dtype_prop_1.2_88d9121bb179f5e4_.log) 2025-12-04T12:05:01.8480203Z 2025-12-04T12:05:01.8480558Z Finished inductor/test_op_dtype_prop 1/2 ... [2025-12-04 12:05:01.308230][10305.42944464], took 47.56min 2025-12-04T12:05:01.8481416Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-826c2562e296e7cd.xml 2025-12-04T12:05:01.8482270Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-738ee4f0dcb0d1ea.xml 2025-12-04T12:05:01.8483087Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a05156439a0d1496.xml 2025-12-04T12:05:01.8483890Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a884406c7304f17c.xml 2025-12-04T12:05:01.8484706Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c03b63e1d9e14442.xml 2025-12-04T12:05:01.8485520Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a7e12d722ef68d61.xml 2025-12-04T12:05:01.8486329Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-758c1dbc5b36fbbe.xml 2025-12-04T12:05:01.8487141Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1356c8f9e8cb48ab.xml 2025-12-04T12:05:01.8488155Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3d33fda2cceb2cbd.xml 2025-12-04T12:05:01.8488975Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0982db42fabefc11.xml 2025-12-04T12:05:01.8489777Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e365e000dec26b6e.xml 2025-12-04T12:05:01.8490661Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c05ba049f6d7f244.xml 2025-12-04T12:05:01.8491463Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e64a3e20d6733c67.xml 2025-12-04T12:05:01.8707771Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-60656dccead73746.xml 2025-12-04T12:05:01.9052018Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-74b07a56cd02ebb4.xml 2025-12-04T12:05:01.9399158Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-97c1aae6502fe9a4.xml 2025-12-04T12:05:01.9697232Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9520e13942c1e127.xml 2025-12-04T12:05:02.0572596Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5b20aba83da29ea2.xml 2025-12-04T12:05:02.0864558Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c4f43dfc4bd691e8.xml 2025-12-04T12:05:02.1192691Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1ceca9168882f805.xml 2025-12-04T12:05:02.1502544Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c6f480cabc56b496.xml 2025-12-04T12:05:02.1805892Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9497acc525242c8a.xml 2025-12-04T12:05:02.2131184Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-158a50fa606ffe95.xml 2025-12-04T12:05:02.2429946Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-27359545a6656d51.xml 2025-12-04T12:05:02.2773192Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1f2cd5255ddbd20d.xml 2025-12-04T12:05:02.3081196Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-6e021b62864c2856.xml 2025-12-04T12:05:02.3383591Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-49627e719c8e890b.xml 2025-12-04T12:05:02.4213139Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-66537c67c1172e95.xml 2025-12-04T12:05:02.4589553Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b567c9f6400d034b.xml 2025-12-04T12:05:02.4881480Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2dff14642abe4d78.xml 2025-12-04T12:05:02.5204966Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-65b5f001c721490a.xml 2025-12-04T12:05:02.5708552Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-51dd6a289fe92b8c.xml 2025-12-04T12:05:02.6049946Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-fcdeca5855991d94.xml 2025-12-04T12:05:02.6390833Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-97fc9d1afafb45ba.xml 2025-12-04T12:05:02.6657021Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-58940be16b8df1f1.xml 2025-12-04T12:05:02.6946314Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9349febb39593fe3.xml 2025-12-04T12:05:02.7274999Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-610ad64c0d2affb1.xml 2025-12-04T12:05:02.7569520Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-535142d7b2475ad0.xml 2025-12-04T12:05:02.7882662Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-44b8ebfaa38db266.xml 2025-12-04T12:05:02.8205027Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-bba5c35945b28810.xml 2025-12-04T12:05:02.8519582Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a3c90c249d149d10.xml 2025-12-04T12:05:02.8838053Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c896d4e441aec272.xml 2025-12-04T12:05:02.9171737Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0f626bbdc83148be.xml 2025-12-04T12:05:02.9513989Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e68881511844a645.xml 2025-12-04T12:05:02.9828522Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a161840b55d48987.xml 2025-12-04T12:05:03.0141269Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-84da9e83d0a63a49.xml 2025-12-04T12:05:03.0447315Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5ba1ce04884dd264.xml 2025-12-04T12:05:03.0733960Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7aa6851dee05bf26.xml 2025-12-04T12:05:03.1050592Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cff0107c6e55c745.xml 2025-12-04T12:05:03.1339876Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-65ccdb61caed4adb.xml 2025-12-04T12:05:03.1634704Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-332efb6825e8ee00.xml 2025-12-04T12:05:03.1889512Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5cf5b975b5ed0b41.xml 2025-12-04T12:05:03.2186538Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-66c6826e1f29c7ae.xml 2025-12-04T12:05:03.2473625Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2a970c00eea5bec9.xml 2025-12-04T12:05:03.2778037Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-284ef1fed3dcf877.xml 2025-12-04T12:05:03.3093443Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5553e88673e5c7d3.xml 2025-12-04T12:05:03.3373015Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-93bd8f6af98a9b4c.xml 2025-12-04T12:05:03.3785914Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-58f36c545585b24b.xml 2025-12-04T12:05:03.4072808Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-6ef09e48fb8b226a.xml 2025-12-04T12:05:03.4357192Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e71119cdd1d43b1e.xml 2025-12-04T12:05:03.4637316Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c8dc22bc4fac83a2.xml 2025-12-04T12:05:03.4920091Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-8170ae01b738ee47.xml 2025-12-04T12:05:03.5197033Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0595c5125e4115a2.xml 2025-12-04T12:05:03.5469555Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ea830dd4e98e3dce.xml 2025-12-04T12:05:03.5742747Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0ddafb1da2805e03.xml 2025-12-04T12:05:03.6058701Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3b676b001d5055f4.xml 2025-12-04T12:05:03.6364709Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-15a19448251eca38.xml 2025-12-04T12:05:03.6700202Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7dd506857cedd350.xml 2025-12-04T12:05:03.7266170Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a068bf6354320ba5.xml 2025-12-04T12:05:03.7572785Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0bf202cdb375dae7.xml 2025-12-04T12:05:03.7885417Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-796bdc853aa8ac88.xml 2025-12-04T12:05:03.8235182Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-abef13434279995d.xml 2025-12-04T12:05:03.8553499Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-47824ae1e1d0f6cb.xml 2025-12-04T12:05:03.8859767Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4038060382adbf9a.xml 2025-12-04T12:05:03.9275806Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d0c6827a4c24a96e.xml 2025-12-04T12:05:03.9558618Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-eb0f3705043be69a.xml 2025-12-04T12:05:03.9869321Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b4b4a4cac36fd9fb.xml 2025-12-04T12:05:04.0142888Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5f059c7d0f50bda4.xml 2025-12-04T12:05:04.0465171Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c9ff80f1a6820ed2.xml 2025-12-04T12:05:04.0757538Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c2ef3587eca8dd3f.xml 2025-12-04T12:05:04.1030800Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-f520a42aa1f82bba.xml 2025-12-04T12:05:04.1324558Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-26d6067018821038.xml 2025-12-04T12:05:04.1714302Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-323433d4d00bc615.xml 2025-12-04T12:05:04.2000704Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-171ced1c6f8950ca.xml 2025-12-04T12:05:04.2263554Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-25dfec8b4de20cd4.xml 2025-12-04T12:05:04.2547353Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-301e5b291449627e.xml 2025-12-04T12:05:04.2844797Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0b50a11bd83abca6.xml 2025-12-04T12:05:04.3088169Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-28422965aaa270fa.xml 2025-12-04T12:05:04.3412378Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cc1d7ca9ed71a647.xml 2025-12-04T12:05:04.3690551Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-28fdda0d819ffd5b.xml 2025-12-04T12:05:04.3969251Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2efd0ac46f0ffe8c.xml 2025-12-04T12:05:04.4294660Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a06616e9f3ca63a0.xml 2025-12-04T12:05:04.4581013Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ac835e731ca0e136.xml 2025-12-04T12:05:04.4861001Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b97506f71fc95dd3.xml 2025-12-04T12:05:04.5191431Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e0e0cb20cc598f84.xml 2025-12-04T12:05:04.5518477Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7ccf4f5fc7381dbf.xml 2025-12-04T12:05:04.5824443Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9c9073ed00c82380.xml 2025-12-04T12:05:04.6136059Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-24dd415c882be612.xml 2025-12-04T12:05:04.6432378Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ebb637005a79032b.xml 2025-12-04T12:05:04.6779298Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-64323540e9247afe.xml 2025-12-04T12:05:04.7067241Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a52cc93387944e77.xml 2025-12-04T12:05:04.7852166Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c3ebbd7913c9335f.xml 2025-12-04T12:05:04.8154286Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4acaad8a10b8235e.xml 2025-12-04T12:05:04.8490188Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7532210fc6e9ae4a.xml 2025-12-04T12:05:04.8766336Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d5d3164b3ad67fad.xml 2025-12-04T12:05:04.9071301Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4e5ad1e60a526460.xml 2025-12-04T12:05:04.9355036Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4c3c91d6f895ec60.xml 2025-12-04T12:05:04.9761418Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-95a17199eb5946d1.xml 2025-12-04T12:05:05.0076658Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-81b9e43a20c07e3d.xml 2025-12-04T12:05:05.0374404Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1de8202a5dd1888a.xml 2025-12-04T12:05:05.0666624Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c241afcfaa8e2579.xml 2025-12-04T12:05:05.0978934Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-dbee4fdc7ea3ecd0.xml 2025-12-04T12:05:05.1259712Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-163cc6583cdf0c02.xml 2025-12-04T12:05:05.1569366Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a5ca9a61ebc16b36.xml 2025-12-04T12:05:05.1864385Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c4c3d903e4063d32.xml 2025-12-04T12:05:05.2151128Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-967683d4f4b3f6ea.xml 2025-12-04T12:05:05.2451997Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-dd59b77eb3bfc158.xml 2025-12-04T12:05:05.2749315Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-16ca540d5e1f852f.xml 2025-12-04T12:05:05.3029049Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3fca4d8c1020266e.xml 2025-12-04T12:05:05.3361441Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d72e902ee6d4abee.xml 2025-12-04T12:05:05.3683287Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ec9f5a8da88c6473.xml 2025-12-04T12:05:05.3981384Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cd768c9b8fb615f9.xml 2025-12-04T12:05:05.4407485Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-eed072ccebef54d5.xml 2025-12-04T12:05:05.4703106Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e17832bbca358cbf.xml 2025-12-04T12:05:05.5015769Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2596f02ad0864233.xml 2025-12-04T12:05:05.5303704Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-35234e0a942debc1.xml 2025-12-04T12:05:05.5586847Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-11b937d6d736747a.xml 2025-12-04T12:05:05.6047833Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5cef76ce1b71d0a4.xml 2025-12-04T12:05:05.6347320Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-f5efe4ba459d5b44.xml 2025-12-04T12:05:05.6668036Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-92f9442ed796f73f.xml 2025-12-04T12:05:05.7008602Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-bcaaee0a01e62e43.xml 2025-12-04T12:05:05.7296179Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-76ac61a88acbe22b.xml 2025-12-04T12:05:05.7577858Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-8647c42278291c82.xml 2025-12-04T12:05:05.7950581Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a0ddea104bebcf1c.xml 2025-12-04T12:05:05.8266226Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c80d90eb9dbd0365.xml 2025-12-04T12:05:05.8827073Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-98804e2c83f300d6.xml 2025-12-04T12:05:06.2814738Z Uploading logs for 57119749259 to S3 2025-12-04T12:05:06.4062684Z Uploading artifacts took 0.50 seconds 2025-12-04T12:05:06.4063127Z inductor/test_op_dtype_prop 1/2 failed! 2025-12-04T12:05:06.4067343Z Running dynamo/test_callback 1/1 ... [2025-12-04 12:05:06.406572][10310.527791078] 2025-12-04T12:05:06.4067913Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T12:05:06.4072324Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_callback.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:05:06.407006] 2025-12-04T12:05:25.4486459Z 2025-12-04T12:05:25.4487986Z dynamo/test_callback 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_callback_1.1_3d6aff0243b6db4a_.log 2025-12-04T12:05:25.4490989Z Running 4 items in this shard: test/dynamo/test_callback.py::CallbackTests::test_callbacks_with_duplicate_prevention, test/dynamo/test_callback.py::CallbackTests::test_counter, test/dynamo/test_callback.py::CallbackTests::test_counter_assertion, test/dynamo/test_callback.py::CallbackTests::test_triggers 2025-12-04T12:05:25.4492562Z 2025-12-04T12:05:25.4492904Z Finished dynamo/test_callback 1/1 ... [2025-12-04 12:05:25.448458][10329.569676514], took 0.32min 2025-12-04T12:05:25.4745888Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_callback/dynamo.test_callback-042e319b1bc49e9e.xml 2025-12-04T12:05:25.5593712Z Running dynamo/test_compiler_bisector 1/1 ... [2025-12-04 12:05:25.559049][10329.680269378] 2025-12-04T12:05:25.5594325Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T12:05:25.5597338Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_compiler_bisector.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:05:25.559465] 2025-12-04T12:06:42.2745178Z 2025-12-04T12:06:42.2746391Z PRINTING LOG FILE of dynamo/test_compiler_bisector 1/1 (test/test-reports/dynamo.test_compiler_bisector_1.1_331143819e86e237_.log) 2025-12-04T12:06:42.2748328Z Test results will be stored in test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-9d75f5d2563f16c8.xml 2025-12-04T12:06:42.2749870Z ============================= test session starts ============================== 2025-12-04T12:06:42.2751117Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:06:42.2751888Z cachedir: .pytest_cache 2025-12-04T12:06:42.2752686Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:06:42.2754052Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:06:42.2754539Z configfile: pytest.ini 2025-12-04T12:06:42.2755330Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:06:42.2756175Z collecting ... collected 9 items 2025-12-04T12:06:42.2756586Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T12:06:42.2761799Z Running 9 items in this shard: test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_bad_decomp, test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_bad_lowering, test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_bisect_pre_grad_graph, test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_crossref, test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_eager_backend, test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts, test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_joint_graph, test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_pre_grad, test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_rng 2025-12-04T12:06:42.2765854Z 2025-12-04T12:06:42.2766257Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_bad_decomp PASSED [2.1923s] [ 11%] 2025-12-04T12:06:42.2767191Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_bad_lowering PASSED [4.3306s] [ 22%] 2025-12-04T12:06:42.2769116Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_bisect_pre_grad_graph W1204 12:05:41.747000 134425 site-packages/torch/_inductor/standalone_compile.py:435] [0/0] standalone_compile artifact generation failed, cannot save. Run with TORCH_LOGS=+torch._inductor.codecache to identify the problem 2025-12-04T12:06:42.2771641Z W1204 12:05:41.749000 134425 site-packages/torch/_inductor/standalone_compile.py:435] [0/0] standalone_compile artifact generation failed, cannot save. Run with TORCH_LOGS=+torch._inductor.codecache to identify the problem 2025-12-04T12:06:42.2773671Z W1204 12:05:42.144000 134425 site-packages/torch/_inductor/standalone_compile.py:435] [0/0] standalone_compile artifact generation failed, cannot save. Run with TORCH_LOGS=+torch._inductor.codecache to identify the problem 2025-12-04T12:06:42.2774842Z PASSED [1.2611s] [ 33%] 2025-12-04T12:06:42.2775413Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_crossref PASSED [0.2580s] [ 44%] 2025-12-04T12:06:42.2776341Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_eager_backend PASSED [0.0027s] [ 55%] 2025-12-04T12:06:42.2777653Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts ('RERUN', {'yellow': True}) [0.5374s] [ 66%] 2025-12-04T12:06:42.2778981Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts ('RERUN', {'yellow': True}) [0.3445s] [ 66%] 2025-12-04T12:06:42.2780131Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts FAILED [0.3371s] [ 66%] 2025-12-04T12:06:42.2780739Z 2025-12-04T12:06:42.2780885Z ==================================== RERUNS ==================================== 2025-12-04T12:06:42.2781463Z ______________ TestCompilerBisector.test_emulate_precision_casts _______________ 2025-12-04T12:06:42.2781999Z Traceback (most recent call last): 2025-12-04T12:06:42.2782766Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 245, in test_emulate_precision_casts 2025-12-04T12:06:42.2783573Z out = CompilerBisector.do_bisect(test_fn) 2025-12-04T12:06:42.2784370Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compiler_bisector.py", line 578, in do_bisect 2025-12-04T12:06:42.2785112Z if fn(): 2025-12-04T12:06:42.2785660Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 241, in test_fn 2025-12-04T12:06:42.2786378Z compile_scale = torch.compile(calculate_scale)(inp) 2025-12-04T12:06:42.2787170Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T12:06:42.2788278Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T12:06:42.2789191Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T12:06:42.2790041Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T12:06:42.2790873Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T12:06:42.2791680Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T12:06:42.2792507Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T12:06:42.2793596Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T12:06:42.2794579Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T12:06:42.2795426Z _check_triton_bf16_support(graph) 2025-12-04T12:06:42.2796230Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T12:06:42.2797036Z warn_and_skip(node.get_device()) 2025-12-04T12:06:42.2797767Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T12:06:42.2798536Z raise SkipFrame("BF16 is not supported") 2025-12-04T12:06:42.2799058Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.2799496Z 2025-12-04T12:06:42.2800220Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.2801077Z 2025-12-04T12:06:42.2801082Z 2025-12-04T12:06:42.2801298Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.2802209Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.2802894Z 2025-12-04T12:06:42.2803177Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.2803808Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.2804294Z No bisection status found. 2025-12-04T12:06:42.2804656Z Starting bisection process with system: eager 2025-12-04T12:06:42.2805066Z Moving to the next system: aot_eager 2025-12-04T12:06:42.2805527Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.2806051Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.2806529Z Moving to the next system: inductor 2025-12-04T12:06:42.2806974Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.2808086Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.2809060Z warnings.warn( 2025-12-04T12:06:42.2809506Z ______________ TestCompilerBisector.test_emulate_precision_casts _______________ 2025-12-04T12:06:42.2810045Z Traceback (most recent call last): 2025-12-04T12:06:42.2810801Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 245, in test_emulate_precision_casts 2025-12-04T12:06:42.2811600Z out = CompilerBisector.do_bisect(test_fn) 2025-12-04T12:06:42.2812376Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compiler_bisector.py", line 578, in do_bisect 2025-12-04T12:06:42.2813120Z if fn(): 2025-12-04T12:06:42.2813661Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 241, in test_fn 2025-12-04T12:06:42.2814376Z compile_scale = torch.compile(calculate_scale)(inp) 2025-12-04T12:06:42.2815165Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T12:06:42.2816042Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T12:06:42.2816947Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T12:06:42.2817779Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T12:06:42.2818684Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T12:06:42.2819497Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T12:06:42.2820370Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T12:06:42.2821360Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T12:06:42.2822391Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T12:06:42.2823185Z _check_triton_bf16_support(graph) 2025-12-04T12:06:42.2823990Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T12:06:42.2824795Z warn_and_skip(node.get_device()) 2025-12-04T12:06:42.2825529Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T12:06:42.2826298Z raise SkipFrame("BF16 is not supported") 2025-12-04T12:06:42.2826850Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.2827250Z 2025-12-04T12:06:42.2827966Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.2828825Z 2025-12-04T12:06:42.2828832Z 2025-12-04T12:06:42.2829047Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.2829956Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.2830642Z 2025-12-04T12:06:42.2830921Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.2831543Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.2832023Z No bisection status found. 2025-12-04T12:06:42.2832381Z Starting bisection process with system: eager 2025-12-04T12:06:42.2832776Z Moving to the next system: aot_eager 2025-12-04T12:06:42.2833221Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.2833747Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.2834218Z Moving to the next system: inductor 2025-12-04T12:06:42.2834662Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.2835776Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.2836750Z warnings.warn( 2025-12-04T12:06:42.2837125Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.2837604Z Bisection status deleted. 2025-12-04T12:06:42.2837962Z Starting bisection process with system: eager 2025-12-04T12:06:42.2838356Z Moving to the next system: aot_eager 2025-12-04T12:06:42.2838773Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.2839298Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.2839770Z Moving to the next system: inductor 2025-12-04T12:06:42.2840213Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.2841313Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.2842290Z warnings.warn( 2025-12-04T12:06:42.2842587Z =================================== FAILURES =================================== 2025-12-04T12:06:42.2843160Z ______________ TestCompilerBisector.test_emulate_precision_casts _______________ 2025-12-04T12:06:42.2843704Z Traceback (most recent call last): 2025-12-04T12:06:42.2844463Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 245, in test_emulate_precision_casts 2025-12-04T12:06:42.2845254Z out = CompilerBisector.do_bisect(test_fn) 2025-12-04T12:06:42.2846042Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compiler_bisector.py", line 578, in do_bisect 2025-12-04T12:06:42.2846825Z if fn(): 2025-12-04T12:06:42.2847353Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 241, in test_fn 2025-12-04T12:06:42.2848117Z compile_scale = torch.compile(calculate_scale)(inp) 2025-12-04T12:06:42.2848922Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T12:06:42.2849799Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T12:06:42.2850688Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T12:06:42.2851542Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T12:06:42.2852381Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T12:06:42.2853248Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T12:06:42.2854059Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T12:06:42.2855070Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T12:06:42.2856070Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T12:06:42.2856867Z _check_triton_bf16_support(graph) 2025-12-04T12:06:42.2857659Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T12:06:42.2858556Z warn_and_skip(node.get_device()) 2025-12-04T12:06:42.2859292Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T12:06:42.2860054Z raise SkipFrame("BF16 is not supported") 2025-12-04T12:06:42.2860639Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.2861044Z 2025-12-04T12:06:42.2861767Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.2862617Z 2025-12-04T12:06:42.2862621Z 2025-12-04T12:06:42.2862853Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.2863746Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.2864450Z 2025-12-04T12:06:42.2864715Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.2865353Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.2865833Z No bisection status found. 2025-12-04T12:06:42.2866178Z Starting bisection process with system: eager 2025-12-04T12:06:42.2866590Z Moving to the next system: aot_eager 2025-12-04T12:06:42.2867006Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.2867512Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.2867979Z Moving to the next system: inductor 2025-12-04T12:06:42.2868439Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.2869537Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.2870502Z warnings.warn( 2025-12-04T12:06:42.2870889Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.2871365Z Bisection status deleted. 2025-12-04T12:06:42.2871707Z Starting bisection process with system: eager 2025-12-04T12:06:42.2872116Z Moving to the next system: aot_eager 2025-12-04T12:06:42.2872538Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.2873065Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.2873563Z Moving to the next system: inductor 2025-12-04T12:06:42.2874019Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.2875159Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.2876116Z warnings.warn( 2025-12-04T12:06:42.2876496Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.2876968Z Bisection status deleted. 2025-12-04T12:06:42.2877318Z Starting bisection process with system: eager 2025-12-04T12:06:42.2877712Z Moving to the next system: aot_eager 2025-12-04T12:06:42.2878125Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.2878643Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.2879141Z Moving to the next system: inductor 2025-12-04T12:06:42.2879602Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.2880698Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.2881671Z warnings.warn( 2025-12-04T12:06:42.2882607Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-9d75f5d2563f16c8.xml - 2025-12-04T12:06:42.2883704Z =========================== short test summary info ============================ 2025-12-04T12:06:42.2884794Z FAILED [0.3371s] dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.2885678Z 2025-12-04T12:06:42.2886434Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.2887284Z 2025-12-04T12:06:42.2887289Z 2025-12-04T12:06:42.2887505Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.2888613Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.2889318Z 2025-12-04T12:06:42.2889587Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.2890185Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:06:42.2890688Z ===================== 1 failed, 5 passed, 2 rerun in 9.31s ===================== 2025-12-04T12:06:42.2891139Z Bisection status deleted. 2025-12-04T12:06:42.2891456Z No bisection status found. 2025-12-04T12:06:42.2891759Z No bisection status found. 2025-12-04T12:06:42.2892064Z Got exit code 1 2025-12-04T12:06:42.2892338Z Retrying single test... 2025-12-04T12:06:42.2893131Z Test results will be stored in test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-704b849fb73fff94.xml 2025-12-04T12:06:42.2894023Z ============================= test session starts ============================== 2025-12-04T12:06:42.2894694Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:06:42.2895298Z cachedir: .pytest_cache 2025-12-04T12:06:42.2895999Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:06:42.2896788Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:06:42.2897138Z configfile: pytest.ini 2025-12-04T12:06:42.2897914Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:06:42.2898911Z collecting ... collected 9 items / 8 deselected / 1 selected 2025-12-04T12:06:42.2899976Z stepcurrent: skipping 5 already run items. Running only test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts 2025-12-04T12:06:42.2900863Z Running 1 items in this shard 2025-12-04T12:06:42.2901074Z 2025-12-04T12:06:42.2901690Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts ('RERUN', {'yellow': True}) [3.0493s] [100%] 2025-12-04T12:06:42.2902913Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts ('RERUN', {'yellow': True}) [0.3427s] [100%] 2025-12-04T12:06:42.2904053Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts FAILED [0.3370s] [100%] 2025-12-04T12:06:42.2904635Z 2025-12-04T12:06:42.2904792Z ==================================== RERUNS ==================================== 2025-12-04T12:06:42.2905366Z ______________ TestCompilerBisector.test_emulate_precision_casts _______________ 2025-12-04T12:06:42.2905956Z Traceback (most recent call last): 2025-12-04T12:06:42.2906725Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 245, in test_emulate_precision_casts 2025-12-04T12:06:42.2907524Z out = CompilerBisector.do_bisect(test_fn) 2025-12-04T12:06:42.2908307Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compiler_bisector.py", line 578, in do_bisect 2025-12-04T12:06:42.2909053Z if fn(): 2025-12-04T12:06:42.2909593Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 241, in test_fn 2025-12-04T12:06:42.2910310Z compile_scale = torch.compile(calculate_scale)(inp) 2025-12-04T12:06:42.2911095Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T12:06:42.2911970Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T12:06:42.2912916Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T12:06:42.2913751Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T12:06:42.2914591Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T12:06:42.2915390Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T12:06:42.2916211Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T12:06:42.2917204Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T12:06:42.2918203Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T12:06:42.2918999Z _check_triton_bf16_support(graph) 2025-12-04T12:06:42.2919809Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T12:06:42.2920617Z warn_and_skip(node.get_device()) 2025-12-04T12:06:42.2921355Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T12:06:42.2922133Z raise SkipFrame("BF16 is not supported") 2025-12-04T12:06:42.2922648Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.2923049Z 2025-12-04T12:06:42.2923772Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.2924637Z 2025-12-04T12:06:42.2924642Z 2025-12-04T12:06:42.2924860Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.2925776Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.2926461Z 2025-12-04T12:06:42.2926750Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.2927408Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.2927895Z No bisection status found. 2025-12-04T12:06:42.2928256Z Starting bisection process with system: eager 2025-12-04T12:06:42.2928684Z Moving to the next system: aot_eager 2025-12-04T12:06:42.2929103Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.2929618Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.2930076Z Moving to the next system: inductor 2025-12-04T12:06:42.2930531Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.2931633Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.2932606Z warnings.warn( 2025-12-04T12:06:42.2933038Z ______________ TestCompilerBisector.test_emulate_precision_casts _______________ 2025-12-04T12:06:42.2933620Z Traceback (most recent call last): 2025-12-04T12:06:42.2934378Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 245, in test_emulate_precision_casts 2025-12-04T12:06:42.2935176Z out = CompilerBisector.do_bisect(test_fn) 2025-12-04T12:06:42.2935948Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compiler_bisector.py", line 578, in do_bisect 2025-12-04T12:06:42.2936689Z if fn(): 2025-12-04T12:06:42.2937223Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 241, in test_fn 2025-12-04T12:06:42.2937921Z compile_scale = torch.compile(calculate_scale)(inp) 2025-12-04T12:06:42.2938781Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T12:06:42.2939664Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T12:06:42.2940615Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T12:06:42.2941460Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T12:06:42.2942297Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T12:06:42.2943099Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T12:06:42.2943907Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T12:06:42.2944909Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T12:06:42.2945899Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T12:06:42.2946689Z _check_triton_bf16_support(graph) 2025-12-04T12:06:42.2947479Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T12:06:42.2948299Z warn_and_skip(node.get_device()) 2025-12-04T12:06:42.2949033Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T12:06:42.2949801Z raise SkipFrame("BF16 is not supported") 2025-12-04T12:06:42.2950312Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.2950709Z 2025-12-04T12:06:42.2951422Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.2952284Z 2025-12-04T12:06:42.2952289Z 2025-12-04T12:06:42.2952504Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.2953411Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.2954093Z 2025-12-04T12:06:42.2954361Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.2955026Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.2955509Z No bisection status found. 2025-12-04T12:06:42.2955865Z Starting bisection process with system: eager 2025-12-04T12:06:42.2956312Z Moving to the next system: aot_eager 2025-12-04T12:06:42.2956728Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.2957244Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.2957703Z Moving to the next system: inductor 2025-12-04T12:06:42.2958165Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.2959265Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.2960277Z warnings.warn( 2025-12-04T12:06:42.2960648Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.2961125Z Bisection status deleted. 2025-12-04T12:06:42.2961480Z Starting bisection process with system: eager 2025-12-04T12:06:42.2961871Z Moving to the next system: aot_eager 2025-12-04T12:06:42.2962288Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.2962803Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.2963263Z Moving to the next system: inductor 2025-12-04T12:06:42.2963717Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.2964814Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.2965787Z warnings.warn( 2025-12-04T12:06:42.2966084Z =================================== FAILURES =================================== 2025-12-04T12:06:42.2966697Z ______________ TestCompilerBisector.test_emulate_precision_casts _______________ 2025-12-04T12:06:42.2967247Z Traceback (most recent call last): 2025-12-04T12:06:42.2967991Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 245, in test_emulate_precision_casts 2025-12-04T12:06:42.2968797Z out = CompilerBisector.do_bisect(test_fn) 2025-12-04T12:06:42.2969581Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compiler_bisector.py", line 578, in do_bisect 2025-12-04T12:06:42.2970328Z if fn(): 2025-12-04T12:06:42.2970850Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 241, in test_fn 2025-12-04T12:06:42.2971566Z compile_scale = torch.compile(calculate_scale)(inp) 2025-12-04T12:06:42.2972363Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T12:06:42.2973231Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T12:06:42.2974143Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T12:06:42.2974989Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T12:06:42.2975828Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T12:06:42.2976615Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T12:06:42.2977431Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T12:06:42.2978512Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T12:06:42.2979506Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T12:06:42.2980284Z _check_triton_bf16_support(graph) 2025-12-04T12:06:42.2981094Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T12:06:42.2981950Z warn_and_skip(node.get_device()) 2025-12-04T12:06:42.2982672Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T12:06:42.2983486Z raise SkipFrame("BF16 is not supported") 2025-12-04T12:06:42.2984010Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.2984398Z 2025-12-04T12:06:42.2985141Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.2985992Z 2025-12-04T12:06:42.2985997Z 2025-12-04T12:06:42.2986228Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.2987125Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.2988079Z 2025-12-04T12:06:42.2988355Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.2988995Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.2989481Z No bisection status found. 2025-12-04T12:06:42.2989829Z Starting bisection process with system: eager 2025-12-04T12:06:42.2990238Z Moving to the next system: aot_eager 2025-12-04T12:06:42.2990661Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.2991176Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.2991649Z Moving to the next system: inductor 2025-12-04T12:06:42.2992112Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.2993200Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.2994180Z warnings.warn( 2025-12-04T12:06:42.2994640Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.2995124Z Bisection status deleted. 2025-12-04T12:06:42.2995464Z Starting bisection process with system: eager 2025-12-04T12:06:42.2995869Z Moving to the next system: aot_eager 2025-12-04T12:06:42.2996289Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.2996797Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.2997269Z Moving to the next system: inductor 2025-12-04T12:06:42.2997724Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.2998805Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.2999772Z warnings.warn( 2025-12-04T12:06:42.3000150Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.3000627Z Bisection status deleted. 2025-12-04T12:06:42.3000973Z Starting bisection process with system: eager 2025-12-04T12:06:42.3001374Z Moving to the next system: aot_eager 2025-12-04T12:06:42.3001797Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.3002306Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.3002774Z Moving to the next system: inductor 2025-12-04T12:06:42.3003226Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.3004322Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.3005284Z warnings.warn( 2025-12-04T12:06:42.3006231Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-704b849fb73fff94.xml - 2025-12-04T12:06:42.3007338Z =========================== short test summary info ============================ 2025-12-04T12:06:42.3008472Z FAILED [0.3370s] dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.3009409Z 2025-12-04T12:06:42.3010120Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.3010979Z 2025-12-04T12:06:42.3010983Z 2025-12-04T12:06:42.3011200Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.3012109Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.3012793Z 2025-12-04T12:06:42.3013072Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.3013706Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:06:42.3014234Z =================== 1 failed, 8 deselected, 2 rerun in 3.76s =================== 2025-12-04T12:06:42.3014692Z Bisection status deleted. 2025-12-04T12:06:42.3015007Z No bisection status found. 2025-12-04T12:06:42.3015307Z No bisection status found. 2025-12-04T12:06:42.3015606Z Got exit code 1 2025-12-04T12:06:42.3015873Z Retrying single test... 2025-12-04T12:06:42.3016649Z Test results will be stored in test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-64c9d19a2b1bbb58.xml 2025-12-04T12:06:42.3017557Z ============================= test session starts ============================== 2025-12-04T12:06:42.3018284Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:06:42.3018882Z cachedir: .pytest_cache 2025-12-04T12:06:42.3019600Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:06:42.3020439Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:06:42.3020800Z configfile: pytest.ini 2025-12-04T12:06:42.3021567Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:06:42.3022508Z collecting ... collected 9 items / 8 deselected / 1 selected 2025-12-04T12:06:42.3023497Z stepcurrent: skipping 5 already run items. Running only test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts 2025-12-04T12:06:42.3024384Z Running 1 items in this shard 2025-12-04T12:06:42.3024593Z 2025-12-04T12:06:42.3025137Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts ('RERUN', {'yellow': True}) [3.0757s] [100%] 2025-12-04T12:06:42.3026373Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts ('RERUN', {'yellow': True}) [0.3483s] [100%] 2025-12-04T12:06:42.3027517Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts FAILED [0.3465s] [100%] 2025-12-04T12:06:42.3028106Z 2025-12-04T12:06:42.3028261Z ==================================== RERUNS ==================================== 2025-12-04T12:06:42.3028819Z ______________ TestCompilerBisector.test_emulate_precision_casts _______________ 2025-12-04T12:06:42.3029369Z Traceback (most recent call last): 2025-12-04T12:06:42.3030123Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 245, in test_emulate_precision_casts 2025-12-04T12:06:42.3030911Z out = CompilerBisector.do_bisect(test_fn) 2025-12-04T12:06:42.3031699Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compiler_bisector.py", line 578, in do_bisect 2025-12-04T12:06:42.3032440Z if fn(): 2025-12-04T12:06:42.3032974Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 241, in test_fn 2025-12-04T12:06:42.3033672Z compile_scale = torch.compile(calculate_scale)(inp) 2025-12-04T12:06:42.3034501Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T12:06:42.3035378Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T12:06:42.3036318Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T12:06:42.3037150Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T12:06:42.3037983Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T12:06:42.3038778Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T12:06:42.3039582Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T12:06:42.3040587Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T12:06:42.3041622Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T12:06:42.3042421Z _check_triton_bf16_support(graph) 2025-12-04T12:06:42.3043214Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T12:06:42.3044036Z warn_and_skip(node.get_device()) 2025-12-04T12:06:42.3044768Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T12:06:42.3045539Z raise SkipFrame("BF16 is not supported") 2025-12-04T12:06:42.3046049Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.3046452Z 2025-12-04T12:06:42.3047170Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.3048022Z 2025-12-04T12:06:42.3048073Z 2025-12-04T12:06:42.3048293Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.3049202Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.3049887Z 2025-12-04T12:06:42.3050152Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.3050782Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.3051264Z No bisection status found. 2025-12-04T12:06:42.3051609Z Starting bisection process with system: eager 2025-12-04T12:06:42.3052014Z Moving to the next system: aot_eager 2025-12-04T12:06:42.3052432Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.3052956Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.3053414Z Moving to the next system: inductor 2025-12-04T12:06:42.3053881Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.3054994Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.3055959Z warnings.warn( 2025-12-04T12:06:42.3056412Z ______________ TestCompilerBisector.test_emulate_precision_casts _______________ 2025-12-04T12:06:42.3056966Z Traceback (most recent call last): 2025-12-04T12:06:42.3057728Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 245, in test_emulate_precision_casts 2025-12-04T12:06:42.3058606Z out = CompilerBisector.do_bisect(test_fn) 2025-12-04T12:06:42.3059400Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compiler_bisector.py", line 578, in do_bisect 2025-12-04T12:06:42.3060148Z if fn(): 2025-12-04T12:06:42.3060674Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 241, in test_fn 2025-12-04T12:06:42.3061397Z compile_scale = torch.compile(calculate_scale)(inp) 2025-12-04T12:06:42.3062280Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T12:06:42.3063168Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T12:06:42.3064095Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T12:06:42.3064942Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T12:06:42.3065781Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T12:06:42.3066584Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T12:06:42.3067386Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T12:06:42.3068425Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T12:06:42.3069418Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T12:06:42.3070201Z _check_triton_bf16_support(graph) 2025-12-04T12:06:42.3071000Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T12:06:42.3071815Z warn_and_skip(node.get_device()) 2025-12-04T12:06:42.3072546Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T12:06:42.3073301Z raise SkipFrame("BF16 is not supported") 2025-12-04T12:06:42.3073820Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.3074204Z 2025-12-04T12:06:42.3074970Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.3075822Z 2025-12-04T12:06:42.3075827Z 2025-12-04T12:06:42.3076061Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.3076953Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.3077655Z 2025-12-04T12:06:42.3077919Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.3078554Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.3079037Z No bisection status found. 2025-12-04T12:06:42.3079382Z Starting bisection process with system: eager 2025-12-04T12:06:42.3079787Z Moving to the next system: aot_eager 2025-12-04T12:06:42.3080205Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.3080715Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.3081185Z Moving to the next system: inductor 2025-12-04T12:06:42.3081646Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.3082749Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.3083710Z warnings.warn( 2025-12-04T12:06:42.3084094Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.3084572Z Bisection status deleted. 2025-12-04T12:06:42.3084908Z Starting bisection process with system: eager 2025-12-04T12:06:42.3085315Z Moving to the next system: aot_eager 2025-12-04T12:06:42.3085731Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.3086255Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.3086714Z Moving to the next system: inductor 2025-12-04T12:06:42.3087170Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.3088556Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.3089528Z warnings.warn( 2025-12-04T12:06:42.3089845Z =================================== FAILURES =================================== 2025-12-04T12:06:42.3090466Z ______________ TestCompilerBisector.test_emulate_precision_casts _______________ 2025-12-04T12:06:42.3091011Z Traceback (most recent call last): 2025-12-04T12:06:42.3091752Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 245, in test_emulate_precision_casts 2025-12-04T12:06:42.3092553Z out = CompilerBisector.do_bisect(test_fn) 2025-12-04T12:06:42.3093336Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compiler_bisector.py", line 578, in do_bisect 2025-12-04T12:06:42.3094063Z if fn(): 2025-12-04T12:06:42.3094646Z File "/var/lib/jenkins/workspace/test/dynamo/test_compiler_bisector.py", line 241, in test_fn 2025-12-04T12:06:42.3095356Z compile_scale = torch.compile(calculate_scale)(inp) 2025-12-04T12:06:42.3096155Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 940, in compile_wrapper 2025-12-04T12:06:42.3097021Z raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 2025-12-04T12:06:42.3097926Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 992, in _compile_fx_inner 2025-12-04T12:06:42.3098834Z raise InductorError(e, currentframe()).with_traceback( 2025-12-04T12:06:42.3099658Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 988, in _compile_fx_inner 2025-12-04T12:06:42.3100458Z mb_compiled_graph = fx_codegen_and_compile( 2025-12-04T12:06:42.3101276Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1757, in fx_codegen_and_compile 2025-12-04T12:06:42.3102330Z return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) 2025-12-04T12:06:42.3103310Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 1471, in codegen_and_compile 2025-12-04T12:06:42.3104097Z _check_triton_bf16_support(graph) 2025-12-04T12:06:42.3104902Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2911, in _check_triton_bf16_support 2025-12-04T12:06:42.3105718Z warn_and_skip(node.get_device()) 2025-12-04T12:06:42.3106437Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py", line 2894, in warn_and_skip 2025-12-04T12:06:42.3107209Z raise SkipFrame("BF16 is not supported") 2025-12-04T12:06:42.3107731Z torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.3108119Z 2025-12-04T12:06:42.3108850Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.3109703Z 2025-12-04T12:06:42.3109708Z 2025-12-04T12:06:42.3109922Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.3110828Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.3111536Z 2025-12-04T12:06:42.3111806Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.3112441Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.3112908Z No bisection status found. 2025-12-04T12:06:42.3113265Z Starting bisection process with system: eager 2025-12-04T12:06:42.3113670Z Moving to the next system: aot_eager 2025-12-04T12:06:42.3114074Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.3114594Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.3115072Z Moving to the next system: inductor 2025-12-04T12:06:42.3115549Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.3116653Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.3117662Z warnings.warn( 2025-12-04T12:06:42.3118046Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.3118511Z Bisection status deleted. 2025-12-04T12:06:42.3118873Z Starting bisection process with system: eager 2025-12-04T12:06:42.3119282Z Moving to the next system: aot_eager 2025-12-04T12:06:42.3119687Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.3120213Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.3120685Z Moving to the next system: inductor 2025-12-04T12:06:42.3121180Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.3122271Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.3123249Z warnings.warn( 2025-12-04T12:06:42.3123642Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:06:42.3124109Z Bisection status deleted. 2025-12-04T12:06:42.3124470Z Starting bisection process with system: eager 2025-12-04T12:06:42.3124880Z Moving to the next system: aot_eager 2025-12-04T12:06:42.3125297Z Moving to the next system: aot_eager_decomp_partition 2025-12-04T12:06:42.3125807Z Moving to the next system: aot_eager_decomp_partition_crossref 2025-12-04T12:06:42.3126284Z Moving to the next system: inductor 2025-12-04T12:06:42.3126742Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:06:42.3127861Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx.py:2891: UserWarning: Tesla T4 does not support bfloat16 compilation natively, skipping 2025-12-04T12:06:42.3128830Z warnings.warn( 2025-12-04T12:06:42.3129787Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-64c9d19a2b1bbb58.xml - 2025-12-04T12:06:42.3130887Z =========================== short test summary info ============================ 2025-12-04T12:06:42.3131963Z FAILED [0.3465s] dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts - torch._inductor.exc.InductorError: SkipFrame: BF16 is not supported 2025-12-04T12:06:42.3132865Z 2025-12-04T12:06:42.3133574Z Set TORCHDYNAMO_VERBOSE=1 for the internal stack trace (please do this especially if you're reporting a bug to PyTorch). For even more developer context, set TORCH_LOGS="+dynamo" 2025-12-04T12:06:42.3134432Z 2025-12-04T12:06:42.3134437Z 2025-12-04T12:06:42.3134659Z To execute this test, run the following from the base repo dir: 2025-12-04T12:06:42.3135564Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/dynamo/test_compiler_bisector.py TestCompilerBisector.test_emulate_precision_casts 2025-12-04T12:06:42.3136248Z 2025-12-04T12:06:42.3136525Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:06:42.3137106Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:06:42.3137632Z =================== 1 failed, 8 deselected, 2 rerun in 3.80s =================== 2025-12-04T12:06:42.3138087Z Bisection status deleted. 2025-12-04T12:06:42.3138463Z No bisection status found. 2025-12-04T12:06:42.3138784Z No bisection status found. 2025-12-04T12:06:42.3139087Z Got exit code 1 2025-12-04T12:06:42.3139709Z FAILED CONSISTENTLY: test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts 2025-12-04T12:06:42.3140735Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T12:06:42.3141930Z Test results will be stored in test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-c370485b718ab3d2.xml 2025-12-04T12:06:42.3142839Z ============================= test session starts ============================== 2025-12-04T12:06:42.3143529Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:06:42.3144133Z cachedir: .pytest_cache 2025-12-04T12:06:42.3144849Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:06:42.3145636Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:06:42.3145974Z configfile: pytest.ini 2025-12-04T12:06:42.3146753Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:06:42.3147695Z collecting ... collected 9 items / 6 deselected / 3 selected 2025-12-04T12:06:42.3148210Z stepcurrent: skipping 6 already run items. 2025-12-04T12:06:42.3148594Z Running 3 items in this shard 2025-12-04T12:06:42.3148800Z 2025-12-04T12:06:42.3149212Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_joint_graph PASSED [5.4012s] [ 33%] 2025-12-04T12:06:42.3151544Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_pre_grad SKIPPED [0.0008s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/164462 for platform(s) linux, rocm, slow. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 66%] 2025-12-04T12:06:42.3155233Z dynamo/test_compiler_bisector.py::TestCompilerBisector::test_rng SKIPPED [0.0007s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/139590 for platform(s) linux, rocm, slow. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [100%] 2025-12-04T12:06:42.3157090Z 2025-12-04T12:06:42.3157895Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-c370485b718ab3d2.xml - 2025-12-04T12:06:42.3159021Z ================== 1 passed, 2 skipped, 6 deselected in 5.43s ================== 2025-12-04T12:06:42.3159956Z The following tests failed consistently: ['test/dynamo/test_compiler_bisector.py::TestCompilerBisector::test_emulate_precision_casts'] 2025-12-04T12:06:42.3160667Z 2025-12-04T12:06:42.3161260Z FINISHED PRINTING LOG FILE of dynamo/test_compiler_bisector 1/1 (test/test-reports/dynamo.test_compiler_bisector_1.1_331143819e86e237_.log) 2025-12-04T12:06:42.3161974Z 2025-12-04T12:06:42.3162365Z Finished dynamo/test_compiler_bisector 1/1 ... [2025-12-04 12:06:42.274728][10406.395945347], took 1.28min 2025-12-04T12:06:42.3163736Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-9d75f5d2563f16c8.xml 2025-12-04T12:06:42.3774042Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-704b849fb73fff94.xml 2025-12-04T12:06:42.4062044Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-64c9d19a2b1bbb58.xml 2025-12-04T12:06:42.4386417Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-c370485b718ab3d2.xml 2025-12-04T12:06:43.0361105Z Uploading logs for 57119749259 to S3 2025-12-04T12:06:43.1455792Z Uploading artifacts took 0.67 seconds 2025-12-04T12:06:43.1456223Z dynamo/test_compiler_bisector 1/1 failed! 2025-12-04T12:06:43.1461318Z Running inductor/test_cudagraph_trees_expandable_segments 1/1 ... [2025-12-04 12:06:43.145937][10407.26715693] 2025-12-04T12:06:43.1462269Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T12:06:43.1466349Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cudagraph_trees_expandable_segments.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:06:43.146390] 2025-12-04T12:10:04.0318392Z 2025-12-04T12:10:04.0320081Z PRINTING LOG FILE of inductor/test_cudagraph_trees_expandable_segments 1/1 (test/test-reports/inductor.test_cudagraph_trees_expandable_segments_1.1_4e86984065370f2d_.log) 2025-12-04T12:10:04.0323254Z /var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees_expandable_segments.py:43: FutureWarning: torch.cuda._set_allocator_settings is deprecated. Use torch._C._accelerator_setAllocatorSettings instead. 2025-12-04T12:10:04.0326136Z torch.cuda.memory._set_allocator_settings("expandable_segments:True") 2025-12-04T12:10:04.0327911Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to ./.pytorch-disabled-tests.json 2025-12-04T12:10:04.0330812Z Test results will be stored in test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-cfe1d81d1ed43800.xml 2025-12-04T12:10:04.0332990Z ============================= test session starts ============================== 2025-12-04T12:10:04.0334230Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:04.0335365Z cachedir: .pytest_cache 2025-12-04T12:10:04.0336735Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:04.0338355Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:10:04.0339014Z configfile: pytest.ini 2025-12-04T12:10:04.0340699Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:10:04.0342296Z collecting ... collected 152 items 2025-12-04T12:10:04.0343088Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T12:10:04.0518858Z Running 152 items in this shard: test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_accumulate_grad, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_accumulate_multiple_recordings, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_alias_of_parameter, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_aliased_output_checkpoint, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_aliased_static_parameter, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_aliased_storage_single_weakref, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_aliasing_static_ref, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_amp_cache_disabled, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_backward_gets_cached_cudagraphs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cache_hit_forward_miss_backward, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cached_boxed_forward_device_index, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cached_forward_backward, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_checkpoint_shared_output_storage_deallocation, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_checkpointing_resets_persistent_refs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cleanup, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_compiled_autograd_static_input_params, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_constant_output, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_conv_benchmark, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cpp_wrapper, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cudagraph_capture_sizes, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cudagraph_capture_sizes1, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cudagraph_capture_sizes2, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cudagraph_or_error, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_dynamic_backward, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_dynamic_warmup, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_empty_cpu_tensor, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_empty_storage, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_end_recording_early, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_error_on_dealloc_use, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_error_on_dealloc_use2, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_execution_into_recording, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_expanded_inputs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_fallback_to_eager_if_recompiling_too_many_times, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_fallback_to_eager_if_recompiling_too_many_times_due_to_cudagraph_managed_tensor, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_fallback_to_eager_if_recompiling_too_many_times_warn_only_once, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_forward_backward, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_forward_backward_not_called_backend_cudagraphs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_forward_backward_not_called_backend_inductor, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_forward_generation, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_forward_with_skipped_cudagraphed_backward, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_frozen_fn, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_function_compiled_multiple_times, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_buffer_reuse, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_condition_op, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_only, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_op_and_dynamic_shapes, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar1, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar2, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar3, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar4, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar_device_put, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar_multiple, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar_mutation, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_tensor_symints, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_op, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_op_dynamoc_shapes, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_op_mutation, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_op_mutation_late_free, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_op_no_split, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_rule, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_dynamic_scalar_inputs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_dynamic_shapes, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_foreach_op, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_forward_backward, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_forward_backward_not_called, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_forward_with_skipped_cudagraphed_backward, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_fused_scheduler_node, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_gc, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_item, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_kernel_reuse_autotune_at_compile_time_False, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_kernel_reuse_autotune_at_compile_time_True, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_log_message, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_multiple_devices_msg, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_reduce_overhead_mode_effectiveness, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_reorder_cpu_and_gpu, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_reorder_cpu_and_gpu_interleave, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_reorder_custom_op_with_no_dependency, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_reorder_custom_op_with_no_dependency1, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_simple, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_symint, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_symint_cat_backward, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_symint_from_mutation_index, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_symint_from_nested_indirect_indexing, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_unbacked_symint, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_unbacked_symint_multi_output_layout, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_user_defined_triton_kernel_reuse, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_view_fallback, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_with_memory_plan_reuse, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_item, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_nonzero, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_nonzero_backend, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_nonzero_graph_breaks, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_index_put, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_live_outputs_multiple_graphs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_manager_per_device, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mark_step, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_meta_tensor, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_child_node, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_custom_module, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_custom_module_buffer, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_parent_node, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_single_compile_builtin_module, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_single_compile_builtin_module_buffers, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_single_compile_param_inputs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multinomial, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multiple_devices_msg_backend_cudagraphs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multiple_devices_msg_backend_inductor, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multiple_insert_removal_caching, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_backend_cudagraphs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_backend_inductor, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_only_once_backend_cudagraphs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_only_once_backend_inductor, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_backend_cudagraphs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_backend_inductor, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_config_backend_cudagraphs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_config_backend_inductor, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_on_inp_backend_cudagraphs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_on_inp_backend_inductor, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_reinplaced, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_no_rerecord_with_mark_static_address, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_not_fallback_to_eager_if_have_not_recompiling_too_many_times, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_output_alias, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_peristed_output_livenes, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_remove_hooks_on_cached_tensors, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_rerecord_if_static_input_address_changed, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_rng_non_trees, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_rng_trees, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_run_simple, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_separate_recordings, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_side_stream_memory_allocation, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_single_stream_use, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_skip_cpp_wrapper, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_skip_cudagraph_unsafe_ops, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_skip_if_dynamic_shape_limit_reached1, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_skip_if_dynamic_shape_limit_reached2, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_skip_symbolic, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_sparsity, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_static_inputs_address_mutation_log, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_storage_access_error, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_tensor_constant_mutation, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_tensor_dies_between_checkpoint, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_tensor_no_longer_in_pool, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_unaligned_static_input_no_cudagraphs, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_unaligned_static_input_non_trees, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_unaligned_static_input_trees, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_unaligned_static_parameter, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_unstable_ptr, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_warmup_stream_sync, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_warn_on_pending_backward, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_warn_once_if_dynamic_shape_limit_reached, test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_workspace_allocation_error 2025-12-04T12:10:04.0643124Z 2025-12-04T12:10:04.0643885Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_accumulate_grad <- test/inductor/test_cudagraph_trees.py PASSED [4.7967s] [ 0%] 2025-12-04T12:10:04.0645685Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_accumulate_multiple_recordings <- test/inductor/test_cudagraph_trees.py PASSED [1.5749s] [ 1%] 2025-12-04T12:10:04.0647347Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_alias_of_parameter <- test/inductor/test_cudagraph_trees.py PASSED [0.3972s] [ 1%] 2025-12-04T12:10:04.0649014Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_aliased_output_checkpoint <- test/inductor/test_cudagraph_trees.py PASSED [0.1917s] [ 2%] 2025-12-04T12:10:04.0650669Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_aliased_static_parameter <- test/inductor/test_cudagraph_trees.py PASSED [0.1871s] [ 3%] 2025-12-04T12:10:04.0652886Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_aliased_storage_single_weakref <- test/inductor/test_cudagraph_trees.py W1204 12:06:59.495000 135215 site-packages/torch/_dynamo/variables/tensor.py:1073] [1/0] Graph break from `Tensor.item()`, consider setting: 2025-12-04T12:10:04.0654968Z W1204 12:06:59.495000 135215 site-packages/torch/_dynamo/variables/tensor.py:1073] [1/0] torch._dynamo.config.capture_scalar_outputs = True 2025-12-04T12:10:04.0655998Z W1204 12:06:59.495000 135215 site-packages/torch/_dynamo/variables/tensor.py:1073] [1/0] or: 2025-12-04T12:10:04.0656976Z W1204 12:06:59.495000 135215 site-packages/torch/_dynamo/variables/tensor.py:1073] [1/0] env TORCHDYNAMO_CAPTURE_SCALAR_OUTPUTS=1 2025-12-04T12:10:04.0658183Z W1204 12:06:59.495000 135215 site-packages/torch/_dynamo/variables/tensor.py:1073] [1/0] to include these operations in the captured graph. 2025-12-04T12:10:04.0659251Z W1204 12:06:59.495000 135215 site-packages/torch/_dynamo/variables/tensor.py:1073] [1/0] 2025-12-04T12:10:04.0660174Z W1204 12:06:59.495000 135215 site-packages/torch/_dynamo/variables/tensor.py:1073] [1/0] Graph break: from user code at: 2025-12-04T12:10:04.0661772Z W1204 12:06:59.495000 135215 site-packages/torch/_dynamo/variables/tensor.py:1073] [1/0] File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 1280, in torch_dynamo_resume_in_foo_at_1278 2025-12-04T12:10:04.0663264Z W1204 12:06:59.495000 135215 site-packages/torch/_dynamo/variables/tensor.py:1073] [1/0] x_alias2 = x[ind:] 2025-12-04T12:10:04.0664143Z W1204 12:06:59.495000 135215 site-packages/torch/_dynamo/variables/tensor.py:1073] [1/0] 2025-12-04T12:10:04.0664902Z W1204 12:06:59.495000 135215 site-packages/torch/_dynamo/variables/tensor.py:1073] [1/0] 2025-12-04T12:10:04.0665488Z PASSED [0.4223s] [ 3%] 2025-12-04T12:10:04.0666859Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_aliasing_static_ref <- test/inductor/test_cudagraph_trees.py W1204 12:07:00.806000 135215 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T12:10:04.0668284Z PASSED [1.5155s] [ 4%] 2025-12-04T12:10:04.0669181Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_amp_cache_disabled <- test/inductor/test_cudagraph_trees.py PASSED [0.7723s] [ 5%] 2025-12-04T12:10:04.0670835Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_backward_gets_cached_cudagraphs <- test/inductor/test_cudagraph_trees.py PASSED [1.8833s] [ 5%] 2025-12-04T12:10:04.0672539Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cache_hit_forward_miss_backward <- test/inductor/test_cudagraph_trees.py PASSED [1.8263s] [ 6%] 2025-12-04T12:10:04.0674409Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cached_boxed_forward_device_index <- test/inductor/test_cudagraph_trees.py SKIPPED [0.0004s] (requires multiple cuda devices) [ 7%] 2025-12-04T12:10:04.0676236Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cached_forward_backward <- test/inductor/test_cudagraph_trees.py PASSED [1.3471s] [ 7%] 2025-12-04T12:10:04.0677990Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_checkpoint_shared_output_storage_deallocation <- test/inductor/test_cudagraph_trees.py PASSED [0.1923s] [ 8%] 2025-12-04T12:10:04.0679858Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_checkpointing_resets_persistent_refs <- test/inductor/test_cudagraph_trees.py PASSED [0.4298s] [ 9%] 2025-12-04T12:10:04.0681528Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cleanup <- test/inductor/test_cudagraph_trees.py PASSED [0.6767s] [ 9%] 2025-12-04T12:10:04.0683156Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_compiled_autograd_static_input_params <- test/inductor/test_cudagraph_trees.py PASSED [1.0461s] [ 10%] 2025-12-04T12:10:04.0684810Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_constant_output <- test/inductor/test_cudagraph_trees.py PASSED [0.7222s] [ 11%] 2025-12-04T12:10:04.0686367Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_conv_benchmark <- test/inductor/test_cudagraph_trees.py PASSED [2.3406s] [ 11%] 2025-12-04T12:10:04.0688113Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cpp_wrapper <- test/inductor/test_cudagraph_trees.py PASSED [2.3680s] [ 12%] 2025-12-04T12:10:04.0689696Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cudagraph_capture_sizes <- test/inductor/test_cudagraph_trees.py PASSED [1.1034s] [ 13%] 2025-12-04T12:10:04.0691338Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cudagraph_capture_sizes1 <- test/inductor/test_cudagraph_trees.py PASSED [0.5318s] [ 13%] 2025-12-04T12:10:04.0692959Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cudagraph_capture_sizes2 <- test/inductor/test_cudagraph_trees.py PASSED [0.5527s] [ 14%] 2025-12-04T12:10:04.0694573Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_cudagraph_or_error <- test/inductor/test_cudagraph_trees.py PASSED [0.3690s] [ 15%] 2025-12-04T12:10:04.0696211Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_dynamic_backward <- test/inductor/test_cudagraph_trees.py PASSED [1.5916s] [ 15%] 2025-12-04T12:10:04.0697763Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_dynamic_warmup <- test/inductor/test_cudagraph_trees.py PASSED [0.2264s] [ 16%] 2025-12-04T12:10:04.0699351Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_empty_cpu_tensor <- test/inductor/test_cudagraph_trees.py PASSED [0.4095s] [ 17%] 2025-12-04T12:10:04.0700898Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_empty_storage <- test/inductor/test_cudagraph_trees.py PASSED [0.7205s] [ 17%] 2025-12-04T12:10:04.0702451Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_end_recording_early <- test/inductor/test_cudagraph_trees.py PASSED [0.7340s] [ 18%] 2025-12-04T12:10:04.0704038Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_error_on_dealloc_use <- test/inductor/test_cudagraph_trees.py PASSED [0.3899s] [ 19%] 2025-12-04T12:10:04.0705623Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_error_on_dealloc_use2 <- test/inductor/test_cudagraph_trees.py PASSED [0.3915s] [ 19%] 2025-12-04T12:10:04.0707221Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_execution_into_recording <- test/inductor/test_cudagraph_trees.py PASSED [0.7539s] [ 20%] 2025-12-04T12:10:04.0708818Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_expanded_inputs <- test/inductor/test_cudagraph_trees.py PASSED [0.4174s] [ 21%] 2025-12-04T12:10:04.0710511Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_fallback_to_eager_if_recompiling_too_many_times <- test/inductor/test_cudagraph_trees.py PASSED [0.4779s] [ 21%] 2025-12-04T12:10:04.0712545Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_fallback_to_eager_if_recompiling_too_many_times_due_to_cudagraph_managed_tensor <- test/inductor/test_cudagraph_trees.py PASSED [0.5670s] [ 22%] 2025-12-04T12:10:04.0714595Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_fallback_to_eager_if_recompiling_too_many_times_warn_only_once <- test/inductor/test_cudagraph_trees.py PASSED [0.4830s] [ 23%] 2025-12-04T12:10:04.0716400Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_forward_backward <- test/inductor/test_cudagraph_trees.py PASSED [0.8064s] [ 23%] 2025-12-04T12:10:04.0718107Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_forward_backward_not_called_backend_cudagraphs <- test/inductor/test_cudagraph_trees.py PASSED [0.4042s] [ 24%] 2025-12-04T12:10:04.0719943Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_forward_backward_not_called_backend_inductor <- test/inductor/test_cudagraph_trees.py PASSED [0.6590s] [ 25%] 2025-12-04T12:10:04.0721701Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_forward_generation <- test/inductor/test_cudagraph_trees.py PASSED [0.8763s] [ 25%] 2025-12-04T12:10:04.0723405Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_forward_with_skipped_cudagraphed_backward <- test/inductor/test_cudagraph_trees.py PASSED [0.5272s] [ 26%] 2025-12-04T12:10:04.0725049Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_frozen_fn <- test/inductor/test_cudagraph_trees.py PASSED [0.3894s] [ 26%] 2025-12-04T12:10:04.0726648Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_function_compiled_multiple_times <- test/inductor/test_cudagraph_trees.py PASSED [0.6852s] [ 27%] 2025-12-04T12:10:04.0728718Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition <- test/inductor/test_cudagraph_trees.py W1204 12:07:29.116000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0730366Z W1204 12:07:29.118000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0731002Z PASSED [1.0952s] [ 28%] 2025-12-04T12:10:04.0732346Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_buffer_reuse <- test/inductor/test_cudagraph_trees.py W1204 12:07:30.268000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0734041Z W1204 12:07:30.270000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0734986Z W1204 12:07:30.274000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0735913Z W1204 12:07:30.276000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0736561Z PASSED [1.1863s] [ 28%] 2025-12-04T12:10:04.0737522Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_condition_op <- test/inductor/test_cudagraph_trees.py PASSED [1.0587s] [ 29%] 2025-12-04T12:10:04.0739254Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_only <- test/inductor/test_cudagraph_trees.py PASSED [1.7690s] [ 30%] 2025-12-04T12:10:04.0741350Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_op_and_dynamic_shapes <- test/inductor/test_cudagraph_trees.py W1204 12:07:34.231000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0743100Z W1204 12:07:34.233000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0744045Z W1204 12:07:35.210000 135215 site-packages/torch/_inductor/utils.py:2565] [0/1] DeviceCopy in input program 2025-12-04T12:10:04.0744993Z W1204 12:07:35.212000 135215 site-packages/torch/_inductor/utils.py:2565] [0/1] DeviceCopy in input program 2025-12-04T12:10:04.0745645Z PASSED [2.2635s] [ 30%] 2025-12-04T12:10:04.0747014Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar1 <- test/inductor/test_cudagraph_trees.py W1204 12:07:36.413000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0748432Z PASSED [0.9827s] [ 31%] 2025-12-04T12:10:04.0749754Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar2 <- test/inductor/test_cudagraph_trees.py W1204 12:07:37.402000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0751141Z PASSED [0.9902s] [ 32%] 2025-12-04T12:10:04.0752458Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar3 <- test/inductor/test_cudagraph_trees.py W1204 12:07:38.397000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0753881Z PASSED [0.9855s] [ 32%] 2025-12-04T12:10:04.0755211Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar4 <- test/inductor/test_cudagraph_trees.py W1204 12:07:39.384000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0756596Z PASSED [0.9851s] [ 33%] 2025-12-04T12:10:04.0757942Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar_device_put <- test/inductor/test_cudagraph_trees.py W1204 12:07:40.363000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0759668Z W1204 12:07:40.365000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0760316Z PASSED [0.9388s] [ 34%] 2025-12-04T12:10:04.0761717Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar_multiple <- test/inductor/test_cudagraph_trees.py W1204 12:07:41.312000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0763133Z PASSED [0.9942s] [ 34%] 2025-12-04T12:10:04.0764495Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_scalar_mutation <- test/inductor/test_cudagraph_trees.py W1204 12:07:42.299000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0766216Z W1204 12:07:42.301000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0766862Z PASSED [0.9627s] [ 35%] 2025-12-04T12:10:04.0767824Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_cpu_tensor_symints <- test/inductor/test_cudagraph_trees.py PASSED [2.0739s] [ 36%] 2025-12-04T12:10:04.0769515Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_op <- test/inductor/test_cudagraph_trees.py PASSED [0.5967s] [ 36%] 2025-12-04T12:10:04.0771232Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_op_dynamoc_shapes <- test/inductor/test_cudagraph_trees.py PASSED [0.8829s] [ 37%] 2025-12-04T12:10:04.0772999Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_op_mutation <- test/inductor/test_cudagraph_trees.py PASSED [0.4827s] [ 38%] 2025-12-04T12:10:04.0774782Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_op_mutation_late_free <- test/inductor/test_cudagraph_trees.py PASSED [0.5923s] [ 38%] 2025-12-04T12:10:04.0776543Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_op_no_split <- test/inductor/test_cudagraph_trees.py PASSED [0.7704s] [ 39%] 2025-12-04T12:10:04.0778327Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_custom_rule <- test/inductor/test_cudagraph_trees.py PASSED [0.9113s] [ 40%] 2025-12-04T12:10:04.0780440Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_dynamic_scalar_inputs <- test/inductor/test_cudagraph_trees.py W1204 12:07:49.766000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0782212Z W1204 12:07:49.768000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0783161Z W1204 12:07:50.629000 135215 site-packages/torch/_inductor/utils.py:2565] [0/1] DeviceCopy in input program 2025-12-04T12:10:04.0784087Z W1204 12:07:50.631000 135215 site-packages/torch/_inductor/utils.py:2565] [0/1] DeviceCopy in input program 2025-12-04T12:10:04.0784732Z PASSED [1.5031s] [ 40%] 2025-12-04T12:10:04.0785705Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_dynamic_shapes <- test/inductor/test_cudagraph_trees.py PASSED [0.6007s] [ 41%] 2025-12-04T12:10:04.0787415Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_foreach_op <- test/inductor/test_cudagraph_trees.py PASSED [0.4389s] [ 42%] 2025-12-04T12:10:04.0789651Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_forward_backward <- test/inductor/test_cudagraph_trees.py W1204 12:07:52.228000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0791365Z W1204 12:07:52.234000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0792018Z PASSED [1.3294s] [ 42%] 2025-12-04T12:10:04.0801614Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_forward_backward_not_called <- test/inductor/test_cudagraph_trees.py PASSED [0.6834s] [ 43%] 2025-12-04T12:10:04.0803823Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_forward_with_skipped_cudagraphed_backward <- test/inductor/test_cudagraph_trees.py PASSED [0.5419s] [ 44%] 2025-12-04T12:10:04.0805701Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_fused_scheduler_node <- test/inductor/test_cudagraph_trees.py PASSED [0.4478s] [ 44%] 2025-12-04T12:10:04.0807377Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_gc <- test/inductor/test_cudagraph_trees.py PASSED [0.6281s] [ 45%] 2025-12-04T12:10:04.0808968Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_item <- test/inductor/test_cudagraph_trees.py PASSED [0.4194s] [ 46%] 2025-12-04T12:10:04.0811126Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_kernel_reuse_autotune_at_compile_time_False <- test/inductor/test_cudagraph_trees.py W1204 12:07:56.228000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0812988Z W1204 12:07:56.230000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0813944Z W1204 12:07:56.231000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0814602Z PASSED [1.0430s] [ 46%] 2025-12-04T12:10:04.0816094Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_kernel_reuse_autotune_at_compile_time_True <- test/inductor/test_cudagraph_trees.py W1204 12:07:57.271000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0817904Z W1204 12:07:57.273000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0818927Z W1204 12:07:57.274000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0819580Z PASSED [1.0607s] [ 47%] 2025-12-04T12:10:04.0820603Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_log_message <- test/inductor/test_cudagraph_trees.py PASSED [0.9842s] [ 48%] 2025-12-04T12:10:04.0822479Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_multiple_devices_msg <- test/inductor/test_cudagraph_trees.py SKIPPED [0.0003s] (requires multiple cuda devices) [ 48%] 2025-12-04T12:10:04.0824916Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_reduce_overhead_mode_effectiveness <- test/inductor/test_cudagraph_trees.py W1204 12:07:59.377000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0826732Z W1204 12:07:59.379000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0827383Z PASSED [1.0710s] [ 49%] 2025-12-04T12:10:04.0828411Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_reorder_cpu_and_gpu <- test/inductor/test_cudagraph_trees.py PASSED [1.1311s] [ 50%] 2025-12-04T12:10:04.0830593Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_reorder_cpu_and_gpu_interleave <- test/inductor/test_cudagraph_trees.py W1204 12:08:01.642000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0832071Z PASSED [1.2217s] [ 50%] 2025-12-04T12:10:04.0833149Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_reorder_custom_op_with_no_dependency <- test/inductor/test_cudagraph_trees.py PASSED [0.8231s] [ 51%] 2025-12-04T12:10:04.0835051Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_reorder_custom_op_with_no_dependency1 <- test/inductor/test_cudagraph_trees.py PASSED [0.9056s] [ 51%] 2025-12-04T12:10:04.0837221Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_simple <- test/inductor/test_cudagraph_trees.py W1204 12:08:04.538000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0838883Z W1204 12:08:04.540000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0839535Z PASSED [1.1680s] [ 52%] 2025-12-04T12:10:04.0840845Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_symint <- test/inductor/test_cudagraph_trees.py W1204 12:08:05.708000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0842492Z W1204 12:08:05.710000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0843435Z W1204 12:08:06.640000 135215 site-packages/torch/_inductor/utils.py:2565] [0/1] DeviceCopy in input program 2025-12-04T12:10:04.0844376Z W1204 12:08:06.642000 135215 site-packages/torch/_inductor/utils.py:2565] [0/1] DeviceCopy in input program 2025-12-04T12:10:04.0845020Z PASSED [2.0960s] [ 53%] 2025-12-04T12:10:04.0846011Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_symint_cat_backward <- test/inductor/test_cudagraph_trees.py PASSED [1.5953s] [ 53%] 2025-12-04T12:10:04.0847795Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_symint_from_mutation_index <- test/inductor/test_cudagraph_trees.py PASSED [0.8344s] [ 54%] 2025-12-04T12:10:04.0849657Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_symint_from_nested_indirect_indexing <- test/inductor/test_cudagraph_trees.py PASSED [0.6403s] [ 55%] 2025-12-04T12:10:04.0851852Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_unbacked_symint <- test/inductor/test_cudagraph_trees.py W1204 12:08:10.941000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0853594Z W1204 12:08:10.943000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0854232Z PASSED [1.1734s] [ 55%] 2025-12-04T12:10:04.0855303Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_unbacked_symint_multi_output_layout <- test/inductor/test_cudagraph_trees.py PASSED [1.0146s] [ 56%] 2025-12-04T12:10:04.0857611Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_user_defined_triton_kernel_reuse <- test/inductor/test_cudagraph_trees.py W1204 12:08:13.371000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0859470Z W1204 12:08:13.373000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0860137Z ('RERUN', {'yellow': True}) [1.4195s] [ 57%] 2025-12-04T12:10:04.0861716Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_user_defined_triton_kernel_reuse <- test/inductor/test_cudagraph_trees.py W1204 12:08:14.501000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0863507Z W1204 12:08:14.503000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0864191Z ('RERUN', {'yellow': True}) [1.3067s] [ 57%] 2025-12-04T12:10:04.0865692Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_user_defined_triton_kernel_reuse <- test/inductor/test_cudagraph_trees.py W1204 12:08:15.808000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0867481Z W1204 12:08:15.810000 135215 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.0868135Z FAILED [1.3074s] [ 57%] 2025-12-04T12:10:04.0869218Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_user_defined_triton_kernel_reuse <- test/inductor/test_cudagraph_trees.py ERROR [0.0001s] [ 57%] 2025-12-04T12:10:04.0870215Z 2025-12-04T12:10:04.0870372Z ==================================== RERUNS ==================================== 2025-12-04T12:10:04.0870962Z ___ CudaGraphTreeTests.test_graph_partition_user_defined_triton_kernel_reuse ___ 2025-12-04T12:10:04.0871542Z Traceback (most recent call last): 2025-12-04T12:10:04.0872417Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4171, in test_graph_partition_user_defined_triton_kernel_reuse 2025-12-04T12:10:04.0873320Z self.assertEqual(eager_out, compiled_out) 2025-12-04T12:10:04.0874067Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T12:10:04.0874835Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T12:10:04.0875681Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T12:10:04.0876563Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T12:10:04.0877050Z AssertionError: Tensor-likes are not close! 2025-12-04T12:10:04.0877319Z 2025-12-04T12:10:04.0877457Z Mismatched elements: 64 / 128 (50.0%) 2025-12-04T12:10:04.0878020Z Greatest absolute difference: 2.7803521156311035 at index (65,) (up to 1e-05 allowed) 2025-12-04T12:10:04.0878728Z Greatest relative difference: inf at index (64,) (up to 1.3e-06 allowed) 2025-12-04T12:10:04.0879139Z 2025-12-04T12:10:04.0879261Z The failure occurred for item [0] 2025-12-04T12:10:04.0879489Z 2025-12-04T12:10:04.0879717Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:04.0880743Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cudagraph_trees.py CudaGraphTreeTests.test_graph_partition_user_defined_triton_kernel_reuse 2025-12-04T12:10:04.0881567Z 2025-12-04T12:10:04.0881837Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:04.0882516Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:04.0882996Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:04.0883361Z stats [('calls_captured', 7), ('unique_graphs', 1)] 2025-12-04T12:10:04.0884001Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:04.0885123Z inductor [('triton_bundler_save_kernel', 8), ('extern_calls', 4), ('pattern_matcher_count', 2), ('pattern_matcher_nodes', 2), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1)] 2025-12-04T12:10:04.0886030Z graph_break [] 2025-12-04T12:10:04.0886397Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:04.0886944Z cudagraph partition due to non gpu ops. Found from : 2025-12-04T12:10:04.0887647Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4159, in foo 2025-12-04T12:10:04.0888516Z output1_cpu = output1.cpu() + 1 2025-12-04T12:10:04.0888767Z 2025-12-04T12:10:04.0888904Z cudagraph partition due to non gpu ops 2025-12-04T12:10:04.0889367Z cudagraph partition due to DeviceCopy ops. Found from : 2025-12-04T12:10:04.0890084Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4161, in foo 2025-12-04T12:10:04.0890716Z x2 = output1_cpu.to("cuda") 2025-12-04T12:10:04.0890940Z 2025-12-04T12:10:04.0891071Z cudagraph partition into 3 partitions 2025-12-04T12:10:04.0891638Z ___ CudaGraphTreeTests.test_graph_partition_user_defined_triton_kernel_reuse ___ 2025-12-04T12:10:04.0892201Z Traceback (most recent call last): 2025-12-04T12:10:04.0893067Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4171, in test_graph_partition_user_defined_triton_kernel_reuse 2025-12-04T12:10:04.0893981Z self.assertEqual(eager_out, compiled_out) 2025-12-04T12:10:04.0894797Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T12:10:04.0895565Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T12:10:04.0896413Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T12:10:04.0897304Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T12:10:04.0897781Z AssertionError: Tensor-likes are not close! 2025-12-04T12:10:04.0898069Z 2025-12-04T12:10:04.0898250Z Mismatched elements: 64 / 128 (50.0%) 2025-12-04T12:10:04.0898820Z Greatest absolute difference: 2.7356221675872803 at index (90,) (up to 1e-05 allowed) 2025-12-04T12:10:04.0899548Z Greatest relative difference: inf at index (64,) (up to 1.3e-06 allowed) 2025-12-04T12:10:04.0899946Z 2025-12-04T12:10:04.0900070Z The failure occurred for item [0] 2025-12-04T12:10:04.0900311Z 2025-12-04T12:10:04.0900525Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:04.0901571Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cudagraph_trees.py CudaGraphTreeTests.test_graph_partition_user_defined_triton_kernel_reuse 2025-12-04T12:10:04.0902379Z 2025-12-04T12:10:04.0902661Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:04.0903284Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:04.0903767Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:04.0904144Z stats [('calls_captured', 7), ('unique_graphs', 1)] 2025-12-04T12:10:04.0904755Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:04.0905865Z inductor [('triton_bundler_save_kernel', 8), ('extern_calls', 4), ('pattern_matcher_count', 2), ('pattern_matcher_nodes', 2), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1)] 2025-12-04T12:10:04.0906775Z graph_break [] 2025-12-04T12:10:04.0907156Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:04.0907698Z cudagraph partition due to non gpu ops. Found from : 2025-12-04T12:10:04.0908457Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4159, in foo 2025-12-04T12:10:04.0909116Z output1_cpu = output1.cpu() + 1 2025-12-04T12:10:04.0909399Z 2025-12-04T12:10:04.0909545Z cudagraph partition due to non gpu ops 2025-12-04T12:10:04.0909993Z cudagraph partition due to DeviceCopy ops. Found from : 2025-12-04T12:10:04.0910711Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4161, in foo 2025-12-04T12:10:04.0911354Z x2 = output1_cpu.to("cuda") 2025-12-04T12:10:04.0911565Z 2025-12-04T12:10:04.0911695Z cudagraph partition into 3 partitions 2025-12-04T12:10:04.0912173Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:04.0912656Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:04.0913036Z stats [('calls_captured', 7), ('unique_graphs', 1)] 2025-12-04T12:10:04.0913675Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:04.0914794Z inductor [('triton_bundler_save_kernel', 8), ('extern_calls', 4), ('pattern_matcher_count', 2), ('pattern_matcher_nodes', 2), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1)] 2025-12-04T12:10:04.0915704Z graph_break [] 2025-12-04T12:10:04.0916066Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:04.0916612Z cudagraph partition due to non gpu ops. Found from : 2025-12-04T12:10:04.0917307Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4159, in foo 2025-12-04T12:10:04.0917960Z output1_cpu = output1.cpu() + 1 2025-12-04T12:10:04.0918196Z 2025-12-04T12:10:04.0918326Z cudagraph partition due to non gpu ops 2025-12-04T12:10:04.0918784Z cudagraph partition due to DeviceCopy ops. Found from : 2025-12-04T12:10:04.0919543Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4161, in foo 2025-12-04T12:10:04.0920176Z x2 = output1_cpu.to("cuda") 2025-12-04T12:10:04.0920403Z 2025-12-04T12:10:04.0920533Z cudagraph partition into 3 partitions 2025-12-04T12:10:04.0920927Z ==================================== ERRORS ==================================== 2025-12-04T12:10:04.0921605Z _ ERROR at teardown of CudaGraphTreeTests.test_graph_partition_user_defined_triton_kernel_reuse _ 2025-12-04T12:10:04.0922239Z Traceback (most recent call last): 2025-12-04T12:10:04.0922879Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 174, in tearDown 2025-12-04T12:10:04.0923570Z self.assertEqual(all_live_block_count(), 0) 2025-12-04T12:10:04.0924308Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T12:10:04.0925079Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T12:10:04.0925920Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T12:10:04.0926814Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T12:10:04.0927275Z AssertionError: Scalars are not equal! 2025-12-04T12:10:04.0927536Z 2025-12-04T12:10:04.0927644Z Expected 0 but got 2. 2025-12-04T12:10:04.0927938Z Absolute difference: 2 2025-12-04T12:10:04.0928225Z Relative difference: inf 2025-12-04T12:10:04.0928641Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:04.0929119Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:04.0929496Z stats [('calls_captured', 7), ('unique_graphs', 1)] 2025-12-04T12:10:04.0930089Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:04.0931216Z inductor [('triton_bundler_save_kernel', 8), ('extern_calls', 4), ('pattern_matcher_count', 2), ('pattern_matcher_nodes', 2), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1)] 2025-12-04T12:10:04.0932127Z graph_break [] 2025-12-04T12:10:04.0932532Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:04.0933086Z cudagraph partition due to non gpu ops. Found from : 2025-12-04T12:10:04.0933785Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4159, in foo 2025-12-04T12:10:04.0934475Z output1_cpu = output1.cpu() + 1 2025-12-04T12:10:04.0934713Z 2025-12-04T12:10:04.0934844Z cudagraph partition due to non gpu ops 2025-12-04T12:10:04.0935305Z cudagraph partition due to DeviceCopy ops. Found from : 2025-12-04T12:10:04.0936015Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4161, in foo 2025-12-04T12:10:04.0936647Z x2 = output1_cpu.to("cuda") 2025-12-04T12:10:04.0936875Z 2025-12-04T12:10:04.0937006Z cudagraph partition into 3 partitions 2025-12-04T12:10:04.0937478Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:04.0937981Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:04.0938421Z stats [('calls_captured', 7), ('unique_graphs', 1)] 2025-12-04T12:10:04.0939028Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:04.0940149Z inductor [('triton_bundler_save_kernel', 8), ('extern_calls', 4), ('pattern_matcher_count', 2), ('pattern_matcher_nodes', 2), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1)] 2025-12-04T12:10:04.0941055Z graph_break [] 2025-12-04T12:10:04.0941416Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:04.0941960Z cudagraph partition due to non gpu ops. Found from : 2025-12-04T12:10:04.0942657Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4159, in foo 2025-12-04T12:10:04.0943295Z output1_cpu = output1.cpu() + 1 2025-12-04T12:10:04.0943539Z 2025-12-04T12:10:04.0943670Z cudagraph partition due to non gpu ops 2025-12-04T12:10:04.0944133Z cudagraph partition due to DeviceCopy ops. Found from : 2025-12-04T12:10:04.0944868Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4161, in foo 2025-12-04T12:10:04.0945512Z x2 = output1_cpu.to("cuda") 2025-12-04T12:10:04.0945743Z 2025-12-04T12:10:04.0945872Z cudagraph partition into 3 partitions 2025-12-04T12:10:04.0946342Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:04.0946805Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:04.0947189Z stats [('calls_captured', 7), ('unique_graphs', 1)] 2025-12-04T12:10:04.0947791Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:04.0948901Z inductor [('triton_bundler_save_kernel', 8), ('extern_calls', 4), ('pattern_matcher_count', 2), ('pattern_matcher_nodes', 2), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1)] 2025-12-04T12:10:04.0949811Z graph_break [] 2025-12-04T12:10:04.0950188Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:04.0950739Z cudagraph partition due to non gpu ops. Found from : 2025-12-04T12:10:04.0951425Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4159, in foo 2025-12-04T12:10:04.0952074Z output1_cpu = output1.cpu() + 1 2025-12-04T12:10:04.0952309Z 2025-12-04T12:10:04.0952453Z cudagraph partition due to non gpu ops 2025-12-04T12:10:04.0952911Z cudagraph partition due to DeviceCopy ops. Found from : 2025-12-04T12:10:04.0953610Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4161, in foo 2025-12-04T12:10:04.0954253Z x2 = output1_cpu.to("cuda") 2025-12-04T12:10:04.0954463Z 2025-12-04T12:10:04.0954604Z cudagraph partition into 3 partitions 2025-12-04T12:10:04.0954993Z =================================== FAILURES =================================== 2025-12-04T12:10:04.0955598Z ___ CudaGraphTreeTests.test_graph_partition_user_defined_triton_kernel_reuse ___ 2025-12-04T12:10:04.0956189Z Traceback (most recent call last): 2025-12-04T12:10:04.0957106Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4171, in test_graph_partition_user_defined_triton_kernel_reuse 2025-12-04T12:10:04.0958010Z self.assertEqual(eager_out, compiled_out) 2025-12-04T12:10:04.0958762Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/test_case.py", line 113, in assertEqual 2025-12-04T12:10:04.0959586Z return super().assertEqual(x, y, *args, **kwargs) 2025-12-04T12:10:04.0960421Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 4284, in assertEqual 2025-12-04T12:10:04.0961315Z raise error_metas.pop()[0].to_error( # type: ignore[index] 2025-12-04T12:10:04.0961810Z AssertionError: Tensor-likes are not close! 2025-12-04T12:10:04.0962080Z 2025-12-04T12:10:04.0962218Z Mismatched elements: 64 / 128 (50.0%) 2025-12-04T12:10:04.0962758Z Greatest absolute difference: 2.709859848022461 at index (126,) (up to 1e-05 allowed) 2025-12-04T12:10:04.0963519Z Greatest relative difference: inf at index (64,) (up to 1.3e-06 allowed) 2025-12-04T12:10:04.0963920Z 2025-12-04T12:10:04.0964055Z The failure occurred for item [0] 2025-12-04T12:10:04.0964281Z 2025-12-04T12:10:04.0964510Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:04.0965534Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cudagraph_trees.py CudaGraphTreeTests.test_graph_partition_user_defined_triton_kernel_reuse 2025-12-04T12:10:04.0966362Z 2025-12-04T12:10:04.0966631Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:04.0967268Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:04.0967748Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:04.0968116Z stats [('calls_captured', 7), ('unique_graphs', 1)] 2025-12-04T12:10:04.0968726Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:04.0969883Z inductor [('triton_bundler_save_kernel', 8), ('extern_calls', 4), ('pattern_matcher_count', 2), ('pattern_matcher_nodes', 2), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1)] 2025-12-04T12:10:04.0970779Z graph_break [] 2025-12-04T12:10:04.0971153Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:04.0971700Z cudagraph partition due to non gpu ops. Found from : 2025-12-04T12:10:04.0972403Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4159, in foo 2025-12-04T12:10:04.0973043Z output1_cpu = output1.cpu() + 1 2025-12-04T12:10:04.0973293Z 2025-12-04T12:10:04.0973423Z cudagraph partition due to non gpu ops 2025-12-04T12:10:04.0973881Z cudagraph partition due to DeviceCopy ops. Found from : 2025-12-04T12:10:04.0974583Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4161, in foo 2025-12-04T12:10:04.0975222Z x2 = output1_cpu.to("cuda") 2025-12-04T12:10:04.0975437Z 2025-12-04T12:10:04.0975585Z cudagraph partition into 3 partitions 2025-12-04T12:10:04.0976061Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:04.0976520Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:04.0976896Z stats [('calls_captured', 7), ('unique_graphs', 1)] 2025-12-04T12:10:04.0977503Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:04.0978678Z inductor [('triton_bundler_save_kernel', 8), ('extern_calls', 4), ('pattern_matcher_count', 2), ('pattern_matcher_nodes', 2), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1)] 2025-12-04T12:10:04.0979586Z graph_break [] 2025-12-04T12:10:04.0979966Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:04.0980516Z cudagraph partition due to non gpu ops. Found from : 2025-12-04T12:10:04.0981200Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4159, in foo 2025-12-04T12:10:04.0981861Z output1_cpu = output1.cpu() + 1 2025-12-04T12:10:04.0982093Z 2025-12-04T12:10:04.0982283Z cudagraph partition due to non gpu ops 2025-12-04T12:10:04.0982728Z cudagraph partition due to DeviceCopy ops. Found from : 2025-12-04T12:10:04.0983443Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4161, in foo 2025-12-04T12:10:04.0984204Z x2 = output1_cpu.to("cuda") 2025-12-04T12:10:04.0984414Z 2025-12-04T12:10:04.0984558Z cudagraph partition into 3 partitions 2025-12-04T12:10:04.0985021Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T12:10:04.0985504Z frames [('total', 1), ('ok', 1)] 2025-12-04T12:10:04.0985885Z stats [('calls_captured', 7), ('unique_graphs', 1)] 2025-12-04T12:10:04.0986480Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T12:10:04.0987609Z inductor [('triton_bundler_save_kernel', 8), ('extern_calls', 4), ('pattern_matcher_count', 2), ('pattern_matcher_nodes', 2), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1)] 2025-12-04T12:10:04.0988738Z graph_break [] 2025-12-04T12:10:04.0989121Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T12:10:04.0989658Z cudagraph partition due to non gpu ops. Found from : 2025-12-04T12:10:04.0990361Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4159, in foo 2025-12-04T12:10:04.0991013Z output1_cpu = output1.cpu() + 1 2025-12-04T12:10:04.0991247Z 2025-12-04T12:10:04.0991378Z cudagraph partition due to non gpu ops 2025-12-04T12:10:04.0991839Z cudagraph partition due to DeviceCopy ops. Found from : 2025-12-04T12:10:04.0992549Z File "/var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees.py", line 4161, in foo 2025-12-04T12:10:04.0993188Z x2 = output1_cpu.to("cuda") 2025-12-04T12:10:04.0993397Z 2025-12-04T12:10:04.0993528Z cudagraph partition into 3 partitions 2025-12-04T12:10:04.0994862Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-cfe1d81d1ed43800.xml - 2025-12-04T12:10:04.0996176Z =========================== short test summary info ============================ 2025-12-04T12:10:04.0997355Z FAILED [1.3074s] inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_user_defined_triton_kernel_reuse - AssertionError: Tensor-likes are not close! 2025-12-04T12:10:04.0998328Z 2025-12-04T12:10:04.0998448Z Mismatched elements: 64 / 128 (50.0%) 2025-12-04T12:10:04.0999005Z Greatest absolute difference: 2.709859848022461 at index (126,) (up to 1e-05 allowed) 2025-12-04T12:10:04.0999724Z Greatest relative difference: inf at index (64,) (up to 1.3e-06 allowed) 2025-12-04T12:10:04.1000119Z 2025-12-04T12:10:04.1000252Z The failure occurred for item [0] 2025-12-04T12:10:04.1000476Z 2025-12-04T12:10:04.1000698Z To execute this test, run the following from the base repo dir: 2025-12-04T12:10:04.1001729Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_cudagraph_trees.py CudaGraphTreeTests.test_graph_partition_user_defined_triton_kernel_reuse 2025-12-04T12:10:04.1002537Z 2025-12-04T12:10:04.1002814Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T12:10:04.1004048Z ERROR [0.0001s] inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_user_defined_triton_kernel_reuse - AssertionError: Scalars are not equal! 2025-12-04T12:10:04.1004997Z 2025-12-04T12:10:04.1005102Z Expected 0 but got 2. 2025-12-04T12:10:04.1005390Z Absolute difference: 2 2025-12-04T12:10:04.1005686Z Relative difference: inf 2025-12-04T12:10:04.1006054Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 2 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:10:04.1006619Z ===== 1 failed, 84 passed, 2 skipped, 1 error, 2 rerun in 84.89s (0:01:24) ===== 2025-12-04T12:10:04.1007103Z Got exit code 1 2025-12-04T12:10:04.1007374Z Retrying single test... 2025-12-04T12:10:04.1008602Z /var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees_expandable_segments.py:43: FutureWarning: torch.cuda._set_allocator_settings is deprecated. Use torch._C._accelerator_setAllocatorSettings instead. 2025-12-04T12:10:04.1010041Z torch.cuda.memory._set_allocator_settings("expandable_segments:True") 2025-12-04T12:10:04.1010979Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to ./.pytorch-disabled-tests.json 2025-12-04T12:10:04.1012459Z Test results will be stored in test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-1783f83983666f0c.xml 2025-12-04T12:10:04.1013566Z ============================= test session starts ============================== 2025-12-04T12:10:04.1014235Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:04.1014887Z cachedir: .pytest_cache 2025-12-04T12:10:04.1015593Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:04.1016381Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:10:04.1016732Z configfile: pytest.ini 2025-12-04T12:10:04.1017514Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:10:04.1018537Z collecting ... collected 152 items / 151 deselected / 1 selected 2025-12-04T12:10:04.1019764Z stepcurrent: skipping 86 already run items. Running only test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_user_defined_triton_kernel_reuse 2025-12-04T12:10:04.1020878Z Running 1 items in this shard 2025-12-04T12:10:04.1021087Z 2025-12-04T12:10:04.1022400Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_user_defined_triton_kernel_reuse <- test/inductor/test_cudagraph_trees.py W1204 12:08:33.347000 137631 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.1024194Z W1204 12:08:33.350000 137631 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.1024853Z PASSED [6.0797s] [100%] 2025-12-04T12:10:04.1025047Z 2025-12-04T12:10:04.1026038Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-1783f83983666f0c.xml - 2025-12-04T12:10:04.1027363Z ====================== 1 passed, 151 deselected in 6.11s ======================= 2025-12-04T12:10:04.1027788Z Got exit code 0 2025-12-04T12:10:04.1028197Z Test succeeded in new process, continuing with the rest of the tests 2025-12-04T12:10:04.1029581Z /var/lib/jenkins/workspace/test/inductor/test_cudagraph_trees_expandable_segments.py:43: FutureWarning: torch.cuda._set_allocator_settings is deprecated. Use torch._C._accelerator_setAllocatorSettings instead. 2025-12-04T12:10:04.1030979Z torch.cuda.memory._set_allocator_settings("expandable_segments:True") 2025-12-04T12:10:04.1031897Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to ./.pytorch-disabled-tests.json 2025-12-04T12:10:04.1033391Z Test results will be stored in test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-04ae990c731b58f8.xml 2025-12-04T12:10:04.1034513Z ============================= test session starts ============================== 2025-12-04T12:10:04.1035183Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:10:04.1035776Z cachedir: .pytest_cache 2025-12-04T12:10:04.1036493Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:10:04.1037289Z rootdir: /var/lib/jenkins/workspace 2025-12-04T12:10:04.1037634Z configfile: pytest.ini 2025-12-04T12:10:04.1038446Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T12:10:04.1039403Z collecting ... collected 152 items / 87 deselected / 65 selected 2025-12-04T12:10:04.1039946Z stepcurrent: skipping 87 already run items. 2025-12-04T12:10:04.1040323Z Running 65 items in this shard 2025-12-04T12:10:04.1040549Z 2025-12-04T12:10:04.1041722Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_view_fallback <- test/inductor/test_cudagraph_trees.py W1204 12:08:53.760000 137915 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.1043431Z W1204 12:08:53.761000 137915 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T12:10:04.1044079Z PASSED [4.8816s] [ 1%] 2025-12-04T12:10:04.1045550Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_with_memory_plan_reuse <- test/inductor/test_cudagraph_trees.py W1204 12:08:55.684000 137915 site-packages/torch/_inductor/utils.py:1703] [0/0] Not enough SMs to use max_autotune_gemm mode 2025-12-04T12:10:04.1047056Z PASSED [2.3574s] [ 3%] 2025-12-04T12:10:04.1048027Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_item <- test/inductor/test_cudagraph_trees.py PASSED [0.2982s] [ 4%] 2025-12-04T12:10:04.1049758Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_nonzero <- test/inductor/test_cudagraph_trees.py PASSED [0.3673s] [ 6%] 2025-12-04T12:10:04.1051545Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_nonzero_backend <- test/inductor/test_cudagraph_trees.py PASSED [0.2706s] [ 7%] 2025-12-04T12:10:04.1053425Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_nonzero_graph_breaks <- test/inductor/test_cudagraph_trees.py PASSED [0.6562s] [ 9%] 2025-12-04T12:10:04.1055102Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_index_put <- test/inductor/test_cudagraph_trees.py PASSED [0.6926s] [ 10%] 2025-12-04T12:10:04.1056681Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_live_outputs_multiple_graphs <- test/inductor/test_cudagraph_trees.py PASSED [1.2030s] [ 12%] 2025-12-04T12:10:04.1058549Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_manager_per_device <- test/inductor/test_cudagraph_trees.py SKIPPED [0.0003s] (requires multiple cuda devices) [ 13%] 2025-12-04T12:10:04.1060230Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mark_step <- test/inductor/test_cudagraph_trees.py PASSED [0.6865s] [ 15%] 2025-12-04T12:10:04.1061730Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_meta_tensor <- test/inductor/test_cudagraph_trees.py PASSED [0.6807s] [ 16%] 2025-12-04T12:10:04.1063310Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_child_node <- test/inductor/test_cudagraph_trees.py PASSED [1.1076s] [ 18%] 2025-12-04T12:10:04.1064970Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_custom_module <- test/inductor/test_cudagraph_trees.py PASSED [0.8417s] [ 20%] 2025-12-04T12:10:04.1066678Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_custom_module_buffer <- test/inductor/test_cudagraph_trees.py PASSED [0.9259s] [ 21%] 2025-12-04T12:10:04.1068362Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_parent_node <- test/inductor/test_cudagraph_trees.py PASSED [1.1229s] [ 23%] 2025-12-04T12:10:04.1070103Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_single_compile_builtin_module <- test/inductor/test_cudagraph_trees.py PASSED [0.6586s] [ 24%] 2025-12-04T12:10:04.1072037Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_single_compile_builtin_module_buffers <- test/inductor/test_cudagraph_trees.py PASSED [0.9183s] [ 26%] 2025-12-04T12:10:04.1073932Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multi_dispatch_single_compile_param_inputs <- test/inductor/test_cudagraph_trees.py PASSED [0.4882s] [ 27%] 2025-12-04T12:10:04.1076956Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multinomial <- test/inductor/test_cudagraph_trees.py SKIPPED [0.0008s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/166682 for platform(s) linux. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 29%] 2025-12-04T12:10:04.1080157Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multiple_devices_msg_backend_cudagraphs <- test/inductor/test_cudagraph_trees.py SKIPPED [0.0002s] (requires multiple cuda devices) [ 30%] 2025-12-04T12:10:04.1082242Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multiple_devices_msg_backend_inductor <- test/inductor/test_cudagraph_trees.py SKIPPED [0.0002s] (requires multiple cuda devices) [ 32%] 2025-12-04T12:10:04.1084129Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_multiple_insert_removal_caching <- test/inductor/test_cudagraph_trees.py PASSED [0.1935s] [ 33%] 2025-12-04T12:10:04.1085961Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_backend_cudagraphs <- test/inductor/test_cudagraph_trees.py PASSED [0.3167s] [ 35%] 2025-12-04T12:10:04.1088253Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_backend_inductor <- test/inductor/test_cudagraph_trees.py PASSED [0.5699s] [ 36%] 2025-12-04T12:10:04.1090471Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_only_once_backend_cudagraphs <- test/inductor/test_cudagraph_trees.py PASSED [0.3294s] [ 38%] 2025-12-04T12:10:04.1092508Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_only_once_backend_inductor <- test/inductor/test_cudagraph_trees.py PASSED [0.5917s] [ 40%] 2025-12-04T12:10:04.1094493Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_backend_cudagraphs <- test/inductor/test_cudagraph_trees.py PASSED [0.3362s] [ 41%] 2025-12-04T12:10:04.1096421Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_backend_inductor <- test/inductor/test_cudagraph_trees.py PASSED [0.5434s] [ 43%] 2025-12-04T12:10:04.1098439Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_config_backend_cudagraphs <- test/inductor/test_cudagraph_trees.py PASSED [0.3184s] [ 44%] 2025-12-04T12:10:04.1100429Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_config_backend_inductor <- test/inductor/test_cudagraph_trees.py PASSED [0.5296s] [ 46%] 2025-12-04T12:10:04.1102269Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_on_inp_backend_cudagraphs <- test/inductor/test_cudagraph_trees.py PASSED [0.3362s] [ 47%] 2025-12-04T12:10:04.1103964Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_on_inp_backend_inductor <- test/inductor/test_cudagraph_trees.py PASSED [0.6080s] [ 49%] 2025-12-04T12:10:04.1105623Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_mutation_reinplaced <- test/inductor/test_cudagraph_trees.py PASSED [0.4245s] [ 50%] 2025-12-04T12:10:04.1107362Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_no_rerecord_with_mark_static_address <- test/inductor/test_cudagraph_trees.py PASSED [0.8538s] [ 52%] 2025-12-04T12:10:04.1109233Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_not_fallback_to_eager_if_have_not_recompiling_too_many_times <- test/inductor/test_cudagraph_trees.py PASSED [0.4892s] [ 53%] 2025-12-04T12:10:04.1110947Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_output_alias <- test/inductor/test_cudagraph_trees.py PASSED [0.2027s] [ 55%] 2025-12-04T12:10:04.1112528Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_peristed_output_livenes <- test/inductor/test_cudagraph_trees.py PASSED [0.3678s] [ 56%] 2025-12-04T12:10:04.1114186Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_remove_hooks_on_cached_tensors <- test/inductor/test_cudagraph_trees.py PASSED [0.4179s] [ 58%] 2025-12-04T12:10:04.1115971Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_rerecord_if_static_input_address_changed <- test/inductor/test_cudagraph_trees.py PASSED [0.6010s] [ 60%] 2025-12-04T12:10:04.1117626Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_rng_non_trees <- test/inductor/test_cudagraph_trees.py PASSED [0.3017s] [ 61%] 2025-12-04T12:10:04.1119113Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_rng_trees <- test/inductor/test_cudagraph_trees.py PASSED [0.2987s] [ 63%] 2025-12-04T12:10:04.1120596Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_run_simple <- test/inductor/test_cudagraph_trees.py PASSED [0.7839s] [ 64%] 2025-12-04T12:10:04.1122182Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_separate_recordings <- test/inductor/test_cudagraph_trees.py PASSED [0.7119s] [ 66%] 2025-12-04T12:10:04.1123829Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_side_stream_memory_allocation <- test/inductor/test_cudagraph_trees.py PASSED [0.2161s] [ 67%] 2025-12-04T12:10:04.1125437Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_single_stream_use <- test/inductor/test_cudagraph_trees.py PASSED [0.5781s] [ 69%] 2025-12-04T12:10:04.1126973Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_skip_cpp_wrapper <- test/inductor/test_cudagraph_trees.py PASSED [1.9863s] [ 70%] 2025-12-04T12:10:04.1128566Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_skip_cudagraph_unsafe_ops <- test/inductor/test_cudagraph_trees.py PASSED [0.4208s] [ 72%] 2025-12-04T12:10:04.1130253Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_skip_if_dynamic_shape_limit_reached1 <- test/inductor/test_cudagraph_trees.py PASSED [1.1909s] [ 73%] 2025-12-04T12:10:04.1131996Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_skip_if_dynamic_shape_limit_reached2 <- test/inductor/test_cudagraph_trees.py PASSED [11.6971s] [ 75%] 2025-12-04T12:10:04.1133617Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_skip_symbolic <- test/inductor/test_cudagraph_trees.py PASSED [0.4404s] [ 76%] 2025-12-04T12:10:04.1135126Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_sparsity <- test/inductor/test_cudagraph_trees.py PASSED [0.3092s] [ 78%] 2025-12-04T12:10:04.1136719Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_static_inputs_address_mutation_log <- test/inductor/test_cudagraph_trees.py PASSED [0.6565s] [ 80%] 2025-12-04T12:10:04.1138441Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_storage_access_error <- test/inductor/test_cudagraph_trees.py PASSED [0.2356s] [ 81%] 2025-12-04T12:10:04.1140104Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_tensor_constant_mutation <- test/inductor/test_cudagraph_trees.py PASSED [0.4644s] [ 83%] 2025-12-04T12:10:04.1141762Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_tensor_dies_between_checkpoint <- test/inductor/test_cudagraph_trees.py PASSED [0.2503s] [ 84%] 2025-12-04T12:10:04.1143447Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_tensor_no_longer_in_pool <- test/inductor/test_cudagraph_trees.py PASSED [0.2438s] [ 86%] 2025-12-04T12:10:04.1145127Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_unaligned_static_input_no_cudagraphs <- test/inductor/test_cudagraph_trees.py PASSED [0.3406s] [ 87%] 2025-12-04T12:10:04.1146849Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_unaligned_static_input_non_trees <- test/inductor/test_cudagraph_trees.py PASSED [0.3351s] [ 89%] 2025-12-04T12:10:04.1148571Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_unaligned_static_input_trees <- test/inductor/test_cudagraph_trees.py PASSED [0.3378s] [ 90%] 2025-12-04T12:10:04.1150221Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_unaligned_static_parameter <- test/inductor/test_cudagraph_trees.py PASSED [0.2582s] [ 92%] 2025-12-04T12:10:04.1151812Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_unstable_ptr <- test/inductor/test_cudagraph_trees.py PASSED [0.4186s] [ 93%] 2025-12-04T12:10:04.1153354Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_warmup_stream_sync <- test/inductor/test_cudagraph_trees.py PASSED [5.1688s] [ 95%] 2025-12-04T12:10:04.1154953Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_warn_on_pending_backward <- test/inductor/test_cudagraph_trees.py PASSED [0.4314s] [ 96%] 2025-12-04T12:10:04.1156669Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_warn_once_if_dynamic_shape_limit_reached <- test/inductor/test_cudagraph_trees.py PASSED [1.2808s] [ 98%] 2025-12-04T12:10:04.1158861Z inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_workspace_allocation_error <- test/inductor/test_cudagraph_trees.py [W1204 12:09:45.414867726 unwind.cpp:219] Warning: Unsupported unwinding pattern: Address not in range (function unwinderFor) 2025-12-04T12:10:04.1160344Z PASSED [16.5192s] [100%] 2025-12-04T12:10:04.1160529Z 2025-12-04T12:10:04.1161544Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-04ae990c731b58f8.xml - 2025-12-04T12:10:04.1162899Z =========== 61 passed, 4 skipped, 87 deselected in 72.22s (0:01:12) ============ 2025-12-04T12:10:04.1164177Z The following tests failed and then succeeded when run in a new process['test/inductor/test_cudagraph_trees_expandable_segments.py::CudaGraphTreeTests::test_graph_partition_user_defined_triton_kernel_reuse'] 2025-12-04T12:10:04.1165253Z 2025-12-04T12:10:04.1166048Z FINISHED PRINTING LOG FILE of inductor/test_cudagraph_trees_expandable_segments 1/1 (test/test-reports/inductor.test_cudagraph_trees_expandable_segments_1.1_4e86984065370f2d_.log) 2025-12-04T12:10:04.1166989Z 2025-12-04T12:10:04.1167469Z Finished inductor/test_cudagraph_trees_expandable_segments 1/1 ... [2025-12-04 12:10:04.032250][10608.153465588], took 3.35min 2025-12-04T12:10:04.1169178Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-cfe1d81d1ed43800.xml 2025-12-04T12:10:04.1736099Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-1783f83983666f0c.xml 2025-12-04T12:10:04.2064944Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-04ae990c731b58f8.xml 2025-12-04T12:10:04.2456914Z Running inductor/test_collective_autotuning 1/1 ... [2025-12-04 12:10:04.245497][10608.36671676] 2025-12-04T12:10:04.2457636Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T12:10:04.2461266Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_collective_autotuning.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:10:04.245872] 2025-12-04T12:10:19.2708621Z 2025-12-04T12:10:19.2709816Z inductor/test_collective_autotuning 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_collective_autotuning_1.1_e338f0430869aa69_.log 2025-12-04T12:10:19.2712275Z Running 2 items in this shard: test/inductor/test_collective_autotuning.py::TestCollectiveAutotuning2Ranks::test_equivalent_allreduce_strategies, test/inductor/test_collective_autotuning.py::TestCollectiveAutotuning4Ranks::test_vllm_style_allreduce 2025-12-04T12:10:19.2713641Z 2025-12-04T12:10:19.2714066Z Finished inductor/test_collective_autotuning 1/1 ... [2025-12-04 12:10:19.270646][10623.391867244], took 0.25min 2025-12-04T12:10:19.2968304Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_collective_autotuning/inductor.test_collective_autotuning-0f27caa637154718.xml 2025-12-04T12:10:19.3768317Z Running higher_order_ops/test_local_map 1/1 ... [2025-12-04 12:10:19.376531][10623.497750877] 2025-12-04T12:10:19.3768946Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T12:10:19.3772270Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'higher_order_ops/test_local_map.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:10:19.376948] 2025-12-04T12:10:33.6109261Z 2025-12-04T12:10:33.6110368Z higher_order_ops/test_local_map 1/1 was successful, full logs can be found in artifacts with path test/test-reports/higher_order_ops.test_local_map_1.1_356472a63c965b37_.log 2025-12-04T12:10:33.6115975Z Running 12 items in this shard: test/higher_order_ops/test_local_map.py::TestLocalMap::test_filtered_gradients, test/higher_order_ops/test_local_map.py::TestLocalMap::test_fx_annotations, test/higher_order_ops/test_local_map.py::TestLocalMap::test_local_map_dynamo_mismatch_placements, test/higher_order_ops/test_local_map.py::TestLocalMap::test_local_map_dynamo_reordered_inputs, test/higher_order_ops/test_local_map.py::TestLocalMap::test_local_map_with_local_shapes_dynamo_tracing, test/higher_order_ops/test_local_map.py::TestLocalMap::test_local_map_with_local_shapes_hop_tracing, test/higher_order_ops/test_local_map.py::TestLocalMap::test_none_gradients, test/higher_order_ops/test_local_map.py::TestLocalMap::test_none_placements, test/higher_order_ops/test_local_map.py::TestLocalMap::test_sac, test/higher_order_ops/test_local_map.py::TestLocalMap::test_sac_deferred, test/higher_order_ops/test_local_map.py::TestLocalMap::test_simple, test/higher_order_ops/test_local_map.py::TestLocalMap::test_symint_activations 2025-12-04T12:10:33.6120810Z 2025-12-04T12:10:33.6121199Z Finished higher_order_ops/test_local_map 1/1 ... [2025-12-04 12:10:33.610749][10637.73196763], took 0.24min 2025-12-04T12:10:33.6375562Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/higher_order_ops.test_local_map/higher_order_ops.test_local_map-9df8631d0f17132c.xml 2025-12-04T12:10:33.7186899Z Running test_dataloader 1/2 ... [2025-12-04 12:10:33.718391][10637.839610174] 2025-12-04T12:10:33.7187465Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T12:10:33.7191239Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_dataloader.py', '--shard-id=1', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:10:33.718847] 2025-12-04T12:14:55.2168084Z 2025-12-04T12:14:55.2169043Z test_dataloader 1/2 was successful, full logs can be found in artifacts with path test/test-reports/test_dataloader_1.2_13cbd3bba7ddab60_.log 2025-12-04T12:14:55.2208216Z Running 96 items in this shard: test/test_dataloader.py::TestDatasetRandomSplit::test_incomplete_fractional_splits, test/test_dataloader.py::TestDatasetRandomSplit::test_slicing_of_subset_of_subset, test/test_dataloader.py::TestDatasetRandomSplit::test_splits_are_mutually_exclusive, test/test_dataloader.py::TestDatasetRandomSplit::test_splits_indexing_type, test/test_dataloader.py::TestTensorDataset::test_getitem, test/test_dataloader.py::TestTensorDataset::test_len, test/test_dataloader.py::TestTensorDataset::test_many_tensors, test/test_dataloader.py::TestStackDataset::test_getitem, test/test_dataloader.py::TestStackDataset::test_getitems, test/test_dataloader.py::TestStackDataset::test_getitems_raises_index_error, test/test_dataloader.py::TestStackDataset::test_getitems_value_error, test/test_dataloader.py::TestStackDataset::test_mixed, test/test_dataloader.py::TestStackDataset::test_single, test/test_dataloader.py::TestStackDataset::test_size_mismatch, test/test_dataloader.py::TestConcatDataset::test_concat_two_non_singletons, test/test_dataloader.py::TestConcatDataset::test_concat_two_non_singletons_with_empty, test/test_dataloader.py::TestConcatDataset::test_iterable_dataset_err, test/test_dataloader.py::TestDataLoader::test_builtin_collection_conversion, test/test_dataloader.py::TestDataLoader::test_default_collate_bad_numpy_types, test/test_dataloader.py::TestDataLoader::test_default_collate_mapping_keep_type, test/test_dataloader.py::TestDataLoader::test_default_collate_sequence_dont_keep_type, test/test_dataloader.py::TestDataLoader::test_default_collate_sequence_keep_type, test/test_dataloader.py::TestDataLoader::test_default_collate_shared_tensor, test/test_dataloader.py::TestDataLoader::test_default_convert_sequence_dont_keep_type, test/test_dataloader.py::TestDataLoader::test_duplicating_data_with_drop_last, test/test_dataloader.py::TestDataLoader::test_error, test/test_dataloader.py::TestDataLoader::test_error_in_init, test/test_dataloader.py::TestDataLoader::test_excessive_thread_creation_warning, test/test_dataloader.py::TestDataLoader::test_fd_limit_exceeded, test/test_dataloader.py::TestDataLoader::test_get_worker_info, test/test_dataloader.py::TestDataLoader::test_invalid_assign_after_init, test/test_dataloader.py::TestDataLoader::test_iterable_style_dataset, test/test_dataloader.py::TestDataLoader::test_iterabledataset_len, test/test_dataloader.py::TestDataLoader::test_len, test/test_dataloader.py::TestDataLoader::test_multi_epochs_reproducibility, test/test_dataloader.py::TestDataLoader::test_multiprocessing_contexts, test/test_dataloader.py::TestDataLoader::test_multiprocessing_iterdatapipe_with_dill, test/test_dataloader.py::TestDataLoader::test_no_segfault, test/test_dataloader.py::TestDataLoader::test_numpy, test/test_dataloader.py::TestDataLoader::test_partial_workers, test/test_dataloader.py::TestDataLoader::test_random_sampler, test/test_dataloader.py::TestDataLoader::test_random_sampler_len_with_replacement, test/test_dataloader.py::TestDataLoader::test_random_sampler_len_without_replacement, test/test_dataloader.py::TestDataLoader::test_sampler, test/test_dataloader.py::TestDataLoader::test_sampler_reproducibility, test/test_dataloader.py::TestDataLoader::test_seqential_batch_workers, test/test_dataloader.py::TestDataLoader::test_seqential_batch_workers_prefetch, test/test_dataloader.py::TestDataLoader::test_shuffle, test/test_dataloader.py::TestDataLoader::test_shuffle_batch, test/test_dataloader.py::TestDataLoader::test_worker_init_fn, test/test_dataloader.py::TestDictDataLoader::test_pin_memory_no_cuda, test/test_dataloader.py::TestDictDataLoader::test_pin_memory_with_only_device, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_batch_sampler, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_dataset_not_reset, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_bad_numpy_types, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_dtype, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_mapping_keep_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_numpy_memmap, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_sequence_dont_keep_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_sequence_keep_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_convert_sequence_dont_keep_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_convert_sequence_keep_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_duplicating_data_with_drop_last, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_error, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_get_worker_info, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_invalid_ctor_args_combinations, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_iterable_style_dataset, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_iterabledataset_len, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_large_sampler_indices, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_multiprocessing_iterdatapipe, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_numpy_scalars, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_partial_workers, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_random_sampler, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_segfault, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_seqential_batch_workers, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_seqential_batch_workers_prefetch, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_sequential_workers, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_batch_none, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_batch_workers, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_batch_workers_prefetch, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_pin_memory, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_workers, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_timeout, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_worker_init_fn, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_worker_seed_reproducibility, test/test_dataloader.py::TestNamedTupleDataLoader::test_dataloader_with_namedtuple, test/test_dataloader.py::TestCustomPinFn::test_custom_batch_pin, test/test_dataloader.py::TestConvAfterFork::test_conv_after_fork, test/test_dataloader.py::TestOutOfOrderDataLoader::test_in_order_index_ds, test/test_dataloader.py::TestOutOfOrderDataLoader::test_out_of_order_index_ds, test/test_dataloader.py::TestOutOfOrderDataLoader::test_out_of_order_iterable_ds, test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_nested_tensor_multiprocessing_context_fork_cuda, test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_nested_tensor_multiprocessing_context_spawn_cuda, test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_fork_cuda, test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T12:14:55.2246304Z 2025-12-04T12:14:55.2246608Z Finished test_dataloader 1/2 ... [2025-12-04 12:14:55.215980][10899.33719674], took 4.36min 2025-12-04T12:14:55.2420669Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_dataloader/test_dataloader-94259eed0293b6e4.xml 2025-12-04T12:14:55.3272685Z Running test_decomp 4/17 ... [2025-12-04 12:14:55.326842][10899.448060085] 2025-12-04T12:14:55.3273229Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T12:14:55.3275608Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_decomp.py', '--shard-id=4', '--num-shards=17', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:14:55.327293] 2025-12-04T12:27:51.2784556Z 2025-12-04T12:27:51.2785512Z test_decomp 4/17 was successful, full logs can be found in artifacts with path test/test-reports/test_decomp_4.17_574ee9916c2803b5_.log 2025-12-04T12:27:51.2988634Z Running 530 items in this shard: test/test_decomp.py::TestDecompCUDA::test_comprehensive_H_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive___getitem___cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive___radd___cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rand___cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rxor___cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rxor___cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive__batch_norm_with_update_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive__chunk_cat_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive__unsafe_masked_index_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive__upsample_bilinear2d_aa_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_add_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addcmul_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addcmul_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addmm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addmm_decomposed_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_alias_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_alias_copy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_amax_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_any_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_arange_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_arange_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argmax_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_partial_views_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asin_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asin_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asinh_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atan2_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atanh_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_3d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_3d_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_and_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_left_shift_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_right_shift_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_right_shift_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_broadcast_tensors_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bucketize_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cartesian_prod_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cfloat_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chalf_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chalf_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_char_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chunk_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clone_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_column_stack_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_column_stack_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_conj_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_conj_physical_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_constant_pad_nd_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_corrcoef_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_corrcoef_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cos_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cosh_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cross_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cross_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumsum_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagflat_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_scatter_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_scatter_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_digamma_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_floor_rounding_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_floor_rounding_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_floor_rounding_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_trunc_rounding_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dot_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dsplit_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dstack_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dstack_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_like_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_permuted_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_permuted_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eq_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eq_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eq_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_equal_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erf_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erf_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erfinv_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erfinv_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_exp2_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_exp2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_exp2_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_exp_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_as_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eye_cuda_float8_e5m2, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eye_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fft2_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifft_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfft2_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfft2_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfft_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flatten_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fliplr_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flipud_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_power_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_power_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_power_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmax_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmin_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gather_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gradient_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_grid_sampler_2d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_half_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hash_tensor_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_heaviside_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hstack_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_i0_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_i0_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_imag_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_add_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_fill_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_put_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_amax_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_amax_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_prod_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_int_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_int_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isin_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isin_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isnan_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isreal_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_item_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_binary_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_unary_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kron_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kron_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kron_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kthvalue_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kthvalue_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_le_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cholesky_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_diagonal_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_eigh_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_eigvals_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_householder_product_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_ldl_solve_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lstsq_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lu_factor_ex_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_matrix_rank_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_matrix_rank_hermitian_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_pinv_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_qr_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_tensorsolve_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_vecdot_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_tensor_overload_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log10_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log2_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log2_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logaddexp_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logit_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_tensor_overload_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logsumexp_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lt_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lt_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lu_unpack_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mT_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mT_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_amin_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_amin_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_fill_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_logsumexp_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_mean_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_prod_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_scatter_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_max_reduction_with_dim_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_maximum_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_median_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_meshgrid_list_of_tensors_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_binary_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_no_dim_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_no_dim_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_with_dim_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mm_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_msort_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mul_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_multinomial_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nan_to_num_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nanmedian_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nanmedian_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nanquantile_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nansum_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nansum_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nansum_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_native_batch_norm_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ne_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_strided_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_strided_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_ones_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_zeros_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_avg_pool2d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv1d_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv1d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_dropout2d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_dropout_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_embedding_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_glu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_nearest_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_l1_loss_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_leaky_relu_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_linear_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_linear_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_pool1d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_unpool2d_grad_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_mish_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_mse_loss_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_multi_head_attention_forward_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_circular_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_poisson_nll_loss_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_relu6_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_relu_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_relu_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_selu_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_silu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softmin_with_dtype_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softshrink_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_threshold_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_threshold_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_unfold_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_upsample_nearest_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_upsample_nearest_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_static_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_like_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_outer_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_copy_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_0_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_qr_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randint_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randn_like_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_remainder_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_remainder_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_remainder_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_interleave_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_as_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_as_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resize__cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resolve_conj_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_roll_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rot90_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_round_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rsqrt_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_mean_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_sum_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_searchsorted_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_select_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_select_scatter_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sgn_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sign_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sign_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signal_windows_hann_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sin_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinc_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_scatter_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_softmax_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sort_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_j0_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_entr_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_entr_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_hermite_polynomial_h_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_log_ndtr_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_log_ndtr_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_log_ndtr_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_k1_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_ndtr_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_ndtr_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_ndtri_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_scaled_modified_bessel_k1_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_list_args_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_list_args_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sqrt_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sqrt_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_stack_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_mean_unbiased_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sub_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sub_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sub_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_svd_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_take_along_dim_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_to_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_to_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_topk_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_transpose_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapezoid_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tril_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triu_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trunc_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unflatten_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unique_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unique_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_chunk_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_split_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsqueeze_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vdot_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_copy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_where_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zero__cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zero__cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zero__cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zeros_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zeros_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick__chunk_cat_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_abs_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_addcdiv_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_addcmul_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_addmm_decomposed_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_addr_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_addr_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_amin_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_arange_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_scatter_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_scatter_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_asinh_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_asinh_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_atan_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_atanh_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_xor_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_block_diag_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_bucketize_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_cat_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_cat_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_conj_physical_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_conj_physical_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_constant_pad_nd_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_deg2rad_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_dot_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_lerp_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_logit_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_std_mean_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_trace_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_transpose_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_count_nonzero_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_count_nonzero_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_cumsum_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_deg2rad_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_diag_embed_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_diag_embed_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_digamma_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_div_floor_rounding_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_dot_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_erfinv_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_erfinv_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_exp2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_exp_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_exp_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_expand_copy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_expand_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_expand_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft2_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft2_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fftn_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfft_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfft_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfft_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfft_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfft_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfftn_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_flip_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_flip_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_floor_divide_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_floor_divide_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fmin_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_full_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_full_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_ge_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_ge_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_ge_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_geometric_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_heaviside_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_hypot_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_i0_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_i0_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_igammac_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_index_add_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_index_add_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_index_select_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_isnan_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_lgamma_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_diagonal_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_vector_norm_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_log1p_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_log2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_log_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_log_softmax_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_logaddexp_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_logical_and_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_logical_not_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_logical_not_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_logical_not_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_logical_xor_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_logit_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_meshgrid_variadic_tensors_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_minimum_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_mul_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_nan_to_num_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_nansum_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_native_batch_norm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_new_full_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_embedding_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_embedding_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_embedding_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_hardshrink_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_leaky_relu_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_mse_loss_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu6_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_softshrink_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_norm_inf_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_ones_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_ones_like_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_permute_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_permute_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_permute_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_rad2deg_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_reciprocal_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_remainder_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_remainder_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_roll_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_rsqrt_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_rsub_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_rsub_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_select_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_sgn_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_sigmoid_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_sigmoid_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_sinc_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_split_list_args_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_sqrt_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_multiple_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_stack_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_std_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_std_unbiased_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_sum_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_t_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_tan_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_tan_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_tanh_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_triu_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_uniform_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_unsqueeze_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_var_mean_unbiased_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_var_mean_unbiased_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_view_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_like_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_rnn_decomp_module_nn_LSTM_train_mode_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_rnn_decomp_module_nn_RNN_eval_mode_cuda_float32 2025-12-04T12:27:51.3189550Z 2025-12-04T12:27:51.3189868Z Finished test_decomp 4/17 ... [2025-12-04 12:27:51.278972][11675.400190036], took 12.93min 2025-12-04T12:27:51.3190944Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_decomp/test_decomp-f51ad4b3b0738d41.xml 2025-12-04T12:27:51.7941910Z Uploading artifacts took 0.40 seconds 2025-12-04T12:27:51.7946399Z Running test_decomp 9/17 ... [2025-12-04 12:27:51.794468][11675.915688084] 2025-12-04T12:27:51.7947064Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T12:27:51.7950951Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_decomp.py', '--shard-id=9', '--num-shards=17', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:27:51.794876] 2025-12-04T12:41:03.1969760Z 2025-12-04T12:41:03.1970729Z test_decomp 9/17 was successful, full logs can be found in artifacts with path test/test-reports/test_decomp_9.17_c2558e85bd6204cf_.log 2025-12-04T12:41:03.2178934Z Running 542 items in this shard: test/test_decomp.py::TestDecompCUDA::test_comprehensive_H_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_T_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_T_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rand___cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rand___cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmod___cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmul___cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive___ror___cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive___ror___cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive__chunk_cat_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive__segment_reduce_lengths_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive__upsample_bilinear2d_aa_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acos_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acos_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acos_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_add_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_add_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addcmul_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addmv_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_all_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_amax_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_amax_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_partial_views_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asin_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asin_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asinh_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asinh_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asinh_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atan2_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atan_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atan_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atanh_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atanh_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_2d_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_3d_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_block_diag_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bucketize_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cartesian_prod_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cat_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cdist_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cdist_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ceil_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ceil_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chalf_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chalf_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_char_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_char_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clone_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clone_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_column_stack_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_column_stack_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_column_stack_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_conj_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_conj_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cosh_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cov_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cross_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cummax_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumsum_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumsum_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_embed_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagflat_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_scatter_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_scatter_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diff_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_digamma_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_digamma_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dist_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_no_rounding_mode_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dot_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_double_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dstack_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_einsum_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_like_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_permuted_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_equal_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_exp2_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eye_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fft_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftn_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfftn_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifftn_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifftshift_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfftn_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfftn_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfftn_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fill_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flatten_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flip_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flip_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flip_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flipud_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_floor_divide_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_floor_divide_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_like_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_like_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_geometric_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_grid_sampler_3d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_half_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_heaviside_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_heaviside_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_histc_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_i0_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_copy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_mean_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_prod_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_select_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_select_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_inner_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_int_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isfinite_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isin_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isnan_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isnan_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isneginf_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isreal_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_unary_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kron_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_le_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lerp_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cholesky_ex_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cross_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cross_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_eig_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_eigvalsh_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_inv_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lstsq_grad_oriented_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lu_factor_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lu_factor_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_multi_dot_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_solve_ex_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_svd_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_tensorsolve_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_vector_norm_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_tensor_overload_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_tensor_overload_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log10_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_normal_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_softmax_with_dtype_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logaddexp_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_and_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_xor_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lu_solve_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mT_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_amax_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_amin_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_cumprod_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_logsumexp_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_logsumexp_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_median_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_norm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_normalize_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_prod_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_scatter_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_scatter_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_scatter_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_scatter_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_select_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_var_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_matmul_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_matmul_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_matmul_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_max_reduction_no_dim_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_maximum_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_maximum_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mean_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_median_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_binary_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_binary_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mode_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mode_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_movedim_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_msort_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nanmean_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ne_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_strided_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_zeros_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_zeros_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nextafter_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_avg_pool1d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_celu_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv2d_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_elu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_embedding_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_without_train_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_fractional_max_pool2d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_gelu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_grid_sample_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_grid_sample_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hardshrink_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hardsigmoid_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hardtanh_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hinge_embedding_loss_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_huber_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_bilinear_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_nearest-exact_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_layer_norm_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_multi_margin_loss_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pairwise_distance_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_silu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softsign_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_tanhshrink_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_threshold_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_triplet_margin_with_distance_loss_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_static_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_like_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_like_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_outer_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_1_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_4_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_prod_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_put_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_quantile_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rand_like_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randint_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randint_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randint_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randint_like_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randn_like_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ravel_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_real_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_real_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reciprocal_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reciprocal_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reciprocal_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resolve_conj_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resolve_conj_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resolve_neg_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_round_decimals_3_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scalar_tensor_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scalar_tensor_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_add_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_add_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_amax_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_mean_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_select_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sigmoid_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sign_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signal_windows_general_hamming_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signbit_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_softmax_with_dtype_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sort_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_j0_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_j0_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_j1_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_y0_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_t_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_entr_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_entr_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_hermite_polynomial_h_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i0e_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1e_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_k0_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_ndtri_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_ndtri_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_scaled_modified_bessel_k0_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_scaled_modified_bessel_k0_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_square_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_multiple_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_multiple_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_stack_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tan_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tanh_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tanh_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tensor_split_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_topk_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_topk_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_torch_ops_aten__safe_softmax_default_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_transpose_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapezoid_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trunc_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_uniform_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsqueeze_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsqueeze_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vsplit_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vstack_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_where_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zeros_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick__chunk_cat_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick__upsample_bilinear2d_aa_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_addcdiv_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_addcdiv_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_addcmul_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_addcmul_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_addmm_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_all_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_all_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_amin_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_aminmax_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_atan_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_baddbmm_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_or_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_right_shift_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_xor_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_block_diag_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_cat_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_clamp_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_clamp_max_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_clamp_max_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_clamp_max_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_clamp_min_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_complex_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_constant_pad_nd_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_zero__cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_count_nonzero_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_cumprod_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_deg2rad_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_diag_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_empty_like_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_empty_strided_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_erf_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_erf_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_erfc_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_exp2_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_exp2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_exp_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_expand_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_expm1_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_expm1_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_expm1_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_eye_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft2_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfft2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfft2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfft_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfft_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfftn_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifftn_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfft_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfftn_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfftn_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfft2_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfft2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfft2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_fill_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_fill_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_floor_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_floor_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_frexp_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_full_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_ge_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_gt_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_gt_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_heaviside_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_igammac_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_index_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_index_fill_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_isin_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_isinf_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_isnan_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_item_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_le_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_le_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_le_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_lerp_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_cross_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_cross_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_cross_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_log10_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_log10_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_log1p_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_log2_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_log2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_log_normal_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_logical_and_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_logical_and_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_logical_and_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_logical_not_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_logical_or_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_logical_or_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_logical_xor_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_logit_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_tensor_overload_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_lt_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_lt_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_masked_fill_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_maximum_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_maximum_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_meshgrid_list_of_tensors_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_mul_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_mul_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_mv_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nan_to_num_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_nan_to_num_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_nansum_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_nansum_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_nansum_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_nansum_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_narrow_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_native_layer_norm_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_new_empty_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_gelu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_logsigmoid_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_max_unpool3d_grad_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_pad_constant_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_silu_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_unfold_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_norm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_norm_inf_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_normal_in_place_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_ones_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_ones_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_ones_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_permute_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_polar_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_prod_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_prod_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_reciprocal_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_repeat_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_round_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_round_decimals_neg_3_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_select_scatter_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_sign_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_signbit_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_sinc_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_sinc_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_sinh_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_sinh_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_slice_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_slice_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_softmax_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_special_entr_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_special_i0e_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_special_i1_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_special_i1_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_special_i1_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_special_i1e_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_special_log_ndtr_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_special_log_ndtr_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_special_ndtri_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_special_ndtri_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_special_xlog1py_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_split_list_args_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_sqrt_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_sqrt_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_stack_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_stack_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_std_unbiased_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_sum_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_t_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_t_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_t_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_t_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_t_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_tanh_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_trace_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_trace_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_tril_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_triu_indices_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_trunc_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_unsafe_split_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_unsqueeze_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_unsqueeze_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_var_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_vdot_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_where_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_like_cuda_bfloat16, test/test_decomp.py::DecompOneOffTestsCUDA::test_contiguous_log_softmax_cuda, test/test_decomp.py::HasDecompTest::test_has_decomposition 2025-12-04T12:41:03.2380636Z 2025-12-04T12:41:03.2380949Z Finished test_decomp 9/17 ... [2025-12-04 12:41:03.197286][12467.318505935], took 13.19min 2025-12-04T12:41:03.2382109Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_decomp/test_decomp-6b78cf4133d2553c.xml 2025-12-04T12:41:03.3191782Z Running test_decomp 14/17 ... [2025-12-04 12:41:03.318973][12467.440193578] 2025-12-04T12:41:03.3192374Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T12:41:03.3195691Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_decomp.py', '--shard-id=14', '--num-shards=17', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:41:03.319332] 2025-12-04T12:51:19.0388319Z 2025-12-04T12:51:19.0389425Z test_decomp 14/17 was successful, full logs can be found in artifacts with path test/test-reports/test_decomp_14.17_c45af0ac343f4cce_.log 2025-12-04T12:51:19.0590891Z Running 526 items in this shard: test/test_decomp.py::TestDecompCUDA::test_bernoulli_default_cuda, test/test_decomp.py::TestDecompCUDA::test_comprehensive_H_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_T_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_T_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_T_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive___getitem___cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___radd___cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___radd___cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive___radd___cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rand___cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rdiv___cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmatmul___cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmul___cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmul___cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rsub___cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive__chunk_cat_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive__upsample_bilinear2d_aa_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acos_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acos_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addmv_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_all_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_angle_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_angle_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_any_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_arange_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_arange_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argwhere_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_partial_views_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_scatter_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_scatter_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asin_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asinh_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atan2_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atan_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atan_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atanh_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_xor_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_block_diag_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_broadcast_tensors_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_broadcast_tensors_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_byte_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cauchy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cdouble_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cdouble_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_max_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clone_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clone_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clone_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_column_stack_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_combinations_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_conj_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_contiguous_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_copysign_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_corrcoef_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_count_nonzero_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_count_nonzero_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cov_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cross_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cummax_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumsum_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumulative_trapezoid_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_embed_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagflat_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagflat_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_scatter_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_scatter_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_digamma_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_floor_rounding_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_no_rounding_mode_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_double_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_einsum_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_einsum_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_like_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_strided_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_strided_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eq_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erfc_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_exp_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_as_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_as_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expm1_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fft2_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftshift_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftshift_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftshift_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft2_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifftn_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfft2_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft2_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_rfft2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_rfft_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fill_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flip_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flip_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_power_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_floor_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmax_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmod_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_like_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gather_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gather_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_geqrf_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gradient_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gradient_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_half_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_heaviside_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_heaviside_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_igammac_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_fill_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_put_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_amax_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_amax_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_mean_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_prod_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_select_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_inner_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isclose_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isclose_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isfinite_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isneginf_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isposinf_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_4inputs_with_extra_args_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_4inputs_with_extra_args_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ldexp_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lerp_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lgamma_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lgamma_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cholesky_ex_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_eig_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_matrix_rank_hermitian_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_solve_triangular_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_svd_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_vander_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_vander_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_normal_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_softmax_with_dtype_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_softmax_with_dtype_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logaddexp_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_not_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_not_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_or_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_xor_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_xor_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_xor_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_tensor_overload_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logsumexp_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logsumexp_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_long_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lu_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mH_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_argmax_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_argmin_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_select_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_select_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_softmax_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_sum_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_sum_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_var_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_matmul_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_matrix_exp_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_max_binary_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_max_pool2d_with_indices_backward_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_max_reduction_no_dim_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_max_reduction_with_dim_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mean_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_meshgrid_list_of_tensors_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_meshgrid_list_of_tensors_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_meshgrid_list_of_tensors_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mode_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_msort_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mul_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nanmean_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_copy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_native_layer_norm_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ne_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ne_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_strided_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_ones_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_bilinear_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv2d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose1d_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose1d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose3d_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cosine_embedding_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cosine_embedding_loss_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cosine_embedding_loss_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_elu_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_embedding_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_without_train_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_gelu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_group_norm_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_instance_norm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_linear_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_nearest_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_pool2d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_unpool1d_grad_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_unpool3d_grad_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_mish_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_normalize_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_replicate_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_replicate_negative_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_shuffle_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_unshuffle_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_unshuffle_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_unshuffle_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_rms_norm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softsign_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softsign_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_triplet_margin_loss_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_norm_nuc_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_3_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_3_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_positive_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_pow_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_prod_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rand_like_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randn_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randn_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ravel_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_real_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reciprocal_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reciprocal_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_remainder_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_interleave_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_as_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_as_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resize_as__cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resolve_conj_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_roll_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rot90_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rsqrt_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rsub_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scalar_tensor_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scalar_tensor_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_add_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_amin_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_amin_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_sum_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_searchsorted_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_select_scatter_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_short_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sigmoid_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signal_windows_gaussian_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sin_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinc_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinh_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinh_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sort_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sort_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_airy_ai_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_j0_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_j0_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_t_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_u_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1e_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_laguerre_polynomial_l_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_log_ndtr_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_i0_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_spherical_bessel_j0_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_xlog1py_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_zeta_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_list_args_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_square_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_copy_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_multiple_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_stack_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_mean_unbiased_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_mean_unbiased_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sum_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sum_to_size_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_take_along_dim_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_take_along_dim_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_take_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tanh_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tanh_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tensor_split_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tensor_split_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tensordot_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_to_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trace_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_transpose_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_transpose_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_transpose_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapezoid_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapezoid_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapz_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tril_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tril_indices_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triu_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_true_divide_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trunc_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_copy_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_chunk_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_var_mean_unbiased_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_where_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_xlogy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_xlogy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zeros_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_put_accumulate_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_acos_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_acosh_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_acosh_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_addcmul_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_addr_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_alias_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_alias_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_all_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_all_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_all_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_amax_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_aminmax_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_any_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_asinh_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_atan2_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_and_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_not_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_or_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_cauchy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_ceil_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_clamp_max_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_clone_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_conj_physical_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_conj_physical_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_constant_pad_nd_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward__unsafe_masked_index_put_accumulate_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_clamp_min_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_nansum_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_take_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_transpose_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_cos_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_count_nonzero_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_deg2rad_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_diag_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_diag_embed_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_div_trunc_rounding_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_dot_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_empty_like_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_erfc_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_exp2_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_expand_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_expm1_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_eye_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_eye_cuda_float8_e5m2fnuz, test/test_decomp.py::TestDecompCUDA::test_quick_eye_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fftn_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfftn_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifftn_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfft2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfft_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfft_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfftn_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfftn_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfft_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_floor_divide_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fmax_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_fmax_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_frexp_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_geometric_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_heaviside_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_hypot_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_i0_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_index_add_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_index_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_index_select_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_index_select_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_isinf_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_isneginf_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_isposinf_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_isposinf_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_diagonal_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_log10_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_log1p_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_log2_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_log_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_log_normal_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_log_softmax_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_logical_or_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_logical_xor_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_tensor_overload_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_logsumexp_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_masked_fill_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_meshgrid_variadic_tensors_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_minimum_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nan_to_num_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_nansum_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_nansum_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_narrow_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_native_batch_norm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_native_layer_norm_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_new_empty_strided_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_new_full_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_new_zeros_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_new_zeros_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_binary_cross_entropy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_hardsigmoid_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_hardtanh_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_max_unpool2d_grad_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_max_unpool3d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu6_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_unfold_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_permute_copy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_permute_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_prod_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_prod_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_rad2deg_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_reciprocal_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_reciprocal_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_rot90_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_rot90_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_round_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_round_decimals_3_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_round_decimals_3_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_rsqrt_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_rsub_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_select_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_sign_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_sign_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_signbit_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_sin_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_sin_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_sin_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_sin_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_sinc_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_sinc_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_sinc_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_sinh_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_sinh_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_slice_scatter_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_softmax_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_special_entr_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_special_i0e_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_split_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_split_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_split_list_args_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_split_list_args_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_split_list_args_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_sqrt_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_multiple_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_stack_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_stack_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_stack_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_sub_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_tan_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_trace_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_triu_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_triu_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_trunc_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_unsafe_split_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_unsqueeze_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_view_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_rnn_decomp_module_nn_RNN_eval_mode_cuda_float64, test/test_decomp.py::DecompOneOffTestsCUDA::test_sdpa_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_decomp.py::HasDecompTest::test_conv1d_decomposition 2025-12-04T12:51:19.0790762Z 2025-12-04T12:51:19.0791064Z Finished test_decomp 14/17 ... [2025-12-04 12:51:19.039042][13083.160261961], took 10.26min 2025-12-04T12:51:19.0792147Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_decomp/test_decomp-91f7fdd24463f867.xml 2025-12-04T12:51:19.5978554Z Uploading artifacts took 0.45 seconds 2025-12-04T12:51:19.5982792Z Running test_ops_fwd_gradients 2/2 ... [2025-12-04 12:51:19.598091][13083.71931168] 2025-12-04T12:51:19.5983593Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T12:51:19.5987566Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops_fwd_gradients.py', '--shard-id=2', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:51:19.598474] 2025-12-04T12:59:38.2230315Z 2025-12-04T12:59:38.2231452Z test_ops_fwd_gradients 2/2 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_fwd_gradients_2.2_d78504ac4b58b9c4_.log 2025-12-04T12:59:38.3059235Z Running 1579 items in this shard: test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_H_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_T_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad___getitem___cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad___rdiv___cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad___rdiv___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad___rmatmul___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad___rmul___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad___rpow___cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad___rsub___cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad___rsub___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad__batch_norm_with_update_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad__chunk_cat_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad__chunk_cat_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad__native_batch_norm_legit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad__softmax_backward_data_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad__unsafe_masked_index_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad__upsample_bilinear2d_aa_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_abs_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_acos_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_acosh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_add_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_addbmm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_addcdiv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_addcmul_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_addcmul_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_addmm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_addmm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_addmm_decomposed_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_addmv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_addr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_all_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_allclose_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_amax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_any_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_arange_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_argmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_argwhere_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_as_strided_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_as_strided_partial_views_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_as_strided_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_asin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_atanh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_atleast_1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_atleast_2d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_atleast_3d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_atleast_3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_baddbmm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_baddbmm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_bernoulli_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_bfloat16_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_block_diag_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_bmm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_broadcast_tensors_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_broadcast_tensors_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_bucketize_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_byte_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cartesian_prod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cdist_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cdouble_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_ceil_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cfloat_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_chalf_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_char_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_char_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cholesky_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cholesky_inverse_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cholesky_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_chunk_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_chunk_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_combinations_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_complex_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_conj_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_conj_physical_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_conj_physical_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_constant_pad_nd_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_contiguous_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_contiguous_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_corrcoef_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cos_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cos_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cosh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cosh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_count_nonzero_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_count_nonzero_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cummin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cumsum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cumulative_trapezoid_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_cumulative_trapezoid_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_deg2rad_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_diag_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_diag_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_diagflat_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_diagflat_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_diagonal_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_diagonal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_diagonal_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_diff_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_digamma_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_dist_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_div_floor_rounding_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_div_no_rounding_mode_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_div_trunc_rounding_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_dot_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_dot_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_dstack_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_dstack_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_einsum_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_einsum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_empty_permuted_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_empty_strided_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_eq_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_equal_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_erf_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_erfinv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_exp2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_exp_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_expand_as_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_expand_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_expand_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_expand_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_expm1_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_fft_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_fftn_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_hfft2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_hfft_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_hfft_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_ifft2_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_ifft2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_ifft_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_ifft_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_ifftshift_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_ihfft2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_ihfft_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_irfft_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_irfftn_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_irfftn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fft_rfftn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fill_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_flatten_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_flatten_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_flip_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fliplr_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_flipud_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_float_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_float_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_float_power_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_float_power_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_floor_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_floor_divide_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fmin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_fmod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_frac_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_full_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_ge_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_geometric_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_geqrf_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_geqrf_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_gradient_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_gradient_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_grid_sampler_2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_half_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_hash_tensor_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_heaviside_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_hsplit_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_i0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_imag_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_index_add_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_index_add_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_index_select_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_isfinite_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_isnan_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_isreal_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_isreal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_item_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_item_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_jiterator_binary_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_jiterator_binary_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_jiterator_unary_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_kron_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_kron_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_ldexp_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_le_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_lerp_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_cholesky_ex_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_cond_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_cross_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_cross_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_det_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_diagonal_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_diagonal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_eig_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_eigvals_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_eigvalsh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_eigvalsh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_householder_product_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_inv_ex_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_ldl_factor_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_ldl_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_lstsq_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_lu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_lu_factor_ex_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_matrix_norm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_matrix_power_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_matrix_rank_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_norm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_norm_subgradients_at_zero_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_pinv_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_pinv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_pinv_singular_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_slogdet_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_solve_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_solve_ex_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_solve_ex_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_solve_triangular_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_tensorinv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_tensorsolve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_tensorsolve_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_vander_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_vander_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linalg_vector_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linspace_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linspace_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linspace_tensor_overload_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_linspace_tensor_overload_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_log1p_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_log2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_log_normal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_log_softmax_with_dtype_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_log_softmax_with_dtype_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_logaddexp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_logdet_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_logical_and_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_logical_not_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_logical_or_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_logical_or_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_logical_xor_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_logit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_logspace_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_long_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_lu_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_lu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_lu_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_lu_solve_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_lu_unpack_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_mH_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_mT_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_masked_amin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_masked_argmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_masked_cumprod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_masked_cumsum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_masked_fill_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_masked_normalize_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_masked_normalize_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_masked_prod_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_masked_softmin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_masked_sum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_matrix_exp_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_max_binary_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_maximum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_mean_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_meshgrid_list_of_tensors_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_meshgrid_list_of_tensors_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_min_reduction_with_dim_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_mm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_mode_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_movedim_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_mul_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_mv_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_mv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nan_to_num_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nanmean_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nanmean_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nanmedian_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_narrow_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_narrow_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_native_batch_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_native_layer_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_ne_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_neg_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_new_empty_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_new_empty_strided_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_new_full_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_new_full_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_new_ones_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_new_ones_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_new_zeros_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_new_zeros_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_alpha_dropout_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_batch_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_bilinear_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_conv2d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_conv2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_cosine_similarity_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_dropout3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_embedding_bag_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_glu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_grid_sample_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_group_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_hardshrink_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_hardsigmoid_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_hardswish_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_huber_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_instance_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_interpolate_area_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_interpolate_nearest_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_kl_div_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_l1_loss_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_layer_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_linear_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_logsigmoid_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_max_pool1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_max_pool2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_max_pool3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_max_unpool2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_mish_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_normalize_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_normalize_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_pad_circular_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_pad_constant_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_pad_reflect_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_pad_replicate_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_pad_replicate_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_relu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_rms_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_silu_complex_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_silu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_soft_margin_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_softplus_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_softsign_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_tanhshrink_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_threshold_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nn_functional_upsample_bilinear_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nonzero_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_nonzero_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_norm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_norm_fro_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_norm_inf_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_norm_nuc_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_normal_number_mean_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_ones_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_ormqr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_permute_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_permute_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_pinverse_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_pinverse_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_polar_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_polygamma_polygamma_n_0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_polygamma_polygamma_n_1_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_polygamma_polygamma_n_2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_polygamma_polygamma_n_3_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_polygamma_polygamma_n_4_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_positive_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_pow_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_pow_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_prod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_put_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_qr_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_quantile_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_randn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_randn_like_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_ravel_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_real_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_reciprocal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_remainder_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_repeat_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_repeat_interleave_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_resize__cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_resize_as__cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_resize_as__cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_resolve_conj_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_resolve_conj_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_roll_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_rot90_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_round_decimals_0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_round_decimals_neg_3_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_rsqrt_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_rsqrt_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_rsub_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_scalar_tensor_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_scatter_add_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_scatter_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_scatter_reduce_amax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_scatter_reduce_prod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_searchsorted_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_select_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sgn_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_short_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_short_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sigmoid_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sign_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_signal_windows_cosine_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_signal_windows_gaussian_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_signal_windows_general_hamming_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_signal_windows_hamming_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_signal_windows_kaiser_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sin_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sinc_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_slice_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_slice_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_softmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sparse_mm_reduce_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sparse_sampled_addmm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_bessel_j0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_chebyshev_polynomial_u_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_chebyshev_polynomial_w_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_erfcx_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_hermite_polynomial_h_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_hermite_polynomial_he_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_i1_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_laguerre_polynomial_l_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_log_ndtr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_modified_bessel_i0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_modified_bessel_i1_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_special_spherical_bessel_j0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_split_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_split_list_args_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_split_list_args_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_split_with_sizes_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sqrt_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_square_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_squeeze_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_squeeze_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_squeeze_multiple_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_squeeze_multiple_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_stack_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_std_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_std_mean_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_std_mean_unbiased_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sub_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sum_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sum_to_size_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_sum_to_size_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_svd_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_svd_lowrank_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_t_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_t_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_take_along_dim_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_take_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_tanh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_tensor_split_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_tensor_split_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_tensordot_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_tensordot_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_tile_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_to_sparse_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_topk_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_trace_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_transpose_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_transpose_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_transpose_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_trapezoid_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_triangular_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_triangular_solve_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_triu_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_trunc_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_unbind_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_unbind_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_unflatten_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_unfold_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_unfold_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_uniform_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_uniform_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_unique_consecutive_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_unique_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_unsafe_chunk_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_unsafe_chunk_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_unsqueeze_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_var_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_var_mean_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_var_mean_unbiased_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_var_unbiased_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_vdot_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_view_as_complex_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_view_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_view_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_view_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_vstack_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_where_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_xlogy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_fn_fwgrad_bwgrad_zeros_like_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_H_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_T_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD___getitem___cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD___getitem___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD___rdiv___cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD___rdiv___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD___rmod___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD___rmul___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD___rsub___cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD__batch_norm_with_update_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD__chunk_cat_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD__chunk_cat_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD__native_batch_norm_legit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD__softmax_backward_data_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD__unsafe_masked_index_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD__upsample_bilinear2d_aa_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_abs_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_acos_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_acosh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_acosh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_add_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_addbmm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_addcdiv_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_addmv_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_addmv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_addr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_all_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_all_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_allclose_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_amin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_aminmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_any_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_as_strided_partial_views_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_as_strided_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_asin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_asinh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_atan2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_atan_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_atan_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_atanh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_atanh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_atleast_1d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_atleast_1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_atleast_3d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_atleast_3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_baddbmm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_bernoulli_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_bfloat16_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_block_diag_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_bmm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_bool_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_broadcast_tensors_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_broadcast_tensors_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_broadcast_to_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_broadcast_to_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_byte_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cartesian_prod_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cartesian_prod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cat_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cauchy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_ceil_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_chalf_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_char_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_char_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cholesky_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cholesky_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cholesky_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_chunk_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_clamp_max_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_clamp_min_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_clone_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_column_stack_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_combinations_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_conj_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_constant_pad_nd_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_constant_pad_nd_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_contiguous_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_contiguous_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_corrcoef_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cos_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cosh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_count_nonzero_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_count_nonzero_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cross_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cumprod_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cumprod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cumsum_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_cumsum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_deg2rad_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_diag_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_diag_embed_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_diagonal_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_diagonal_scatter_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_diagonal_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_diff_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_dist_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_div_floor_rounding_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_div_no_rounding_mode_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_div_no_rounding_mode_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_div_trunc_rounding_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_dot_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_dsplit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_dstack_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_einsum_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_einsum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_empty_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_empty_like_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_empty_permuted_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_empty_permuted_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_empty_strided_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_eq_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_equal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_erf_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_erfinv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_exp2_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_exp2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_expand_as_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_expand_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_expand_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_expm1_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_expm1_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_eye_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_eye_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_fft_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_fftshift_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_hfft2_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_hfft2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_hfft_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_ifft2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_ifftn_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_ifftn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_ifftshift_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_ihfft2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_ihfft_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_irfft2_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_irfft_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_irfftn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_rfft2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fft_rfftn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fill_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_flatten_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_flatten_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_flip_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_flip_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fliplr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_flipud_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_float_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_float_power_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_floor_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_fmod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_frexp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_full_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_full_like_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_gather_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_gather_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_ge_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_geometric_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_geqrf_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_geqrf_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_gradient_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_gt_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_half_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_half_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_hash_tensor_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_heaviside_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_hsplit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_i0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_igamma_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_imag_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_index_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_index_fill_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_index_fill_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_index_put_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_index_put_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_index_reduce_amin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_index_select_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_inner_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_int_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_isfinite_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_isin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_isinf_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_isposinf_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_isreal_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_isreal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_istft_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_item_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_item_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_jiterator_2inputs_2outputs_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_jiterator_binary_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_jiterator_unary_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_kron_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_ldexp_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_le_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_lerp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_cholesky_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_cholesky_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_cond_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_cond_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_diagonal_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_diagonal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_eig_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_eigh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_eigvals_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_inv_ex_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_ldl_factor_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_ldl_factor_ex_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_lstsq_grad_oriented_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_lu_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_lu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_lu_factor_ex_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_matrix_norm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_matrix_rank_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_multi_dot_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_norm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_qr_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_slogdet_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_solve_ex_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_solve_ex_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_solve_triangular_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_solve_triangular_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_svdvals_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_svdvals_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_tensorinv_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_tensorsolve_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_linalg_vander_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_log2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_log_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_log_normal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_log_softmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_log_softmax_with_dtype_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_log_softmax_with_dtype_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logaddexp2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logaddexp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logcumsumexp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logdet_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logdet_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logical_and_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logical_not_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logical_xor_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logspace_tensor_overload_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logspace_tensor_overload_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_logsumexp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_long_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_long_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_lt_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_lu_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_lu_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_lu_unpack_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_mH_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_argmin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_cumprod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_cumsum_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_fill_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_log_softmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_logaddexp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_logsumexp_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_logsumexp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_mean_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_normalize_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_normalize_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_prod_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_prod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_std_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_std_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_sum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_masked_var_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_matmul_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_matmul_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_max_pool2d_with_indices_backward_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_max_reduction_no_dim_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_mean_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_mean_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_median_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_meshgrid_list_of_tensors_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_meshgrid_variadic_tensors_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_meshgrid_variadic_tensors_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_min_reduction_no_dim_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_mm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_mm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_movedim_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_mul_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_mv_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nan_to_num_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nanmean_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nanquantile_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nansum_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nansum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_narrow_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_narrow_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_native_batch_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_ne_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_neg_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_new_empty_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_new_empty_strided_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_new_full_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_new_full_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_new_ones_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nextafter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_avg_pool3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_bilinear_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_channel_shuffle_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_conv2d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_conv2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_conv_transpose1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_conv_transpose2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_cross_entropy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_ctc_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_elu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_embedding_bag_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_embedding_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_gelu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_glu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_grid_sample_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_group_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_hardshrink_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_hardswish_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_huber_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_interpolate_area_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_interpolate_linear_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_kl_div_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_l1_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_layer_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_leaky_relu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_logsigmoid_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_max_pool1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_max_pool2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_max_unpool1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_max_unpool2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_mse_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_normalize_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_pad_circular_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_pad_constant_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_pad_replicate_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_pad_replicate_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_pad_replicate_negative_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_pairwise_distance_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_pdist_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_relu6_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_rms_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_silu_complex_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_softmin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_softshrink_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_softsign_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_softsign_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_threshold_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nn_functional_upsample_nearest_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_nonzero_static_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_norm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_norm_fro_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_norm_inf_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_norm_nuc_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_normal_in_place_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_normal_number_mean_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_ones_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_outer_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_outer_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_pca_lowrank_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_permute_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_permute_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_pinverse_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_pinverse_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_polygamma_polygamma_n_0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_polygamma_polygamma_n_2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_polygamma_polygamma_n_3_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_polygamma_polygamma_n_4_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_positive_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_pow_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_prod_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_prod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_put_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_qr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_rand_like_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_randint_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_randint_like_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_randn_like_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_ravel_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_ravel_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_reciprocal_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_renorm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_repeat_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_reshape_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_reshape_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_resize__cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_resize__cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_resize_as__cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_resolve_conj_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_roll_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_roll_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_rot90_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_round_decimals_0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_round_decimals_neg_3_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_rsqrt_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_rsub_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_rsub_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_scalar_tensor_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_scalar_tensor_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_scatter_add_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_scatter_add_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_searchsorted_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_select_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_select_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sgn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sigmoid_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_signal_windows_blackman_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_signal_windows_cosine_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_signal_windows_general_hamming_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_signal_windows_hamming_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_signbit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_slice_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_softmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_softmax_with_dtype_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sort_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sparse_mm_reduce_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sparse_sampled_addmm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sparse_sampled_addmm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_bessel_j0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_chebyshev_polynomial_v_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_erfcx_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_hermite_polynomial_he_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_i1_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_i1e_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_legendre_polynomial_p_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_modified_bessel_i0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_modified_bessel_k1_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_ndtr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_xlog1py_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_special_zeta_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_split_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_split_list_args_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_split_with_sizes_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_split_with_sizes_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_split_with_sizes_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sqrt_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_square_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_squeeze_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_squeeze_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_squeeze_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_squeeze_multiple_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_stack_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_stack_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_std_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_std_mean_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_std_mean_unbiased_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_std_unbiased_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_stft_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sub_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sum_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sum_to_size_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_sum_to_size_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_svd_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_svd_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_svd_lowrank_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_svd_lowrank_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_t_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_tan_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_tanh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_tensor_split_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_tensordot_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_tensordot_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_tile_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_to_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_transpose_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_transpose_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_trapezoid_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_trapz_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_triangular_solve_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_tril_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_true_divide_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_true_divide_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unbind_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unbind_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unbind_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unflatten_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unfold_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_uniform_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unique_consecutive_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unsafe_chunk_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unsafe_chunk_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unsafe_split_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unsqueeze_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unsqueeze_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_unsqueeze_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_var_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_var_mean_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_var_mean_unbiased_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_vdot_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_vdot_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_view_as_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_view_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_view_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_zero__cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_forward_mode_AD_zeros_like_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD___getitem___cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD___radd___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD___rdiv___cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD___rmod___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD___rmul___cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD___rmul___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD___rpow___cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD__batch_norm_with_update_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD__chunk_cat_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD__chunk_cat_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD__native_batch_norm_legit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD__segment_reduce_offsets_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD__softmax_backward_data_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD__upsample_bilinear2d_aa_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_acos_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_add_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_add_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_addcdiv_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_addcdiv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_addcmul_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_addmm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_addmv_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_addmv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_addr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_alias_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_alias_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_all_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_all_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_allclose_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_amin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_angle_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_any_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_arange_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_argwhere_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_argwhere_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_as_strided_partial_views_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_as_strided_partial_views_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_as_strided_scatter_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_asin_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_asin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_asinh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_atan2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_atan_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_atan_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_atanh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_atleast_1d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_atleast_1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_atleast_3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_baddbmm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_bfloat16_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_bfloat16_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_block_diag_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_bool_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_broadcast_tensors_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_broadcast_tensors_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_broadcast_to_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cartesian_prod_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cartesian_prod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cdist_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cdouble_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_ceil_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cfloat_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_chalf_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cholesky_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cholesky_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cholesky_inverse_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cholesky_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_chunk_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_clamp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_clamp_max_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_clamp_min_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_clone_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_column_stack_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_column_stack_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_combinations_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_constant_pad_nd_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_constant_pad_nd_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_contiguous_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_corrcoef_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_corrcoef_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cos_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cos_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cosh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_count_nonzero_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cov_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cross_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cummax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cummin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cumprod_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_cumsum_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_diag_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_diag_embed_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_diagflat_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_diagonal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_diagonal_scatter_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_diff_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_diff_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_dist_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_div_floor_rounding_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_dot_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_dstack_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_dstack_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_empty_like_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_empty_strided_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_empty_strided_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_eq_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_equal_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_equal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_erf_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_erfc_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_exp2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_exp_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_expand_as_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_expand_as_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_expand_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_expand_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_exponential_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_fft2_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_fft_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_fftn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_fftshift_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_hfft2_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_hfft2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_hfft_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_hfft_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_hfftn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_ifft_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_ifftshift_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_ifftshift_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_ihfft2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_ihfft_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_irfft2_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_irfft_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_irfftn_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_irfftn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fft_rfft_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fill_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_flatten_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_flip_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fliplr_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_flipud_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_float_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_float_power_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_float_power_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_floor_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_floor_divide_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_fmod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_frexp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_full_like_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_full_like_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_gather_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_gather_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_geometric_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_geqrf_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_gradient_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_grid_sampler_2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_grid_sampler_3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_hsplit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_igamma_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_imag_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_index_add_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_index_add_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_index_reduce_amax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_index_reduce_mean_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_index_select_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_inner_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_inner_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_int_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_int_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_isclose_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_isfinite_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_isfinite_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_isinf_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_isnan_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_isneginf_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_isreal_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_isreal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_istft_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_item_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_jiterator_2inputs_2outputs_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_jiterator_binary_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_jiterator_binary_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_jiterator_binary_return_by_ref_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_jiterator_unary_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_jiterator_unary_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_kron_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_kron_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_kthvalue_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_ldexp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_cholesky_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_cross_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_det_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_eig_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_eig_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_eigh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_eigh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_eigvals_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_eigvalsh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_eigvalsh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_inv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_inv_ex_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_ldl_factor_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_ldl_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_lstsq_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_lu_factor_ex_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_lu_solve_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_matrix_norm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_matrix_rank_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_multi_dot_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_pinv_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_pinv_hermitian_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_pinv_hermitian_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_pinv_singular_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_qr_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_qr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_solve_ex_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_solve_ex_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_svdvals_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_svdvals_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_tensorsolve_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_vander_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_vecdot_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linalg_vector_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linspace_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linspace_tensor_overload_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_linspace_tensor_overload_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_log10_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_log1p_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_log1p_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_log2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_log_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_log_softmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logaddexp2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logaddexp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logdet_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logdet_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logical_and_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logical_not_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logical_not_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logical_or_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logical_or_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logical_xor_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logical_xor_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_logit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_long_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_long_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_lt_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_lu_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_lu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_lu_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_lu_solve_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_lu_unpack_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_mH_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_mT_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_amin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_argmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_cumprod_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_cumprod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_fill_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_fill_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_log_softmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_logaddexp_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_logsumexp_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_mean_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_normalize_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_prod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_scatter_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_select_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_select_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_softmax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_std_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_sum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_masked_var_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_matmul_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_max_binary_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_max_pool2d_with_indices_backward_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_max_reduction_no_dim_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_max_reduction_with_dim_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_mean_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_median_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_meshgrid_list_of_tensors_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_meshgrid_list_of_tensors_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_meshgrid_variadic_tensors_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_mode_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_movedim_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_mul_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_multinomial_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_mv_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_mv_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nansum_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nansum_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_narrow_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_narrow_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_narrow_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_narrow_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_native_dropout_backward_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_ne_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_neg_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_neg_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_new_empty_strided_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_new_empty_strided_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_new_full_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_new_zeros_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nextafter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_avg_pool2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_bilinear_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_celu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_conv1d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_conv1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_conv2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_conv_transpose3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_cosine_similarity_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_cross_entropy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_dropout3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_embedding_bag_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_gaussian_nll_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_hardshrink_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_hardsigmoid_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_hardswish_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_hardtanh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_interpolate_area_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_l1_loss_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_layer_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_linear_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_local_response_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_max_pool1d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_max_pool3d_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_mish_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_mse_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_multi_margin_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_normalize_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_pad_circular_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_pad_circular_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_pad_constant_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_pad_reflect_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_pad_reflect_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_pad_replicate_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_pad_replicate_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_pairwise_distance_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_pixel_shuffle_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_rms_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_selu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_silu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_soft_margin_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_softmin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_softplus_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_softsign_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_tanhshrink_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_unfold_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_upsample_bilinear_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nn_functional_upsample_nearest_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nonzero_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nonzero_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_nonzero_static_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_norm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_norm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_norm_fro_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_norm_inf_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_norm_nuc_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_norm_nuc_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_normal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_normal_in_place_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_normal_in_place_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_ones_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_ones_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_ormqr_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_ormqr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_outer_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_pca_lowrank_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_permute_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_pinverse_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_polygamma_polygamma_n_1_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_polygamma_polygamma_n_2_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_polygamma_polygamma_n_4_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_pow_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_prod_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_prod_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_put_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_qr_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_rad2deg_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_rand_like_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_randn_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_randn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_randn_like_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_real_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_real_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_reciprocal_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_reciprocal_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_renorm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_repeat_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_resize_as__cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_round_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_round_decimals_neg_3_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_rsqrt_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_rsub_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_scalar_tensor_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_scalar_tensor_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_scatter_add_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_scatter_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_scatter_reduce_amax_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_scatter_reduce_amin_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_searchsorted_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_select_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_select_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_select_scatter_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sgn_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sigmoid_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_signal_windows_cosine_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_signal_windows_exponential_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_signal_windows_general_hamming_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_signal_windows_hann_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_signal_windows_kaiser_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_signbit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sinc_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sinh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sinh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_softmax_with_dtype_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sort_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sparse_mm_reduce_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sparse_sampled_addmm_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sparse_sampled_addmm_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_airy_ai_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_bessel_j1_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_bessel_y0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_chebyshev_polynomial_t_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_chebyshev_polynomial_w_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_hermite_polynomial_he_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_i0e_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_i1e_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_laguerre_polynomial_l_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_log_ndtr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_modified_bessel_i0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_modified_bessel_k1_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_ndtr_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_spherical_bessel_j0_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_xlog1py_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_special_zeta_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_split_with_sizes_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_split_with_sizes_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sqrt_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_square_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_square_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_squeeze_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_squeeze_multiple_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_stack_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_std_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_std_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_std_mean_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_std_mean_unbiased_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_std_mean_unbiased_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sub_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sum_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sum_to_size_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_sum_to_size_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_t_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_t_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_take_along_dim_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_take_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_take_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_tanh_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_tanh_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_tensor_split_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_tensor_split_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_tile_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_to_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_to_sparse_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_trace_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_trapezoid_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_trapezoid_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_trapz_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_triangular_solve_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_tril_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_triu_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_triu_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_true_divide_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_unbind_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_unbind_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_unbind_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_unflatten_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_uniform_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_uniform_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_unique_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_unsqueeze_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_unsqueeze_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_unsqueeze_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_var_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_var_mean_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_var_unbiased_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_var_unbiased_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_vdot_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_view_as_complex_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_view_as_real_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_view_copy_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_view_copy_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_view_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_view_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_vsplit_cuda_float64, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_where_cuda_complex128, test/test_ops_fwd_gradients.py::TestFwdGradientsCUDA::test_inplace_forward_mode_AD_zeros_like_cuda_float64 2025-12-04T12:59:38.3869791Z 2025-12-04T12:59:38.3870155Z Finished test_ops_fwd_gradients 2/2 ... [2025-12-04 12:59:38.225568][13582.346783232], took 8.31min 2025-12-04T12:59:38.3871410Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_ops_fwd_gradients/test_ops_fwd_gradients-dba31c3696d52244.xml 2025-12-04T12:59:38.3872523Z Running test_meta 3/5 ... [2025-12-04 12:59:38.373444][13582.494664076] 2025-12-04T12:59:38.3872996Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T12:59:38.3874323Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_meta.py', '--shard-id=3', '--num-shards=5', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:59:38.373816] 2025-12-04T13:23:34.8168278Z 2025-12-04T13:23:34.8169151Z test_meta 3/5 was successful, full logs can be found in artifacts with path test/test-reports/test_meta_3.5_0884fc4739a02c0f_.log 2025-12-04T13:23:35.1615663Z Running 8136 items in this shard: test/test_meta.py::TestMetaConverter::test_complex_noncontiguous_bug, test/test_meta.py::TestMetaConverter::test_leaf, test/test_meta.py::TestMetaConverter::test_non_leaf_torture, test/test_meta.py::TestMetaConverter::test_view_dtype, test/test_meta.py::TestMetaConverter::test_view_of_non_leaf, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_div_no_rounding_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_isclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_logical_xor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_gt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_igamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_igammac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_cdist_forward_cuda, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmatmul___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___ror___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___ror___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rsub___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rxor___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__batch_norm_with_update_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_frac_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_frac_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_lengths_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_lengths_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_lengths_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_offsets_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_offsets_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__upsample_bilinear2d_aa_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcdiv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_allclose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argsort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argsort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_baddbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bincount_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bincount_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_and_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_left_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_not_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_or_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_right_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_xor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_shapes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_inverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_complex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dist_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dist_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dist_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frac_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frac_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_uint32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geqrf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_heaviside_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_heaviside_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_heaviside_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_histc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hypot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hypot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hypot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_inner_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_istft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kron_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lcm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lcm_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cond_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cond_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvalsh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_householder_product_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_householder_product_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_inv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_grad_oriented_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_grad_oriented_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_factor_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_hermitian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_multi_dot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_hermitian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_slogdet_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_svdvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_tensorinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_normalize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_softmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_no_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_median_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_median_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_median_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_multinomial_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmedian_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmedian_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanquantile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nextafter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_alpha_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_celu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_celu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_celu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_similarity_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_ctc_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_elu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_elu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_bag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_gelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_glu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_grid_sample_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardswish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardtanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_huber_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_huber_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_instance_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_instance_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_instance_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_linear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_trilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_kl_div_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_local_response_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_logsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool1d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mse_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multi_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_circular_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pdist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_prelu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_silu_complex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_smooth_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_threshold_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_fro_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ormqr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pca_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polar_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_qr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_neg_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_cosine_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_kaiser_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_mm_reduce_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_sampled_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_erfcx_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_legendre_polynomial_p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_along_dim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triangular_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triangular_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_uniform_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_uniform_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_uniform_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unravel_index_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vdot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vdot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rsub___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rxor___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__batch_norm_with_update_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__batch_norm_with_update_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_min_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_min_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_min_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sign_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__segment_reduce_lengths_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__segment_reduce_offsets_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__segment_reduce_offsets_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__softmax_backward_data_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__softmax_backward_data_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_decomposed_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_decomposed_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_baddbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_baddbmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_baddbmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bernoulli_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bincount_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_left_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_not_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_xor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_shapes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bucketize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cauchy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chalf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chalf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chalf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_inverse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_contiguous_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_contiguous_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_contiguous_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_deg2rad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dist_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_floor_rounding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_einsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfinv_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_grid_sampler_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hypot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hypot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hypot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_i0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_igamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_igammac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_imag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_inner_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_inner_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isneginf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lcm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lgamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_det_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvalsh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvalsh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_householder_product_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_householder_product_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_inv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_multi_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_multi_dot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_norm_subgradients_at_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_singular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_qr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_slogdet_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_triangular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svdvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_tensorinv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_tensorinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_tensorsolve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_tensorsolve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_normal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logdet_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_maximum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nan_to_num_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nextafter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_binary_cross_entropy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_celu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose2d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_similarity_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cross_entropy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_ctc_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_elu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_elu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_bag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_grid_sample_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_huber_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_trilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_leaky_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_local_response_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_logsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_logsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool3d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mish_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mse_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_head_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_circular_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_selu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_silu_complex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_silu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_smooth_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_smooth_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softplus_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_nearest_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_inf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_number_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_number_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pca_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pca_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_qr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_quantile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_renorm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_neg_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_neg_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scalar_tensor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scalar_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scalar_tensor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_mean_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_mean_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_blackman_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_cosine_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_gaussian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_hann_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_mm_reduce_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensordot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensordot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensordot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_uniform_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_real_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rand___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rand___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmatmul___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmatmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___ror___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___ror___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___ror___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rxor___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__batch_norm_with_update_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__batch_norm_with_update_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_max_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__native_batch_norm_legit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_lengths_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_lengths_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_lengths_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__softmax_backward_data_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__upsample_bilinear2d_aa_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__upsample_bilinear2d_aa_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__upsample_bilinear2d_aa_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_decomposed_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_H_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___rmatmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_addcmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__native_batch_norm_legit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__softmax_backward_data_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_acos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_acosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_alias_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_angle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_as_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_as_strided_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_atanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bincount_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bitwise_left_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bool_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_broadcast_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bucketize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cartesian_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cauchy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_clamp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_constant_pad_nd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cummin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diagflat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diff_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_div_floor_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_div_no_rounding_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_double_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_dstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_empty_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_expand_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_ifft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_ifftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_rfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_float_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_grid_sampler_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_hash_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_hstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_hypot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_igamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_igammac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_isreal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_jiterator_unary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_cholesky_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_cond_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_householder_product_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_ldl_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_solve_triangular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_vander_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mH_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_max_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_meshgrid_variadic_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_native_dropout_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_ne_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_cosine_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_dropout2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_dropout3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_hinge_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_instance_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_interpolate_bicubic_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_unpool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_multi_head_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_relu6_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_smooth_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_softplus_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_threshold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nonzero_static_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_norm_inf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_pinverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_polar_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_polygamma_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_polygamma_polygamma_n_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_round_decimals_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_round_decimals_neg_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_rsqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_reduce_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_select_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sgn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sinc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_slice_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_entr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_modified_bessel_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_ndtri_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_squeeze_multiple_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_std_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_stft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_trace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_transpose_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_triu_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unique_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_where_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_baddbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bincount_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bincount_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bincount_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_and_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_left_shift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_or_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_right_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_right_shift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_right_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cauchy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cauchy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_inverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_min_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_copysign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dist_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_einsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_einsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exponential_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_uint32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gcd_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geqrf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geqrf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hypot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_istft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lcm_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_le_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cond_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cond_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eig_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigvalsh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lstsq_grad_oriented_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_factor_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_factor_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_power_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_hermitian_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_multi_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_subgradients_at_zero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_singular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_qr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_slogdet_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_triangular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_svd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_svdvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_svdvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorinv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vander_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vander_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logcumsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logcumsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_log_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_median_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_normalize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_pool2d_with_indices_backward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_variadic_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_multinomial_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_multinomial_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_dropout_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_dropout_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_alpha_dropout_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_alpha_dropout_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_celu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cross_entropy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_elu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_embedding_bag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_embedding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_embedding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_glu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_grid_sample_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_grid_sample_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardswish_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardswish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_huber_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_huber_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bicubic_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_linear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_trilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_l1_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_l1_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_leaky_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_leaky_relu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_leaky_relu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_logsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool2d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool2d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mse_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pdist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu6_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu6_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu6_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_selu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_selu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_silu_complex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_smooth_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_soft_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_threshold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_unfold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_unfold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_nuc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_nuc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_in_place_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_number_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_number_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ormqr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ormqr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pca_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rand_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rand_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rand_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_neg_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_general_cosine_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_general_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_mm_reduce_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_v_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_w_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtri_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtri_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_zeta_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_zeta_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_zeta_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_svd_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_svd_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensordot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tile_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_uniform_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_uint64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_xlogy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmatmul___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___ror___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__batch_norm_with_update_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__batch_norm_with_update_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sigmoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__native_batch_norm_legit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__native_batch_norm_legit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__segment_reduce_lengths_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__segment_reduce_lengths_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__segment_reduce_offsets_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__softmax_backward_data_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_decomposed_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___getitem___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rmatmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___ror___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rpow___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rsub___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_sqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__segment_reduce_lengths_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__segment_reduce_offsets_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addcmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addmm_decomposed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addmv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_angle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_any_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_arange_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_asinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_atan2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_atanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_atleast_1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bfloat16_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bitwise_and_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bitwise_left_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bucketize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cholesky_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cholesky_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_column_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_conj_physical_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_contiguous_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_copysign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cumsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diagonal_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_dist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_dstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_empty_permuted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_equal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_erfinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_expand_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ifft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ifft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ifftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_rfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_rfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_frac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_frexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_geqrf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_grid_sampler_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_index_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_index_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_inner_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_istft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_item_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_2inputs_2outputs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_binary_return_by_ref_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_eig_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_ldl_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_lstsq_grad_oriented_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_matrix_rank_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_norm_subgradients_at_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_slogdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_solve_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_tensorinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_log10_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_min_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_multinomial_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_new_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nextafter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_celu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_channel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_conv1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_grid_sample_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_hardshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_hardswish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_hardtanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_instance_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_kl_div_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_leaky_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_unpool1d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pad_constant_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pad_replicate_negative_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pairwise_distance_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_silu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_softplus_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_softshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_upsample_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nonzero_static_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_normal_number_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ones_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ones_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polar_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polygamma_polygamma_n_2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_positive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_rand_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_repeat_interleave_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_roll_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_scatter_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_short_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_bartlett_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_slice_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_bessel_j1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_hermite_polynomial_he_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_modified_bessel_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_scaled_modified_bessel_k1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_split_list_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_split_with_sizes_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_squeeze_multiple_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_stft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_tensordot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_topk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_true_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unfold_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unravel_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_vsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_allclose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_allclose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_allclose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_baddbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_left_shift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_right_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_xor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cauchy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ceil_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_inverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_copysign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dist_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_floor_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_floor_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exponential_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float8_e5m2fnuz, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fliplr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_uint32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gcd_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geqrf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geqrf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geqrf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_igamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_imag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_inner_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cholesky_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cond_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cond_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_det_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eig_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvalsh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lstsq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_power_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_multi_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_hermitian_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_singular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_slogdet_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_triangular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svdvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svdvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorinv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vector_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vector_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_normal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_unpack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_log_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_normalize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_no_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nextafter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_alpha_dropout_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_batch_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_celu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_channel_shuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_channel_shuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_bag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_glu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_glu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_grid_sample_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_group_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bicubic_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_kl_div_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_kl_div_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_logsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_logsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_one_hot_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rms_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rrelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_selu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_silu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softplus_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softplus_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softplus_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_threshold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_fro_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_inf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_in_place_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_in_place_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_number_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_number_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ormqr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ormqr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pca_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_qr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_quantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_remainder_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_remainder_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rot90_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_neg_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_mean_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_general_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_hann_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_kaiser_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_nuttall_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_airy_ai_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_h_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtri_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensordot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensordot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensordot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_topk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unravel_index_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vdot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vdot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_xlogy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_xlogy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_xlogy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_embedding_bag_dense_backward_per_sample_weights_cuda, test/test_meta.py::TestMetaCUDA::test_group_norm_backward_output_mask6_cuda, test/test_meta.py::TestMetaCUDA::test_inplace_masked_fill_error_cuda, test/test_meta.py::TestMetaCUDA::test_layer_norm_backward_output_mask2_cuda, test/test_meta.py::TestMetaCUDA::test_layer_norm_backward_output_mask7_cuda, test/test_meta.py::TestMetaCUDA::test_map_location_deserialize_cuda, test/test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rand___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rand___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmatmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmatmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rxor___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__batch_norm_with_update_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__native_batch_norm_legit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__segment_reduce_lengths_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__segment_reduce_lengths_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__softmax_backward_data_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__upsample_bilinear2d_aa_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_angle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_angle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_angle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_partial_views_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_baddbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_baddbmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_baddbmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bincount_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_left_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_left_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_or_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cauchy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cauchy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_complex_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_deg2rad_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dist_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_trunc_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_trunc_rounding_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exponential_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_float8_e5m2fnuz, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_frexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_uint32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gcd_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gcd_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_inner_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lcm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_det_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_det_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvalsh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvalsh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_householder_product_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_multi_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_multi_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_singular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_triangular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svdvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorinv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorsolve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorsolve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vander_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vander_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vector_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vector_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logcumsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logcumsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logcumsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_unpack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_unpack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_matrix_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_pool2d_with_indices_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_pool2d_with_indices_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_pool2d_with_indices_backward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanmean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanmedian_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanquantile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nextafter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_embedding_bag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_embedding_bag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_embedding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gaussian_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_glu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_group_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardswish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bicubic_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mse_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_normalize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_prelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rms_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rrelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rrelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rrelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_selu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_silu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_silu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_silu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_smooth_l1_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softplus_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_nearest_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_inf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_nuc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_in_place_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_number_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ormqr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pca_lowrank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pinverse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polar_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_qr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_qr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_quantile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rad2deg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rad2deg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rad2deg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rand_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_renorm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_renorm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_renorm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_neg_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_neg_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_neg_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_bartlett_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_hann_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_nuttall_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_mm_reduce_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_sampled_addmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_w_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_svd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch__scaled_mm_cuda_float8_e4m3fn, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trace_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapezoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_indices_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_uniform_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_uniform_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unravel_index_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vdot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rand___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmatmul___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmatmul___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace___ror___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rxor___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rxor___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rxor___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__native_batch_norm_legit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_lengths_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_offsets_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__softmax_backward_data_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__softmax_backward_data_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_baddbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_baddbmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bincount_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bincount_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_not_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_right_shift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cauchy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_contiguous_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_contiguous_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumulative_trapezoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_digamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exponential_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_frac_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_frac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_frexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gcd_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geqrf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geqrf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geqrf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_inner_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_inner_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isclose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kron_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kthvalue_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kthvalue_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lcm_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lerp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lgamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvalsh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvalsh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_householder_product_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_factor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_power_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_rank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_multi_dot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_multi_dot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_subgradients_at_zero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_hermitian_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_slogdet_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svdvals_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorsolve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vector_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vector_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_unpack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_normalize_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_pool2d_with_indices_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_multinomial_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmedian_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmedian_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_alpha_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_celu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_celu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_similarity_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_similarity_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cross_entropy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_bag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_gelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_group_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardswish_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_instance_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_area_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_trilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_trilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_trilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_kl_div_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_layer_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_leaky_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_linear_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_margin_ranking_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_margin_ranking_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_margin_ranking_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mish_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mse_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_normalize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_prelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_rrelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_complex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_soft_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_unfold_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_upsample_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_upsample_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_fro_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_in_place_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_number_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_number_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ormqr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ormqr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pinverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randn_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_renorm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_renorm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_neg_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_neg_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_blackman_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_general_cosine_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sparse_mm_reduce_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_h_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_h_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_laguerre_polynomial_l_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_laguerre_polynomial_l_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_laguerre_polynomial_l_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_legendre_polynomial_p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_legendre_polynomial_p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_legendre_polynomial_p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_svd_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_topk_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_topk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_topk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triangular_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_uint64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unravel_index_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unravel_index_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vdot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_real_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_where_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_where_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_mixed_dtype_for_native_layer_norm_backward_float32_bias_dtype2_cuda, test/test_meta.py::TestMetaCUDA::test_mixed_dtype_for_native_layer_norm_backward_float32_float16_cuda, test/test_meta.py::TestMetaCUDA::test_triangular_solve_out_cuda 2025-12-04T13:23:35.4965895Z 2025-12-04T13:23:35.4966234Z Finished test_meta 3/5 ... [2025-12-04 13:23:34.828592][15018.949807064], took 23.94min 2025-12-04T13:23:35.4967728Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_meta/test_meta-a3131b0e7fce7ef2.xml 2025-12-04T13:23:35.7188498Z Uploading artifacts took 0.57 seconds 2025-12-04T13:23:35.7193240Z Running test_ops_gradients 1/2 ... [2025-12-04 13:23:35.719115][15019.840335017] 2025-12-04T13:23:35.7194241Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T13:23:35.7198287Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops_gradients.py', '--shard-id=1', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:23:35.719560] 2025-12-04T13:34:20.6438422Z 2025-12-04T13:34:20.6439524Z test_ops_gradients 1/2 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_gradients_1.2_48c762d3544cab24_.log 2025-12-04T13:34:20.7743460Z Running 2681 items in this shard: test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_H_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_H_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyCatCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyCubeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyMulCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyNonzeroCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpySplitCopyWithIntCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_T_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___getitem___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___radd___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rmatmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rmatmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rmod___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rpow___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__batch_norm_with_update_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__native_batch_norm_legit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__segment_reduce_lengths_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__unsafe_masked_index_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_abs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_acos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addmm_decomposed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addmv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_alias_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_all_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_angle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_angle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_argsort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_argwhere_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_as_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_as_strided_partial_views_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_as_strided_partial_views_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_asinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atan2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atleast_2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atleast_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atleast_3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atleast_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_baddbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bfloat16_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bfloat16_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bool_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bool_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_broadcast_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_broadcast_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cartesian_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cauchy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cdouble_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ceil_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cfloat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_chalf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cholesky_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_clamp_min_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_column_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_contiguous_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_contiguous_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_corrcoef_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_corrcoef_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_count_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_count_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cov_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cummin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_deg2rad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diag_embed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diag_embed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagflat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagonal_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagonal_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagonal_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_dist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_div_floor_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_div_no_rounding_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_div_trunc_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_double_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_double_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_einsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_eq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_equal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_equal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_erfc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_erfinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_exp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_expand_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_expand_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_expand_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_hfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_hfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ifftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ihfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ihfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_irfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_irfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_irfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_irfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_rfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_flatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_flatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_flip_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fliplr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_flipud_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_flipud_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_float_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_float_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_float_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_full_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_gather_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ge_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_geqrf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_gradient_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_gradient_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_grid_sampler_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_gt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_half_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_histc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_hstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_igamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_imag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_inner_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isinf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isposinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_item_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_binary_return_by_ref_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_unary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_kron_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lerp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cholesky_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eigh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eigvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eigvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eigvalsh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eigvalsh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_householder_product_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_inv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_inv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_inv_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_ldl_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_ldl_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_ldl_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lstsq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_matrix_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_matrix_rank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_matrix_rank_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_multi_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_pinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_pinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_pinv_singular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_solve_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_svdvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_tensorsolve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_vander_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_vecdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_vector_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log10_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log1p_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logcumsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logical_not_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logical_or_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logical_xor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_long_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_long_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lu_unpack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mH_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mH_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mT_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mT_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_matmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_matmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_matrix_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_max_pool2d_with_indices_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_max_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_max_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_maximum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_meshgrid_list_of_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_meshgrid_variadic_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_min_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_min_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_minimum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_movedim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_msort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_multinomial_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nanmean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nansum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_narrow_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_narrow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_native_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ne_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_channel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv_transpose1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_cosine_similarity_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_dropout3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_embedding_bag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_gelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_group_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_hardswish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_huber_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_interpolate_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_kl_div_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_l1_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_linear_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_mse_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_multi_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_circular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_circular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_constant_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_constant_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_reflect_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_reflect_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_replicate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_replicate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pixel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_relu6_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_rms_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_rrelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softsign_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softsign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_tanhshrink_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_upsample_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nonzero_static_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nonzero_static_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_norm_fro_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_norm_nuc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_normal_in_place_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ones_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_outer_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_outer_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_permute_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_permute_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_pinverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_pinverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_polygamma_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_polygamma_polygamma_n_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_polygamma_polygamma_n_2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_polygamma_polygamma_n_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_polygamma_polygamma_n_4_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_pow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_quantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_randn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_randn_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ravel_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ravel_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_renorm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_repeat_interleave_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resize__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resize__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resize_as__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resolve_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rot90_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_round_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_round_decimals_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rsqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rsqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rsub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scalar_tensor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_reduce_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sgn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sgn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_short_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sigmoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_bartlett_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_general_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_hann_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_nuttall_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sinc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_slice_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_slice_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_slice_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sparse_sampled_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sparse_sampled_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_airy_ai_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_bessel_y0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_erfcx_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_hermite_polynomial_he_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_laguerre_polynomial_l_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_legendre_polynomial_p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_ndtri_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_scaled_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_spherical_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_xlog1py_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_list_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_with_sizes_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_with_sizes_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_with_sizes_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_square_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_squeeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_squeeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_squeeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_squeeze_multiple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sum_to_size_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_svd_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_t_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_t_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_take_along_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_take_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_take_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tanh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tensordot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tile_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_trace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_transpose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_transpose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_triangular_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tril_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tril_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_triu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_trunc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unbind_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unbind_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unfold_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unique_consecutive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsafe_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsafe_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsqueeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsqueeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_vdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_view_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_view_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_vsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_vstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_where_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_where_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_zero__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_H_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyCatCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyMulScalarCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyNonzeroCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyTakeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_T_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___getitem___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___radd___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rdiv___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rdiv___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rmatmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rpow___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rpow___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rsub___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__chunk_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__chunk_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__native_batch_norm_legit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__segment_reduce_offsets_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__softmax_backward_data_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__unsafe_masked_index_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__unsafe_masked_index_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_acosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addcdiv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addcmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addmm_decomposed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addmm_decomposed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_alias_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_allclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_aminmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_angle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_arange_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_argsort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_argwhere_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_argwhere_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_partial_views_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_asin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atan2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atleast_1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atleast_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atleast_3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_auto_functionalize_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bernoulli_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bfloat16_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_block_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_broadcast_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bucketize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_byte_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cartesian_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cauchy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cdouble_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ceil_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cfloat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_chalf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_char_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_inverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_inverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cholesky_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_clamp_min_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_clone_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_conj_physical_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_constant_pad_nd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_copysign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cov_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cummax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cummin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cumulative_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cumulative_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diag_embed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagflat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagonal_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diff_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diff_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_digamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_dist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_double_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_double_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_dsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_dstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_einsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_einsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_permuted_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_permuted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_eq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_eq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_equal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_erfc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_erfinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_exp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expand_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expand_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expand_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expand_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expm1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_eye_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_hfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_hfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_hfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_hfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_hfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ihfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ihfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_irfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_irfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_flatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_flipud_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_float_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_float_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_float_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_floor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_floor_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_frac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_frexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_gather_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_gather_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ge_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_geometric_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_geqrf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_gradient_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_grid_sampler_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_gt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_heaviside_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_hypot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_igamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_igammac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_imag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_inner_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_int_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_int_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_invoke_quant_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isfinite_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isinf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isnan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isneginf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isreal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_istft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_2inputs_2outputs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_binary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_binary_return_by_ref_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_kron_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_kthvalue_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ldexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_le_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lerp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lerp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lgamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cholesky_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cholesky_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cond_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cond_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_det_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_det_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eig_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eigh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eigvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eigvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eigvalsh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_householder_product_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_inv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_ldl_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_ldl_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lstsq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lstsq_grad_oriented_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_rank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_multi_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_multi_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_pinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_pinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_pinv_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_slogdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_solve_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_solve_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_solve_triangular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_solve_triangular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_svdvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_tensorinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_tensorsolve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_tensorsolve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_vander_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_vector_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log10_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logaddexp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logaddexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logcumsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logcumsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_not_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_or_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_or_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_xor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mT_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mT_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_map_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_map_triple_nested_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_matmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_max_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_max_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_max_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_maximum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_meshgrid_list_of_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_meshgrid_list_of_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_min_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_min_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nansum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nansum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_narrow_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_narrow_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_narrow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_native_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ne_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv_transpose2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv_transpose3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_cosine_similarity_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_ctc_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_dropout2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_dropout3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_gaussian_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_glu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_hardsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_instance_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_area_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_l1_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_leaky_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_local_response_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_unpool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_mish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_multi_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_constant_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_replicate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pairwise_distance_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pairwise_distance_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pixel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_relu6_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_silu_complex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_softsign_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_tanhshrink_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_tanhshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_upsample_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_upsample_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nonzero_static_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_fro_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_nuc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_nuc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_normal_in_place_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_normal_number_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ones_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ormqr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_outer_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_outer_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_pca_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_pca_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_permute_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_permute_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_pinverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_pinverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_polar_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_polygamma_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_polygamma_polygamma_n_4_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_positive_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_pow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_quantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rand_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rand_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_randint_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_randint_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_randn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ravel_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_reciprocal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_renorm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_repeat_interleave_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_reshape_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_reshape_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_resize__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_resize_as__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_resolve_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rot90_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_round_decimals_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rsub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_searchsorted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sgn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sigmoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_general_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_slice_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_slice_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sparse_mm_reduce_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_bessel_j1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_bessel_y0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_bessel_y1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_erfcx_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_hermite_polynomial_h_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_i0e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_i1e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_log_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_modified_bessel_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_modified_bessel_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_ndtri_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_scaled_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_xlog1py_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_zeta_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_list_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_list_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_with_sizes_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_with_sizes_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_square_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_square_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_squeeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_squeeze_multiple_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_squeeze_multiple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_stft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_stft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sum_to_size_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_svd_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_svd_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_t_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_take_along_dim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_take_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tensor_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tensor_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tensordot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_trace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_transpose_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tril_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_true_divide_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unbind_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unbind_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unflatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unflatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unfold_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unfold_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unique_consecutive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unique_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unsafe_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unsafe_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unsqueeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_var_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_vdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_vdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_as_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_as_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_vsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_vstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_where_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_while_loop_stack_output_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_xlogy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_zero__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_H_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyCubeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyMulCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyNonzeroCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpySplitCopyCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpySplitCopyWithIntCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyTakeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyViewCopyCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___getitem___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___radd___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___radd___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rdiv___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rdiv___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rmatmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rmod___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rpow___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rpow___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__chunk_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__chunk_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__unsafe_masked_index_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__unsafe_masked_index_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_acos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_acosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_acosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addcdiv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addcmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addcmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addmm_decomposed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addmm_decomposed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_alias_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_any_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_any_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_arange_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_argsort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_argwhere_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_partial_views_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_partial_views_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_asin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atan2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atleast_2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atleast_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atleast_3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atleast_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_bernoulli_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_bfloat16_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_block_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_bmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_broadcast_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_byte_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cartesian_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cdouble_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_chalf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cholesky_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cholesky_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_clone_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cond_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_conj_physical_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_contiguous_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_copysign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_corrcoef_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_corrcoef_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_count_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cov_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cov_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cummax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cummin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diag_embed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagflat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagonal_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagonal_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diff_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dist_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_div_floor_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_div_no_rounding_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_div_trunc_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_einsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_empty_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_equal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_erfc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_exp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_expand_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_expm1_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_eye_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_fft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_fft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_fftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_hfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_hfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_hfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ihfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_irfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_irfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_flatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_flatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_flipud_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_float_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_floor_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fmod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_frexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_full_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_full_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_gather_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_geometric_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_geqrf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_gradient_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_gradient_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_grid_sampler_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_grid_sampler_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_gt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_half_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_hash_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_histc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_hsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_hsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_hstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_hypot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_int_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_int_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_invoke_quant_packed_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_invoke_quant_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isfinite_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isinf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isneginf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isreal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_2inputs_2outputs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_unary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_kron_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_kron_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ldexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_le_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cond_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cond_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_det_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_det_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eig_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eigh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eigvalsh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eigvalsh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_householder_product_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_householder_product_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_inv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_inv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_inv_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_inv_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_ldl_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_ldl_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_ldl_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_ldl_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lstsq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lstsq_grad_oriented_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_rank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_rank_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_multi_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_norm_subgradients_at_zero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_pinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_pinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_pinv_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_pinv_singular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_slogdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_solve_triangular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_svdvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_svdvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_tensorinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_tensorsolve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_vander_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_vander_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_vecdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_vector_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log10_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log10_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log1p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logaddexp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logaddexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logcumsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_and_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_not_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_or_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_or_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_xor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_long_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mT_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_map_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_matmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_matrix_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_max_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_max_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_min_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_movedim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_multinomial_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nan_to_num_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nanquantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nansum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_narrow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ne_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ne_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_alpha_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_celu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_channel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv_transpose1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv_transpose3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_dropout2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_dropout3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_gelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_glu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_grid_sample_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_group_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_hardshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_hardsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_area_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_kl_div_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_l1_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_leaky_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_local_response_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_logsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_unpool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_unpool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_mse_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_circular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_circular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_replicate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_replicate_negative_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pairwise_distance_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pairwise_distance_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pixel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_prelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_relu6_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_upsample_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nonzero_static_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_inf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_nuc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_nuc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_normal_number_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ones_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_outer_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_outer_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_pca_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_permute_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_pinverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_polar_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_polygamma_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_polygamma_polygamma_n_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_pow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_pow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_quantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rand_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_randint_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_randint_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_randn_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_randn_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ravel_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_real_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_reciprocal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_remainder_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_repeat_interleave_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_reshape_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resize__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resize_as__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resolve_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resolve_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resolve_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_roll_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rot90_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rot90_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_round_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_round_decimals_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_round_decimals_neg_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rsqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rsqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rsub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rsub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_reduce_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_searchsorted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sgn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_short_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_short_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sigmoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_gaussian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_general_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_general_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_hann_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signbit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sinc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_slice_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_slice_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_entr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_erfcx_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_i0e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_scaled_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_zeta_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_list_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_with_sizes_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_with_sizes_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_with_sizes_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_square_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_square_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_squeeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_squeeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_squeeze_multiple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_stft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_svd_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_t_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_t_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_take_along_dim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_take_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_take_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tensor_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tensordot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_to_sparse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_to_sparse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_transpose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trapz_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_triu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_true_divide_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trunc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unbind_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unbind_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unfold_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_uniform_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unique_consecutive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unique_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsafe_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsafe_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsqueeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsqueeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsqueeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_vdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_vdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_as_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_as_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_vsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_vsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_vstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_where_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_where_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_while_loop_stack_output_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyCatCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyCubeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyMulScalarCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyNMSCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyNonzeroCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpySplitCopyWithIntCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyTakeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_T_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___getitem___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___radd___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___radd___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rsub___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__chunk_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__chunk_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__native_batch_norm_legit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__segment_reduce_lengths_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__softmax_backward_data_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__unsafe_masked_index_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__unsafe_masked_index_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_abs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_acos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addcdiv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addcmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addmm_decomposed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addmm_decomposed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addmv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_alias_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_all_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_allclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_angle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_any_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_arange_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_argsort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_argwhere_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_partial_views_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atan2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atleast_1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atleast_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_baddbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_bfloat16_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_block_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_bmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_broadcast_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_broadcast_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_broadcast_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_byte_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cartesian_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cauchy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cdouble_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cdouble_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ceil_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cfloat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_char_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_char_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cholesky_inverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cholesky_inverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cholesky_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_clamp_min_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_clone_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_clone_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_column_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_column_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_combinations_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_constant_pad_nd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_constant_pad_nd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_contiguous_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_contiguous_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cov_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cov_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cummax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cumulative_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_deg2rad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diag_embed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagflat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagonal_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagonal_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagonal_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_digamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dist_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_div_no_rounding_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_einsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_permuted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_eq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_eq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_equal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_erfinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_exp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expand_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expand_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expand_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expand_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_eye_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ifftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ihfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ihfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_irfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_irfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_rfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_flatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_flip_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fliplr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_float_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_floor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_frexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_full_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_geqrf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_gradient_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_grid_sampler_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_half_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_hash_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_heaviside_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_histc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_hstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_hypot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_igamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_igammac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_imag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_inner_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_inner_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isnan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isnan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_unary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_kron_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_kthvalue_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lerp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lgamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cholesky_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cond_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_det_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_det_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_eig_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_eig_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_eigh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_householder_product_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_inv_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_ldl_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lstsq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lstsq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lstsq_grad_oriented_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lstsq_grad_oriented_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_rank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_rank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_rank_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_norm_subgradients_at_zero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_singular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_pinv_singular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_solve_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_solve_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_solve_triangular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_svdvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_tensorinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_vander_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_vecdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_vecdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_vector_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log10_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log1p_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log1p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logaddexp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logcumsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_and_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_and_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_or_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_or_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_xor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_long_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_long_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mH_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mT_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_matmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_matmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_max_pool2d_with_indices_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_max_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_meshgrid_variadic_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_meshgrid_variadic_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_min_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_min_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_movedim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_multinomial_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nanmedian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nansum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nansum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_narrow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_native_dropout_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_alpha_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv_transpose2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_cosine_similarity_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_dropout3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_fractional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_gaussian_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_gelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_glu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_group_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_hardshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_hardsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_interpolate_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_kl_div_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_l1_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_linear_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_logsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_unpool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_unpool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_mish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_multi_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_circular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_circular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_constant_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_reflect_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_reflect_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_replicate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_replicate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pairwise_distance_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pixel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_rms_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_selu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_silu_complex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_softplus_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_softshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_tanhshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_threshold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nonzero_static_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_fro_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_fro_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_inf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_nuc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_nuc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_normal_in_place_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_normal_in_place_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_normal_number_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ones_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ormqr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_permute_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_permute_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_polygamma_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_polygamma_polygamma_n_2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_polygamma_polygamma_n_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_pow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_quantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_randint_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_randint_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_randn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_randn_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_real_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_reciprocal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_remainder_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_repeat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_repeat_interleave_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_reshape_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_reshape_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_resize__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_resolve_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_round_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_round_decimals_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_round_decimals_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_round_decimals_neg_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_rsqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_reduce_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_searchsorted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_select_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sgn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_short_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sigmoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_general_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_general_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_kaiser_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_nuttall_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_airy_ai_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_bessel_y1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_erfcx_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_hermite_polynomial_h_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_hermite_polynomial_he_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_legendre_polynomial_p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_modified_bessel_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_modified_bessel_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_ndtri_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_spherical_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_xlog1py_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_zeta_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_list_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_with_sizes_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_with_sizes_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_square_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_squeeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_squeeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_squeeze_multiple_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_squeeze_multiple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sum_to_size_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sum_to_size_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_svd_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_t_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_take_along_dim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_take_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tensor_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tensordot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tile_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_to_sparse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_to_sparse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_trace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_transpose_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_trapz_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_triangular_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tril_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unbind_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unbind_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unbind_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unbind_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unflatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unflatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_uniform_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_uniform_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unsafe_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unsafe_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unsqueeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_var_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_var_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_vdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_as_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_vsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_xlogy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_zero__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_zeros_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_T_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___getitem___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___radd___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rdiv___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rmatmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__chunk_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__native_batch_norm_legit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__segment_reduce_offsets_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__softmax_backward_data_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__unsafe_masked_index_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__unsafe_masked_index_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__upsample_bilinear2d_aa_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_acosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_acosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addcdiv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addcdiv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addcmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addmm_decomposed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addmv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addmv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_alias_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_allclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_arange_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_argwhere_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_argwhere_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_as_strided_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_as_strided_partial_views_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_as_strided_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_asin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_asinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atan2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atleast_1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_baddbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bernoulli_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bfloat16_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_block_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bool_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bool_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_broadcast_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_broadcast_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cartesian_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cauchy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cdouble_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cdouble_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cfloat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cfloat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_chalf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_char_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cholesky_inverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cholesky_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_clamp_max_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_clamp_min_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_clone_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_clone_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_column_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_column_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_combinations_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_conj_physical_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_constant_pad_nd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_contiguous_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_copysign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_corrcoef_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_corrcoef_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_count_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_count_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cov_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cummax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cummin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cumulative_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diag_embed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diag_embed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diagonal_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diff_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_div_floor_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_div_no_rounding_mode_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_div_no_rounding_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_div_trunc_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_double_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_einsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_empty_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_empty_permuted_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_erf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_erfinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expand_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expand_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expand_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expand_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expm1_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_eye_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_eye_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_hfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_hfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_hfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_hfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ihfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ihfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_irfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_flip_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fliplr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fliplr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_flipud_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_flipud_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_float_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fmod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_frexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_gather_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_geometric_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_geqrf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_gradient_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_grid_sampler_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_gt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_half_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_hash_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_histc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_hsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_hstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_igammac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_imag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_inner_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_inner_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_int_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isfinite_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isnan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isnan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isneginf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_item_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_item_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_binary_return_by_ref_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_unary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_kron_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_kron_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ldexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ldexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_le_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lerp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cholesky_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cond_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cond_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_eigvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_eigvalsh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_eigvalsh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_householder_product_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_householder_product_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_ldl_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_ldl_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_ldl_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lstsq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lstsq_grad_oriented_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_matrix_rank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_multi_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_pinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_pinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_pinv_singular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_slogdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_solve_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_solve_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_svdvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_tensorsolve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_vander_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_vecdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_vecdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_vector_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_vector_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log10_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log10_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log1p_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logaddexp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logical_and_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logical_and_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logical_xor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_long_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_long_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lu_unpack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mT_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_matrix_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_max_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_max_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_maximum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_meshgrid_list_of_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_meshgrid_list_of_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_meshgrid_variadic_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_meshgrid_variadic_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_min_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_min_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_movedim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_multinomial_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nanquantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nansum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_narrow_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_narrow_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_narrow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_narrow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_native_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_celu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_channel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv_transpose1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_cosine_similarity_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_dropout2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_embedding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_gelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_grid_sample_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_group_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_hardshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_hardswish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_huber_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_kl_div_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_leaky_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_linear_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_unpool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_unpool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_mish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_circular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_constant_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_constant_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_replicate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_replicate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_replicate_negative_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pairwise_distance_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pairwise_distance_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pixel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_prelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_relu6_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_selu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_silu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softplus_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softsign_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_tanhshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_upsample_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nonzero_static_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_fro_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_nuc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_nuc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ormqr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ormqr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_outer_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_permute_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_pinverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_polar_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_polygamma_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_polygamma_polygamma_n_2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_polygamma_polygamma_n_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_polygamma_polygamma_n_4_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_positive_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_positive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_pow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_pow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_quantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rad2deg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rand_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_randn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_randn_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ravel_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_real_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_reciprocal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_renorm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_renorm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_repeat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_repeat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_repeat_interleave_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_reshape_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_reshape_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_resize_as__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_resolve_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_roll_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rot90_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_round_decimals_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_round_decimals_neg_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rsub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scalar_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_searchsorted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_select_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sgn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_short_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sigmoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_general_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_slice_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_bessel_y0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_entr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_hermite_polynomial_he_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_i1e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_log_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_modified_bessel_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_modified_bessel_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_ndtri_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_xlog1py_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_zeta_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_split_list_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_split_with_sizes_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_square_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_squeeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_squeeze_multiple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_stft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_take_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_take_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tensor_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tensor_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tensordot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tile_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_topk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_transpose_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_transpose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_transpose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_triangular_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tril_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_triu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_trunc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unbind_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unbind_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unflatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unfold_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unfold_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unique_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unsqueeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_view_as_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_view_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_vsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_vstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_vstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_where_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_where_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_xlogy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_zero__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_zeros_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_zeros_like_cuda_float64 2025-12-04T13:34:20.9012555Z 2025-12-04T13:34:20.9012920Z Finished test_ops_gradients 1/2 ... [2025-12-04 13:34:20.648089][15664.769306325], took 10.75min 2025-12-04T13:34:20.9014138Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_ops_gradients/test_ops_gradients-8f336b5d99061bf3.xml 2025-12-04T13:34:20.9015229Z Running test_linalg 1/1 ... [2025-12-04 13:34:20.821611][15664.942829719] 2025-12-04T13:34:20.9015713Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T13:34:20.9016890Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_linalg.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:34:20.822012] 2025-12-04T13:41:45.4532902Z 2025-12-04T13:41:45.4534230Z PRINTING LOG FILE of test_linalg 1/1 (test/test-reports/test_linalg_1.1_2a5b6f269d53327b_.log) 2025-12-04T13:41:45.4535644Z Test results will be stored in test-reports/python-pytest/test_linalg/test_linalg-77c9be2be0e8d8f0.xml 2025-12-04T13:41:45.4536907Z ============================= test session starts ============================== 2025-12-04T13:41:45.4538061Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:41:45.4538970Z cachedir: .pytest_cache 2025-12-04T13:41:45.4539962Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:41:45.4540894Z rootdir: /var/lib/jenkins/workspace 2025-12-04T13:41:45.4541332Z configfile: pytest.ini 2025-12-04T13:41:45.4542241Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T13:41:45.4543624Z collecting ... collected 1263 items 2025-12-04T13:41:45.4544178Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T13:41:45.5112259Z Running 1263 items in this shard: test/test_linalg.py::TestLinalgCUDA::test_1_sized_with_0_strided_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_1_sized_with_0_strided_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_128_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_128_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_64_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_64_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_128_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_128_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_64_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_64_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_128_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_32_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_128_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_32_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_32_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_32_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_64_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_64_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_32_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_32_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_64_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_64_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_large_shape_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_errors_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_False_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_False_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_True_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_True_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_False_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_False_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_True_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_True_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_False_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_False_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_True_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_True_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_False_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_False_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_True_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_True_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmm_baddbmm_overflow_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_addmm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_addmm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmv_rowmajor_colmajor_incx_incy_lda_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmv_rowmajor_colmajor_incx_incy_lda_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmv_rowmajor_colmajor_incx_incy_lda_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addr_bool_cuda_bool, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int16, test/test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int32, test/test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int64, test/test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int8, test/test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_uint8, test/test_linalg.py::TestLinalgCUDA::test_addr_type_promotion_cuda, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_int16, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_int32, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_int64, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_nan_input_with_zero_beta_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_blas_empty_cuda, test/test_linalg.py::TestLinalgCUDA::test_blas_mv_large_input_cuda, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_blaslog_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_bmm_tunableop_rocm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_broadcast_batched_matmul_cuda, test/test_linalg.py::TestLinalgCUDA::test_broadcast_fused_matmul_cuda, test/test_linalg.py::TestLinalgCUDA::test_call_count_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_chain_matmul_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_backward_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_ck_blas_library_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_128_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_128_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_64_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_64_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_128_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_128_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_64_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_64_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_32_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_32_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_64_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_64_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_32_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_32_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_64_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_64_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_cond_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cond_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cond_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cond_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cross_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cross_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cross_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_cross_with_and_without_dim_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cross_with_and_without_dim_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_det_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_det_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_det_logdet_slogdet_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_det_logdet_slogdet_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_disable_tuning_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_dot_invalid_args_cuda, test/test_linalg.py::TestLinalgCUDA::test_dot_vs_numpy_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_dot_vs_numpy_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_dump_results_on_exit_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_check_magma_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eig_numpy_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eig_numpy_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eig_removed_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigh_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigh_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigh_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigh_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigh_svd_illcondition_matrix_input_should_not_crash_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigh_svd_illcondition_matrix_input_should_not_crash_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigvals_numpy_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigvals_numpy_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_einsum_corner_cases_cuda, test/test_linalg.py::TestLinalgCUDA::test_einsum_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_einsum_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_einsum_error_cases_cuda, test/test_linalg.py::TestLinalgCUDA::test_einsum_output_layout_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_einsum_random_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_einsum_random_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_einsum_sublist_format_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_einsum_sublist_format_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_32_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_35_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_36_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_40_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_32_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_35_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_36_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_40_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_32_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_35_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_36_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_40_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_32_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_35_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_36_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_40_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_32_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_35_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_36_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_40_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_gemm_bias_offline_tunableop_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_gemm_bias_tunableop_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_householder_product_errors_and_warnings_cuda, test/test_linalg.py::TestLinalgCUDA::test_inner_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inner_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inverse_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inverse_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inverse_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inverse_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_kron_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_kron_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_kron_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_kron_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lapack_empty_cuda, test/test_linalg.py::TestLinalgCUDA::test_large_bmm_backward_cuda, test/test_linalg.py::TestLinalgCUDA::test_large_bmm_mm_backward_cuda, test/test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_cross_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_cross_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_cross_with_and_without_dim_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_cross_with_and_without_dim_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_batch_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_batch_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_no_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_utils_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_utils_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_qr_autograd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linear_algebra_scalar_raises_cuda, test/test_linalg.py::TestLinalgCUDA::test_lobpcg_basic_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lobpcg_ortho_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lobpcg_scipy_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lobpcg_torchscript_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_logaddexp_cpu_vs_cuda_complex_cuda, test/test_linalg.py::TestLinalgCUDA::test_lower_precision_accumulation_with_ref_path_cuda, test/test_linalg.py::TestLinalgCUDA::test_lstsq_removed_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lu_unpack_check_input_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matmul_45724_cuda, test/test_linalg.py::TestLinalgCUDA::test_matmul_check_entries_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_matmul_empty_existing_file_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_mv_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_matmul_mv_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_matmul_mv_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_offline_mgpu_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_offline_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_matmul_out_kernel_errors_with_autograd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matmul_out_kernel_errors_with_autograd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_1d_Nd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_1d_Nd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_2d_Nd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_2d_Nd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_3d_Nd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_3d_Nd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_tunableop_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_norm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_norm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_power_negative_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_power_negative_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_power_non_negative_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_power_non_negative_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_rtol_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_removed_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_minimum_tuning_iteration_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_mm_bmm_non_memory_dense_cuda, test/test_linalg.py::TestLinalgCUDA::test_mm_conjtranspose_cuda, test/test_linalg.py::TestLinalgCUDA::test_mm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_mm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_mm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_mm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_mm_empty_inputs_mixed_dtype_errors_cuda, test/test_linalg.py::TestLinalgCUDA::test_mm_submatrix_offline_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_multi_dot_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_multi_dot_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_multi_dot_errors_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_bfloat16_and_half_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_norm_bfloat16_and_half_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_norm_complex_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_norm_complex_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_norm_complex_old_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_complexhalf_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_errors_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_errors_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_extreme_values_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_fastpaths_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_fro_2_equivalence_old_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_fused_type_promotion_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_norm_fused_type_promotion_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_old_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_old_nan_propagation_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_nuclear_norm_axes_small_brute_force_old_cuda, test/test_linalg.py::TestLinalgCUDA::test_nuclear_norm_exceptions_old_cuda, test/test_linalg.py::TestLinalgCUDA::test_nuclear_norm_out_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_nuclear_norm_out_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_numeric_check_leak_tunableop_rocm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_numerical_check_accuracy_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_numerical_check_accuracy_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_numerical_check_python_binding_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_offline_tuning_append_to_existing_file_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_many_batches_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_ops_append_to_existing_file_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_ger_addr_legacy_tests_cuda, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_uint8, test/test_linalg.py::TestLinalgCUDA::test_pca_lowrank_cuda, test/test_linalg.py::TestLinalgCUDA::test_permute_matmul_cuda, test/test_linalg.py::TestLinalgCUDA::test_pinv_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_pinv_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_pinv_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_pinv_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_preferred_blas_library_cuda, test/test_linalg.py::TestLinalgCUDA::test_preferred_linalg_library_cuda, test/test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_qr_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_qr_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_qr_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_qr_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_qr_error_cases_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_renorm_cuda, test/test_linalg.py::TestLinalgCUDA::test_renorm_ps_cuda, test/test_linalg.py::TestLinalgCUDA::test_rotating_buffer_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_rowwise_scaled_gemm_numerics_tunableop_cuda_float8_e4m3fnuz, test/test_linalg.py::TestLinalgCUDA::test_scaled_gemm_offline_tunableop_cuda_float8_e4m3fnuz, test/test_linalg.py::TestLinalgCUDA::test_scaled_gemm_offline_tunableop_cuda_float8_e5m2fnuz, test/test_linalg.py::TestLinalgCUDA::test_scaled_gemm_tunableop_cuda_float8_e4m3fnuz, test/test_linalg.py::TestLinalgCUDA::test_scaled_gemm_tunableop_cuda_float8_e5m2fnuz, test/test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_solve_removed_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_strided_mm_bmm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_strided_mm_bmm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_svd_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_svd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_svd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_svd_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_svd_lowrank_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_svd_lowrank_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_symeig_removed_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_tensordot_cuda, test/test_linalg.py::TestLinalgCUDA::test_tensordot_out_kernel_errors_with_autograd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensordot_out_kernel_errors_with_autograd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tf32_offline_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tf32_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_large_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_bool, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int16, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int32, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int64, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int8, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_uint8, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_large_matrix_64bit_cuda, test/test_linalg.py::TestLinalgCUDA::test_validator_tunableop_rocm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_vdot_invalid_args_cuda, test/test_linalg.py::TestLinalgCUDA::test_vdot_vs_numpy_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_vdot_vs_numpy_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_decom_unbacked_checks_cuda, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_dim_tuple_arg_cuda, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_extreme_values_cuda, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_float64 2025-12-04T13:41:45.5643842Z 2025-12-04T13:41:45.5644260Z test_linalg.py::TestLinalgCUDA::test_1_sized_with_0_strided_cuda_float32 PASSED [0.8366s] [ 0%] 2025-12-04T13:41:45.5645141Z test_linalg.py::TestLinalgCUDA::test_1_sized_with_0_strided_cuda_float64 PASSED [0.0085s] [ 0%] 2025-12-04T13:41:45.5646148Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_128_n_11008_cuda SKIPPED [0.0035s] (CUDA is unsupported) [ 0%] 2025-12-04T13:41:45.5647396Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_128_n_4096_cuda SKIPPED [0.0029s] (CUDA is unsupported) [ 0%] 2025-12-04T13:41:45.5648504Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_64_n_11008_cuda SKIPPED [0.0029s] (CUDA is unsupported) [ 0%] 2025-12-04T13:41:45.5649670Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_64_n_4096_cuda SKIPPED [0.0029s] (CUDA is unsupported) [ 0%] 2025-12-04T13:41:45.5650781Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_128_n_11008_cuda SKIPPED [0.0032s] (CUDA is unsupported) [ 0%] 2025-12-04T13:41:45.5651898Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_128_n_4096_cuda SKIPPED [0.0028s] (CUDA is unsupported) [ 0%] 2025-12-04T13:41:45.5652996Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_64_n_11008_cuda SKIPPED [0.0028s] (CUDA is unsupported) [ 0%] 2025-12-04T13:41:45.5654114Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_64_n_4096_cuda SKIPPED [0.0029s] (CUDA is unsupported) [ 0%] 2025-12-04T13:41:45.5655270Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_128_cuda SKIPPED [0.0032s] (CUDA Backend is unsupported) [ 0%] 2025-12-04T13:41:45.5656471Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_32_cuda SKIPPED [0.0028s] (CUDA Backend is unsupported) [ 0%] 2025-12-04T13:41:45.5657648Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_48_cuda SKIPPED [0.0028s] (CUDA Backend is unsupported) [ 1%] 2025-12-04T13:41:45.5658904Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_64_cuda SKIPPED [0.0029s] (CUDA Backend is unsupported) [ 1%] 2025-12-04T13:41:45.5660097Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_128_cuda SKIPPED [0.0033s] (CUDA Backend is unsupported) [ 1%] 2025-12-04T13:41:45.5661348Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_32_cuda SKIPPED [0.0028s] (CUDA Backend is unsupported) [ 1%] 2025-12-04T13:41:45.5662518Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_48_cuda SKIPPED [0.0029s] (CUDA Backend is unsupported) [ 1%] 2025-12-04T13:41:45.5663699Z test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_64_n_64_cuda SKIPPED [0.0028s] (CUDA Backend is unsupported) [ 1%] 2025-12-04T13:41:45.5664793Z test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_32_n_48_cuda SKIPPED [0.0029s] (requires SM80 or later) [ 1%] 2025-12-04T13:41:45.5665779Z test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_32_n_64_cuda SKIPPED [0.0033s] (requires SM80 or later) [ 1%] 2025-12-04T13:41:45.5666762Z test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_64_n_48_cuda SKIPPED [0.0029s] (requires SM80 or later) [ 1%] 2025-12-04T13:41:45.5667722Z test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_64_n_64_cuda SKIPPED [0.0023s] (requires SM80 or later) [ 1%] 2025-12-04T13:41:45.5668704Z test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_32_n_48_cuda SKIPPED [0.0014s] (requires SM80 or later) [ 1%] 2025-12-04T13:41:45.5669738Z test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_32_n_64_cuda SKIPPED [0.0014s] (requires SM80 or later) [ 1%] 2025-12-04T13:41:45.5670703Z test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_64_n_48_cuda SKIPPED [0.0017s] (requires SM80 or later) [ 1%] 2025-12-04T13:41:45.5671684Z test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_64_n_64_cuda SKIPPED [0.0014s] (requires SM80 or later) [ 2%] 2025-12-04T13:41:45.5672833Z test_linalg.py::TestLinalgCUDA::test__int8_mm_large_shape_cuda SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 2%] 2025-12-04T13:41:45.5673987Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_False_slice_False_cuda PASSED [0.0036s] [ 2%] 2025-12-04T13:41:45.5674975Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_False_slice_True_cuda PASSED [0.0029s] [ 2%] 2025-12-04T13:41:45.5675976Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_True_slice_False_cuda PASSED [3.3803s] [ 2%] 2025-12-04T13:41:45.5676946Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_True_slice_True_cuda PASSED [0.0051s] [ 2%] 2025-12-04T13:41:45.5677954Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_False_slice_False_cuda PASSED [0.0029s] [ 2%] 2025-12-04T13:41:45.5678938Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_False_slice_True_cuda PASSED [0.0026s] [ 2%] 2025-12-04T13:41:45.5679894Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_True_slice_False_cuda PASSED [0.0043s] [ 2%] 2025-12-04T13:41:45.5680860Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_True_slice_True_cuda PASSED [0.0046s] [ 2%] 2025-12-04T13:41:45.5681830Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_False_slice_False_cuda PASSED [0.0030s] [ 2%] 2025-12-04T13:41:45.5682796Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_False_slice_True_cuda PASSED [0.0028s] [ 2%] 2025-12-04T13:41:45.5683769Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_True_slice_False_cuda PASSED [0.0046s] [ 3%] 2025-12-04T13:41:45.5684736Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_True_slice_True_cuda PASSED [0.0045s] [ 3%] 2025-12-04T13:41:45.5685708Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_False_slice_False_cuda PASSED [0.0028s] [ 3%] 2025-12-04T13:41:45.5686674Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_False_slice_True_cuda PASSED [0.0030s] [ 3%] 2025-12-04T13:41:45.5687642Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_True_slice_False_cuda PASSED [0.0045s] [ 3%] 2025-12-04T13:41:45.5688783Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_True_slice_True_cuda PASSED [0.0045s] [ 3%] 2025-12-04T13:41:45.5689815Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_False_slice_False_cuda PASSED [0.0028s] [ 3%] 2025-12-04T13:41:45.5692023Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_False_slice_True_cuda PASSED [0.0028s] [ 3%] 2025-12-04T13:41:45.5692984Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_True_slice_False_cuda PASSED [0.0048s] [ 3%] 2025-12-04T13:41:45.5693954Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_True_slice_True_cuda PASSED [0.0044s] [ 3%] 2025-12-04T13:41:45.5694929Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_False_slice_False_cuda PASSED [0.0028s] [ 3%] 2025-12-04T13:41:45.5695907Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_False_slice_True_cuda PASSED [0.0027s] [ 3%] 2025-12-04T13:41:45.5696869Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_True_slice_False_cuda PASSED [0.0045s] [ 3%] 2025-12-04T13:41:45.5697831Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_True_slice_True_cuda PASSED [0.0045s] [ 4%] 2025-12-04T13:41:45.5698939Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_False_slice_False_cuda PASSED [0.0032s] [ 4%] 2025-12-04T13:41:45.5699905Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_False_slice_True_cuda PASSED [0.0027s] [ 4%] 2025-12-04T13:41:45.5700881Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_True_slice_False_cuda PASSED [0.0045s] [ 4%] 2025-12-04T13:41:45.5701846Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_True_slice_True_cuda PASSED [0.0045s] [ 4%] 2025-12-04T13:41:45.5702822Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_False_slice_False_cuda PASSED [0.0028s] [ 4%] 2025-12-04T13:41:45.5703792Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_False_slice_True_cuda PASSED [0.0028s] [ 4%] 2025-12-04T13:41:45.5704765Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_True_slice_False_cuda PASSED [0.0050s] [ 4%] 2025-12-04T13:41:45.5705789Z test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_True_slice_True_cuda PASSED [0.0045s] [ 4%] 2025-12-04T13:41:45.5707012Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 4%] 2025-12-04T13:41:45.5708530Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 4%] 2025-12-04T13:41:45.5709978Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 4%] 2025-12-04T13:41:45.5711438Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 4%] 2025-12-04T13:41:45.5712897Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5714350Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5715793Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5717246Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5718693Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5720171Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5721612Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5723037Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5724489Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5725950Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5727410Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5728921Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 5%] 2025-12-04T13:41:45.5730382Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5731822Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5733276Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5734784Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5736238Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5737720Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5739240Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5740679Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5742137Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5743610Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5745072Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5746539Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5747993Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 6%] 2025-12-04T13:41:45.5749491Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5750945Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5752392Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5753843Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5755291Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5756741Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5758221Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5759663Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0021s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5761132Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5762604Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5764063Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5765534Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5767018Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 7%] 2025-12-04T13:41:45.5768471Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5769921Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5771377Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5772824Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5774404Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5775859Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5777315Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5778890Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5780352Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5781814Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5783276Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5784731Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.5786187Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5787632Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5789259Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5790711Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5792157Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5793591Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5795110Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5796593Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5798108Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5799569Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5801012Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5802485Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0017s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5803940Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.5805501Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0017s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5806954Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5808393Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5809888Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5811340Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5812799Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5814263Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5815724Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5817189Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5818770Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5820234Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5821674Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.5823128Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5824625Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5826077Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5827555Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5828983Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5830439Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5831911Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5833372Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5834833Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5836269Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5837726Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5839209Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5840675Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.5842128Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5843555Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5845002Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5846451Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5847958Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5849419Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5850890Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5852355Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5853847Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5855310Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5856789Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5858314Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5859770Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 12%] 2025-12-04T13:41:45.5861231Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5862688Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5864129Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5865588Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5867063Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5868573Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5870045Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5871487Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5872949Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5874413Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0020s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5875881Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5877429Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 13%] 2025-12-04T13:41:45.5878895Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5880353Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5881800Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5883263Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5884756Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5886262Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5887871Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5889336Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5890803Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5892259Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5893719Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5895172Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5896626Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.5898200Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5899651Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5901108Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5902574Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5904043Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5905499Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5907007Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5908465Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5909920Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5911381Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5912830Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5914323Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5915775Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 15%] 2025-12-04T13:41:45.5917265Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5918718Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5920162Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5921624Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5923074Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5924524Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5925970Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5927441Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5928894Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5930343Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5931784Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5933212Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 16%] 2025-12-04T13:41:45.5934656Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5936204Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5937705Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5939239Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5940679Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5942130Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5943620Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5945068Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5946548Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5947978Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5949422Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5950855Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5952291Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 17%] 2025-12-04T13:41:45.5953731Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5955198Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5956836Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5959089Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5960558Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5962006Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5963457Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5964910Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5966364Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5967846Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5969282Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5970728Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.5972191Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5973691Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5975165Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5976639Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5978091Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5979620Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5981076Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5982521Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5983985Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5985436Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5986880Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5988582Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5990034Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0020s] (Only runs on cpu) [ 19%] 2025-12-04T13:41:45.5991507Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.5992978Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.5994443Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.5995903Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.5997397Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.5998869Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.6000327Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.6001783Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.6003234Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.6004718Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.6006167Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.6007664Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.6009134Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 20%] 2025-12-04T13:41:45.6010590Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 21%] 2025-12-04T13:41:45.6012059Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 21%] 2025-12-04T13:41:45.6013527Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 21%] 2025-12-04T13:41:45.6014984Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 21%] 2025-12-04T13:41:45.6016442Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 21%] 2025-12-04T13:41:45.6017919Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 21%] 2025-12-04T13:41:45.6019448Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 21%] 2025-12-04T13:41:45.6020900Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 21%] 2025-12-04T13:41:45.6022362Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 21%] 2025-12-04T13:41:45.6023814Z test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 21%] 2025-12-04T13:41:45.6024887Z test_linalg.py::TestLinalgCUDA::test__int_mm_errors_cuda PASSED [0.0050s] [ 21%] 2025-12-04T13:41:45.6025816Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_False_use_transpose_b_False_cuda PASSED [0.0038s] [ 21%] 2025-12-04T13:41:45.6026962Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_False_use_transpose_b_True_cuda PASSED [0.0036s] [ 22%] 2025-12-04T13:41:45.6028052Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_True_use_transpose_b_False_cuda PASSED [0.0031s] [ 22%] 2025-12-04T13:41:45.6029129Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_16_use_transpose_a_True_use_transpose_b_True_cuda PASSED [0.0019s] [ 22%] 2025-12-04T13:41:45.6030218Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_False_use_transpose_b_False_cuda PASSED [0.0026s] [ 22%] 2025-12-04T13:41:45.6031318Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_False_use_transpose_b_True_cuda PASSED [0.0026s] [ 22%] 2025-12-04T13:41:45.6032405Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_True_use_transpose_b_False_cuda PASSED [0.0019s] [ 22%] 2025-12-04T13:41:45.6033504Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_True_use_transpose_b_True_cuda PASSED [0.0022s] [ 22%] 2025-12-04T13:41:45.6034604Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_False_use_transpose_b_False_cuda PASSED [0.0026s] [ 22%] 2025-12-04T13:41:45.6035731Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_False_use_transpose_b_True_cuda PASSED [0.0026s] [ 22%] 2025-12-04T13:41:45.6036824Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_True_use_transpose_b_False_cuda PASSED [0.0018s] [ 22%] 2025-12-04T13:41:45.6037896Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_True_use_transpose_b_True_cuda PASSED [0.0018s] [ 22%] 2025-12-04T13:41:45.6038987Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_False_use_transpose_b_False_cuda PASSED [0.0026s] [ 22%] 2025-12-04T13:41:45.6040082Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_False_use_transpose_b_True_cuda PASSED [0.0030s] [ 22%] 2025-12-04T13:41:45.6041171Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_True_use_transpose_b_False_cuda PASSED [0.0018s] [ 23%] 2025-12-04T13:41:45.6042246Z test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_True_use_transpose_b_True_cuda PASSED [0.0018s] [ 23%] 2025-12-04T13:41:45.6043158Z test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_bfloat16 PASSED [0.6193s] [ 23%] 2025-12-04T13:41:45.6043909Z test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_complex128 PASSED [0.6718s] [ 23%] 2025-12-04T13:41:45.6044663Z test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_complex64 PASSED [0.6643s] [ 23%] 2025-12-04T13:41:45.6045390Z test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_float16 PASSED [0.7092s] [ 23%] 2025-12-04T13:41:45.6046107Z test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_float32 PASSED [0.6073s] [ 23%] 2025-12-04T13:41:45.6046856Z test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_float64 PASSED [0.6046s] [ 23%] 2025-12-04T13:41:45.6047645Z test_linalg.py::TestLinalgCUDA::test_addmm_baddbmm_overflow_cuda_float16 PASSED [0.0151s] [ 23%] 2025-12-04T13:41:45.6048444Z test_linalg.py::TestLinalgCUDA::test_addmm_cuda_bfloat16 PASSED [0.0899s] [ 23%] 2025-12-04T13:41:45.6049174Z test_linalg.py::TestLinalgCUDA::test_addmm_cuda_complex128 PASSED [0.1227s] [ 23%] 2025-12-04T13:41:45.6049914Z test_linalg.py::TestLinalgCUDA::test_addmm_cuda_complex64 PASSED [0.0939s] [ 23%] 2025-12-04T13:41:45.6050625Z test_linalg.py::TestLinalgCUDA::test_addmm_cuda_float32 PASSED [0.0849s] [ 23%] 2025-12-04T13:41:45.6073551Z test_linalg.py::TestLinalgCUDA::test_addmm_cuda_float64 PASSED [0.1036s] [ 24%] 2025-12-04T13:41:45.6074315Z test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_bfloat16 PASSED [0.1014s] [ 24%] 2025-12-04T13:41:45.6075088Z test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_float16 PASSED [0.1018s] [ 24%] 2025-12-04T13:41:45.6075854Z test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_float32 PASSED [0.0956s] [ 24%] 2025-12-04T13:41:45.6076622Z test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_float64 PASSED [0.1128s] [ 24%] 2025-12-04T13:41:45.6077713Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_0_cuda_bfloat16 PASSED [0.0030s] [ 24%] 2025-12-04T13:41:45.6078966Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_0_cuda_float16 PASSED [0.0022s] [ 24%] 2025-12-04T13:41:45.6080203Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_0_cuda_float32 PASSED [0.0021s] [ 24%] 2025-12-04T13:41:45.6081452Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_5_cuda_bfloat16 PASSED [0.0021s] [ 24%] 2025-12-04T13:41:45.6082677Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_5_cuda_float16 PASSED [0.0021s] [ 24%] 2025-12-04T13:41:45.6083919Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_5_cuda_float32 PASSED [0.0021s] [ 24%] 2025-12-04T13:41:45.6085209Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_1_0_cuda_bfloat16 PASSED [0.0024s] [ 24%] 2025-12-04T13:41:45.6086500Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_1_0_cuda_float16 PASSED [0.0020s] [ 25%] 2025-12-04T13:41:45.6087919Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_1_0_cuda_float32 PASSED [0.0021s] [ 25%] 2025-12-04T13:41:45.6089165Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_0_cuda_bfloat16 PASSED [0.0020s] [ 25%] 2025-12-04T13:41:45.6090411Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_0_cuda_float16 PASSED [0.0020s] [ 25%] 2025-12-04T13:41:45.6091654Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_0_cuda_float32 PASSED [0.0020s] [ 25%] 2025-12-04T13:41:45.6092882Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_5_cuda_bfloat16 PASSED [0.0021s] [ 25%] 2025-12-04T13:41:45.6094121Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_5_cuda_float16 PASSED [0.0024s] [ 25%] 2025-12-04T13:41:45.6095360Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_5_cuda_float32 PASSED [0.0021s] [ 25%] 2025-12-04T13:41:45.6096594Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_1_0_cuda_bfloat16 PASSED [0.0020s] [ 25%] 2025-12-04T13:41:45.6097831Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_1_0_cuda_float16 PASSED [0.0020s] [ 25%] 2025-12-04T13:41:45.6099202Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_1_0_cuda_float32 PASSED [0.0021s] [ 25%] 2025-12-04T13:41:45.6100442Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_0_cuda_bfloat16 PASSED [0.0020s] [ 25%] 2025-12-04T13:41:45.6101676Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_0_cuda_float16 PASSED [0.0027s] [ 25%] 2025-12-04T13:41:45.6102910Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_0_cuda_float32 PASSED [0.0020s] [ 26%] 2025-12-04T13:41:45.6104134Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_5_cuda_bfloat16 PASSED [0.0020s] [ 26%] 2025-12-04T13:41:45.6105366Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_5_cuda_float16 PASSED [0.0020s] [ 26%] 2025-12-04T13:41:45.6106599Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_5_cuda_float32 PASSED [0.0021s] [ 26%] 2025-12-04T13:41:45.6107840Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_1_0_cuda_bfloat16 PASSED [0.0020s] [ 26%] 2025-12-04T13:41:45.6109106Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_1_0_cuda_float16 PASSED [0.0024s] [ 26%] 2025-12-04T13:41:45.6110348Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_1_0_cuda_float32 PASSED [0.0021s] [ 26%] 2025-12-04T13:41:45.6111575Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_0_cuda_bfloat16 PASSED [0.0020s] [ 26%] 2025-12-04T13:41:45.6112795Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_0_cuda_float16 PASSED [0.0021s] [ 26%] 2025-12-04T13:41:45.6114003Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_0_cuda_float32 PASSED [0.0021s] [ 26%] 2025-12-04T13:41:45.6115268Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_5_cuda_bfloat16 PASSED [0.0021s] [ 26%] 2025-12-04T13:41:45.6116498Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_5_cuda_float16 PASSED [0.0021s] [ 26%] 2025-12-04T13:41:45.6117911Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_5_cuda_float32 PASSED [0.0024s] [ 26%] 2025-12-04T13:41:45.6119138Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_1_0_cuda_bfloat16 PASSED [0.0020s] [ 27%] 2025-12-04T13:41:45.6120351Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_1_0_cuda_float16 PASSED [0.0020s] [ 27%] 2025-12-04T13:41:45.6121575Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_1_0_cuda_float32 PASSED [0.0020s] [ 27%] 2025-12-04T13:41:45.6122809Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_0_cuda_bfloat16 PASSED [0.0020s] [ 27%] 2025-12-04T13:41:45.6124036Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_0_cuda_float16 PASSED [0.0020s] [ 27%] 2025-12-04T13:41:45.6125243Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_0_cuda_float32 PASSED [0.0024s] [ 27%] 2025-12-04T13:41:45.6126468Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_5_cuda_bfloat16 PASSED [0.0020s] [ 27%] 2025-12-04T13:41:45.6127691Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_5_cuda_float16 PASSED [0.0020s] [ 27%] 2025-12-04T13:41:45.6128909Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_5_cuda_float32 PASSED [0.0021s] [ 27%] 2025-12-04T13:41:45.6130161Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_1_0_cuda_bfloat16 PASSED [0.0020s] [ 27%] 2025-12-04T13:41:45.6131390Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_1_0_cuda_float16 PASSED [0.0021s] [ 27%] 2025-12-04T13:41:45.6132612Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_1_0_cuda_float32 PASSED [0.0024s] [ 27%] 2025-12-04T13:41:45.6133837Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_0_cuda_bfloat16 PASSED [0.0020s] [ 28%] 2025-12-04T13:41:45.6135045Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_0_cuda_float16 PASSED [0.0020s] [ 28%] 2025-12-04T13:41:45.6136268Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_0_cuda_float32 PASSED [0.0020s] [ 28%] 2025-12-04T13:41:45.6137499Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_5_cuda_bfloat16 PASSED [0.0020s] [ 28%] 2025-12-04T13:41:45.6138857Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_5_cuda_float16 PASSED [0.0020s] [ 28%] 2025-12-04T13:41:45.6140068Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_5_cuda_float32 PASSED [0.0021s] [ 28%] 2025-12-04T13:41:45.6141302Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_1_0_cuda_bfloat16 PASSED [0.0024s] [ 28%] 2025-12-04T13:41:45.6142535Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_1_0_cuda_float16 PASSED [0.0020s] [ 28%] 2025-12-04T13:41:45.6143756Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_1_0_cuda_float32 PASSED [0.0021s] [ 28%] 2025-12-04T13:41:45.6144991Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_0_cuda_bfloat16 PASSED [0.0021s] [ 28%] 2025-12-04T13:41:45.6146238Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_0_cuda_float16 PASSED [0.0020s] [ 28%] 2025-12-04T13:41:45.6147461Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_0_cuda_float32 PASSED [0.0021s] [ 28%] 2025-12-04T13:41:45.6148729Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_5_cuda_bfloat16 PASSED [0.0024s] [ 28%] 2025-12-04T13:41:45.6149955Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_5_cuda_float16 PASSED [0.0020s] [ 29%] 2025-12-04T13:41:45.6151161Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_5_cuda_float32 PASSED [0.0021s] [ 29%] 2025-12-04T13:41:45.6152390Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_1_0_cuda_bfloat16 PASSED [0.0020s] [ 29%] 2025-12-04T13:41:45.6153617Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_1_0_cuda_float16 PASSED [0.0020s] [ 29%] 2025-12-04T13:41:45.6154835Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_1_0_cuda_float32 PASSED [0.0021s] [ 29%] 2025-12-04T13:41:45.6156046Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_0_cuda_bfloat16 PASSED [0.0024s] [ 29%] 2025-12-04T13:41:45.6157268Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_0_cuda_float16 PASSED [0.0020s] [ 29%] 2025-12-04T13:41:45.6158490Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_0_cuda_float32 PASSED [0.0020s] [ 29%] 2025-12-04T13:41:45.6159754Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_5_cuda_bfloat16 PASSED [0.0020s] [ 29%] 2025-12-04T13:41:45.6160967Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_5_cuda_float16 PASSED [0.0021s] [ 29%] 2025-12-04T13:41:45.6162188Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_5_cuda_float32 PASSED [0.0021s] [ 29%] 2025-12-04T13:41:45.6163418Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_1_0_cuda_bfloat16 PASSED [0.0021s] [ 29%] 2025-12-04T13:41:45.6164645Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_1_0_cuda_float16 PASSED [0.0024s] [ 30%] 2025-12-04T13:41:45.6165849Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_1_0_cuda_float32 PASSED [0.0021s] [ 30%] 2025-12-04T13:41:45.6167078Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_0_cuda_bfloat16 PASSED [0.0020s] [ 30%] 2025-12-04T13:41:45.6168306Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_0_cuda_float16 PASSED [0.0020s] [ 30%] 2025-12-04T13:41:45.6169560Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_0_cuda_float32 PASSED [0.0021s] [ 30%] 2025-12-04T13:41:45.6170791Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_5_cuda_bfloat16 PASSED [0.0020s] [ 30%] 2025-12-04T13:41:45.6172000Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_5_cuda_float16 PASSED [0.0024s] [ 30%] 2025-12-04T13:41:45.6173218Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_5_cuda_float32 PASSED [0.0020s] [ 30%] 2025-12-04T13:41:45.6174442Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_1_0_cuda_bfloat16 PASSED [0.0020s] [ 30%] 2025-12-04T13:41:45.6175703Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_1_0_cuda_float16 PASSED [0.0020s] [ 30%] 2025-12-04T13:41:45.6176911Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_1_0_cuda_float32 PASSED [0.0021s] [ 30%] 2025-12-04T13:41:45.6178269Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_0_cuda_bfloat16 PASSED [0.0020s] [ 30%] 2025-12-04T13:41:45.6179496Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_0_cuda_float16 PASSED [0.0024s] [ 30%] 2025-12-04T13:41:45.6180710Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_0_cuda_float32 PASSED [0.0020s] [ 31%] 2025-12-04T13:41:45.6181911Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_5_cuda_bfloat16 PASSED [0.0021s] [ 31%] 2025-12-04T13:41:45.6183134Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_5_cuda_float16 PASSED [0.0021s] [ 31%] 2025-12-04T13:41:45.6184348Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_5_cuda_float32 PASSED [0.0021s] [ 31%] 2025-12-04T13:41:45.6185566Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_1_0_cuda_bfloat16 PASSED [0.0021s] [ 31%] 2025-12-04T13:41:45.6186768Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_1_0_cuda_float16 PASSED [0.0020s] [ 31%] 2025-12-04T13:41:45.6188140Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_1_0_cuda_float32 PASSED [0.0024s] [ 31%] 2025-12-04T13:41:45.6189354Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_0_cuda_bfloat16 PASSED [0.0020s] [ 31%] 2025-12-04T13:41:45.6190641Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_0_cuda_float16 PASSED [0.0020s] [ 31%] 2025-12-04T13:41:45.6191862Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_0_cuda_float32 PASSED [0.0021s] [ 31%] 2025-12-04T13:41:45.6193060Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_5_cuda_bfloat16 PASSED [0.0020s] [ 31%] 2025-12-04T13:41:45.6194276Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_5_cuda_float16 PASSED [0.0021s] [ 31%] 2025-12-04T13:41:45.6195488Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_5_cuda_float32 PASSED [0.0027s] [ 31%] 2025-12-04T13:41:45.6196703Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_1_0_cuda_bfloat16 PASSED [0.0020s] [ 32%] 2025-12-04T13:41:45.6197910Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_1_0_cuda_float16 PASSED [0.0021s] [ 32%] 2025-12-04T13:41:45.6199169Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_1_0_cuda_float32 PASSED [0.0021s] [ 32%] 2025-12-04T13:41:45.6200387Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_0_cuda_bfloat16 PASSED [0.0020s] [ 32%] 2025-12-04T13:41:45.6201604Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_0_cuda_float16 PASSED [0.0020s] [ 32%] 2025-12-04T13:41:45.6202802Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_0_cuda_float32 PASSED [0.0024s] [ 32%] 2025-12-04T13:41:45.6204016Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_5_cuda_bfloat16 PASSED [0.0020s] [ 32%] 2025-12-04T13:41:45.6205233Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_5_cuda_float16 PASSED [0.0021s] [ 32%] 2025-12-04T13:41:45.6206494Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_5_cuda_float32 PASSED [0.0021s] [ 32%] 2025-12-04T13:41:45.6207704Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_1_0_cuda_bfloat16 PASSED [0.0020s] [ 32%] 2025-12-04T13:41:45.6208960Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_1_0_cuda_float16 PASSED [0.0021s] [ 32%] 2025-12-04T13:41:45.6210170Z test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_1_0_cuda_float32 PASSED [0.0021s] [ 32%] 2025-12-04T13:41:45.6211171Z test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_bfloat16 PASSED [0.0932s] [ 33%] 2025-12-04T13:41:45.6211927Z test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_float16 PASSED [0.0935s] [ 33%] 2025-12-04T13:41:45.6212686Z test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_float32 PASSED [0.0886s] [ 33%] 2025-12-04T13:41:45.6213450Z test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_float64 PASSED [0.1131s] [ 33%] 2025-12-04T13:41:45.6214519Z test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_bfloat16 SKIPPED [0.0017s] (test doesn't currently work on the CUDA stack) [ 33%] 2025-12-04T13:41:45.6215862Z test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_float16 SKIPPED [0.0016s] (test doesn't currently work on the CUDA stack) [ 33%] 2025-12-04T13:41:45.6217263Z test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_float32 SKIPPED [0.0003s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 33%] 2025-12-04T13:41:45.6218732Z test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_float64 SKIPPED [0.0020s] (test doesn't currently work on the CUDA stack) [ 33%] 2025-12-04T13:41:45.6219800Z test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_complex128 PASSED [0.0288s] [ 33%] 2025-12-04T13:41:45.6220618Z test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_complex64 PASSED [0.0265s] [ 33%] 2025-12-04T13:41:45.6221405Z test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_float32 PASSED [0.0258s] [ 33%] 2025-12-04T13:41:45.6222177Z test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_float64 PASSED [0.0265s] [ 33%] 2025-12-04T13:41:45.6222927Z test_linalg.py::TestLinalgCUDA::test_addmv_cuda_bfloat16 PASSED [0.0265s] [ 33%] 2025-12-04T13:41:45.6223651Z test_linalg.py::TestLinalgCUDA::test_addmv_cuda_complex128 PASSED [0.0280s] [ 34%] 2025-12-04T13:41:45.6224390Z test_linalg.py::TestLinalgCUDA::test_addmv_cuda_complex64 PASSED [0.0271s] [ 34%] 2025-12-04T13:41:45.6225113Z test_linalg.py::TestLinalgCUDA::test_addmv_cuda_float16 PASSED [0.0265s] [ 34%] 2025-12-04T13:41:45.6225819Z test_linalg.py::TestLinalgCUDA::test_addmv_cuda_float32 PASSED [0.0248s] [ 34%] 2025-12-04T13:41:45.6226534Z test_linalg.py::TestLinalgCUDA::test_addmv_cuda_float64 PASSED [0.0248s] [ 34%] 2025-12-04T13:41:45.6227408Z test_linalg.py::TestLinalgCUDA::test_addmv_rowmajor_colmajor_incx_incy_lda_cuda_bfloat16 PASSED [0.0171s] [ 34%] 2025-12-04T13:41:45.6228480Z test_linalg.py::TestLinalgCUDA::test_addmv_rowmajor_colmajor_incx_incy_lda_cuda_float32 PASSED [0.0165s] [ 34%] 2025-12-04T13:41:45.6229484Z test_linalg.py::TestLinalgCUDA::test_addmv_rowmajor_colmajor_incx_incy_lda_cuda_float64 PASSED [0.0167s] [ 34%] 2025-12-04T13:41:45.6230359Z test_linalg.py::TestLinalgCUDA::test_addr_bool_cuda_bool PASSED [0.0108s] [ 34%] 2025-12-04T13:41:45.6231159Z test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_bfloat16 PASSED [0.0118s] [ 34%] 2025-12-04T13:41:45.6232052Z test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_complex128 PASSED [0.0188s] [ 34%] 2025-12-04T13:41:45.6232932Z test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_complex64 PASSED [0.0166s] [ 34%] 2025-12-04T13:41:45.6233819Z test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_float16 PASSED [0.0112s] [ 34%] 2025-12-04T13:41:45.6234694Z test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_float32 PASSED [0.0107s] [ 35%] 2025-12-04T13:41:45.6235592Z test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_float64 PASSED [0.0104s] [ 35%] 2025-12-04T13:41:45.6236395Z test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int16 PASSED [0.0086s] [ 35%] 2025-12-04T13:41:45.6237194Z test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int32 PASSED [0.0084s] [ 35%] 2025-12-04T13:41:45.6237960Z test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int64 PASSED [0.0086s] [ 35%] 2025-12-04T13:41:45.6238713Z test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int8 PASSED [0.0081s] [ 35%] 2025-12-04T13:41:45.6239475Z test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_uint8 PASSED [0.0084s] [ 35%] 2025-12-04T13:41:45.6240240Z test_linalg.py::TestLinalgCUDA::test_addr_type_promotion_cuda PASSED [0.3882s] [ 35%] 2025-12-04T13:41:45.6240995Z test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_bfloat16 PASSED [1.1892s] [ 35%] 2025-12-04T13:41:45.6241744Z test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_complex128 PASSED [1.3312s] [ 35%] 2025-12-04T13:41:45.6242508Z test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_complex64 PASSED [1.3103s] [ 35%] 2025-12-04T13:41:45.6243252Z test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_float16 PASSED [1.2148s] [ 35%] 2025-12-04T13:41:45.6243991Z test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_float32 PASSED [1.1723s] [ 36%] 2025-12-04T13:41:45.6244708Z test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_float64 PASSED [1.1754s] [ 36%] 2025-12-04T13:41:45.6245575Z test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_float16 PASSED [0.0023s] [ 36%] 2025-12-04T13:41:45.6246579Z test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_float32 PASSED [0.0023s] [ 36%] 2025-12-04T13:41:45.6247565Z test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_float64 PASSED [0.0018s] [ 36%] 2025-12-04T13:41:45.6248594Z test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_int16 PASSED [0.0018s] [ 36%] 2025-12-04T13:41:45.6249578Z test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_int32 PASSED [0.0018s] [ 36%] 2025-12-04T13:41:45.6250559Z test_linalg.py::TestLinalgCUDA::test_baddbmm_input_dtypes_compatibility_cuda_int64 PASSED [0.0022s] [ 36%] 2025-12-04T13:41:45.6251512Z test_linalg.py::TestLinalgCUDA::test_baddbmm_nan_input_with_zero_beta_cuda_float32 PASSED [0.0057s] [ 36%] 2025-12-04T13:41:45.6252425Z test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_bfloat16 PASSED [0.0030s] [ 36%] 2025-12-04T13:41:45.6253302Z test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_complex128 PASSED [0.0031s] [ 36%] 2025-12-04T13:41:45.6254183Z test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_complex64 PASSED [0.0031s] [ 36%] 2025-12-04T13:41:45.6255038Z test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_float16 PASSED [0.0030s] [ 36%] 2025-12-04T13:41:45.6255902Z test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_float32 PASSED [0.0033s] [ 37%] 2025-12-04T13:41:45.6256814Z test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_float64 PASSED [0.0029s] [ 37%] 2025-12-04T13:41:45.6257615Z test_linalg.py::TestLinalgCUDA::test_blas_empty_cuda PASSED [0.0117s] [ 37%] 2025-12-04T13:41:45.6258511Z test_linalg.py::TestLinalgCUDA::test_blas_mv_large_input_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 37%] 2025-12-04T13:41:45.6259380Z test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_bfloat16 PASSED [0.0032s] [ 37%] 2025-12-04T13:41:45.6260177Z test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_complex128 PASSED [0.0030s] [ 37%] 2025-12-04T13:41:45.6260959Z test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_complex64 PASSED [0.0034s] [ 37%] 2025-12-04T13:41:45.6261738Z test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_float16 PASSED [0.0030s] [ 37%] 2025-12-04T13:41:45.6262505Z test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_float32 PASSED [0.0030s] [ 37%] 2025-12-04T13:41:45.6263316Z test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_float64 PASSED [0.0030s] [ 37%] 2025-12-04T13:41:45.6264341Z test_linalg.py::TestLinalgCUDA::test_blaslog_tunableop_cuda_float16 SKIPPED [0.0014s] (test doesn't currently work on the CUDA stack) [ 37%] 2025-12-04T13:41:45.6265373Z test_linalg.py::TestLinalgCUDA::test_bmm_cuda_bfloat16 PASSED [1.6136s] [ 37%] 2025-12-04T13:41:45.6266096Z test_linalg.py::TestLinalgCUDA::test_bmm_cuda_complex128 PASSED [1.8615s] [ 38%] 2025-12-04T13:41:45.6266820Z test_linalg.py::TestLinalgCUDA::test_bmm_cuda_complex64 PASSED [1.7576s] [ 38%] 2025-12-04T13:41:45.6267523Z test_linalg.py::TestLinalgCUDA::test_bmm_cuda_float16 PASSED [1.7515s] [ 38%] 2025-12-04T13:41:45.6268237Z test_linalg.py::TestLinalgCUDA::test_bmm_cuda_float32 PASSED [1.5862s] [ 38%] 2025-12-04T13:41:45.6268945Z test_linalg.py::TestLinalgCUDA::test_bmm_cuda_float64 PASSED [1.6606s] [ 38%] 2025-12-04T13:41:45.6269930Z test_linalg.py::TestLinalgCUDA::test_bmm_tunableop_rocm_cuda_float32 SKIPPED [0.0018s] (test doesn't currently work on the CUDA stack) [ 38%] 2025-12-04T13:41:45.6270984Z test_linalg.py::TestLinalgCUDA::test_broadcast_batched_matmul_cuda PASSED [0.0278s] [ 38%] 2025-12-04T13:41:45.6271793Z test_linalg.py::TestLinalgCUDA::test_broadcast_fused_matmul_cuda PASSED [0.0040s] [ 38%] 2025-12-04T13:41:45.6272849Z test_linalg.py::TestLinalgCUDA::test_call_count_tunableop_cuda_float32 SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 38%] 2025-12-04T13:41:45.6273856Z test_linalg.py::TestLinalgCUDA::test_chain_matmul_cuda_float64 PASSED [0.0032s] [ 38%] 2025-12-04T13:41:45.6274765Z test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 38%] 2025-12-04T13:41:45.6275813Z test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 38%] 2025-12-04T13:41:45.6276869Z test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 38%] 2025-12-04T13:41:45.6277877Z test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 39%] 2025-12-04T13:41:45.6279003Z test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 39%] 2025-12-04T13:41:45.6280234Z test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_complex64 SKIPPED [0.0018s] (no MAGMA library detected) [ 39%] 2025-12-04T13:41:45.6281447Z test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 39%] 2025-12-04T13:41:45.6282639Z test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 39%] 2025-12-04T13:41:45.6283633Z test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_complex128 PASSED [0.1494s] [ 39%] 2025-12-04T13:41:45.6284418Z test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_complex64 PASSED [0.0096s] [ 39%] 2025-12-04T13:41:45.6285191Z test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_float32 PASSED [0.0086s] [ 39%] 2025-12-04T13:41:45.6285982Z test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_float64 PASSED [0.0072s] [ 39%] 2025-12-04T13:41:45.6286795Z test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_complex128 PASSED [0.0032s] [ 39%] 2025-12-04T13:41:45.6287651Z test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_complex64 PASSED [0.0027s] [ 39%] 2025-12-04T13:41:45.6288670Z test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_float32 PASSED [0.0026s] [ 39%] 2025-12-04T13:41:45.6289485Z test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_float64 PASSED [0.0027s] [ 39%] 2025-12-04T13:41:45.6290464Z test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6291588Z test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6292768Z test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_float32 SKIPPED [0.0018s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6293856Z test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6295114Z test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6296427Z test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6297714Z test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6299052Z test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6300273Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_backward_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6301517Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6302831Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6304108Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.6305388Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 41%] 2025-12-04T13:41:45.6306745Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 41%] 2025-12-04T13:41:45.6307984Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 41%] 2025-12-04T13:41:45.6309132Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_float32 SKIPPED [0.0018s] (no MAGMA library detected) [ 41%] 2025-12-04T13:41:45.6310277Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 41%] 2025-12-04T13:41:45.6311640Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_complex128 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 41%] 2025-12-04T13:41:45.6313201Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_complex64 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 41%] 2025-12-04T13:41:45.6314750Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_float32 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 41%] 2025-12-04T13:41:45.6316272Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_float64 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 41%] 2025-12-04T13:41:45.6317640Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 41%] 2025-12-04T13:41:45.6318739Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_complex64 SKIPPED [0.0018s] (no MAGMA library detected) [ 41%] 2025-12-04T13:41:45.6319812Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 41%] 2025-12-04T13:41:45.6320867Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 41%] 2025-12-04T13:41:45.6322065Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 42%] 2025-12-04T13:41:45.6323417Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 42%] 2025-12-04T13:41:45.6324719Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 42%] 2025-12-04T13:41:45.6326101Z test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 42%] 2025-12-04T13:41:45.6327321Z test_linalg.py::TestLinalgCUDA::test_ck_blas_library_cuda SKIPPED [0.0019s] (test doesn't currently work on the CUDA stack) [ 42%] 2025-12-04T13:41:45.6328496Z test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_128_n_11008_cuda SKIPPED [0.0014s] (CUDA is unsupported) [ 42%] 2025-12-04T13:41:45.6329679Z test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_128_n_4096_cuda SKIPPED [0.0014s] (CUDA is unsupported) [ 42%] 2025-12-04T13:41:45.6330863Z test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_64_n_11008_cuda SKIPPED [0.0014s] (CUDA is unsupported) [ 42%] 2025-12-04T13:41:45.6332032Z test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_1_k_64_n_4096_cuda SKIPPED [0.0014s] (CUDA is unsupported) [ 42%] 2025-12-04T13:41:45.6333213Z test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_128_n_11008_cuda SKIPPED [0.0014s] (CUDA is unsupported) [ 42%] 2025-12-04T13:41:45.6334392Z test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_128_n_4096_cuda SKIPPED [0.0014s] (CUDA is unsupported) [ 42%] 2025-12-04T13:41:45.6335570Z test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_64_n_11008_cuda SKIPPED [0.0014s] (CUDA is unsupported) [ 42%] 2025-12-04T13:41:45.6336729Z test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_64_n_4096_cuda SKIPPED [0.0014s] (CUDA is unsupported) [ 42%] 2025-12-04T13:41:45.6337880Z test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_32_n_48_cuda SKIPPED [0.0014s] (requires SM80 or later) [ 43%] 2025-12-04T13:41:45.6339003Z test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_32_n_64_cuda SKIPPED [0.0014s] (requires SM80 or later) [ 43%] 2025-12-04T13:41:45.6340054Z test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_64_n_48_cuda SKIPPED [0.0014s] (requires SM80 or later) [ 43%] 2025-12-04T13:41:45.6341093Z test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_64_n_64_cuda SKIPPED [0.0014s] (requires SM80 or later) [ 43%] 2025-12-04T13:41:45.6342136Z test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_32_n_48_cuda SKIPPED [0.0018s] (requires SM80 or later) [ 43%] 2025-12-04T13:41:45.6343179Z test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_32_n_64_cuda SKIPPED [0.0014s] (requires SM80 or later) [ 43%] 2025-12-04T13:41:45.6344220Z test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_64_n_48_cuda SKIPPED [0.0014s] (requires SM80 or later) [ 43%] 2025-12-04T13:41:45.6345256Z test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_64_n_64_cuda SKIPPED [0.0014s] (requires SM80 or later) [ 43%] 2025-12-04T13:41:45.6346322Z test_linalg.py::TestLinalgCUDA::test_cond_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 43%] 2025-12-04T13:41:45.6347318Z test_linalg.py::TestLinalgCUDA::test_cond_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 43%] 2025-12-04T13:41:45.6348296Z test_linalg.py::TestLinalgCUDA::test_cond_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 43%] 2025-12-04T13:41:45.6349240Z test_linalg.py::TestLinalgCUDA::test_cond_cuda_float64 SKIPPED [0.0021s] (no MAGMA library detected) [ 43%] 2025-12-04T13:41:45.6350320Z test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 44%] 2025-12-04T13:41:45.6354120Z test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 44%] 2025-12-04T13:41:45.6355323Z test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 44%] 2025-12-04T13:41:45.6356543Z test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 44%] 2025-12-04T13:41:45.6357599Z test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_bfloat16 PASSED [0.4230s] [ 44%] 2025-12-04T13:41:45.6358610Z test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_complex128 PASSED [0.7496s] [ 44%] 2025-12-04T13:41:45.6359573Z test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_complex64 PASSED [0.5528s] [ 44%] 2025-12-04T13:41:45.6360519Z test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_float16 PASSED [0.4192s] [ 44%] 2025-12-04T13:41:45.6361464Z test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_float32 PASSED [0.4209s] [ 44%] 2025-12-04T13:41:45.6362419Z test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_float64 PASSED [0.5436s] [ 44%] 2025-12-04T13:41:45.6363262Z test_linalg.py::TestLinalgCUDA::test_cross_cuda_complex64 PASSED [0.0028s] [ 44%] 2025-12-04T13:41:45.6363989Z test_linalg.py::TestLinalgCUDA::test_cross_cuda_float32 PASSED [0.0022s] [ 44%] 2025-12-04T13:41:45.6364696Z test_linalg.py::TestLinalgCUDA::test_cross_error_cuda PASSED [0.0027s] [ 44%] 2025-12-04T13:41:45.6365506Z test_linalg.py::TestLinalgCUDA::test_cross_with_and_without_dim_cuda_complex64 PASSED [0.0023s] [ 45%] 2025-12-04T13:41:45.6366411Z test_linalg.py::TestLinalgCUDA::test_cross_with_and_without_dim_cuda_float32 PASSED [0.0022s] [ 45%] 2025-12-04T13:41:45.6367353Z test_linalg.py::TestLinalgCUDA::test_det_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 45%] 2025-12-04T13:41:45.6368324Z test_linalg.py::TestLinalgCUDA::test_det_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 45%] 2025-12-04T13:41:45.6369376Z test_linalg.py::TestLinalgCUDA::test_det_logdet_slogdet_batched_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 45%] 2025-12-04T13:41:45.6370376Z test_linalg.py::TestLinalgCUDA::test_det_logdet_slogdet_cuda_float64 PASSED [2.7973s] [ 45%] 2025-12-04T13:41:45.6371235Z test_linalg.py::TestLinalgCUDA::test_disable_tuning_tunableop_cuda_float32 PASSED [0.0195s] [ 45%] 2025-12-04T13:41:45.6372050Z test_linalg.py::TestLinalgCUDA::test_dot_invalid_args_cuda PASSED [0.0030s] [ 45%] 2025-12-04T13:41:45.6372799Z test_linalg.py::TestLinalgCUDA::test_dot_vs_numpy_cuda_complex64 PASSED [0.0049s] [ 45%] 2025-12-04T13:41:45.6373565Z test_linalg.py::TestLinalgCUDA::test_dot_vs_numpy_cuda_float32 PASSED [0.0045s] [ 45%] 2025-12-04T13:41:45.6374417Z test_linalg.py::TestLinalgCUDA::test_dump_results_on_exit_tunableop_cuda_float32 PASSED [8.0786s] [ 45%] 2025-12-04T13:41:45.6375547Z test_linalg.py::TestLinalgCUDA::test_eig_check_magma_cuda_float32 SKIPPED [0.0017s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 45%] 2025-12-04T13:41:45.6376798Z test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_complex128 SKIPPED [0.0021s] (no MAGMA library detected) [ 46%] 2025-12-04T13:41:45.6377992Z test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 46%] 2025-12-04T13:41:45.6379235Z test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 46%] 2025-12-04T13:41:45.6380365Z test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 46%] 2025-12-04T13:41:45.6381472Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 ('RERUN', {'yellow': True}) [0.0290s] [ 46%] 2025-12-04T13:41:45.6382589Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 ('RERUN', {'yellow': True}) [0.3028s] [ 46%] 2025-12-04T13:41:45.6383696Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 FAILED [0.1887s] [ 46%] 2025-12-04T13:41:45.6384231Z 2025-12-04T13:41:45.6384384Z ==================================== RERUNS ==================================== 2025-12-04T13:41:45.6384979Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 ________ 2025-12-04T13:41:45.6385526Z Traceback (most recent call last): 2025-12-04T13:41:45.6386233Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.6386990Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6388225Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.6389129Z 2025-12-04T13:41:45.6389345Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.6390214Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.6390866Z 2025-12-04T13:41:45.6391134Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.6391832Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 ________ 2025-12-04T13:41:45.6392384Z Traceback (most recent call last): 2025-12-04T13:41:45.6393049Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.6393800Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6394889Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.6395786Z 2025-12-04T13:41:45.6396002Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.6396865Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.6397510Z 2025-12-04T13:41:45.6397768Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.6398323Z =================================== FAILURES =================================== 2025-12-04T13:41:45.6398895Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 ________ 2025-12-04T13:41:45.6399441Z Traceback (most recent call last): 2025-12-04T13:41:45.6400092Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.6400850Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6401927Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.6402813Z 2025-12-04T13:41:45.6403036Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.6403891Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.6404637Z 2025-12-04T13:41:45.6404902Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.6405898Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-77c9be2be0e8d8f0.xml - 2025-12-04T13:41:45.6406779Z =========================== short test summary info ============================ 2025-12-04T13:41:45.6408315Z FAILED [0.1887s] test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 - RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.6409683Z 2025-12-04T13:41:45.6409896Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.6410828Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.6411475Z 2025-12-04T13:41:45.6411789Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.6412364Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T13:41:45.6412946Z ============= 1 failed, 265 passed, 319 skipped, 2 rerun in 44.54s ============= 2025-12-04T13:41:45.6413404Z Got exit code 1 2025-12-04T13:41:45.6413668Z Retrying single test... 2025-12-04T13:41:45.6414245Z Test results will be stored in test-reports/python-pytest/test_linalg/test_linalg-87fea4734ba7fa90.xml 2025-12-04T13:41:45.6414956Z ============================= test session starts ============================== 2025-12-04T13:41:45.6415620Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:41:45.6416210Z cachedir: .pytest_cache 2025-12-04T13:41:45.6416922Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:41:45.6417705Z rootdir: /var/lib/jenkins/workspace 2025-12-04T13:41:45.6418059Z configfile: pytest.ini 2025-12-04T13:41:45.6418917Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T13:41:45.6419886Z collecting ... collected 1263 items / 1262 deselected / 1 selected 2025-12-04T13:41:45.6420855Z stepcurrent: skipping 584 already run items. Running only test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.6421706Z Running 1 items in this shard 2025-12-04T13:41:45.6421916Z 2025-12-04T13:41:45.6422825Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 [W1204 13:35:21.210562996 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.6423882Z 2025-12-04T13:41:45.6424011Z ('RERUN', {'yellow': True}) [15.3830s] [100%] 2025-12-04T13:41:45.6425181Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 [W1204 13:35:36.761549095 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.6426223Z 2025-12-04T13:41:45.6426361Z ('RERUN', {'yellow': True}) [0.1675s] [100%] 2025-12-04T13:41:45.6427508Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 [W1204 13:35:36.903326076 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.6428554Z 2025-12-04T13:41:45.6428650Z FAILED [0.1397s] [100%] 2025-12-04T13:41:45.6428834Z 2025-12-04T13:41:45.6428971Z ==================================== RERUNS ==================================== 2025-12-04T13:41:45.6429544Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 ________ 2025-12-04T13:41:45.6430076Z Traceback (most recent call last): 2025-12-04T13:41:45.6430739Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.6431543Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6432631Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.6434278Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.6435294Z C++ CapturedTraceback: 2025-12-04T13:41:45.6436827Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.6438911Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.6439868Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.6441198Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6442544Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.6443531Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6444390Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6445405Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6448554Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6451366Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6451962Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6454278Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6456716Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.6457703Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6460761Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6463391Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6464121Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.6464990Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.6465756Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6466617Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6467587Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6468405Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6469120Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6469915Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6470777Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6471480Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6472249Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6473040Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6473828Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6474615Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6475389Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6476175Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6476954Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6477882Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6478802Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6479879Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6480825Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6481743Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6482678Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6483603Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6484537Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6485452Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6486378Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6487186Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6487996Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6488753Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6489609Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6490490Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6491223Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6491960Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6492814Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6493732Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6494645Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6495563Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6496395Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6497213Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6498203Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6499196Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6500106Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6501013Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6501860Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6502648Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6503384Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6504078Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.6504737Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6505502Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6506422Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6507333Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6508247Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6509160Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6509930Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6510687Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6511612Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6512525Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6513439Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6514344Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6515109Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6515872Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6516780Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6517693Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6518638Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6519550Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6520399Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6521185Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6521919Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6522654Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6523495Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6524444Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6525245Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6526009Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6526930Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6527875Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6528785Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6529694Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6530551Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6531346Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6532080Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6532817Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6533677Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6534608Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6535531Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6536455Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6537383Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6538378Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6539150Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6539929Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6540866Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6541792Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6542712Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6543642Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6544507Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6545311Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6546050Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6546839Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6547693Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6548624Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6549545Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6550479Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6551409Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6552337Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6553176Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.6553946Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.6554673Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.6555357Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.6556177Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.6556993Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.6557867Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.6558566Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.6559243Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.6559859Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.6560288Z #134 _start from ??:0 2025-12-04T13:41:45.6560592Z #135 from ??:0 2025-12-04T13:41:45.6560837Z 2025-12-04T13:41:45.6560842Z 2025-12-04T13:41:45.6561059Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.6561928Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.6562576Z 2025-12-04T13:41:45.6562842Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.6563480Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.6564877Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.6566228Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6566825Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 ________ 2025-12-04T13:41:45.6567381Z Traceback (most recent call last): 2025-12-04T13:41:45.6568048Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.6568811Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6569885Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.6571550Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.6572564Z C++ CapturedTraceback: 2025-12-04T13:41:45.6574069Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.6576038Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.6577007Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.6578366Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6579723Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.6580741Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6581635Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6582664Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6585853Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6588811Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6589406Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6591723Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6594168Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.6595151Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6598125Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6600714Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6601438Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.6602299Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.6603073Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6603940Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6604863Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6605744Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6606451Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6607237Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6608049Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6608751Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6609535Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6610314Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6611121Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6611961Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6612722Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6613521Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6614287Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6615207Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6616132Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6617050Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6617975Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6618970Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6619898Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6620819Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6621748Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6622673Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6623599Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6624399Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6625092Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6625876Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6626741Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6627550Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6628300Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6629045Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6629894Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6630820Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6631750Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6632685Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6633496Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6634272Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6635197Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6636129Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6637042Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6637965Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6638830Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6639656Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6640433Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6641145Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.6641871Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6642641Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6643574Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6644509Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6645446Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6646370Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6647159Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6647951Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6648873Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6649812Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6650747Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6651684Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6652459Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6653249Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6654189Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6655127Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6656046Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6656982Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6657860Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6658736Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6659477Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6660224Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6661090Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6662013Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6662840Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6663624Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6664559Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6665478Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6666411Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6667340Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6668248Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6669077Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6669836Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6670604Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6671515Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6672451Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6673401Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6674351Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6675286Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6676231Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6677027Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6677822Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6678757Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6679698Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6680638Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6681580Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6682457Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6683277Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6684043Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6684805Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6685667Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6686614Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6687557Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6688631Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6689583Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6690529Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6691425Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.6692166Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.6692904Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.6693609Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.6694404Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.6695219Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.6695994Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.6696755Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.6697430Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.6698086Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.6698592Z #134 _start from ??:0 2025-12-04T13:41:45.6698897Z #135 from ??:0 2025-12-04T13:41:45.6699178Z 2025-12-04T13:41:45.6699183Z 2025-12-04T13:41:45.6699399Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.6700273Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.6700930Z 2025-12-04T13:41:45.6701208Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.6701847Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.6703227Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.6704582Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6705057Z =================================== FAILURES =================================== 2025-12-04T13:41:45.6705639Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 ________ 2025-12-04T13:41:45.6706178Z Traceback (most recent call last): 2025-12-04T13:41:45.6706850Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.6707616Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6708700Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.6710366Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.6711381Z C++ CapturedTraceback: 2025-12-04T13:41:45.6712904Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.6714843Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.6715801Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.6717106Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6718462Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.6719507Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6720360Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6721397Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6724632Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6727461Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6728107Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6730450Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6732881Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.6733881Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6736873Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6739558Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6740296Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.6741184Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.6742080Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6742953Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6743896Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6744705Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6745415Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6746204Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6747027Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6747723Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6748507Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6749347Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6750123Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6750913Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6751694Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6752475Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6753241Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6754178Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6755154Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6756118Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6757041Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6758003Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6758932Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6759864Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6760785Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6761715Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6762652Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6763465Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6764165Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6764949Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6765830Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6766630Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6767381Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6768128Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6768991Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6769915Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6770851Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6771777Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6772552Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6773336Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6774266Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6775193Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6776114Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6777047Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6777958Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6778844Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6779590Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6780309Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.6780988Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6781771Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6782691Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6783729Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6784694Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6785614Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6786435Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6787219Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6788296Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6789214Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6789637Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6790013Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6790293Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6790666Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6791073Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6791461Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6791868Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6792249Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6792598Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6792905Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6793211Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6793518Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6793926Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6794310Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6794566Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6794948Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6795354Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6795722Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6796140Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6796574Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6796938Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6797248Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6797548Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6797867Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6798284Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6798665Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6799136Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6799557Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6799989Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6800406Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6800670Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6801062Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6801475Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6801866Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6802284Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6802665Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6803039Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6803352Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6803664Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6803973Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6804385Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6804780Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6805197Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6805579Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6806006Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6806383Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6806690Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.6806998Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.6807269Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.6807566Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.6808016Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.6808362Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.6808655Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.6808992Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.6809273Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.6809473Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.6809576Z #134 _start from ??:0 2025-12-04T13:41:45.6809712Z #135 from ??:0 2025-12-04T13:41:45.6809719Z 2025-12-04T13:41:45.6809724Z 2025-12-04T13:41:45.6809945Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.6810481Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.6810487Z 2025-12-04T13:41:45.6810809Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.6811033Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.6812105Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.6812328Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6812929Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-87fea4734ba7fa90.xml - 2025-12-04T13:41:45.6813102Z =========================== short test summary info ============================ 2025-12-04T13:41:45.6814336Z FAILED [0.1397s] test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 - RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.6815120Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.6815234Z C++ CapturedTraceback: 2025-12-04T13:41:45.6816563Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.6817053Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.6817402Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.6818292Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6818698Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.6819154Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6819432Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6820057Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6822472Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6822802Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6822970Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6825027Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6825350Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.6825904Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6828238Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6828419Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6828838Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.6829167Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.6829482Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6829894Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6830274Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6830582Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6830841Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6831226Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6831520Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6831780Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6832168Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6832426Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6832799Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6833063Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6833436Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6833704Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6834073Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6834483Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6834864Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6835301Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6835683Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6836089Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6836460Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6836875Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6837243Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6837678Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6838090Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6838381Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6838678Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6839046Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6839394Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6839707Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6839999Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6840320Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6840723Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6841094Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6841509Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6841880Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6842150Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6842516Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6842922Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6843304Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6843704Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6844078Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6844437Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6844742Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6845045Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6845309Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.6845564Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6845944Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6846348Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6846731Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6847168Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6847537Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6847801Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6848169Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6848581Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6848949Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6849385Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6849795Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6850054Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6850420Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6850877Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6851247Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6851664Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6852034Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6852385Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6852703Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6852999Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6853314Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6853719Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6854090Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6854361Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6854731Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6855137Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6855523Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6855933Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6856316Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6856665Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6856968Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6857279Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6857587Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6858017Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6858485Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6858907Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6859341Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6859757Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6860151Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6860418Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6860796Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6861226Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6861636Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6862097Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6862478Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6862870Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6863194Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6863490Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6863796Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6864222Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6864604Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6865034Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6865415Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6865829Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6866224Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6866517Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.6866840Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.6867111Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.6867396Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.6867764Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.6868094Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.6868385Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.6868671Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.6868938Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.6869147Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.6869253Z #134 _start from ??:0 2025-12-04T13:41:45.6869378Z #135 from ??:0 2025-12-04T13:41:45.6869384Z 2025-12-04T13:41:45.6869389Z 2025-12-04T13:41:45.6869623Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.6870149Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.6870155Z 2025-12-04T13:41:45.6870437Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.6870652Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T13:41:45.6870859Z ================= 1 failed, 1262 deselected, 2 rerun in 15.81s ================= 2025-12-04T13:41:45.6870975Z Got exit code 1 2025-12-04T13:41:45.6871087Z Retrying single test... 2025-12-04T13:41:45.6871499Z Test results will be stored in test-reports/python-pytest/test_linalg/test_linalg-cdf15fa2abf8389b.xml 2025-12-04T13:41:45.6871674Z ============================= test session starts ============================== 2025-12-04T13:41:45.6872031Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:41:45.6872155Z cachedir: .pytest_cache 2025-12-04T13:41:45.6872711Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:41:45.6872844Z rootdir: /var/lib/jenkins/workspace 2025-12-04T13:41:45.6873001Z configfile: pytest.ini 2025-12-04T13:41:45.6873599Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T13:41:45.6873863Z collecting ... collected 1263 items / 1262 deselected / 1 selected 2025-12-04T13:41:45.6874496Z stepcurrent: skipping 584 already run items. Running only test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.6874616Z Running 1 items in this shard 2025-12-04T13:41:45.6874622Z 2025-12-04T13:41:45.6875558Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 [W1204 13:35:44.004640715 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.6875564Z 2025-12-04T13:41:45.6875702Z ('RERUN', {'yellow': True}) [15.6097s] [100%] 2025-12-04T13:41:45.6876758Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 [W1204 13:36:00.781358244 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.6876767Z 2025-12-04T13:41:45.6876900Z ('RERUN', {'yellow': True}) [0.1668s] [100%] 2025-12-04T13:41:45.6877807Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 [W1204 13:36:00.912456557 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.6877826Z 2025-12-04T13:41:45.6877928Z FAILED [0.1291s] [100%] 2025-12-04T13:41:45.6877933Z 2025-12-04T13:41:45.6878072Z ==================================== RERUNS ==================================== 2025-12-04T13:41:45.6878369Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 ________ 2025-12-04T13:41:45.6878494Z Traceback (most recent call last): 2025-12-04T13:41:45.6878937Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.6879148Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6879909Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.6880693Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.6898216Z C++ CapturedTraceback: 2025-12-04T13:41:45.6899679Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.6900190Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.6900662Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.6901489Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6901892Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.6902333Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6902619Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6903277Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6905755Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6906084Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6906267Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6908299Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6908605Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.6909151Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6911434Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6911631Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6912041Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.6912378Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.6912681Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6913099Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6913477Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6913771Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6914075Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6914452Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6914748Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6915013Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6915383Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6915648Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6916020Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6916303Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6916683Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6916994Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6917378Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6917818Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6918187Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6918598Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6918967Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6919374Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6919753Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6920160Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6920545Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6920951Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6921319Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6921620Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6921874Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6922255Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6922604Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6922912Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6923216Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6923521Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6923936Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6924305Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6924711Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6925086Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6925343Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6925716Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6926162Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6926529Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6926944Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6927312Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6927654Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6927968Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6928292Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6928569Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.6928860Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6929242Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6929692Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6930063Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6930466Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6930847Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6931103Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6931488Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6931899Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6932273Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6932689Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6933063Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6933326Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6933695Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6934101Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6934481Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6934883Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6935265Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6935619Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6935925Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6936227Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6936527Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6936932Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6937315Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6937571Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6937986Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6938482Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6938859Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6939277Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6939648Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6940007Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6940345Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6940644Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6940994Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6941410Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6941829Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6942242Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6942618Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6943136Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6943516Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6943783Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6944171Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6944582Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6944971Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6945381Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6945757Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6946119Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6946428Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6946739Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6947045Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6947458Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6947849Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6948257Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6948646Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6949054Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6949429Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6949733Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.6950043Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.6950349Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.6950642Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.6950992Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.6951327Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.6951617Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.6951885Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.6952159Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.6952388Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.6952502Z #134 _start from ??:0 2025-12-04T13:41:45.6952621Z #135 from ??:0 2025-12-04T13:41:45.6952660Z 2025-12-04T13:41:45.6952665Z 2025-12-04T13:41:45.6952884Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.6953418Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.6953454Z 2025-12-04T13:41:45.6953721Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.6953957Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.6954978Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.6955174Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6955474Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 ________ 2025-12-04T13:41:45.6955599Z Traceback (most recent call last): 2025-12-04T13:41:45.6956031Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.6956238Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.6956998Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.6957776Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.6957888Z C++ CapturedTraceback: 2025-12-04T13:41:45.6959208Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.6959709Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.6960048Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.6960864Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6961268Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.6961719Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.6961993Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6962635Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6965052Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6965380Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.6965558Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6967611Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.6967939Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.6968484Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6970771Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.6970952Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.6971369Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.6971691Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.6971996Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6972412Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6972788Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6973081Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6973345Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6973718Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6974017Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6974271Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6974642Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6974904Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6975275Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6975543Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6975943Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6976201Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6976583Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6977117Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6977490Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6977905Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6978402Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6978862Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6979240Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6979645Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6980057Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6980458Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6980841Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6981131Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.6981387Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6981770Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6982121Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6982435Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6982731Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6983028Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6983442Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6983812Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6984212Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6984597Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6984856Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6985236Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6985646Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6986013Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6986426Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6986791Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6987149Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6987458Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6987875Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6988217Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.6988472Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6988855Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6989258Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6989629Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6990047Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6990458Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6990715Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6991135Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6991540Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6991956Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6992363Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6992733Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6993000Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6993371Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6993786Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6994159Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6994558Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6994938Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6995284Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6995593Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.6995884Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.6996185Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.6996601Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.6996973Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6997234Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.6997611Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6998015Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6998391Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6998795Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.6999163Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.6999521Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.6999827Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7000241Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7000549Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7000966Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7001353Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7001766Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7002147Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7002599Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7002979Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7003285Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7003665Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7004107Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7004490Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7004903Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7005289Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7005644Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7005952Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7006259Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7006564Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7006991Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7007369Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7007778Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7008164Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7008576Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7008951Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7009254Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7009560Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7009841Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7010124Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7010475Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7010813Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7011102Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7011419Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7011746Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7012000Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7012113Z #134 _start from ??:0 2025-12-04T13:41:45.7012233Z #135 from ??:0 2025-12-04T13:41:45.7012241Z 2025-12-04T13:41:45.7012246Z 2025-12-04T13:41:45.7012465Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7012997Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.7013002Z 2025-12-04T13:41:45.7013262Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7013499Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.7014554Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.7014795Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7014944Z =================================== FAILURES =================================== 2025-12-04T13:41:45.7015226Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 ________ 2025-12-04T13:41:45.7016070Z Traceback (most recent call last): 2025-12-04T13:41:45.7016504Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.7016696Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7017467Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7018355Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7018477Z C++ CapturedTraceback: 2025-12-04T13:41:45.7019797Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7020296Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7020647Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7021457Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7021868Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7022310Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7022591Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7023207Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7025620Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7025954Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7026121Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7028150Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7028483Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7029077Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7031368Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7031590Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7031998Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7032318Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7032636Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7033045Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7033431Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7033722Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7033977Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7034360Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7034650Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7034916Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7035284Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7035542Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7035915Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7036173Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7036540Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7036798Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7037169Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7037584Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7037953Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7038365Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7038771Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7039175Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7039554Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7039961Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7040329Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7040744Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7041141Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7041475Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7041731Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7042100Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7042490Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7042794Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7043088Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7043396Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7043803Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7044182Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7044589Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7044959Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7045227Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7045595Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7046011Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7046374Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7046778Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7047161Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7047511Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7047823Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7048118Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7048383Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7048645Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7049016Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7049419Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7049802Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7050207Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7050618Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7050874Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7051242Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7051655Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7052031Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7052439Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7052837Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7053096Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7053505Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7053906Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7054329Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7054733Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7055099Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7055452Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7055753Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7056045Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7056353Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7056759Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7057139Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7057396Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7057765Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7058247Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7058742Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7059160Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7059533Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7059882Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7060195Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7060493Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7060803Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7061225Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7061607Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7062029Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7062406Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7062859Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7063247Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7063513Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7063898Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7064311Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7064686Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7065139Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7065548Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7065915Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7066222Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7066551Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7066864Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7067276Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7067664Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7068074Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7068449Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7068873Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7069245Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7069537Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7069853Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7070121Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7070418Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7070763Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7071087Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7071387Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7071655Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7071931Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7072125Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7072224Z #134 _start from ??:0 2025-12-04T13:41:45.7072354Z #135 from ??:0 2025-12-04T13:41:45.7072361Z 2025-12-04T13:41:45.7072366Z 2025-12-04T13:41:45.7072577Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7073095Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.7073116Z 2025-12-04T13:41:45.7073381Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7073601Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.7074665Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.7074857Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7075437Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-cdf15fa2abf8389b.xml - 2025-12-04T13:41:45.7075619Z =========================== short test summary info ============================ 2025-12-04T13:41:45.7076870Z FAILED [0.1291s] test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 - RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7077674Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7077788Z C++ CapturedTraceback: 2025-12-04T13:41:45.7079106Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7079625Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7079961Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7080775Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7081180Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7081624Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7081898Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7082508Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7084911Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7085204Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7085366Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7087387Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7087689Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7088445Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7090744Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7090918Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7091382Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7091745Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7092052Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7092517Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7092894Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7093193Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7093450Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7093822Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7094129Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7094386Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7094760Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7095026Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7095397Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7095661Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7096028Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7096279Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7096656Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7097065Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7097445Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7097851Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7098286Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7098702Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7099071Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7099488Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7099855Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7100259Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7100638Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7100966Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7101218Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7101594Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7101940Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7102252Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7102543Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7102876Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7103289Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7103711Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7104128Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7104523Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7104778Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7105154Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7105553Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7105932Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7106332Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7106705Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7107063Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7107364Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7107655Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7107930Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7108186Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7108565Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7108970Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7109337Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7109748Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7110116Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7110385Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7110747Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7111151Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7111531Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7111935Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7112438Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7112739Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7113100Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7113516Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7113888Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7114285Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7114662Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7115039Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7115355Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7115672Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7115974Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7116420Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7116787Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7117053Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7117422Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7117820Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7118203Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7118605Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7118972Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7119332Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7119631Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7119939Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7120241Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7120655Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7121048Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7121460Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7121848Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7122254Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7122629Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7122899Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7123271Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7123688Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7124061Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7124473Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7124892Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7125245Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7125555Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7125856Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7126158Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7126579Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7126982Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7127397Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7127813Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7128228Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7128644Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7128933Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7129241Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7129519Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7129800Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7130158Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7130483Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7130773Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7131049Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7131315Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7131509Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7131618Z #134 _start from ??:0 2025-12-04T13:41:45.7131742Z #135 from ??:0 2025-12-04T13:41:45.7131749Z 2025-12-04T13:41:45.7131754Z 2025-12-04T13:41:45.7131982Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7132503Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.7132510Z 2025-12-04T13:41:45.7132779Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7132972Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T13:41:45.7133176Z ================= 1 failed, 1262 deselected, 2 rerun in 16.03s ================= 2025-12-04T13:41:45.7133288Z Got exit code 1 2025-12-04T13:41:45.7133723Z FAILED CONSISTENTLY: test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32 2025-12-04T13:41:45.7134132Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T13:41:45.7134537Z Test results will be stored in test-reports/python-pytest/test_linalg/test_linalg-301657435283457f.xml 2025-12-04T13:41:45.7134698Z ============================= test session starts ============================== 2025-12-04T13:41:45.7135052Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:41:45.7135171Z cachedir: .pytest_cache 2025-12-04T13:41:45.7135695Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:41:45.7135862Z rootdir: /var/lib/jenkins/workspace 2025-12-04T13:41:45.7135965Z configfile: pytest.ini 2025-12-04T13:41:45.7136561Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T13:41:45.7136801Z collecting ... collected 1263 items / 585 deselected / 678 selected 2025-12-04T13:41:45.7136944Z stepcurrent: skipping 585 already run items. 2025-12-04T13:41:45.7137055Z Running 678 items in this shard 2025-12-04T13:41:45.7137070Z 2025-12-04T13:41:45.7137567Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 ('RERUN', {'yellow': True}) [0.0357s] [ 0%] 2025-12-04T13:41:45.7138083Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 ('RERUN', {'yellow': True}) [0.1393s] [ 0%] 2025-12-04T13:41:45.7138596Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 FAILED [0.1239s] [ 0%] 2025-12-04T13:41:45.7138605Z 2025-12-04T13:41:45.7138746Z ==================================== RERUNS ==================================== 2025-12-04T13:41:45.7139062Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 ________ 2025-12-04T13:41:45.7139196Z Traceback (most recent call last): 2025-12-04T13:41:45.7139623Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.7139825Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7140582Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7140589Z 2025-12-04T13:41:45.7140799Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7141326Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7141333Z 2025-12-04T13:41:45.7141599Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7141890Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 ________ 2025-12-04T13:41:45.7142010Z Traceback (most recent call last): 2025-12-04T13:41:45.7142440Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.7142637Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7143391Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7143397Z 2025-12-04T13:41:45.7143614Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7144125Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7144132Z 2025-12-04T13:41:45.7144394Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7144548Z =================================== FAILURES =================================== 2025-12-04T13:41:45.7144829Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 ________ 2025-12-04T13:41:45.7144957Z Traceback (most recent call last): 2025-12-04T13:41:45.7145383Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.7145573Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7146335Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7146341Z 2025-12-04T13:41:45.7146554Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7147107Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7147114Z 2025-12-04T13:41:45.7147374Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7147946Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-301657435283457f.xml - 2025-12-04T13:41:45.7148120Z =========================== short test summary info ============================ 2025-12-04T13:41:45.7149383Z FAILED [0.1239s] test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 - RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7149390Z 2025-12-04T13:41:45.7149609Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7150143Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7150149Z 2025-12-04T13:41:45.7150418Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7150626Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T13:41:45.7150818Z ================== 1 failed, 585 deselected, 2 rerun in 0.42s ================== 2025-12-04T13:41:45.7150923Z Got exit code 1 2025-12-04T13:41:45.7151024Z Retrying single test... 2025-12-04T13:41:45.7151424Z Test results will be stored in test-reports/python-pytest/test_linalg/test_linalg-274ad688fc221da2.xml 2025-12-04T13:41:45.7151595Z ============================= test session starts ============================== 2025-12-04T13:41:45.7151941Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:41:45.7152057Z cachedir: .pytest_cache 2025-12-04T13:41:45.7152580Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:41:45.7152699Z rootdir: /var/lib/jenkins/workspace 2025-12-04T13:41:45.7152814Z configfile: pytest.ini 2025-12-04T13:41:45.7153405Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T13:41:45.7153631Z collecting ... collected 1263 items / 1262 deselected / 1 selected 2025-12-04T13:41:45.7154241Z stepcurrent: skipping 585 already run items. Running only test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7154353Z Running 1 items in this shard 2025-12-04T13:41:45.7154358Z 2025-12-04T13:41:45.7155283Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 [W1204 13:36:16.092081205 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7155291Z 2025-12-04T13:41:45.7155422Z ('RERUN', {'yellow': True}) [16.1387s] [100%] 2025-12-04T13:41:45.7156325Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 [W1204 13:36:33.401792436 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7156343Z 2025-12-04T13:41:45.7156472Z ('RERUN', {'yellow': True}) [0.1708s] [100%] 2025-12-04T13:41:45.7157368Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 [W1204 13:36:33.537302548 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7157374Z 2025-12-04T13:41:45.7157482Z FAILED [0.1335s] [100%] 2025-12-04T13:41:45.7157487Z 2025-12-04T13:41:45.7157628Z ==================================== RERUNS ==================================== 2025-12-04T13:41:45.7157919Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 ________ 2025-12-04T13:41:45.7158069Z Traceback (most recent call last): 2025-12-04T13:41:45.7158505Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.7158707Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7159459Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7160229Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7160337Z C++ CapturedTraceback: 2025-12-04T13:41:45.7161711Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7162209Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7162589Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7163405Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7163806Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7164243Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7164530Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7165138Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7167564Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7167840Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7168019Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7170044Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7170347Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7170889Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7173179Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7173386Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7173789Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7174116Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7174449Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7174866Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7175276Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7175567Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7175862Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7176236Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7176524Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7176785Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7177151Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7177415Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7177783Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7178042Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7178492Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7178750Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7179128Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7179533Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7179899Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7180314Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7180684Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7181097Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7181462Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7181864Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7182238Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7182642Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7183007Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7183300Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7183557Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7183930Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7184313Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7184617Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7184918Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7185216Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7185621Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7185989Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7186421Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7186821Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7187080Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7187448Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7188095Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7188464Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7188872Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7189239Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7189588Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7189898Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7190191Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7190464Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7190720Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7191090Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7191502Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7191869Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7192276Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7192646Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7192903Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7193278Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7193687Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7194053Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7194463Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7194829Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7195091Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7195460Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7195864Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7196321Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7196726Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7197111Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7197461Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7197767Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7198072Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7198414Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7198833Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7199245Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7199504Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7199921Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7200325Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7200698Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7201111Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7201485Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7201842Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7202149Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7202447Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7202769Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7203183Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7203573Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7203988Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7204364Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7204795Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7205174Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7205453Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7205834Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7206243Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7206635Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7207041Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7207414Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7207785Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7208094Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7208433Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7208741Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7209153Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7209540Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7209947Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7210336Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7210774Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7211180Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7211487Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7211796Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7212098Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7212383Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7212731Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7213060Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7213351Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7213620Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7213897Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7214093Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7214200Z #134 _start from ??:0 2025-12-04T13:41:45.7214319Z #135 from ??:0 2025-12-04T13:41:45.7214325Z 2025-12-04T13:41:45.7214330Z 2025-12-04T13:41:45.7214544Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7215073Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7215079Z 2025-12-04T13:41:45.7215343Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7215574Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.7216597Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.7216794Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7217084Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 ________ 2025-12-04T13:41:45.7217209Z Traceback (most recent call last): 2025-12-04T13:41:45.7217637Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.7217835Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7218660Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7219434Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7219544Z C++ CapturedTraceback: 2025-12-04T13:41:45.7220871Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7221396Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7221728Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7222541Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7222986Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7223467Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7223740Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7224377Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7226796Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7227080Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7227260Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7229294Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7229596Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7230144Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7232440Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7232618Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7233027Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7233356Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7233663Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7234113Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7234485Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7234783Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7235039Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7235409Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7235701Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7235953Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7236353Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7236650Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7237024Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7237329Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7237699Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7237953Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7238328Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7238732Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7239112Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7239513Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7239885Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7240298Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7240662Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7241065Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7241447Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7241849Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7242227Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7242514Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7242769Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7243144Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7243491Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7243802Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7244095Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7244395Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7244806Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7245173Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7245592Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7245993Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7246247Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7246622Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7247024Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7247390Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7247799Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7248194Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7248579Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7248886Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7249175Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7249481Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7249734Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7250110Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7250518Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7250885Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7251294Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7251668Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7251922Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7252302Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7252705Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7253082Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7253481Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7253848Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7254111Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7254479Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7254891Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7255257Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7255658Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7256029Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7256375Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7256680Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7256970Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7257271Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7257713Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7258080Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7258414Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7258792Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7259194Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7259566Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7259999Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7260404Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7260763Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7261061Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7261401Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7261704Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7262121Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7262509Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7262922Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7263307Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7263719Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7264091Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7264365Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7264738Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7265146Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7265534Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7265943Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7266329Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7266686Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7266995Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7267299Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7267601Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7268019Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7268394Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7268800Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7269187Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7269601Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7270014Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7270310Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7270615Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7270892Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7271177Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7271525Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7271890Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7272180Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7272487Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7272755Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7272978Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7273088Z #134 _start from ??:0 2025-12-04T13:41:45.7273207Z #135 from ??:0 2025-12-04T13:41:45.7273213Z 2025-12-04T13:41:45.7273218Z 2025-12-04T13:41:45.7273445Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7273964Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7273970Z 2025-12-04T13:41:45.7274231Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7274459Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.7275481Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.7275682Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7275827Z =================================== FAILURES =================================== 2025-12-04T13:41:45.7276110Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 ________ 2025-12-04T13:41:45.7276244Z Traceback (most recent call last): 2025-12-04T13:41:45.7276670Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.7276862Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7277630Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7278397Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7278522Z C++ CapturedTraceback: 2025-12-04T13:41:45.7279841Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7280339Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7280671Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7281482Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7281926Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7282363Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7282647Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7283256Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7285741Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7286053Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7286213Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7288378Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7288675Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7289231Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7291503Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7291692Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7292097Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7292430Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7292736Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7293140Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7293522Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7293813Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7294069Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7294451Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7294740Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7295066Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7295435Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7295691Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7296071Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7296322Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7296697Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7296947Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7297356Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7297825Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7298255Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7298662Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7299090Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7299494Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7299870Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7300274Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7300643Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7301057Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7301421Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7301719Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7301977Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7302342Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7302698Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7303001Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7303300Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7303599Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7304005Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7304380Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7304784Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7305152Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7305417Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7305786Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7306196Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7306566Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7306968Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7307377Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7307727Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7308036Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7308327Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7308593Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7308858Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7309361Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7309779Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7310199Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7310609Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7311017Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7311273Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7311641Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7312054Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7312425Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7312839Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7313211Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7313465Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7313844Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7314246Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7314622Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7315025Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7315390Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7315747Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7316053Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7316340Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7316647Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7317048Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7317423Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7317673Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7318041Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7318452Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7318822Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7319266Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7319631Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7319980Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7320291Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7320587Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7320905Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7321346Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7321727Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7322183Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7322561Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7323000Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7323391Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7323652Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7324038Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7324452Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7324827Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7325250Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7325624Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7325992Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7326300Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7326593Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7326907Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7327322Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7327712Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7328128Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7328501Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7328923Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7329298Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7329587Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7329902Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7330171Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7330469Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7330821Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7331179Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7331475Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7331746Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7332021Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7332216Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7332315Z #134 _start from ??:0 2025-12-04T13:41:45.7332444Z #135 from ??:0 2025-12-04T13:41:45.7332450Z 2025-12-04T13:41:45.7332455Z 2025-12-04T13:41:45.7332674Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7333241Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7333259Z 2025-12-04T13:41:45.7333557Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7333787Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.7334840Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.7335039Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7335631Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-274ad688fc221da2.xml - 2025-12-04T13:41:45.7335808Z =========================== short test summary info ============================ 2025-12-04T13:41:45.7337032Z FAILED [0.1335s] test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 - RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7337814Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7337926Z C++ CapturedTraceback: 2025-12-04T13:41:45.7339335Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7339824Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7340169Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7340991Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7341395Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7341848Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7342118Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7342729Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7345147Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7345470Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7345636Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7347727Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7348030Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7348610Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7350903Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7351087Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7351508Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7351833Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7352137Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7352559Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7352932Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7353239Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7353500Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7353878Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7354187Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7354441Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7354816Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7355089Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7355458Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7355722Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7356091Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7356347Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7356737Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7357178Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7357562Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7357967Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7358338Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7358758Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7359130Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7359577Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7359983Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7360393Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7360802Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7361095Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7361349Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7361733Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7362081Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7362400Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7362694Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7362995Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7363413Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7363783Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7364196Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7364566Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7364821Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7365201Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7365610Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7365993Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7366400Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7366769Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7367129Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7367429Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7367721Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7367997Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7368252Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7368639Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7369078Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7369446Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7369865Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7370232Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7370499Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7370867Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7371298Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7371681Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7372118Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7372494Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7372782Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7373148Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7373564Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7373929Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7374330Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7374707Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7375056Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7375367Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7375661Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7375960Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7376374Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7376739Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7377003Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7377375Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7377773Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7378219Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7378627Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7379006Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7379352Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7379654Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7379964Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7380274Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7380690Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7381128Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7381540Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7381931Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7382341Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7382719Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7382995Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7383408Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7383831Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7384309Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7384726Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7385147Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7385506Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7385817Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7386125Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7386432Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7386862Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7387245Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7387654Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7388193Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7388605Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7388991Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7389282Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7389589Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7389872Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7390161Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7390526Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7390851Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7391142Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7391425Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7391693Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7391890Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7392002Z #134 _start from ??:0 2025-12-04T13:41:45.7392123Z #135 from ??:0 2025-12-04T13:41:45.7392129Z 2025-12-04T13:41:45.7392137Z 2025-12-04T13:41:45.7392365Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7392885Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7392950Z 2025-12-04T13:41:45.7393218Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7393411Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T13:41:45.7393617Z ================= 1 failed, 1262 deselected, 2 rerun in 16.57s ================= 2025-12-04T13:41:45.7393732Z Got exit code 1 2025-12-04T13:41:45.7393836Z Retrying single test... 2025-12-04T13:41:45.7394242Z Test results will be stored in test-reports/python-pytest/test_linalg/test_linalg-c003f5ff32bae5e6.xml 2025-12-04T13:41:45.7394414Z ============================= test session starts ============================== 2025-12-04T13:41:45.7394807Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:41:45.7394917Z cachedir: .pytest_cache 2025-12-04T13:41:45.7395489Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:41:45.7395614Z rootdir: /var/lib/jenkins/workspace 2025-12-04T13:41:45.7395771Z configfile: pytest.ini 2025-12-04T13:41:45.7396365Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T13:41:45.7396595Z collecting ... collected 1263 items / 1262 deselected / 1 selected 2025-12-04T13:41:45.7397212Z stepcurrent: skipping 585 already run items. Running only test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7397327Z Running 1 items in this shard 2025-12-04T13:41:45.7397332Z 2025-12-04T13:41:45.7398254Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 [W1204 13:36:41.722673734 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7398265Z 2025-12-04T13:41:45.7398394Z ('RERUN', {'yellow': True}) [15.3314s] [100%] 2025-12-04T13:41:45.7399299Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 [W1204 13:36:57.222899427 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7399308Z 2025-12-04T13:41:45.7399450Z ('RERUN', {'yellow': True}) [0.1684s] [100%] 2025-12-04T13:41:45.7400354Z test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 [W1204 13:36:57.353060614 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7400360Z 2025-12-04T13:41:45.7400466Z FAILED [0.1281s] [100%] 2025-12-04T13:41:45.7400471Z 2025-12-04T13:41:45.7400612Z ==================================== RERUNS ==================================== 2025-12-04T13:41:45.7400896Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 ________ 2025-12-04T13:41:45.7401032Z Traceback (most recent call last): 2025-12-04T13:41:45.7401466Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.7401673Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7402431Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7403195Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7403313Z C++ CapturedTraceback: 2025-12-04T13:41:45.7404630Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7405159Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7405491Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7406308Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7406712Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7407182Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7407497Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7408110Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7410545Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7410826Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7411006Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7413029Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7413334Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7413878Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7416162Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7416356Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7416757Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7417088Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7417389Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7417797Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7418251Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7418579Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7418848Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7419217Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7419507Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7419773Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7420144Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7420473Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7420847Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7421134Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7421514Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7421812Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7422183Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7422598Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7422969Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7423386Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7423763Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7424171Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7424555Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7424962Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7425343Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7425746Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7426116Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7426420Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7426679Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7427060Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7427413Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7427719Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7428024Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7428328Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7428733Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7429112Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7429518Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7429900Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7430192Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7430564Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7430975Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7431346Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7431761Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7432131Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7432480Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7432824Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7433153Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7433430Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7433717Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7434086Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7434502Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7434874Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7435278Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7435654Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7435920Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7436302Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7436705Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7437074Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7437485Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7437853Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7438114Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7438483Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7438884Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7439265Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7439663Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7440047Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7440388Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7440687Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7440994Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7441294Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7441696Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7442108Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7442362Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7442744Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7443148Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7443514Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7443930Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7444435Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7444836Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7445171Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7445475Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7445794Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7446239Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7446618Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7447042Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7447421Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7447847Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7448226Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7448488Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7448876Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7449289Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7449676Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7450085Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7450458Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7450824Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7451136Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7451443Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7451749Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7452162Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7452551Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7452965Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7453340Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7453763Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7454139Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7454472Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7454783Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7455055Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7455352Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7455703Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7456036Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7456324Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7456624Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7456900Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7457126Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7457228Z #134 _start from ??:0 2025-12-04T13:41:45.7457357Z #135 from ??:0 2025-12-04T13:41:45.7457392Z 2025-12-04T13:41:45.7457397Z 2025-12-04T13:41:45.7457614Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7458217Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7458224Z 2025-12-04T13:41:45.7458494Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7458725Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.7459761Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.7459962Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7460265Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 ________ 2025-12-04T13:41:45.7460390Z Traceback (most recent call last): 2025-12-04T13:41:45.7460830Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.7461033Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7461794Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7462570Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7462686Z C++ CapturedTraceback: 2025-12-04T13:41:45.7464014Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7464526Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7464860Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7465686Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7466095Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7466543Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7466853Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7467467Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7469917Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7470243Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7470421Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7472481Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7472785Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7473332Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7475721Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7475905Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7476308Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7476638Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7476942Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7477365Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7477738Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7478033Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7478299Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7478671Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7478969Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7479225Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7479598Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7479861Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7480277Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7480531Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7480910Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7481162Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7481539Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7481945Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7482314Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7482753Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7483156Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7483572Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7483969Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7484371Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7484753Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7485156Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7485538Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7485950Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7486215Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7486593Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7486941Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7487244Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7487546Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7487996Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7488414Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7488790Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7489194Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7489577Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7489832Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7490214Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7490616Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7490981Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7491398Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7491768Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7492125Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7492502Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7492790Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7493072Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7493327Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7493696Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7494110Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7494479Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7494942Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7495349Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7495607Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7496026Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7496428Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7496811Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7497217Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7497587Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7497854Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7498286Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7498704Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7499072Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7499475Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7499856Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7500202Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7500506Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7500811Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7501111Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7501529Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7501898Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7502155Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7502533Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7502938Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7503319Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7503723Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7504088Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7504481Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7504782Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7505088Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7505395Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7505810Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7506201Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7506620Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7507022Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7507477Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7507861Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7508166Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7508548Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7508953Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7509336Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7509752Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7510140Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7510494Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7510802Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7511107Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7511414Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7511827Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7512212Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7512623Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7513010Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7513422Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7513800Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7514098Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7514406Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7514685Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7514969Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7515315Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7515649Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7515943Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7516220Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7516522Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7516715Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7516825Z #134 _start from ??:0 2025-12-04T13:41:45.7516946Z #135 from ??:0 2025-12-04T13:41:45.7516952Z 2025-12-04T13:41:45.7516957Z 2025-12-04T13:41:45.7517171Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7517694Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7517701Z 2025-12-04T13:41:45.7517966Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7518242Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.7519299Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.7519495Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7519679Z =================================== FAILURES =================================== 2025-12-04T13:41:45.7519959Z ________ TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 ________ 2025-12-04T13:41:45.7520091Z Traceback (most recent call last): 2025-12-04T13:41:45.7520522Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2286, in test_eig_cuda_complex_eigenvectors 2025-12-04T13:41:45.7520713Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7521478Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7522244Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7522361Z C++ CapturedTraceback: 2025-12-04T13:41:45.7523673Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7524162Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7524505Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7525315Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7525725Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7526165Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7526438Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7527053Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7529463Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7529784Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7529954Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7532027Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7532351Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7532909Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7535217Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7535410Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7535815Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7536139Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7536458Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7536869Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7537255Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7537546Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7537802Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7538258Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7538553Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7538825Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7539197Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7539455Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7539842Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7540095Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7540461Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7540725Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7541095Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7541514Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7541918Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7542320Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7542703Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7543104Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7543484Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7543883Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7544278Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7544697Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7545097Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7545397Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7545681Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7546049Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7546407Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7546706Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7546997Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7547309Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7547715Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7548097Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7548498Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7548865Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7549131Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7549498Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7549915Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7550280Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7550684Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7551066Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7551412Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7551715Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7552017Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7552282Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7552547Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7552912Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7553316Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7553729Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7554131Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7554514Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7554767Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7555133Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7555553Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7555920Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7556364Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7556766Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7557023Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7557436Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7557835Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7558203Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7558613Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7558982Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7559344Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7559647Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7560037Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7560361Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7560769Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7561151Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7561402Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7561770Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7562183Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7562549Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7562972Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7563338Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7571711Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7572117Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7572422Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7572730Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7573161Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7573548Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7574112Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7574492Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7574907Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7575294Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7575562Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7575939Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7576361Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7576774Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7577227Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7577605Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7577998Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7578433Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7578734Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7579053Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7579474Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7579855Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7580280Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7580658Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7581082Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7581459Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7581746Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7582065Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7582336Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7582623Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7582988Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7583323Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7583624Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7583896Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7584164Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7584373Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7584474Z #134 _start from ??:0 2025-12-04T13:41:45.7584598Z #135 from ??:0 2025-12-04T13:41:45.7584615Z 2025-12-04T13:41:45.7584621Z 2025-12-04T13:41:45.7584842Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7585363Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7585368Z 2025-12-04T13:41:45.7585686Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7585914Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.7586948Z /var/lib/jenkins/workspace/test/test_linalg.py:2286: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.7587138Z vals_complex, vecs_complex = torch.linalg.eig(A_complex) 2025-12-04T13:41:45.7587866Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-c003f5ff32bae5e6.xml - 2025-12-04T13:41:45.7588049Z =========================== short test summary info ============================ 2025-12-04T13:41:45.7589457Z FAILED [0.1281s] test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 - RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7590245Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7590395Z C++ CapturedTraceback: 2025-12-04T13:41:45.7591715Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7592212Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7592550Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7593370Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7593776Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7594212Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7594492Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7595104Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7597527Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7597811Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7597985Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7599998Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7600329Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7600880Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7603194Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7603398Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7603805Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7604139Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7604469Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7604882Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7605250Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7605544Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7605813Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7606182Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7606482Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7606732Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7607102Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7607365Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7607730Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7607982Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7608359Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7608611Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7608989Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7609399Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7609767Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7610182Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7610548Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7610965Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7611334Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7611737Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7612115Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7612546Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7612925Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7613216Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7613468Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7613846Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7614193Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7614492Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7614820Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7615149Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7615566Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7615978Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7616376Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7616753Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7617006Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7617384Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7617785Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7618240Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7618664Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7619033Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7619382Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7619695Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7619985Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7620267Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7620520Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7620885Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7621306Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7621674Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7622090Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7622456Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7622711Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7623089Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7623490Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7623873Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7624276Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7624685Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7624948Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7625316Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7625718Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7626094Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7626495Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7626912Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7627288Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7627592Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7627925Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7628222Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7628632Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7628998Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7629248Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7629633Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7630035Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7630419Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7630812Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7631179Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7631539Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7631838Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7632135Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7632448Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7632863Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7633258Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7633670Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7634047Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7634462Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7634835Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7635104Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7635481Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7635893Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7636306Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7636716Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7637098Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7637454Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7637763Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7638066Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7638372Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7638808Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7639227Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7639640Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7640049Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7640461Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7640831Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7641129Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7641436Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7641713Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7641995Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7642349Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7642684Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7642976Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7643240Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7643513Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7643707Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7643813Z #134 _start from ??:0 2025-12-04T13:41:45.7643932Z #135 from ??:0 2025-12-04T13:41:45.7643939Z 2025-12-04T13:41:45.7643944Z 2025-12-04T13:41:45.7644162Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7644688Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7644696Z 2025-12-04T13:41:45.7644962Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7645149Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T13:41:45.7645351Z ================= 1 failed, 1262 deselected, 2 rerun in 15.75s ================= 2025-12-04T13:41:45.7645446Z Got exit code 1 2025-12-04T13:41:45.7645892Z FAILED CONSISTENTLY: test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64 2025-12-04T13:41:45.7646299Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T13:41:45.7646699Z Test results will be stored in test-reports/python-pytest/test_linalg/test_linalg-e9e5772b33a862c1.xml 2025-12-04T13:41:45.7646870Z ============================= test session starts ============================== 2025-12-04T13:41:45.7647222Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:41:45.7647372Z cachedir: .pytest_cache 2025-12-04T13:41:45.7647896Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:41:45.7648017Z rootdir: /var/lib/jenkins/workspace 2025-12-04T13:41:45.7648135Z configfile: pytest.ini 2025-12-04T13:41:45.7648728Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T13:41:45.7648965Z collecting ... collected 1263 items / 586 deselected / 677 selected 2025-12-04T13:41:45.7649111Z stepcurrent: skipping 586 already run items. 2025-12-04T13:41:45.7649221Z Running 677 items in this shard 2025-12-04T13:41:45.7649226Z 2025-12-04T13:41:45.7649791Z test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_complex128 SKIPPED [0.0029s] (no MAGMA library detected) [ 0%] 2025-12-04T13:41:45.7650335Z test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_complex64 SKIPPED [0.0015s] (no MAGMA library detected) [ 0%] 2025-12-04T13:41:45.7650845Z test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_float32 SKIPPED [0.0015s] (no MAGMA library detected) [ 0%] 2025-12-04T13:41:45.7651375Z test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_float64 SKIPPED [0.0016s] (no MAGMA library detected) [ 0%] 2025-12-04T13:41:45.7651826Z test_linalg.py::TestLinalgCUDA::test_eig_numpy_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 0%] 2025-12-04T13:41:45.7652265Z test_linalg.py::TestLinalgCUDA::test_eig_numpy_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 0%] 2025-12-04T13:41:45.7652566Z test_linalg.py::TestLinalgCUDA::test_eig_removed_error_cuda PASSED [0.0053s] [ 1%] 2025-12-04T13:41:45.7653024Z test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 1%] 2025-12-04T13:41:45.7653491Z test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 1%] 2025-12-04T13:41:45.7653935Z test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 1%] 2025-12-04T13:41:45.7654384Z test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_float64 SKIPPED [0.0017s] (no MAGMA library detected) [ 1%] 2025-12-04T13:41:45.7654809Z test_linalg.py::TestLinalgCUDA::test_eigh_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 1%] 2025-12-04T13:41:45.7655230Z test_linalg.py::TestLinalgCUDA::test_eigh_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 1%] 2025-12-04T13:41:45.7655647Z test_linalg.py::TestLinalgCUDA::test_eigh_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 2%] 2025-12-04T13:41:45.7656058Z test_linalg.py::TestLinalgCUDA::test_eigh_cuda_float64 SKIPPED [0.0017s] (no MAGMA library detected) [ 2%] 2025-12-04T13:41:45.7656590Z test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 2%] 2025-12-04T13:41:45.7657113Z test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 2%] 2025-12-04T13:41:45.7657618Z test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 2%] 2025-12-04T13:41:45.7658194Z test_linalg.py::TestLinalgCUDA::test_eigh_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 2%] 2025-12-04T13:41:45.7658677Z test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_complex128 SKIPPED [0.0017s] (no MAGMA library detected) [ 2%] 2025-12-04T13:41:45.7659163Z test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 3%] 2025-12-04T13:41:45.7659625Z test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 3%] 2025-12-04T13:41:45.7660122Z test_linalg.py::TestLinalgCUDA::test_eigh_lower_uplo_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 3%] 2025-12-04T13:41:45.7660577Z test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_complex128 SKIPPED [0.0015s] (Only runs on cpu) [ 3%] 2025-12-04T13:41:45.7661013Z test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_complex64 SKIPPED [0.0017s] (Only runs on cpu) [ 3%] 2025-12-04T13:41:45.7661442Z test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_float32 SKIPPED [0.0014s] (Only runs on cpu) [ 3%] 2025-12-04T13:41:45.7661861Z test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_float64 SKIPPED [0.0014s] (Only runs on cpu) [ 3%] 2025-12-04T13:41:45.7662389Z test_linalg.py::TestLinalgCUDA::test_eigh_svd_illcondition_matrix_input_should_not_crash_cuda_float32 PASSED [0.9860s] [ 4%] 2025-12-04T13:41:45.7662899Z test_linalg.py::TestLinalgCUDA::test_eigh_svd_illcondition_matrix_input_should_not_crash_cuda_float64 PASSED [0.1430s] [ 4%] 2025-12-04T13:41:45.7663478Z test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_complex128 SKIPPED [0.0017s] (no MAGMA library detected) [ 4%] 2025-12-04T13:41:45.7664045Z test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_complex64 SKIPPED [0.0015s] (no MAGMA library detected) [ 4%] 2025-12-04T13:41:45.7664555Z test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 4%] 2025-12-04T13:41:45.7665064Z test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 4%] 2025-12-04T13:41:45.7665615Z test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 5%] 2025-12-04T13:41:45.7666147Z test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 5%] 2025-12-04T13:41:45.7666681Z test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 5%] 2025-12-04T13:41:45.7667205Z test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 5%] 2025-12-04T13:41:45.7667687Z test_linalg.py::TestLinalgCUDA::test_eigvals_numpy_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 5%] 2025-12-04T13:41:45.7668143Z test_linalg.py::TestLinalgCUDA::test_eigvals_numpy_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 5%] 2025-12-04T13:41:45.7668591Z test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_complex128 SKIPPED [0.0017s] (no MAGMA library detected) [ 5%] 2025-12-04T13:41:45.7669047Z test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.7669479Z test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.7669922Z test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.7670469Z test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.7671012Z test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.7671550Z test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_float32 SKIPPED [0.0017s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.7672079Z test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.7672400Z test_linalg.py::TestLinalgCUDA::test_einsum_corner_cases_cuda PASSED [0.0115s] [ 7%] 2025-12-04T13:41:45.7672703Z test_linalg.py::TestLinalgCUDA::test_einsum_cuda_complex128 PASSED [0.0525s] [ 7%] 2025-12-04T13:41:45.7673027Z test_linalg.py::TestLinalgCUDA::test_einsum_cuda_float64 PASSED [0.0437s] [ 7%] 2025-12-04T13:41:45.7673340Z test_linalg.py::TestLinalgCUDA::test_einsum_error_cases_cuda PASSED [0.0057s] [ 7%] 2025-12-04T13:41:45.7673693Z test_linalg.py::TestLinalgCUDA::test_einsum_output_layout_cuda_float32 PASSED [0.0028s] [ 7%] 2025-12-04T13:41:45.7674022Z test_linalg.py::TestLinalgCUDA::test_einsum_random_cuda_complex128 PASSED [4.3685s] [ 7%] 2025-12-04T13:41:45.7674344Z test_linalg.py::TestLinalgCUDA::test_einsum_random_cuda_float64 PASSED [3.8770s] [ 7%] 2025-12-04T13:41:45.7674715Z test_linalg.py::TestLinalgCUDA::test_einsum_sublist_format_cuda_complex128 PASSED [0.0202s] [ 8%] 2025-12-04T13:41:45.7675072Z test_linalg.py::TestLinalgCUDA::test_einsum_sublist_format_cuda_float64 PASSED [0.0168s] [ 8%] 2025-12-04T13:41:45.7675634Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_32_cuda SKIPPED [0.0016s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.7676191Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_35_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.7676729Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_36_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.7677288Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_40_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 8%] 2025-12-04T13:41:45.7677826Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_64_cuda SKIPPED [0.0019s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.7678357Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_32_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.7678885Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_35_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.7679425Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_36_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.7679952Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_40_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.7680485Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_64_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.7681011Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_32_cuda SKIPPED [0.0019s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.7681548Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_35_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.7682072Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_36_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.7682601Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_40_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.7683136Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_64_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.7683664Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_32_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.7684195Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_35_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.7684719Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_36_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.7685250Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_40_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.7685791Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_64_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.7686351Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_32_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.7686890Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_35_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.7687417Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_36_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.7688078Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_40_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.7688676Z test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_64_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 11%] 2025-12-04T13:41:45.7689340Z test_linalg.py::TestLinalgCUDA::test_gemm_bias_offline_tunableop_cuda_bfloat16 SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 12%] 2025-12-04T13:41:45.7689706Z test_linalg.py::TestLinalgCUDA::test_gemm_bias_tunableop_cuda_bfloat16 PASSED [0.4608s] [ 12%] 2025-12-04T13:41:45.7690006Z test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_complex128 PASSED [0.0298s] [ 12%] 2025-12-04T13:41:45.7690341Z test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_complex64 PASSED [0.0283s] [ 12%] 2025-12-04T13:41:45.7690635Z test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_float32 PASSED [0.0257s] [ 12%] 2025-12-04T13:41:45.7690914Z test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_float64 PASSED [0.0253s] [ 12%] 2025-12-04T13:41:45.7691300Z test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_complex128 PASSED [0.0218s] [ 12%] 2025-12-04T13:41:45.7691667Z test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_complex64 PASSED [0.0184s] [ 13%] 2025-12-04T13:41:45.7692020Z test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_float32 PASSED [0.0159s] [ 13%] 2025-12-04T13:41:45.7692382Z test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_float64 PASSED [0.0155s] [ 13%] 2025-12-04T13:41:45.7692806Z test_linalg.py::TestLinalgCUDA::test_householder_product_errors_and_warnings_cuda PASSED [0.0040s] [ 13%] 2025-12-04T13:41:45.7693109Z test_linalg.py::TestLinalgCUDA::test_inner_cuda_complex64 PASSED [0.0198s] [ 13%] 2025-12-04T13:41:45.7693393Z test_linalg.py::TestLinalgCUDA::test_inner_cuda_float32 PASSED [0.0182s] [ 13%] 2025-12-04T13:41:45.7693774Z test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_complex128 PASSED [0.0066s] [ 14%] 2025-12-04T13:41:45.7694158Z test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_complex64 PASSED [0.0053s] [ 14%] 2025-12-04T13:41:45.7694524Z test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_float32 PASSED [0.0053s] [ 14%] 2025-12-04T13:41:45.7694892Z test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_float64 PASSED [0.0054s] [ 14%] 2025-12-04T13:41:45.7695259Z test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_complex128 PASSED [0.0021s] [ 14%] 2025-12-04T13:41:45.7695604Z test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_complex64 PASSED [0.0025s] [ 14%] 2025-12-04T13:41:45.7695941Z test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_float32 PASSED [0.0021s] [ 14%] 2025-12-04T13:41:45.7696272Z test_linalg.py::TestLinalgCUDA::test_inv_ex_info_device_cuda_float64 PASSED [0.0021s] [ 15%] 2025-12-04T13:41:45.7696602Z test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_complex128 PASSED [0.0035s] [ 15%] 2025-12-04T13:41:45.7696946Z test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_complex64 PASSED [0.0032s] [ 15%] 2025-12-04T13:41:45.7697273Z test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_float32 PASSED [0.0031s] [ 15%] 2025-12-04T13:41:45.7697609Z test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_float64 PASSED [0.0038s] [ 15%] 2025-12-04T13:41:45.7698244Z test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 ('RERUN', {'yellow': True}) [0.0375s] [ 15%] 2025-12-04T13:41:45.7698875Z test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 ('RERUN', {'yellow': True}) [0.1816s] [ 15%] 2025-12-04T13:41:45.7699366Z test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 FAILED [0.1410s] [ 15%] 2025-12-04T13:41:45.7699373Z 2025-12-04T13:41:45.7699511Z ==================================== RERUNS ==================================== 2025-12-04T13:41:45.7699837Z _ TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 _ 2025-12-04T13:41:45.7700600Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7700607Z 2025-12-04T13:41:45.7700877Z During handling of the above exception, another exception occurred: 2025-12-04T13:41:45.7700882Z 2025-12-04T13:41:45.7701013Z Traceback (most recent call last): 2025-12-04T13:41:45.7701534Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2964, in test_invariance_error_spectral_decompositions 2025-12-04T13:41:45.7701757Z with self.assertRaisesRegex(RuntimeError, "ill-defined"): 2025-12-04T13:41:45.7702103Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 239, in __exit__ 2025-12-04T13:41:45.7702270Z self._raiseFailure('"{}" does not match "{}"'.format( 2025-12-04T13:41:45.7702617Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 163, in _raiseFailure 2025-12-04T13:41:45.7702762Z raise self.test_case.failureException(msg) 2025-12-04T13:41:45.7703678Z AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver." 2025-12-04T13:41:45.7703684Z 2025-12-04T13:41:45.7703902Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7704491Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.7704497Z 2025-12-04T13:41:45.7704779Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7705094Z _ TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 _ 2025-12-04T13:41:45.7705859Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7705865Z 2025-12-04T13:41:45.7706104Z During handling of the above exception, another exception occurred: 2025-12-04T13:41:45.7706109Z 2025-12-04T13:41:45.7706228Z Traceback (most recent call last): 2025-12-04T13:41:45.7706721Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2964, in test_invariance_error_spectral_decompositions 2025-12-04T13:41:45.7706931Z with self.assertRaisesRegex(RuntimeError, "ill-defined"): 2025-12-04T13:41:45.7707253Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 239, in __exit__ 2025-12-04T13:41:45.7707425Z self._raiseFailure('"{}" does not match "{}"'.format( 2025-12-04T13:41:45.7707760Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 163, in _raiseFailure 2025-12-04T13:41:45.7707916Z raise self.test_case.failureException(msg) 2025-12-04T13:41:45.7708816Z AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver." 2025-12-04T13:41:45.7708822Z 2025-12-04T13:41:45.7709043Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7709639Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.7709644Z 2025-12-04T13:41:45.7709940Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7710090Z =================================== FAILURES =================================== 2025-12-04T13:41:45.7710408Z _ TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 _ 2025-12-04T13:41:45.7711166Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7711172Z 2025-12-04T13:41:45.7711410Z During handling of the above exception, another exception occurred: 2025-12-04T13:41:45.7711416Z 2025-12-04T13:41:45.7711537Z Traceback (most recent call last): 2025-12-04T13:41:45.7712071Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2964, in test_invariance_error_spectral_decompositions 2025-12-04T13:41:45.7712278Z with self.assertRaisesRegex(RuntimeError, "ill-defined"): 2025-12-04T13:41:45.7712618Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 239, in __exit__ 2025-12-04T13:41:45.7712791Z self._raiseFailure('"{}" does not match "{}"'.format( 2025-12-04T13:41:45.7713126Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 163, in _raiseFailure 2025-12-04T13:41:45.7713310Z raise self.test_case.failureException(msg) 2025-12-04T13:41:45.7714211Z AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver." 2025-12-04T13:41:45.7714217Z 2025-12-04T13:41:45.7714435Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7715024Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.7715030Z 2025-12-04T13:41:45.7715292Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7715884Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-e9e5772b33a862c1.xml - 2025-12-04T13:41:45.7716054Z =========================== short test summary info ============================ 2025-12-04T13:41:45.7717521Z FAILED [0.1410s] test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 - AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver." 2025-12-04T13:41:45.7717528Z 2025-12-04T13:41:45.7717741Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7718329Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.7718345Z 2025-12-04T13:41:45.7718608Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7718790Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T13:41:45.7719039Z ====== 1 failed, 36 passed, 70 skipped, 586 deselected, 2 rerun in 10.99s ====== 2025-12-04T13:41:45.7719138Z Got exit code 1 2025-12-04T13:41:45.7719243Z Retrying single test... 2025-12-04T13:41:45.7719653Z Test results will be stored in test-reports/python-pytest/test_linalg/test_linalg-67ba282cb3193875.xml 2025-12-04T13:41:45.7719818Z ============================= test session starts ============================== 2025-12-04T13:41:45.7720170Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:41:45.7720276Z cachedir: .pytest_cache 2025-12-04T13:41:45.7720805Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:41:45.7720933Z rootdir: /var/lib/jenkins/workspace 2025-12-04T13:41:45.7721039Z configfile: pytest.ini 2025-12-04T13:41:45.7721665Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T13:41:45.7721899Z collecting ... collected 1263 items / 1262 deselected / 1 selected 2025-12-04T13:41:45.7722574Z stepcurrent: skipping 692 already run items. Running only test/test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.7722692Z Running 1 items in this shard 2025-12-04T13:41:45.7722698Z 2025-12-04T13:41:45.7723698Z test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 [W1204 13:37:25.599682026 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7723705Z 2025-12-04T13:41:45.7724260Z [W1204 13:37:25.000469806 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7724296Z 2025-12-04T13:41:45.7724428Z ('RERUN', {'yellow': True}) [16.7406s] [100%] 2025-12-04T13:41:45.7725418Z test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 [W1204 13:37:41.549184656 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7725456Z 2025-12-04T13:41:45.7725974Z [W1204 13:37:41.549816541 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7725979Z 2025-12-04T13:41:45.7726107Z ('RERUN', {'yellow': True}) [0.1841s] [100%] 2025-12-04T13:41:45.7727100Z test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 [W1204 13:37:41.698167999 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7727106Z 2025-12-04T13:41:45.7727623Z [W1204 13:37:41.698769009 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.7727630Z 2025-12-04T13:41:45.7727735Z FAILED [0.1466s] [100%] 2025-12-04T13:41:45.7727742Z 2025-12-04T13:41:45.7727878Z ==================================== RERUNS ==================================== 2025-12-04T13:41:45.7728192Z _ TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 _ 2025-12-04T13:41:45.7728957Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7729719Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7729837Z C++ CapturedTraceback: 2025-12-04T13:41:45.7731144Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7731631Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7731971Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7732770Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7733180Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7733616Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7733930Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7734536Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7736970Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7737288Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7737456Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7739553Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7739879Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7740433Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7742716Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7742907Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7743310Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7743628Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7743942Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7744349Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7744731Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7745023Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7745277Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7745656Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7745942Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7746202Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7746577Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7746825Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7747237Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7747490Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7747856Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7748114Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7748481Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7748888Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7749253Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7749682Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7750088Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7750495Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7750899Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7751300Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7751665Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7752074Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7752437Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7752733Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7752986Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7753351Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7753701Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7754002Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7754288Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7754594Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7754998Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7755368Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7755766Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7756136Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7756394Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7756761Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7757169Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7757530Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7757927Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7758303Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7758647Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7758993Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7759287Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7759549Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7759807Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7760173Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7760574Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7760951Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7761379Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7761787Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7762044Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7762409Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7762848Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7763215Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7763622Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7763988Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7764240Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7764613Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7765018Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7765381Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7765788Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7766151Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7766502Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7766801Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7767090Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7767395Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7767798Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7768167Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7768420Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7768784Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7769188Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7769555Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7769966Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7770330Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7770674Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7771017Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7771319Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7771619Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7772041Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7772414Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7772829Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7773233Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7773671Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7774055Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7774342Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7774722Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7775130Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7775500Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7775915Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7776291Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7776652Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7776963Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7777256Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7777570Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7777978Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7778421Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7778838Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7779213Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7779632Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7780009Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7780293Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7780611Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7780872Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7781164Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7781514Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7781833Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7782129Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7782395Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7782765Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7782966Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7783065Z #134 _start from ??:0 2025-12-04T13:41:45.7783190Z #135 from ??:0 2025-12-04T13:41:45.7783196Z 2025-12-04T13:41:45.7783201Z 2025-12-04T13:41:45.7783440Z During handling of the above exception, another exception occurred: 2025-12-04T13:41:45.7783446Z 2025-12-04T13:41:45.7783563Z Traceback (most recent call last): 2025-12-04T13:41:45.7784063Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2964, in test_invariance_error_spectral_decompositions 2025-12-04T13:41:45.7784269Z with self.assertRaisesRegex(RuntimeError, "ill-defined"): 2025-12-04T13:41:45.7784617Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 239, in __exit__ 2025-12-04T13:41:45.7784787Z self._raiseFailure('"{}" does not match "{}"'.format( 2025-12-04T13:41:45.7785149Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 163, in _raiseFailure 2025-12-04T13:41:45.7785304Z raise self.test_case.failureException(msg) 2025-12-04T13:41:45.7786239Z AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7787010Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7787116Z C++ CapturedTraceback: 2025-12-04T13:41:45.7788586Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7789079Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7789412Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7790224Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7790622Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7791061Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7791337Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7791948Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7794365Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7794642Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7794811Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7796904Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7797200Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7797739Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7800103Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7800334Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7800732Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7801058Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7801359Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7801772Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7802142Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7802435Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7802698Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7803070Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7803358Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7803621Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7803989Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7804251Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7804622Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7804875Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7805256Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7805507Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7805885Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7806293Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7806659Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7807068Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7807442Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7807851Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7808252Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7808651Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7809031Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7809436Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7809806Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7810105Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7810358Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7810885Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7811268Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7811567Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7811902Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7812200Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7812611Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7813000Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7813487Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7813867Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7814122Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7814499Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7814898Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7815267Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7815674Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7816038Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7816380Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7816691Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7816977Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7817255Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7817510Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7817876Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7818365Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7818731Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7819138Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7819502Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7819757Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7820134Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7820582Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7820949Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7821356Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7821717Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7821974Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7822339Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7822776Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7823181Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7823583Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7823989Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7824333Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7824630Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7824928Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7825224Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7825634Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7826004Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7826256Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7826627Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7827028Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7827392Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7827800Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7828165Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7828516Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7828813Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7829110Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7829421Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7829833Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7830218Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7830627Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7831001Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7831425Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7831795Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7832094Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7832468Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7832877Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7833259Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7833667Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7834037Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7834398Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7834731Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7835062Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7835369Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7835806Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7836190Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7836597Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7836976Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7837387Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7837761Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7838060Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7838364Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7838635Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7838917Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7839264Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7839597Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7839884Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7840151Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7840424Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7840614Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7840722Z #134 _start from ??:0 2025-12-04T13:41:45.7840837Z #135 from ??:0 2025-12-04T13:41:45.7840919Z " 2025-12-04T13:41:45.7840927Z 2025-12-04T13:41:45.7841153Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7841745Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.7841751Z 2025-12-04T13:41:45.7842017Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7842244Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.7843264Z /var/lib/jenkins/workspace/test/test_linalg.py:2965: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.7843398Z V = torch.linalg.eig(A).eigenvectors 2025-12-04T13:41:45.7843744Z _ TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 _ 2025-12-04T13:41:45.7844502Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7845274Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7845383Z C++ CapturedTraceback: 2025-12-04T13:41:45.7846737Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7847268Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7847614Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7848453Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7848857Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7849298Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7849570Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7850192Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7852607Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7852892Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7853055Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7855074Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7855371Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7855910Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7858283Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7858496Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7858910Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7859228Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7859535Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7859940Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7860337Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7860641Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7860924Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7861291Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7861621Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7861870Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7862242Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7862494Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7862862Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7863123Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7863492Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7863754Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7864121Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7864527Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7864895Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7865295Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7865659Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7866072Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7866436Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7866848Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7867213Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7867614Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7867990Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7868274Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7868530Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7868894Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7869239Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7869581Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7869867Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7870171Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7870574Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7870941Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7871343Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7871705Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7871984Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7872396Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7872804Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7873178Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7873610Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7873976Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7874332Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7874631Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7874931Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7875199Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7875457Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7875828Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7876233Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7876608Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7877013Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7877381Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7877643Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7878015Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7878421Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7878798Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7879201Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7879576Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7879831Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7880196Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7880605Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7880973Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7881384Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7881784Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7882129Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7882441Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7882729Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7883031Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7883431Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7883829Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7884095Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7884496Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7884903Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7885308Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7885706Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7886080Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7886422Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7886724Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7887029Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7887338Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7887948Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7888334Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7888741Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7889123Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7889535Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7889922Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7890185Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7890565Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7890980Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7891355Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7891765Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7892146Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7892500Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7892813Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7893112Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7893418Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7893896Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7894274Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7894687Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7895061Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7895469Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7895853Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7896194Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7896541Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7896815Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7897094Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7897489Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7897810Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7898095Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7898471Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7898732Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7898934Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7899031Z #134 _start from ??:0 2025-12-04T13:41:45.7899147Z #135 from ??:0 2025-12-04T13:41:45.7899156Z 2025-12-04T13:41:45.7899162Z 2025-12-04T13:41:45.7899413Z During handling of the above exception, another exception occurred: 2025-12-04T13:41:45.7899421Z 2025-12-04T13:41:45.7899539Z Traceback (most recent call last): 2025-12-04T13:41:45.7900031Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2964, in test_invariance_error_spectral_decompositions 2025-12-04T13:41:45.7900240Z with self.assertRaisesRegex(RuntimeError, "ill-defined"): 2025-12-04T13:41:45.7900545Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 239, in __exit__ 2025-12-04T13:41:45.7900719Z self._raiseFailure('"{}" does not match "{}"'.format( 2025-12-04T13:41:45.7901050Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 163, in _raiseFailure 2025-12-04T13:41:45.7901198Z raise self.test_case.failureException(msg) 2025-12-04T13:41:45.7902106Z AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7902871Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7902988Z C++ CapturedTraceback: 2025-12-04T13:41:45.7904296Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7904796Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7905125Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7905968Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7906380Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7906813Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7907091Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7907697Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7910155Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7910474Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7910635Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7912671Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7912961Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7913511Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7915919Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7916109Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7916513Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7916833Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7917137Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7917542Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7917924Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7918213Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7918469Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7918841Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7919168Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7919429Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7919803Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7920057Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7920434Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7920689Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7921071Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7921356Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7921759Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7922173Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7922570Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7922978Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7923356Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7923761Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7924136Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7924544Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7924912Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7925319Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7925694Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7925988Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7926244Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7926610Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7926959Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7927265Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7927556Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7927870Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7928272Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7928648Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7929048Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7929415Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7929672Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7930042Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7930448Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7930845Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7931245Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7931621Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7931967Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7932272Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7932559Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7932821Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7933110Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7933512Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7933915Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7934289Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7934719Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7935090Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7935341Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7935709Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7936120Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7936483Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7936893Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7937260Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7937518Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7937890Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7938394Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7938769Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7939170Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7939532Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7939884Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7940184Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7940475Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7940779Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7941184Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7941557Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7941809Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7942178Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7942586Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7943000Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7943408Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7943772Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7944114Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7944424Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7944717Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7945058Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7945506Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7945884Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7946300Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7946708Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7947119Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7947501Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7947761Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7948144Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7948556Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7948935Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7949354Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7949729Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7950093Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7950402Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7950695Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7951011Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7951423Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7951804Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7952215Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7952593Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7953012Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7953383Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7953673Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.7953985Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.7954257Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.7954551Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.7954932Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.7955256Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.7955546Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.7955814Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.7956081Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.7956269Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.7956368Z #134 _start from ??:0 2025-12-04T13:41:45.7956496Z #135 from ??:0 2025-12-04T13:41:45.7956610Z " 2025-12-04T13:41:45.7956617Z 2025-12-04T13:41:45.7956831Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.7957465Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.7957472Z 2025-12-04T13:41:45.7957839Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.7958180Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.7959619Z /var/lib/jenkins/workspace/test/test_linalg.py:2965: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.7959745Z V = torch.linalg.eig(A).eigenvectors 2025-12-04T13:41:45.7959897Z =================================== FAILURES =================================== 2025-12-04T13:41:45.7960213Z _ TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 _ 2025-12-04T13:41:45.7960977Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.7961739Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.7961850Z C++ CapturedTraceback: 2025-12-04T13:41:45.7963176Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.7963660Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.7964002Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.7964808Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7965206Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.7965652Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.7965916Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7966530Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7968940Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7969268Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.7969434Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7971525Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.7971818Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.7972460Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7974733Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.7974910Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.7975316Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.7975634Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.7975941Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7976347Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7976718Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7977011Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7977267Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7977644Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7977930Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7978271Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7978646Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7978898Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7979264Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7979522Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7979886Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7980144Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7980511Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7980950Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7981324Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7981725Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7982096Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7982498Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7982865Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7983790Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7984198Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7984609Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7985025Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7985316Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.7985576Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7985942Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7986288Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7986598Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7986889Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7987199Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.7987603Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.7988149Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7988562Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7988934Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7989199Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7989565Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7989974Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7990355Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7990760Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7991141Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7991493Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7991793Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.7992092Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.7992352Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.7992607Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7992979Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7993445Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7993821Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7994222Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7994589Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7994852Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7995216Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7995667Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7996032Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7996477Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7996851Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7997142Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.7997505Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7997919Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7998286Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7998694Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.7999061Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.7999411Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.7999721Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8000014Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8000322Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8000723Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8001088Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8001347Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8001717Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8002128Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8002498Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8002902Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8003282Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8003625Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8003926Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8004224Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8004530Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8004954Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8005365Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8005777Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8006161Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8006577Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8006955Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8007216Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8007618Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8008034Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8008438Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8008856Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8009260Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8009613Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8009931Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8010225Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8010528Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8010953Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8011332Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8011745Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8012120Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8012530Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8012910Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8013199Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.8013508Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.8013774Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.8014058Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.8014414Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.8014736Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.8015024Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.8015297Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.8015561Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.8015761Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.8015859Z #134 _start from ??:0 2025-12-04T13:41:45.8015973Z #135 from ??:0 2025-12-04T13:41:45.8015980Z 2025-12-04T13:41:45.8015987Z 2025-12-04T13:41:45.8016240Z During handling of the above exception, another exception occurred: 2025-12-04T13:41:45.8016282Z 2025-12-04T13:41:45.8016402Z Traceback (most recent call last): 2025-12-04T13:41:45.8016902Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2964, in test_invariance_error_spectral_decompositions 2025-12-04T13:41:45.8017114Z with self.assertRaisesRegex(RuntimeError, "ill-defined"): 2025-12-04T13:41:45.8017423Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 239, in __exit__ 2025-12-04T13:41:45.8017599Z self._raiseFailure('"{}" does not match "{}"'.format( 2025-12-04T13:41:45.8017934Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 163, in _raiseFailure 2025-12-04T13:41:45.8018082Z raise self.test_case.failureException(msg) 2025-12-04T13:41:45.8019115Z AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.8019915Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.8020039Z C++ CapturedTraceback: 2025-12-04T13:41:45.8021355Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.8021880Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.8022214Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.8023025Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8023436Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.8023876Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8024160Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8024771Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8027182Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8027465Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8027632Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8029649Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8029941Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.8030527Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8032798Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8033017Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8033421Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.8033802Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.8034116Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8034558Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8034942Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8035232Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8035502Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8035875Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8036165Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8036440Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8036813Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8037067Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8037456Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8037713Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8038091Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8038344Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8038717Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8039141Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8039514Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8039931Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8040304Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8040703Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8041079Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8041479Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8041848Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8042260Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8042658Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8042957Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8043210Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8043572Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8043932Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8044232Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8044531Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8044861Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8045293Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8045677Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8046080Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8046488Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8046742Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8047109Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8047518Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8047883Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8048286Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8048663Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8049008Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8049322Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8049614Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8049880Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.8050142Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8050509Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8050924Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8051293Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8051698Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8052076Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8052332Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8052708Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8053109Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8053474Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8053884Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8054250Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8054535Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8054910Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8055315Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8055687Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8056089Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8056454Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8056839Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8057172Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8057475Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8057775Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8058290Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8058675Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8058932Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8059299Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8059716Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8060085Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8060502Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8060871Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8061259Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8061652Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8061948Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8062259Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8062675Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8063053Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8063474Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8063853Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8064272Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8064647Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8064908Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8065294Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8065704Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8066076Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8066499Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8066917Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8067283Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8067589Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8067883Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8068195Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8068609Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8069027Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8069468Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8069847Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8070267Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8070672Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8070971Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.8071278Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.8071550Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.8071842Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.8072191Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.8072516Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.8072812Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.8073085Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.8073357Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.8073549Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.8073650Z #134 _start from ??:0 2025-12-04T13:41:45.8073777Z #135 from ??:0 2025-12-04T13:41:45.8073861Z " 2025-12-04T13:41:45.8073867Z 2025-12-04T13:41:45.8074081Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.8074688Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.8074695Z 2025-12-04T13:41:45.8074963Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.8075195Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.8076217Z /var/lib/jenkins/workspace/test/test_linalg.py:2965: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.8076344Z V = torch.linalg.eig(A).eigenvectors 2025-12-04T13:41:45.8076929Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-67ba282cb3193875.xml - 2025-12-04T13:41:45.8077101Z =========================== short test summary info ============================ 2025-12-04T13:41:45.8078561Z FAILED [0.1466s] test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 - AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.8079358Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.8079480Z C++ CapturedTraceback: 2025-12-04T13:41:45.8080788Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.8081274Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.8081660Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.8082492Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8082934Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.8083368Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8083641Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8084265Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8086679Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8086970Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8087133Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8089357Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8089646Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.8090198Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8092477Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8092663Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8093132Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.8093452Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.8093763Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8094167Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8094546Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8094837Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8095132Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8095511Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8095850Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8096103Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8096518Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8096775Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8097154Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8097408Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8097772Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8098037Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8098489Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8098904Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8099276Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8099680Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8100054Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8100458Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8100835Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8101244Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8101614Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8102029Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8102397Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8102682Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8102947Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8103318Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8103673Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8103979Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8104269Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8104615Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8105023Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8105401Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8105804Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8106172Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8106437Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8106807Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8107246Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8107647Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8108057Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8108463Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8108808Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8109111Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8109409Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8109678Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.8109946Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8110315Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8110722Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8111093Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8111500Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8111876Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8112133Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8112500Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8112915Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8113282Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8113693Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8114061Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8114316Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8114698Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8115101Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8115465Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8115873Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8116243Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8116633Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8116934Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8117224Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8117532Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8117937Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8118310Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8118565Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8118962Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8119406Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8119781Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8120179Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8120589Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8120933Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8121246Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8121541Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8121846Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8122269Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8122651Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8123069Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8123449Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8123854Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8124246Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8124504Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8124886Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8125298Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8125678Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8126098Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8126472Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8126821Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8127140Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8127433Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8127743Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8128154Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8128526Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8128990Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8129362Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8129778Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8130150Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8130438Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.8130753Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.8131048Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.8131374Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.8131726Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.8132048Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.8132377Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.8132645Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.8132910Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.8133114Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.8133212Z #134 _start from ??:0 2025-12-04T13:41:45.8133345Z #135 from ??:0 2025-12-04T13:41:45.8133427Z " 2025-12-04T13:41:45.8133432Z 2025-12-04T13:41:45.8133651Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.8134253Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.8134262Z 2025-12-04T13:41:45.8134529Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.8134708Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T13:41:45.8134917Z ================= 1 failed, 1262 deselected, 2 rerun in 17.19s ================= 2025-12-04T13:41:45.8135011Z Got exit code 1 2025-12-04T13:41:45.8135126Z Retrying single test... 2025-12-04T13:41:45.8135528Z Test results will be stored in test-reports/python-pytest/test_linalg/test_linalg-f88fa3b886672d5c.xml 2025-12-04T13:41:45.8135687Z ============================= test session starts ============================== 2025-12-04T13:41:45.8136048Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:41:45.8136154Z cachedir: .pytest_cache 2025-12-04T13:41:45.8136678Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:41:45.8136808Z rootdir: /var/lib/jenkins/workspace 2025-12-04T13:41:45.8136914Z configfile: pytest.ini 2025-12-04T13:41:45.8137516Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T13:41:45.8137740Z collecting ... collected 1263 items / 1262 deselected / 1 selected 2025-12-04T13:41:45.8138498Z stepcurrent: skipping 692 already run items. Running only test/test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.8138627Z Running 1 items in this shard 2025-12-04T13:41:45.8138632Z 2025-12-04T13:41:45.8139634Z test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 [W1204 13:37:50.709909419 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.8139678Z 2025-12-04T13:41:45.8140211Z [W1204 13:37:50.109084083 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.8140218Z 2025-12-04T13:41:45.8140349Z ('RERUN', {'yellow': True}) [16.7372s] [100%] 2025-12-04T13:41:45.8141347Z test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 [W1204 13:38:06.639929721 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.8141353Z 2025-12-04T13:41:45.8141869Z [W1204 13:38:06.640577171 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.8141874Z 2025-12-04T13:41:45.8142032Z ('RERUN', {'yellow': True}) [0.1750s] [100%] 2025-12-04T13:41:45.8143072Z test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 [W1204 13:38:06.777608250 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.8143079Z 2025-12-04T13:41:45.8143594Z [W1204 13:38:06.778217431 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T13:41:45.8143628Z 2025-12-04T13:41:45.8143735Z FAILED [0.1354s] [100%] 2025-12-04T13:41:45.8143740Z 2025-12-04T13:41:45.8143877Z ==================================== RERUNS ==================================== 2025-12-04T13:41:45.8144203Z _ TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 _ 2025-12-04T13:41:45.8144962Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.8145729Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.8145846Z C++ CapturedTraceback: 2025-12-04T13:41:45.8147155Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.8147653Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.8147985Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.8148794Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8149207Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.8149647Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8149933Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8150538Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8152959Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8153277Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8153460Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8155520Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8155809Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.8156404Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8158721Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8158907Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8159308Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.8159641Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.8159944Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8160351Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8160731Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8161020Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8161285Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8161654Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8161940Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8162201Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8162575Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8162824Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8163202Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8163452Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8163825Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8164080Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8164447Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8164861Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8165229Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8165672Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8166039Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8166441Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8166815Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8167216Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8167587Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8168018Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8168481Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8168787Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8169041Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8169450Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8169810Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8170112Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8170416Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8170712Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8171118Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8171496Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8171897Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8172275Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8172527Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8172894Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8173303Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8173671Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8174085Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8174456Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8174802Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8175114Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8175405Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8175670Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.8175930Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8176304Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8176718Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8177089Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8177525Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8177900Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8178220Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8178601Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8179001Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8179366Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8179809Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8180179Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8180470Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8180837Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8181273Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8181647Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8182048Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8182414Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8182768Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8183069Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8183373Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8183674Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8184077Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8184450Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8184702Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8185080Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8185482Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8185849Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8186259Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8186623Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8186973Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8187271Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8187566Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8188031Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8188446Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8188826Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8189245Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8189675Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8190091Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8190471Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8190730Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8191113Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8191522Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8191945Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8192394Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8192770Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8193129Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8193472Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8193767Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8194075Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8194485Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8194867Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8195275Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8195652Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8196066Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8196443Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8196744Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.8197045Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.8197308Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.8197595Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.8197945Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.8198271Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.8198558Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.8198827Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.8199099Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.8199290Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.8199387Z #134 _start from ??:0 2025-12-04T13:41:45.8199511Z #135 from ??:0 2025-12-04T13:41:45.8199517Z 2025-12-04T13:41:45.8199522Z 2025-12-04T13:41:45.8199766Z During handling of the above exception, another exception occurred: 2025-12-04T13:41:45.8199771Z 2025-12-04T13:41:45.8199895Z Traceback (most recent call last): 2025-12-04T13:41:45.8200381Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2964, in test_invariance_error_spectral_decompositions 2025-12-04T13:41:45.8200587Z with self.assertRaisesRegex(RuntimeError, "ill-defined"): 2025-12-04T13:41:45.8200936Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 239, in __exit__ 2025-12-04T13:41:45.8201111Z self._raiseFailure('"{}" does not match "{}"'.format( 2025-12-04T13:41:45.8201452Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 163, in _raiseFailure 2025-12-04T13:41:45.8201601Z raise self.test_case.failureException(msg) 2025-12-04T13:41:45.8202502Z AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.8203274Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.8203410Z C++ CapturedTraceback: 2025-12-04T13:41:45.8204765Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.8205280Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.8205611Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.8206430Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8206832Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.8207277Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8207545Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8208153Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8210573Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8210852Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8211025Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8213043Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8213337Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.8213881Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8216215Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8216394Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8216803Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.8217169Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.8217471Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8217932Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8218421Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8218767Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8219025Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8219394Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8219691Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8219946Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8220318Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8220583Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8220953Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8221221Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8221593Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8221848Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8222224Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8222629Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8223003Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8223405Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8223774Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8224188Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8224554Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8224969Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8225333Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8225736Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8226116Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8226406Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8226692Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8227066Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8227415Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8227725Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8228010Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8228307Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8228716Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8229111Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8229557Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8229927Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8230182Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8230594Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8230995Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8231360Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8231768Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8232135Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8232490Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8232788Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8233078Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8233349Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.8233601Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8233979Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8234380Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8234744Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8235155Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8235527Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8235789Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8236157Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8236558Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8236931Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8237331Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8237696Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8237955Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8246824Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8247425Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8247817Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8248231Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8248616Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8248966Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8249273Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8249634Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8249950Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8250414Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8250791Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8251095Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8251480Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8251889Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8252272Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8252681Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8253052Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8253413Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8253722Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8254040Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8254350Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8254770Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8255158Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8255574Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8255957Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8256379Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8256758Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8257038Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8257419Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8257835Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8258306Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8258721Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8259113Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8259474Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8259825Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8260140Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8260444Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8260869Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8261248Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8261658Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8262078Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8262521Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8262905Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8263238Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.8263547Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.8263829Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.8264114Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.8264494Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.8264829Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.8265124Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.8265442Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.8265733Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.8266027Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.8266163Z #134 _start from ??:0 2025-12-04T13:41:45.8266293Z #135 from ??:0 2025-12-04T13:41:45.8266419Z " 2025-12-04T13:41:45.8266427Z 2025-12-04T13:41:45.8270819Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.8271430Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.8271438Z 2025-12-04T13:41:45.8271714Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.8271939Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.8272969Z /var/lib/jenkins/workspace/test/test_linalg.py:2965: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.8273107Z V = torch.linalg.eig(A).eigenvectors 2025-12-04T13:41:45.8273425Z _ TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 _ 2025-12-04T13:41:45.8274186Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.8274955Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.8275065Z C++ CapturedTraceback: 2025-12-04T13:41:45.8276416Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.8276943Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.8277284Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.8278099Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8278516Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.8278995Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8279299Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8279924Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8282369Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8282657Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8282824Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8284849Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8285139Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.8285684Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8288147Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8288329Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8288741Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.8289060Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.8289372Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8289778Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8290225Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8290518Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8290775Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8291154Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8291438Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8291691Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8292066Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8292364Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8292787Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8293053Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8293417Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8293713Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8294079Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8294483Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8294857Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8295261Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8295634Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8296036Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8296402Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8296814Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8297179Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8297588Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8297952Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8298306Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8298565Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8298935Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8299298Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8299599Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8299888Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8300193Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8300593Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8300959Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8301376Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8301746Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8302045Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8302412Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8302817Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8303188Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8303589Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8303963Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8304338Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8304643Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8304973Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8305240Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.8305521Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8305896Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8306298Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8306673Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8307074Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8307442Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8307706Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8308072Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8308488Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8308855Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8309258Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8309635Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8309889Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8310271Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8310674Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8311043Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8311455Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8311824Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8312319Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8312783Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8313096Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8313531Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8314155Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8314634Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8314899Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8315271Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8315686Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8316053Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8316457Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8316873Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8317217Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8317562Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8317858Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8318200Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8318623Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8319002Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8319413Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8319804Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8320216Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8320605Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8320872Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8321250Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8321670Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8322044Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8322465Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8322837Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8323195Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8323519Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8323815Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8324123Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8324540Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8324916Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8325331Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8325706Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8326118Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8326503Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8326821Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.8327133Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.8327404Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.8327686Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.8328047Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.8328367Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.8328673Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.8328970Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.8329263Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.8329468Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.8329565Z #134 _start from ??:0 2025-12-04T13:41:45.8329715Z #135 from ??:0 2025-12-04T13:41:45.8329722Z 2025-12-04T13:41:45.8329728Z 2025-12-04T13:41:45.8329988Z During handling of the above exception, another exception occurred: 2025-12-04T13:41:45.8329994Z 2025-12-04T13:41:45.8330114Z Traceback (most recent call last): 2025-12-04T13:41:45.8330610Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2964, in test_invariance_error_spectral_decompositions 2025-12-04T13:41:45.8330819Z with self.assertRaisesRegex(RuntimeError, "ill-defined"): 2025-12-04T13:41:45.8331128Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 239, in __exit__ 2025-12-04T13:41:45.8331312Z self._raiseFailure('"{}" does not match "{}"'.format( 2025-12-04T13:41:45.8331648Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 163, in _raiseFailure 2025-12-04T13:41:45.8331807Z raise self.test_case.failureException(msg) 2025-12-04T13:41:45.8332713Z AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.8333480Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.8333598Z C++ CapturedTraceback: 2025-12-04T13:41:45.8334921Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.8335423Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.8335760Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.8336569Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8336978Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.8337414Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8337696Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8338382Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8340842Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8341121Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8341331Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8343394Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8343715Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.8344271Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8346546Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8346740Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8347145Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.8347475Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.8347777Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8348183Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8348566Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8348858Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8349129Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8349503Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8349796Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8350061Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8350432Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8350683Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8351059Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8351315Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8351697Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8351982Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8352351Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8352772Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8353142Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8353555Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8353921Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8354356Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8354770Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8355177Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8355586Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8355987Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8356351Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8356653Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8356908Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8357278Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8357634Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8357938Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8358239Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8358542Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8358943Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8359325Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8359727Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8360103Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8360360Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8360727Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8361139Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8361507Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8361919Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8362287Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8362630Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8362938Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8363227Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8363490Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.8363786Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8364152Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8364572Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8364941Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8365346Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8365724Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8366009Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8366387Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8366818Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8367184Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8367630Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8367999Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8368253Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8368632Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8369033Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8369412Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8369823Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8370186Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8370544Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8370845Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8371144Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8371446Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8371851Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8372238Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8372495Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8372877Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8373280Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8373648Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8374063Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8374432Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8374778Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8375094Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8375390Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8375830Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8376242Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8376626Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8377047Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8377420Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8377838Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8378320Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8378587Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8379007Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8379415Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8379834Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8380245Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8380621Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8380984Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8381289Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8381585Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8381903Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8382312Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8382698Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8383107Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8383485Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8383899Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8384276Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8384578Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.8384887Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.8385151Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.8385446Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.8385798Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.8386122Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.8386417Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.8386685Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.8386958Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.8387150Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.8387251Z #134 _start from ??:0 2025-12-04T13:41:45.8387422Z #135 from ??:0 2025-12-04T13:41:45.8387506Z " 2025-12-04T13:41:45.8387511Z 2025-12-04T13:41:45.8387880Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.8388492Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.8388498Z 2025-12-04T13:41:45.8388767Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.8389006Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.8390035Z /var/lib/jenkins/workspace/test/test_linalg.py:2965: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.8390235Z V = torch.linalg.eig(A).eigenvectors 2025-12-04T13:41:45.8390427Z =================================== FAILURES =================================== 2025-12-04T13:41:45.8390752Z _ TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 _ 2025-12-04T13:41:45.8391522Z RuntimeError: Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.8392334Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.8392444Z C++ CapturedTraceback: 2025-12-04T13:41:45.8393779Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.8394274Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.8394621Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.8395429Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8395843Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.8396281Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8396560Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8397183Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8399586Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8399873Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8400038Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8402070Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8402398Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.8402957Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8405292Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8405508Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8405912Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.8406229Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.8406542Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8406947Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8407331Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8407624Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8407883Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8408266Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8408554Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8408812Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8409191Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8409439Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8409819Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8410075Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8410443Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8410714Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8411082Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8411502Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8411871Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8412273Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8412650Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8413055Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8413424Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8413863Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8414230Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8414645Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8415010Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8415299Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8415561Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8415957Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8416314Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8416645Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8416939Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8417281Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8417683Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8418066Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8418542Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8418911Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8419180Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8419550Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8419952Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8420330Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8420736Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8421115Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8421463Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8421769Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8422075Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8422340Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.8422605Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8422971Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8423381Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8423756Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8424157Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8424532Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8424790Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8425156Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8425607Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8425974Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8426379Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8426755Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8427005Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8427380Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8427780Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8428174Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8428622Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8428990Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8429389Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8429691Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8429980Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8430289Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8430693Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8431070Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8431327Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8431695Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8432107Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8432479Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8432881Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8433256Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8433602Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8433915Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8434209Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8434516Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8434937Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8435314Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8435736Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8436112Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8436523Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8436905Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8437171Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8437582Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8437994Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8438372Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8438792Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8439165Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8439520Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8439869Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8440161Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8440504Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8440914Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8441317Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8441731Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8442106Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8442519Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8442894Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8443184Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.8443501Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.8443770Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.8444057Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.8444411Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.8444732Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.8445023Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.8445290Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.8445555Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.8445754Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.8445851Z #134 _start from ??:0 2025-12-04T13:41:45.8445975Z #135 from ??:0 2025-12-04T13:41:45.8445989Z 2025-12-04T13:41:45.8445994Z 2025-12-04T13:41:45.8446236Z During handling of the above exception, another exception occurred: 2025-12-04T13:41:45.8446243Z 2025-12-04T13:41:45.8446361Z Traceback (most recent call last): 2025-12-04T13:41:45.8446853Z File "/var/lib/jenkins/workspace/test/test_linalg.py", line 2964, in test_invariance_error_spectral_decompositions 2025-12-04T13:41:45.8447065Z with self.assertRaisesRegex(RuntimeError, "ill-defined"): 2025-12-04T13:41:45.8447372Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 239, in __exit__ 2025-12-04T13:41:45.8447555Z self._raiseFailure('"{}" does not match "{}"'.format( 2025-12-04T13:41:45.8447888Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 163, in _raiseFailure 2025-12-04T13:41:45.8448044Z raise self.test_case.failureException(msg) 2025-12-04T13:41:45.8448941Z AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.8449739Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.8449858Z C++ CapturedTraceback: 2025-12-04T13:41:45.8451167Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.8451694Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.8452031Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.8452878Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8453308Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.8453743Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8454020Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8454628Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8457043Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8457323Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8457498Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8459583Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8459892Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.8460436Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8462710Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8462933Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8463337Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.8463675Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.8463979Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8464386Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8464769Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8465057Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8465352Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8465753Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8466043Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8466304Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8466702Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8466960Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8467325Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8467577Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8467950Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8468203Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8468571Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8468987Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8469353Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8469766Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8470134Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8470534Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8470912Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8471312Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8471694Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8472097Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8472465Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8472763Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8473013Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8473389Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8473736Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8474035Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8474338Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8474665Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8475067Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8475447Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8475845Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8476219Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8476472Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8476866Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8477279Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8477686Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8478101Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8478497Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8478840Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8479151Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8479439Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8479711Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.8479967Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8480338Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8480751Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8481215Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8481631Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8482012Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8482263Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8482641Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8483047Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8483412Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8483826Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8484190Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8484453Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8484820Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8485217Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8485592Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8485996Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8486374Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8486762Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8487061Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8487358Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8487656Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8488209Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8488594Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8488849Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8489288Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8489735Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8490106Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8490549Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8490916Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8491268Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8491566Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8491863Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8492177Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8492590Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8492980Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8493394Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8493770Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8494187Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8494562Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8494826Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8495211Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8495621Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8496002Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8496415Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8496788Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8497151Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8497462Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8497766Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8498073Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8498566Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8499002Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8499413Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8499803Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8500210Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8500586Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8500882Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.8501219Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.8501486Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.8501811Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.8502163Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.8502525Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.8502818Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.8503093Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.8503370Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.8503566Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.8503665Z #134 _start from ??:0 2025-12-04T13:41:45.8503794Z #135 from ??:0 2025-12-04T13:41:45.8503885Z " 2025-12-04T13:41:45.8503890Z 2025-12-04T13:41:45.8504116Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.8504719Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.8504726Z 2025-12-04T13:41:45.8504996Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.8505229Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T13:41:45.8506248Z /var/lib/jenkins/workspace/test/test_linalg.py:2965: UserWarning: Unsupported unwinding pattern: Address not in range (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/profiler/unwind/unwind.cpp:219.) 2025-12-04T13:41:45.8506387Z V = torch.linalg.eig(A).eigenvectors 2025-12-04T13:41:45.8506975Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-f88fa3b886672d5c.xml - 2025-12-04T13:41:45.8507150Z =========================== short test summary info ============================ 2025-12-04T13:41:45.8508607Z FAILED [0.1354s] test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 - AssertionError: "ill-defined" does not match "Calling torch.linalg.eig with MAGMA requires compiling PyTorch with MAGMA. Either transfer the tensor to the CPU before calling torch.linalg.eig or use cuSolver. 2025-12-04T13:41:45.8509391Z Exception raised from apply_magma_eig at /var/lib/jenkins/workspace/aten/src/ATen/native/cuda/linalg/BatchLinearAlgebra.cpp:2024 (most recent call first): 2025-12-04T13:41:45.8509506Z C++ CapturedTraceback: 2025-12-04T13:41:45.8510830Z #4 std::_Function_handler, std::allocator > > const> (), c10::SetStackTraceFetcher(std::function, std::allocator > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 2025-12-04T13:41:45.8511324Z #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) from ??:0 2025-12-04T13:41:45.8511696Z #6 c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) from ??:0 2025-12-04T13:41:45.8512501Z #7 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [clone .constprop.0] from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8512910Z #8 at::native::lazy_linalg::linalg_eig_magma(at::Tensor&, at::Tensor&, at::Tensor&, at::Tensor const&, bool) from ??:0 2025-12-04T13:41:45.8513346Z #9 at::native::linalg_eig_out_info(at::Tensor const&, at::Tensor&, at::Tensor&, at::Tensor&, bool) from BatchLinearAlgebra.cpp:0 2025-12-04T13:41:45.8513625Z #10 at::native::linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8514290Z #11 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out(at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8516705Z #12 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&, at::Tensor&, at::Tensor&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_linalg_eig_out>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&, at::Tensor&, at::Tensor&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor&, at::Tensor&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8519586Z #13 at::_ops::linalg_eig_out::call(at::Tensor const&, at::Tensor&, at::Tensor&) from ??:0 2025-12-04T13:41:45.8520175Z #14 at::native::linalg_eig(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8522508Z #15 c10::impl::wrap_kernel_functor_unboxed_ (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA__linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from RegisterCUDA_0.cpp:0 2025-12-04T13:41:45.8524962Z #16 at::_ops::linalg_eig::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 2025-12-04T13:41:45.8525943Z #17 torch::autograd::VariableType::(anonymous namespace)::linalg_eig(c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8528960Z #18 c10::impl::wrap_kernel_functor_unboxed_ (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::linalg_eig>, std::tuple, c10::guts::typelist::typelist >, std::tuple (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&) from VariableType_2.cpp:0 2025-12-04T13:41:45.8532609Z #19 at::_ops::linalg_eig::call(at::Tensor const&) from ??:0 2025-12-04T13:41:45.8533326Z #20 torch::autograd::THPVariable_linalg_eig(_object*, _object*, _object*) from python_linalg_functions.cpp:0 2025-12-04T13:41:45.8534198Z #21 cfunction_call from /usr/local/src/conda/python-3.10.14/Objects/methodobject.c:543 2025-12-04T13:41:45.8534966Z #22 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8535822Z #23 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8536740Z #24 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8537546Z #25 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8538417Z #26 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8539197Z #27 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8540003Z #28 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8540696Z #29 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8541471Z #30 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8542234Z #31 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8543004Z #32 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8544127Z #33 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8545748Z #34 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8546518Z #35 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8547291Z #36 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8548257Z #37 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8549176Z #38 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8550081Z #39 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8551000Z #40 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8551923Z #41 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8552832Z #42 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8553758Z #43 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8554676Z #44 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8555601Z #45 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8556581Z #46 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8557675Z #47 PyVectorcall_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:267 2025-12-04T13:41:45.8558549Z #48 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8559326Z #49 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8560179Z #50 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8560974Z #51 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8561710Z #52 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8562448Z #53 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8563352Z #54 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8564329Z #55 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8565250Z #56 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8566171Z #57 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8566948Z #58 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8567716Z #59 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8568702Z #60 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8569629Z #61 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8570542Z #62 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8571470Z #63 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8572336Z #64 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8573140Z #65 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8573908Z #66 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8574622Z #67 _PyObject_Call from /usr/local/src/conda/python-3.10.14/Objects/call.c:305 2025-12-04T13:41:45.8575327Z #68 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8576103Z #69 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8577047Z #70 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8577976Z #71 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8578983Z #72 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8579894Z #73 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8580671Z #74 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8581445Z #75 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8582377Z #76 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8583290Z #77 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8584220Z #78 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8585137Z #79 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8585901Z #80 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8586661Z #81 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8587584Z #82 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8588677Z #83 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8589600Z #84 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8590516Z #85 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8591378Z #86 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8592173Z #87 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8592907Z #88 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8593648Z #89 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8594503Z #90 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8595420Z #91 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8596188Z #92 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8597039Z #93 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8597964Z #94 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8598885Z #95 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8599800Z #96 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8600723Z #97 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8601584Z #98 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8602416Z #99 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8603152Z #100 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8603952Z #101 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8604823Z #102 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8605791Z #103 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8606730Z #104 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8607672Z #105 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8608616Z #106 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8609546Z #107 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8610336Z #108 do_call_core from /usr/local/src/conda/python-3.10.14/Python/ceval.c:5945 2025-12-04T13:41:45.8611129Z #109 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8612061Z #110 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8612985Z #111 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8613922Z #112 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8614858Z #113 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8615735Z #114 _PyObject_FastCallDictTstate from /usr/local/src/conda/python-3.10.14/Objects/call.c:153 2025-12-04T13:41:45.8616531Z #115 _PyObject_Call_Prepend from /usr/local/src/conda/python-3.10.14/Objects/call.c:431 2025-12-04T13:41:45.8617284Z #116 slot_tp_call from /usr/local/src/conda/python-3.10.14/Objects/typeobject.c:7494 2025-12-04T13:41:45.8618040Z #117 _PyObject_MakeTpCall from /usr/local/src/conda/python-3.10.14/Objects/call.c:215 2025-12-04T13:41:45.8618970Z #118 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:112 2025-12-04T13:41:45.8619913Z #119 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8620849Z #120 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8621787Z #121 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8622843Z #122 _PyObject_VectorcallTstate from /usr/local/src/conda/python-3.10.14/Include/cpython/abstract.h:114 2025-12-04T13:41:45.8623775Z #123 _PyEval_EvalFrame from /usr/local/src/conda/python-3.10.14/Include/internal/pycore_ceval.h:46 2025-12-04T13:41:45.8624599Z #124 PyEval_EvalCode from /usr/local/src/conda/python-3.10.14/Python/ceval.c:1134 2025-12-04T13:41:45.8625344Z #125 run_eval_code_obj from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1291 2025-12-04T13:41:45.8626112Z #126 run_mod from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1312 2025-12-04T13:41:45.8626807Z #127 pyrun_file from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:1208 2025-12-04T13:41:45.8627590Z #128 _PyRun_SimpleFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:456 2025-12-04T13:41:45.8628404Z #129 _PyRun_AnyFileObject from /usr/local/src/conda/python-3.10.14/Python/pythonrun.c:90 2025-12-04T13:41:45.8629162Z #130 pymain_run_file_obj from /usr/local/src/conda/python-3.10.14/Modules/main.c:357 2025-12-04T13:41:45.8629865Z #131 Py_BytesMain from /usr/local/src/conda/python-3.10.14/Modules/main.c:1090 2025-12-04T13:41:45.8630549Z #132 __libc_start_call_main from ./csu/../sysdeps/nptl/libc_start_call_main.h:58 2025-12-04T13:41:45.8631175Z #133 __libc_start_main_impl from ./csu/../csu/libc-start.c:392 2025-12-04T13:41:45.8631606Z #134 _start from ??:0 2025-12-04T13:41:45.8631925Z #135 from ??:0 2025-12-04T13:41:45.8632240Z " 2025-12-04T13:41:45.8632368Z 2025-12-04T13:41:45.8632587Z To execute this test, run the following from the base repo dir: 2025-12-04T13:41:45.8633640Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_linalg.py TestLinalgCUDA.test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.8634366Z 2025-12-04T13:41:45.8634638Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T13:41:45.8635214Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T13:41:45.8635744Z ================= 1 failed, 1262 deselected, 2 rerun in 17.17s ================= 2025-12-04T13:41:45.8636194Z Got exit code 1 2025-12-04T13:41:45.8636867Z FAILED CONSISTENTLY: test/test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128 2025-12-04T13:41:45.8637912Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T13:41:45.8638876Z Test results will be stored in test-reports/python-pytest/test_linalg/test_linalg-83ecadeff448aeea.xml 2025-12-04T13:41:45.8639595Z ============================= test session starts ============================== 2025-12-04T13:41:45.8640259Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:41:45.8640856Z cachedir: .pytest_cache 2025-12-04T13:41:45.8641567Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:41:45.8642347Z rootdir: /var/lib/jenkins/workspace 2025-12-04T13:41:45.8642687Z configfile: pytest.ini 2025-12-04T13:41:45.8643463Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T13:41:45.8644423Z collecting ... collected 1263 items / 693 deselected / 570 selected 2025-12-04T13:41:45.8644927Z stepcurrent: skipping 693 already run items. 2025-12-04T13:41:45.8645318Z Running 570 items in this shard 2025-12-04T13:41:45.8645539Z 2025-12-04T13:41:45.8645854Z test_linalg.py::TestLinalgCUDA::test_inverse_cuda_complex128 PASSED [1.2806s] [ 0%] 2025-12-04T13:41:45.8646621Z test_linalg.py::TestLinalgCUDA::test_inverse_cuda_complex64 PASSED [0.1345s] [ 0%] 2025-12-04T13:41:45.8647354Z test_linalg.py::TestLinalgCUDA::test_inverse_cuda_float32 PASSED [0.1219s] [ 0%] 2025-12-04T13:41:45.8648082Z test_linalg.py::TestLinalgCUDA::test_inverse_cuda_float64 PASSED [0.1370s] [ 0%] 2025-12-04T13:41:45.8648857Z test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_complex128 PASSED [0.0061s] [ 0%] 2025-12-04T13:41:45.8649673Z test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_complex64 PASSED [0.0046s] [ 1%] 2025-12-04T13:41:45.8650474Z test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_float32 PASSED [0.0043s] [ 1%] 2025-12-04T13:41:45.8651257Z test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_float64 PASSED [0.0044s] [ 1%] 2025-12-04T13:41:45.8652149Z test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_complex128 PASSED [1.6498s] [ 1%] 2025-12-04T13:41:45.8653018Z test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_complex64 PASSED [0.1945s] [ 1%] 2025-12-04T13:41:45.8653882Z test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_float32 PASSED [0.1513s] [ 1%] 2025-12-04T13:41:45.8654730Z test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_float64 PASSED [0.6123s] [ 2%] 2025-12-04T13:41:45.8655873Z test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_complex128 SKIPPED [0.0018s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 2%] 2025-12-04T13:41:45.8657329Z test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_complex64 SKIPPED [0.0019s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 2%] 2025-12-04T13:41:45.8658858Z test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_float32 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 2%] 2025-12-04T13:41:45.8660257Z test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_float64 SKIPPED [0.0015s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 2%] 2025-12-04T13:41:45.8661352Z test_linalg.py::TestLinalgCUDA::test_kron_cuda_complex128 PASSED [0.0208s] [ 2%] 2025-12-04T13:41:45.8662064Z test_linalg.py::TestLinalgCUDA::test_kron_cuda_complex64 PASSED [0.0204s] [ 3%] 2025-12-04T13:41:45.8662777Z test_linalg.py::TestLinalgCUDA::test_kron_cuda_float32 PASSED [0.0189s] [ 3%] 2025-12-04T13:41:45.8663486Z test_linalg.py::TestLinalgCUDA::test_kron_cuda_float64 PASSED [0.0183s] [ 3%] 2025-12-04T13:41:45.8664231Z test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_complex128 PASSED [0.0033s] [ 3%] 2025-12-04T13:41:45.8664997Z test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_complex64 PASSED [0.0031s] [ 3%] 2025-12-04T13:41:45.8665754Z test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_float32 PASSED [0.0031s] [ 4%] 2025-12-04T13:41:45.8666503Z test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_float64 PASSED [0.0034s] [ 4%] 2025-12-04T13:41:45.8667341Z test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_complex128 PASSED [0.0025s] [ 4%] 2025-12-04T13:41:45.8668242Z test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_complex64 PASSED [0.0021s] [ 4%] 2025-12-04T13:41:45.8669135Z test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_float32 PASSED [0.0021s] [ 4%] 2025-12-04T13:41:45.8670013Z test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_float64 PASSED [0.0021s] [ 4%] 2025-12-04T13:41:45.8670941Z test_linalg.py::TestLinalgCUDA::test_lapack_empty_cuda SKIPPED [0.0014s] (no MAGMA library detected) [ 5%] 2025-12-04T13:41:45.8671926Z test_linalg.py::TestLinalgCUDA::test_large_bmm_backward_cuda SKIPPED [0.2030s] (Insufficient cuda memory) [ 5%] 2025-12-04T13:41:45.8672954Z test_linalg.py::TestLinalgCUDA::test_large_bmm_mm_backward_cuda SKIPPED [0.1326s] (Insufficient cuda memory) [ 5%] 2025-12-04T13:41:45.8674001Z test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_complex128 SKIPPED [0.0017s] (no MAGMA library detected) [ 5%] 2025-12-04T13:41:45.8675047Z test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_complex64 SKIPPED [0.0016s] (no MAGMA library detected) [ 5%] 2025-12-04T13:41:45.8676083Z test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 5%] 2025-12-04T13:41:45.8677100Z test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.8678130Z test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_complex128 SKIPPED [0.0016s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.8679157Z test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.8680178Z test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.8681218Z test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_float64 SKIPPED [0.0017s] (no MAGMA library detected) [ 6%] 2025-12-04T13:41:45.8682114Z test_linalg.py::TestLinalgCUDA::test_linalg_cross_cuda_complex64 PASSED [0.0044s] [ 7%] 2025-12-04T13:41:45.8682885Z test_linalg.py::TestLinalgCUDA::test_linalg_cross_cuda_float32 PASSED [0.0026s] [ 7%] 2025-12-04T13:41:45.8683763Z test_linalg.py::TestLinalgCUDA::test_linalg_cross_with_and_without_dim_cuda_complex64 PASSED [0.0025s] [ 7%] 2025-12-04T13:41:45.8684737Z test_linalg.py::TestLinalgCUDA::test_linalg_cross_with_and_without_dim_cuda_float32 PASSED [0.0026s] [ 7%] 2025-12-04T13:41:45.8685862Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 7%] 2025-12-04T13:41:45.8687158Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 7%] 2025-12-04T13:41:45.8688684Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 8%] 2025-12-04T13:41:45.8689928Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 8%] 2025-12-04T13:41:45.8691136Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 8%] 2025-12-04T13:41:45.8692209Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_complex64 SKIPPED [0.0017s] (no MAGMA library detected) [ 8%] 2025-12-04T13:41:45.8693272Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 8%] 2025-12-04T13:41:45.8694322Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 8%] 2025-12-04T13:41:45.8695448Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 9%] 2025-12-04T13:41:45.8696637Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 9%] 2025-12-04T13:41:45.8697805Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 9%] 2025-12-04T13:41:45.8699057Z test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 9%] 2025-12-04T13:41:45.8700167Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_complex128 SKIPPED [0.0014s] (Only runs on cpu) [ 9%] 2025-12-04T13:41:45.8701207Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_complex64 SKIPPED [0.0015s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.8702234Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_float32 SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.8703249Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_float64 SKIPPED [0.0014s] (Only runs on cpu) [ 10%] 2025-12-04T13:41:45.8704186Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_complex128 PASSED [1.3327s] [ 10%] 2025-12-04T13:41:45.8705013Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_complex64 PASSED [1.1443s] [ 10%] 2025-12-04T13:41:45.8705824Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_float32 PASSED [1.0343s] [ 10%] 2025-12-04T13:41:45.8706632Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_float64 PASSED [1.4182s] [ 11%] 2025-12-04T13:41:45.8708463Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_complex128 [W1204 13:38:24.843324892 Context.cpp:415] Warning: torch.backends.cuda.preferred_linalg_library is an experimental feature. If you see any error or unexpected behavior when this flag is set please file an issue on GitHub. (function operator()) 2025-12-04T13:41:45.8710068Z PASSED [0.0791s] [ 11%] 2025-12-04T13:41:45.8710579Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_complex64 PASSED [0.0653s] [ 11%] 2025-12-04T13:41:45.8711434Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_float32 PASSED [0.0575s] [ 11%] 2025-12-04T13:41:45.8712232Z test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_float64 PASSED [0.0614s] [ 11%] 2025-12-04T13:41:45.8713365Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_complex128 SKIPPED [0.0015s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 11%] 2025-12-04T13:41:45.8714847Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_complex64 SKIPPED [0.0015s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 12%] 2025-12-04T13:41:45.8716347Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_float32 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 12%] 2025-12-04T13:41:45.8717827Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_analytic_cuda_float64 SKIPPED [0.0018s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 12%] 2025-12-04T13:41:45.8719128Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_batch_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 12%] 2025-12-04T13:41:45.8720293Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_batch_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 12%] 2025-12-04T13:41:45.8721501Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 12%] 2025-12-04T13:41:45.8722765Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 13%] 2025-12-04T13:41:45.8724015Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 13%] 2025-12-04T13:41:45.8725244Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 13%] 2025-12-04T13:41:45.8726516Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 13%] 2025-12-04T13:41:45.8727821Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 13%] 2025-12-04T13:41:45.8729105Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 14%] 2025-12-04T13:41:45.8730366Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 14%] 2025-12-04T13:41:45.8731566Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_no_warnings_cuda_complex64 SKIPPED [0.0014s] (Only runs on cpu) [ 14%] 2025-12-04T13:41:45.8732791Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 14%] 2025-12-04T13:41:45.8734104Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_complex64 SKIPPED [0.0018s] (no MAGMA library detected) [ 14%] 2025-12-04T13:41:45.8735380Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 14%] 2025-12-04T13:41:45.8736654Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 15%] 2025-12-04T13:41:45.8737878Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_utils_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 15%] 2025-12-04T13:41:45.8739117Z test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_utils_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 15%] 2025-12-04T13:41:45.8740096Z test_linalg.py::TestLinalgCUDA::test_linalg_qr_autograd_cuda_float32 PASSED [0.0066s] [ 15%] 2025-12-04T13:41:45.8741042Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_complex128 PASSED [0.0255s] [ 15%] 2025-12-04T13:41:45.8742126Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_complex64 PASSED [0.0244s] [ 15%] 2025-12-04T13:41:45.8743174Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_float32 PASSED [0.0206s] [ 16%] 2025-12-04T13:41:45.8744181Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_broadcasting_cuda_float64 PASSED [0.0206s] [ 16%] 2025-12-04T13:41:45.8745149Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_complex128 PASSED [6.2282s] [ 16%] 2025-12-04T13:41:45.8746063Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_complex64 PASSED [3.7891s] [ 16%] 2025-12-04T13:41:45.8746960Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_float32 PASSED [0.8383s] [ 16%] 2025-12-04T13:41:45.8747862Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_float64 PASSED [1.0022s] [ 17%] 2025-12-04T13:41:45.8749104Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_complex128 SKIPPED [0.0017s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 17%] 2025-12-04T13:41:45.8750674Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_complex64 SKIPPED [0.0015s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 17%] 2025-12-04T13:41:45.8752170Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_float32 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 17%] 2025-12-04T13:41:45.8753662Z test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_float64 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 17%] 2025-12-04T13:41:45.8754821Z test_linalg.py::TestLinalgCUDA::test_linear_algebra_scalar_raises_cuda PASSED [0.0028s] [ 17%] 2025-12-04T13:41:45.8755779Z test_linalg.py::TestLinalgCUDA::test_lobpcg_basic_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 18%] 2025-12-04T13:41:45.8756695Z test_linalg.py::TestLinalgCUDA::test_lobpcg_ortho_cuda_float64 PASSED [18.0851s] [ 18%] 2025-12-04T13:41:45.8757567Z test_linalg.py::TestLinalgCUDA::test_lobpcg_scipy_cuda_float64 SKIPPED [0.0018s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.8758553Z test_linalg.py::TestLinalgCUDA::test_lobpcg_torchscript_cuda_float64 SKIPPED [0.0016s] (Only runs on cpu) [ 18%] 2025-12-04T13:41:45.8759510Z test_linalg.py::TestLinalgCUDA::test_logaddexp_cpu_vs_cuda_complex_cuda PASSED [0.0064s] [ 18%] 2025-12-04T13:41:45.8760442Z test_linalg.py::TestLinalgCUDA::test_lower_precision_accumulation_with_ref_path_cuda PASSED [0.1488s] [ 18%] 2025-12-04T13:41:45.8761318Z test_linalg.py::TestLinalgCUDA::test_lstsq_removed_error_cuda PASSED [0.0021s] [ 19%] 2025-12-04T13:41:45.8762192Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_complex128 PASSED [0.0151s] [ 19%] 2025-12-04T13:41:45.8763160Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_complex64 PASSED [0.0075s] [ 19%] 2025-12-04T13:41:45.8764102Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_float32 PASSED [0.0060s] [ 19%] 2025-12-04T13:41:45.8765028Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_float64 PASSED [0.0074s] [ 19%] 2025-12-04T13:41:45.8765918Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_complex128 PASSED [0.0281s] [ 20%] 2025-12-04T13:41:45.8766747Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_complex64 PASSED [0.0135s] [ 20%] 2025-12-04T13:41:45.8767569Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_float32 PASSED [0.0125s] [ 20%] 2025-12-04T13:41:45.8768357Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_float64 PASSED [0.0146s] [ 20%] 2025-12-04T13:41:45.8769524Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_complex128 SKIPPED [0.0015s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 20%] 2025-12-04T13:41:45.8771029Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_complex64 SKIPPED [0.0015s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 20%] 2025-12-04T13:41:45.8772556Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_float32 SKIPPED [0.0015s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 21%] 2025-12-04T13:41:45.8774020Z test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_float64 SKIPPED [0.0015s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 21%] 2025-12-04T13:41:45.8775137Z test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_complex128 PASSED [0.0243s] [ 21%] 2025-12-04T13:41:45.8775898Z test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_complex64 PASSED [0.0100s] [ 21%] 2025-12-04T13:41:45.8776673Z test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_float32 PASSED [0.0089s] [ 21%] 2025-12-04T13:41:45.8777395Z test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_float64 PASSED [0.0113s] [ 21%] 2025-12-04T13:41:45.8778453Z test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 22%] 2025-12-04T13:41:45.8779632Z test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 22%] 2025-12-04T13:41:45.8780822Z test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 22%] 2025-12-04T13:41:45.8781954Z test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 22%] 2025-12-04T13:41:45.8783083Z test_linalg.py::TestLinalgCUDA::test_lu_unpack_check_input_cuda_float64 SKIPPED [0.0019s] (no MAGMA library detected) [ 22%] 2025-12-04T13:41:45.8784002Z test_linalg.py::TestLinalgCUDA::test_matmul_45724_cuda PASSED [0.5509s] [ 22%] 2025-12-04T13:41:45.8784833Z test_linalg.py::TestLinalgCUDA::test_matmul_check_entries_tunableop_cuda_float16 PASSED [0.0135s] [ 23%] 2025-12-04T13:41:45.8786027Z test_linalg.py::TestLinalgCUDA::test_matmul_empty_existing_file_tunableop_cuda_float32 SKIPPED [0.0016s] (test doesn't currently work on the CUDA stack) [ 23%] 2025-12-04T13:41:45.8787273Z test_linalg.py::TestLinalgCUDA::test_matmul_mv_cuda_bfloat16 SKIPPED [0.1741s] (Insufficient cuda:0 memory) [ 23%] 2025-12-04T13:41:45.8788512Z test_linalg.py::TestLinalgCUDA::test_matmul_mv_cuda_float16 SKIPPED [0.1261s] (Insufficient cuda:0 memory) [ 23%] 2025-12-04T13:41:45.8789539Z test_linalg.py::TestLinalgCUDA::test_matmul_mv_cuda_float32 SKIPPED [0.1232s] (Insufficient cuda:0 memory) [ 23%] 2025-12-04T13:41:45.8790645Z test_linalg.py::TestLinalgCUDA::test_matmul_offline_mgpu_tunableop_cuda_float32 SKIPPED [0.0003s] (Requires at least 2 GPUs) [ 24%] 2025-12-04T13:41:45.8791929Z test_linalg.py::TestLinalgCUDA::test_matmul_offline_tunableop_cuda_float16 SKIPPED [0.0017s] (test doesn't currently work on the CUDA stack) [ 24%] 2025-12-04T13:41:45.8793125Z test_linalg.py::TestLinalgCUDA::test_matmul_out_kernel_errors_with_autograd_cuda_complex64 PASSED [0.0029s] [ 24%] 2025-12-04T13:41:45.8794157Z test_linalg.py::TestLinalgCUDA::test_matmul_out_kernel_errors_with_autograd_cuda_float32 PASSED [0.0023s] [ 24%] 2025-12-04T13:41:45.8796117Z test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_1d_Nd_cuda_complex64 [W1204 13:38:56.128182004 Context.cpp:525] Warning: torch.backends.cuda.preferred_blas_library is an experimental feature. If you see any error or unexpected behavior when this flag is set please file an issue on GitHub. (function operator()) 2025-12-04T13:41:45.8797753Z PASSED [1.0184s] [ 24%] 2025-12-04T13:41:45.8798332Z test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_1d_Nd_cuda_float32 PASSED [0.9604s] [ 24%] 2025-12-04T13:41:45.8799282Z test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_2d_Nd_cuda_complex64 PASSED [3.9484s] [ 25%] 2025-12-04T13:41:45.8800216Z test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_2d_Nd_cuda_float32 PASSED [3.6640s] [ 25%] 2025-12-04T13:41:45.8801226Z test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_3d_Nd_cuda_complex64 PASSED [4.5770s] [ 25%] 2025-12-04T13:41:45.8802172Z test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_3d_Nd_cuda_float32 PASSED [4.2477s] [ 25%] 2025-12-04T13:41:45.8803361Z test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_tunableop_cuda_float16 SKIPPED [0.0018s] (test doesn't currently work on the CUDA stack) [ 25%] 2025-12-04T13:41:45.8804772Z test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_tunableop_cuda_float32 SKIPPED [0.0016s] (test doesn't currently work on the CUDA stack) [ 25%] 2025-12-04T13:41:45.8806197Z test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_tunableop_cuda_float64 SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 26%] 2025-12-04T13:41:45.8807611Z test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 26%] 2025-12-04T13:41:45.8808974Z test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 26%] 2025-12-04T13:41:45.8810276Z test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_float32 SKIPPED [0.0018s] (no MAGMA library detected) [ 26%] 2025-12-04T13:41:45.8811592Z test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 26%] 2025-12-04T13:41:45.8812614Z test_linalg.py::TestLinalgCUDA::test_matrix_norm_cuda_float32 PASSED [0.0031s] [ 27%] 2025-12-04T13:41:45.8813378Z test_linalg.py::TestLinalgCUDA::test_matrix_norm_cuda_float64 PASSED [0.0025s] [ 27%] 2025-12-04T13:41:45.8814332Z test_linalg.py::TestLinalgCUDA::test_matrix_power_negative_cuda_complex128 PASSED [0.0443s] [ 27%] 2025-12-04T13:41:45.8815208Z test_linalg.py::TestLinalgCUDA::test_matrix_power_negative_cuda_float64 PASSED [0.0263s] [ 27%] 2025-12-04T13:41:45.8816101Z test_linalg.py::TestLinalgCUDA::test_matrix_power_non_negative_cuda_complex128 PASSED [0.0143s] [ 27%] 2025-12-04T13:41:45.8817012Z test_linalg.py::TestLinalgCUDA::test_matrix_power_non_negative_cuda_float64 PASSED [0.0119s] [ 27%] 2025-12-04T13:41:45.8818004Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 28%] 2025-12-04T13:41:45.8819212Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 28%] 2025-12-04T13:41:45.8820302Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 28%] 2025-12-04T13:41:45.8821375Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 28%] 2025-12-04T13:41:45.8822468Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_rtol_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 28%] 2025-12-04T13:41:45.8823590Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 28%] 2025-12-04T13:41:45.8824720Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 29%] 2025-12-04T13:41:45.8825827Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 29%] 2025-12-04T13:41:45.8826901Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 29%] 2025-12-04T13:41:45.8827976Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 29%] 2025-12-04T13:41:45.8829041Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 29%] 2025-12-04T13:41:45.8830094Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 30%] 2025-12-04T13:41:45.8831116Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 30%] 2025-12-04T13:41:45.8832243Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 30%] 2025-12-04T13:41:45.8833363Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 30%] 2025-12-04T13:41:45.8834462Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 30%] 2025-12-04T13:41:45.8835538Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_empty_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 30%] 2025-12-04T13:41:45.8836727Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 31%] 2025-12-04T13:41:45.8838041Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 31%] 2025-12-04T13:41:45.8839337Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 31%] 2025-12-04T13:41:45.8840620Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 31%] 2025-12-04T13:41:45.8841643Z test_linalg.py::TestLinalgCUDA::test_matrix_rank_removed_error_cuda PASSED [0.0018s] [ 31%] 2025-12-04T13:41:45.8842554Z test_linalg.py::TestLinalgCUDA::test_minimum_tuning_iteration_tunableop_cuda_float16 PASSED [0.0901s] [ 31%] 2025-12-04T13:41:45.8843457Z test_linalg.py::TestLinalgCUDA::test_mm_bmm_non_memory_dense_cuda PASSED [0.0053s] [ 32%] 2025-12-04T13:41:45.8844212Z test_linalg.py::TestLinalgCUDA::test_mm_conjtranspose_cuda PASSED [0.0034s] [ 32%] 2025-12-04T13:41:45.8845219Z test_linalg.py::TestLinalgCUDA::test_mm_cuda_complex128 SKIPPED [0.0015s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 32%] 2025-12-04T13:41:45.8846472Z test_linalg.py::TestLinalgCUDA::test_mm_cuda_complex64 SKIPPED [0.0015s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 32%] 2025-12-04T13:41:45.8847701Z test_linalg.py::TestLinalgCUDA::test_mm_cuda_float32 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 32%] 2025-12-04T13:41:45.8848918Z test_linalg.py::TestLinalgCUDA::test_mm_cuda_float64 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 32%] 2025-12-04T13:41:45.8849965Z test_linalg.py::TestLinalgCUDA::test_mm_empty_inputs_mixed_dtype_errors_cuda PASSED [0.0021s] [ 33%] 2025-12-04T13:41:45.8851123Z test_linalg.py::TestLinalgCUDA::test_mm_submatrix_offline_tunableop_cuda_float32 SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 33%] 2025-12-04T13:41:45.8852207Z test_linalg.py::TestLinalgCUDA::test_multi_dot_cuda_complex128 PASSED [0.0107s] [ 33%] 2025-12-04T13:41:45.8852974Z test_linalg.py::TestLinalgCUDA::test_multi_dot_cuda_float64 PASSED [0.0089s] [ 33%] 2025-12-04T13:41:45.8853751Z test_linalg.py::TestLinalgCUDA::test_multi_dot_errors_cuda_float32 PASSED [0.0042s] [ 33%] 2025-12-04T13:41:45.8854594Z test_linalg.py::TestLinalgCUDA::test_norm_bfloat16_and_half_cuda_bfloat16 PASSED [0.0215s] [ 34%] 2025-12-04T13:41:45.8855456Z test_linalg.py::TestLinalgCUDA::test_norm_bfloat16_and_half_cuda_float16 PASSED [0.0214s] [ 34%] 2025-12-04T13:41:45.8856424Z test_linalg.py::TestLinalgCUDA::test_norm_complex_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 34%] 2025-12-04T13:41:45.8857500Z test_linalg.py::TestLinalgCUDA::test_norm_complex_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 34%] 2025-12-04T13:41:45.8858603Z test_linalg.py::TestLinalgCUDA::test_norm_complex_old_cuda SKIPPED [0.0014s] (no MAGMA library detected) [ 34%] 2025-12-04T13:41:45.8859573Z test_linalg.py::TestLinalgCUDA::test_norm_complexhalf_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 34%] 2025-12-04T13:41:45.8860412Z test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_bfloat16 PASSED [0.0829s] [ 35%] 2025-12-04T13:41:45.8861229Z test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_complex128 PASSED [0.2850s] [ 35%] 2025-12-04T13:41:45.8862007Z test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_complex64 PASSED [0.1925s] [ 35%] 2025-12-04T13:41:45.8862765Z test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_float16 PASSED [0.0811s] [ 35%] 2025-12-04T13:41:45.8863507Z test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_float32 PASSED [0.1316s] [ 35%] 2025-12-04T13:41:45.8864255Z test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_float64 PASSED [0.0799s] [ 35%] 2025-12-04T13:41:45.8865009Z test_linalg.py::TestLinalgCUDA::test_norm_errors_cuda_float32 PASSED [0.0083s] [ 36%] 2025-12-04T13:41:45.8865759Z test_linalg.py::TestLinalgCUDA::test_norm_errors_cuda_float64 PASSED [0.0065s] [ 36%] 2025-12-04T13:41:45.8866683Z test_linalg.py::TestLinalgCUDA::test_norm_extreme_values_cuda SKIPPED [0.0014s] (no MAGMA library detected) [ 36%] 2025-12-04T13:41:45.8867610Z test_linalg.py::TestLinalgCUDA::test_norm_fastpaths_cuda PASSED [0.0036s] [ 36%] 2025-12-04T13:41:45.8868420Z test_linalg.py::TestLinalgCUDA::test_norm_fro_2_equivalence_old_cuda_float32 PASSED [0.1322s] [ 36%] 2025-12-04T13:41:45.8869341Z test_linalg.py::TestLinalgCUDA::test_norm_fused_type_promotion_cuda_bfloat16 PASSED [0.0300s] [ 37%] 2025-12-04T13:41:45.8870238Z test_linalg.py::TestLinalgCUDA::test_norm_fused_type_promotion_cuda_float16 PASSED [0.0093s] [ 37%] 2025-12-04T13:41:45.8871203Z test_linalg.py::TestLinalgCUDA::test_norm_matrix_cuda_float32 SKIPPED [0.0016s] (no MAGMA library detected) [ 37%] 2025-12-04T13:41:45.8872231Z test_linalg.py::TestLinalgCUDA::test_norm_matrix_cuda_float64 SKIPPED [0.0015s] (no MAGMA library detected) [ 37%] 2025-12-04T13:41:45.8873485Z test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_complex128 SKIPPED [0.0014s] (Numpy changed handling of degenerate inputs in 2.3.0) [ 37%] 2025-12-04T13:41:45.8874974Z test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_complex64 SKIPPED [0.0014s] (Numpy changed handling of degenerate inputs in 2.3.0) [ 37%] 2025-12-04T13:41:45.8876434Z test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_float32 SKIPPED [0.0014s] (Numpy changed handling of degenerate inputs in 2.3.0) [ 38%] 2025-12-04T13:41:45.8877890Z test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_float64 SKIPPED [0.0018s] (Numpy changed handling of degenerate inputs in 2.3.0) [ 38%] 2025-12-04T13:41:45.8879284Z test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 38%] 2025-12-04T13:41:45.8880614Z test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 38%] 2025-12-04T13:41:45.8881926Z test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 38%] 2025-12-04T13:41:45.8883232Z test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 38%] 2025-12-04T13:41:45.8884346Z test_linalg.py::TestLinalgCUDA::test_norm_old_cuda SKIPPED [0.0014s] (no MAGMA library detected) [ 39%] 2025-12-04T13:41:45.8885204Z test_linalg.py::TestLinalgCUDA::test_norm_old_nan_propagation_cuda PASSED [0.0092s] [ 39%] 2025-12-04T13:41:45.8885991Z test_linalg.py::TestLinalgCUDA::test_norm_vector_cuda_float32 PASSED [0.1106s] [ 39%] 2025-12-04T13:41:45.8886749Z test_linalg.py::TestLinalgCUDA::test_norm_vector_cuda_float64 PASSED [0.6208s] [ 39%] 2025-12-04T13:41:45.8887922Z test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_complex128 SKIPPED [0.0017s] (no MAGMA library detected) [ 39%] 2025-12-04T13:41:45.8889174Z test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_complex64 SKIPPED [0.0016s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.8890410Z test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_float32 SKIPPED [0.0015s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.8891692Z test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.8892908Z test_linalg.py::TestLinalgCUDA::test_nuclear_norm_axes_small_brute_force_old_cuda SKIPPED [0.0019s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.8894069Z test_linalg.py::TestLinalgCUDA::test_nuclear_norm_exceptions_old_cuda SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.8895173Z test_linalg.py::TestLinalgCUDA::test_nuclear_norm_out_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 40%] 2025-12-04T13:41:45.8896294Z test_linalg.py::TestLinalgCUDA::test_nuclear_norm_out_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 41%] 2025-12-04T13:41:45.8897591Z test_linalg.py::TestLinalgCUDA::test_numeric_check_leak_tunableop_rocm_cuda_bfloat16 SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 41%] 2025-12-04T13:41:45.8899094Z test_linalg.py::TestLinalgCUDA::test_numerical_check_accuracy_tunableop_cuda_float16 SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 41%] 2025-12-04T13:41:45.8900583Z test_linalg.py::TestLinalgCUDA::test_numerical_check_accuracy_tunableop_cuda_float32 SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 41%] 2025-12-04T13:41:45.8902058Z test_linalg.py::TestLinalgCUDA::test_numerical_check_python_binding_tunableop_cuda_float16 SKIPPED [0.0019s] (test doesn't currently work on the CUDA stack) [ 41%] 2025-12-04T13:41:45.8903591Z test_linalg.py::TestLinalgCUDA::test_offline_tuning_append_to_existing_file_tunableop_cuda_float32 SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 41%] 2025-12-04T13:41:45.8904943Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 42%] 2025-12-04T13:41:45.8906102Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 42%] 2025-12-04T13:41:45.8907242Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 42%] 2025-12-04T13:41:45.8908373Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 42%] 2025-12-04T13:41:45.8909708Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_many_batches_cuda_float64 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 42%] 2025-12-04T13:41:45.8911065Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_complex128 SKIPPED [0.0019s] (no MAGMA library detected) [ 42%] 2025-12-04T13:41:45.8912286Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 43%] 2025-12-04T13:41:45.8913482Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 43%] 2025-12-04T13:41:45.8914667Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 43%] 2025-12-04T13:41:45.8915787Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 43%] 2025-12-04T13:41:45.8916862Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 43%] 2025-12-04T13:41:45.8917919Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 44%] 2025-12-04T13:41:45.8918968Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_float64 SKIPPED [0.0018s] (no MAGMA library detected) [ 44%] 2025-12-04T13:41:45.8920053Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 44%] 2025-12-04T13:41:45.8921185Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 44%] 2025-12-04T13:41:45.8922336Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 44%] 2025-12-04T13:41:45.8923446Z test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 44%] 2025-12-04T13:41:45.8924703Z test_linalg.py::TestLinalgCUDA::test_ops_append_to_existing_file_tunableop_cuda_float32 SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 45%] 2025-12-04T13:41:45.8925810Z test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_complex128 PASSED [0.1582s] [ 45%] 2025-12-04T13:41:45.8926552Z test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_complex64 PASSED [0.1565s] [ 45%] 2025-12-04T13:41:45.8927310Z test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_float32 PASSED [0.1415s] [ 45%] 2025-12-04T13:41:45.8928013Z test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_float64 PASSED [0.1409s] [ 45%] 2025-12-04T13:41:45.8928866Z test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_complex128 PASSED [0.0045s] [ 45%] 2025-12-04T13:41:45.8929786Z test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_complex64 PASSED [0.0031s] [ 46%] 2025-12-04T13:41:45.8930714Z test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_float32 PASSED [0.0029s] [ 46%] 2025-12-04T13:41:45.8931610Z test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_float64 PASSED [0.0033s] [ 46%] 2025-12-04T13:41:45.8932420Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_bfloat16 PASSED [0.0083s] [ 46%] 2025-12-04T13:41:45.8933140Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_bool PASSED [0.0050s] [ 46%] 2025-12-04T13:41:45.8933858Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_complex128 PASSED [0.0099s] [ 47%] 2025-12-04T13:41:45.8934599Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_complex64 PASSED [0.0088s] [ 47%] 2025-12-04T13:41:45.8935328Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_float16 PASSED [0.0076s] [ 47%] 2025-12-04T13:41:45.8936037Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_float32 PASSED [0.0078s] [ 47%] 2025-12-04T13:41:45.8936743Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_float64 PASSED [0.0075s] [ 47%] 2025-12-04T13:41:45.8937454Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_int16 PASSED [0.0051s] [ 47%] 2025-12-04T13:41:45.8938225Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_int32 PASSED [0.0053s] [ 48%] 2025-12-04T13:41:45.8938929Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_int64 PASSED [0.0054s] [ 48%] 2025-12-04T13:41:45.8939639Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_int8 PASSED [0.0050s] [ 48%] 2025-12-04T13:41:45.8940349Z test_linalg.py::TestLinalgCUDA::test_outer_cuda_uint8 PASSED [0.0054s] [ 48%] 2025-12-04T13:41:45.8941123Z test_linalg.py::TestLinalgCUDA::test_outer_ger_addr_legacy_tests_cuda PASSED [0.0029s] [ 48%] 2025-12-04T13:41:45.8942006Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_bfloat16 PASSED [0.0020s] [ 48%] 2025-12-04T13:41:45.8942943Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_bool PASSED [0.0020s] [ 49%] 2025-12-04T13:41:45.8943890Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_complex128 PASSED [0.0021s] [ 49%] 2025-12-04T13:41:45.8944857Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_complex64 PASSED [0.0020s] [ 49%] 2025-12-04T13:41:45.8945795Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_float16 PASSED [0.0021s] [ 49%] 2025-12-04T13:41:45.8946731Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_float32 PASSED [0.0024s] [ 49%] 2025-12-04T13:41:45.8947662Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_float64 PASSED [0.0020s] [ 50%] 2025-12-04T13:41:45.8948571Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int16 PASSED [0.0020s] [ 50%] 2025-12-04T13:41:45.8949533Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int32 PASSED [0.0020s] [ 50%] 2025-12-04T13:41:45.8950450Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int64 PASSED [0.0020s] [ 50%] 2025-12-04T13:41:45.8951357Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int8 PASSED [0.0022s] [ 50%] 2025-12-04T13:41:45.8952259Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_uint8 PASSED [0.0024s] [ 50%] 2025-12-04T13:41:45.8953171Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_bfloat16 PASSED [0.0020s] [ 51%] 2025-12-04T13:41:45.8954179Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_bool PASSED [0.0020s] [ 51%] 2025-12-04T13:41:45.8955123Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_complex128 PASSED [0.0020s] [ 51%] 2025-12-04T13:41:45.8956037Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_complex64 PASSED [0.0020s] [ 51%] 2025-12-04T13:41:45.8956975Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_float16 PASSED [0.0021s] [ 51%] 2025-12-04T13:41:45.8957872Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_float32 PASSED [0.0024s] [ 51%] 2025-12-04T13:41:45.8958806Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_float64 PASSED [0.0020s] [ 52%] 2025-12-04T13:41:45.8959681Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int16 PASSED [0.0020s] [ 52%] 2025-12-04T13:41:45.8960555Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int32 PASSED [0.0020s] [ 52%] 2025-12-04T13:41:45.8961427Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int64 PASSED [0.0020s] [ 52%] 2025-12-04T13:41:45.8962303Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int8 PASSED [0.0020s] [ 52%] 2025-12-04T13:41:45.8963168Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_uint8 PASSED [0.0021s] [ 52%] 2025-12-04T13:41:45.8964097Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_bfloat16 PASSED [0.0024s] [ 53%] 2025-12-04T13:41:45.8965046Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_bool PASSED [0.0020s] [ 53%] 2025-12-04T13:41:45.8965991Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_complex128 PASSED [0.0020s] [ 53%] 2025-12-04T13:41:45.8966981Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_complex64 PASSED [0.0020s] [ 53%] 2025-12-04T13:41:45.8967945Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_float16 PASSED [0.0021s] [ 53%] 2025-12-04T13:41:45.8968904Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_float32 PASSED [0.0020s] [ 54%] 2025-12-04T13:41:45.8969851Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_float64 PASSED [0.0024s] [ 54%] 2025-12-04T13:41:45.8970800Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int16 PASSED [0.0020s] [ 54%] 2025-12-04T13:41:45.8971730Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int32 PASSED [0.0020s] [ 54%] 2025-12-04T13:41:45.8972659Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int64 PASSED [0.0020s] [ 54%] 2025-12-04T13:41:45.8973573Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int8 PASSED [0.0020s] [ 54%] 2025-12-04T13:41:45.8974496Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_uint8 PASSED [0.0020s] [ 55%] 2025-12-04T13:41:45.8975438Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_bfloat16 PASSED [0.0024s] [ 55%] 2025-12-04T13:41:45.8976369Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_bool PASSED [0.0020s] [ 55%] 2025-12-04T13:41:45.8977303Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_complex128 PASSED [0.0020s] [ 55%] 2025-12-04T13:41:45.8978354Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_complex64 PASSED [0.0020s] [ 55%] 2025-12-04T13:41:45.8979359Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_float16 PASSED [0.0021s] [ 55%] 2025-12-04T13:41:45.8980318Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_float32 PASSED [0.0020s] [ 56%] 2025-12-04T13:41:45.8981248Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_float64 PASSED [0.0020s] [ 56%] 2025-12-04T13:41:45.8982182Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int16 PASSED [0.0024s] [ 56%] 2025-12-04T13:41:45.8983101Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int32 PASSED [0.0020s] [ 56%] 2025-12-04T13:41:45.8984073Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int64 PASSED [0.0020s] [ 56%] 2025-12-04T13:41:45.8984979Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int8 PASSED [0.0020s] [ 57%] 2025-12-04T13:41:45.8985929Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_uint8 PASSED [0.0020s] [ 57%] 2025-12-04T13:41:45.8986866Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_bfloat16 PASSED [0.0021s] [ 57%] 2025-12-04T13:41:45.8987950Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_bool PASSED [0.0025s] [ 57%] 2025-12-04T13:41:45.8988886Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_complex128 PASSED [0.0021s] [ 57%] 2025-12-04T13:41:45.8989839Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_complex64 PASSED [0.0021s] [ 57%] 2025-12-04T13:41:45.8990774Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_float16 PASSED [0.0021s] [ 58%] 2025-12-04T13:41:45.8998482Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_float32 PASSED [0.0021s] [ 58%] 2025-12-04T13:41:45.8999436Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_float64 PASSED [0.0021s] [ 58%] 2025-12-04T13:41:45.9000360Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int16 PASSED [0.0021s] [ 58%] 2025-12-04T13:41:45.9001266Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int32 PASSED [0.0027s] [ 58%] 2025-12-04T13:41:45.9002162Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int64 PASSED [0.0021s] [ 58%] 2025-12-04T13:41:45.9003066Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int8 PASSED [0.0021s] [ 59%] 2025-12-04T13:41:45.9003962Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_uint8 PASSED [0.0021s] [ 59%] 2025-12-04T13:41:45.9004865Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_bfloat16 PASSED [0.0020s] [ 59%] 2025-12-04T13:41:45.9005784Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_bool PASSED [0.0020s] [ 59%] 2025-12-04T13:41:45.9006714Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_complex128 PASSED [0.0024s] [ 59%] 2025-12-04T13:41:45.9007667Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_complex64 PASSED [0.0020s] [ 60%] 2025-12-04T13:41:45.9008591Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_float16 PASSED [0.0021s] [ 60%] 2025-12-04T13:41:45.9009512Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_float32 PASSED [0.0020s] [ 60%] 2025-12-04T13:41:45.9010427Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_float64 PASSED [0.0020s] [ 60%] 2025-12-04T13:41:45.9011331Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int16 PASSED [0.0020s] [ 60%] 2025-12-04T13:41:45.9012221Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int32 PASSED [0.0024s] [ 60%] 2025-12-04T13:41:45.9013117Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int64 PASSED [0.0020s] [ 61%] 2025-12-04T13:41:45.9014019Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int8 PASSED [0.0020s] [ 61%] 2025-12-04T13:41:45.9015047Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_uint8 PASSED [0.0020s] [ 61%] 2025-12-04T13:41:45.9015955Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_bfloat16 PASSED [0.0020s] [ 61%] 2025-12-04T13:41:45.9016869Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_bool PASSED [0.0020s] [ 61%] 2025-12-04T13:41:45.9017796Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_complex128 PASSED [0.0020s] [ 61%] 2025-12-04T13:41:45.9018864Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_complex64 PASSED [0.0024s] [ 62%] 2025-12-04T13:41:45.9019782Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_float16 PASSED [0.0021s] [ 62%] 2025-12-04T13:41:45.9020756Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_float32 PASSED [0.0020s] [ 62%] 2025-12-04T13:41:45.9021190Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_float64 PASSED [0.0020s] [ 62%] 2025-12-04T13:41:45.9021581Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int16 PASSED [0.0020s] [ 62%] 2025-12-04T13:41:45.9022000Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int32 PASSED [0.0020s] [ 62%] 2025-12-04T13:41:45.9022382Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int64 PASSED [0.0024s] [ 63%] 2025-12-04T13:41:45.9022755Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int8 PASSED [0.0020s] [ 63%] 2025-12-04T13:41:45.9023130Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_uint8 PASSED [0.0020s] [ 63%] 2025-12-04T13:41:45.9023525Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_bfloat16 PASSED [0.0020s] [ 63%] 2025-12-04T13:41:45.9023896Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_bool PASSED [0.0020s] [ 63%] 2025-12-04T13:41:45.9024306Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_complex128 PASSED [0.0020s] [ 64%] 2025-12-04T13:41:45.9024693Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_complex64 PASSED [0.0024s] [ 64%] 2025-12-04T13:41:45.9025072Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_float16 PASSED [0.0021s] [ 64%] 2025-12-04T13:41:45.9025451Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_float32 PASSED [0.0020s] [ 64%] 2025-12-04T13:41:45.9025822Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_float64 PASSED [0.0021s] [ 64%] 2025-12-04T13:41:45.9026193Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int16 PASSED [0.0020s] [ 64%] 2025-12-04T13:41:45.9026581Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int32 PASSED [0.0021s] [ 65%] 2025-12-04T13:41:45.9026947Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int64 PASSED [0.0021s] [ 65%] 2025-12-04T13:41:45.9027327Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int8 PASSED [0.0024s] [ 65%] 2025-12-04T13:41:45.9027700Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_uint8 PASSED [0.0020s] [ 65%] 2025-12-04T13:41:45.9028087Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_bfloat16 PASSED [0.0021s] [ 65%] 2025-12-04T13:41:45.9028461Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_bool PASSED [0.0020s] [ 65%] 2025-12-04T13:41:45.9028854Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_complex128 PASSED [0.0020s] [ 66%] 2025-12-04T13:41:45.9029254Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_complex64 PASSED [0.0020s] [ 66%] 2025-12-04T13:41:45.9029630Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_float16 PASSED [0.0025s] [ 66%] 2025-12-04T13:41:45.9030009Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_float32 PASSED [0.0020s] [ 66%] 2025-12-04T13:41:45.9030428Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_float64 PASSED [0.0020s] [ 66%] 2025-12-04T13:41:45.9030794Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int16 PASSED [0.0020s] [ 67%] 2025-12-04T13:41:45.9031177Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int32 PASSED [0.0020s] [ 67%] 2025-12-04T13:41:45.9031545Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int64 PASSED [0.0020s] [ 67%] 2025-12-04T13:41:45.9031914Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int8 PASSED [0.0024s] [ 67%] 2025-12-04T13:41:45.9032293Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_uint8 PASSED [0.0020s] [ 67%] 2025-12-04T13:41:45.9032706Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_bfloat16 PASSED [0.0020s] [ 67%] 2025-12-04T13:41:45.9033088Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_bool PASSED [0.0020s] [ 68%] 2025-12-04T13:41:45.9033511Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_complex128 PASSED [0.0020s] [ 68%] 2025-12-04T13:41:45.9033897Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_complex64 PASSED [0.0020s] [ 68%] 2025-12-04T13:41:45.9034318Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_float16 PASSED [0.0021s] [ 68%] 2025-12-04T13:41:45.9034693Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_float32 PASSED [0.0024s] [ 68%] 2025-12-04T13:41:45.9035065Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_float64 PASSED [0.0020s] [ 68%] 2025-12-04T13:41:45.9035446Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int16 PASSED [0.0020s] [ 69%] 2025-12-04T13:41:45.9035814Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int32 PASSED [0.0020s] [ 69%] 2025-12-04T13:41:45.9036197Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int64 PASSED [0.0020s] [ 69%] 2025-12-04T13:41:45.9036564Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int8 PASSED [0.0020s] [ 69%] 2025-12-04T13:41:45.9036932Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_uint8 PASSED [0.0024s] [ 69%] 2025-12-04T13:41:45.9037322Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_bfloat16 PASSED [0.0020s] [ 70%] 2025-12-04T13:41:45.9037680Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_bool PASSED [0.0020s] [ 70%] 2025-12-04T13:41:45.9038082Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_complex128 PASSED [0.0020s] [ 70%] 2025-12-04T13:41:45.9038462Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_complex64 PASSED [0.0020s] [ 70%] 2025-12-04T13:41:45.9038838Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_float16 PASSED [0.0021s] [ 70%] 2025-12-04T13:41:45.9039222Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_float32 PASSED [0.0020s] [ 70%] 2025-12-04T13:41:45.9039594Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_float64 PASSED [0.0024s] [ 71%] 2025-12-04T13:41:45.9039973Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int16 PASSED [0.0021s] [ 71%] 2025-12-04T13:41:45.9040336Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int32 PASSED [0.0020s] [ 71%] 2025-12-04T13:41:45.9040699Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int64 PASSED [0.0020s] [ 71%] 2025-12-04T13:41:45.9041071Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int8 PASSED [0.0020s] [ 71%] 2025-12-04T13:41:45.9041434Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_uint8 PASSED [0.0020s] [ 71%] 2025-12-04T13:41:45.9041829Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_bfloat16 PASSED [0.0027s] [ 72%] 2025-12-04T13:41:45.9042195Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_bool PASSED [0.0020s] [ 72%] 2025-12-04T13:41:45.9042623Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_complex128 PASSED [0.0020s] [ 72%] 2025-12-04T13:41:45.9043020Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_complex64 PASSED [0.0020s] [ 72%] 2025-12-04T13:41:45.9043398Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_float16 PASSED [0.0021s] [ 72%] 2025-12-04T13:41:45.9043772Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_float32 PASSED [0.0020s] [ 72%] 2025-12-04T13:41:45.9044159Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_float64 PASSED [0.0024s] [ 73%] 2025-12-04T13:41:45.9044527Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int16 PASSED [0.0020s] [ 73%] 2025-12-04T13:41:45.9044939Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int32 PASSED [0.0020s] [ 73%] 2025-12-04T13:41:45.9045342Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int64 PASSED [0.0020s] [ 73%] 2025-12-04T13:41:45.9045709Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int8 PASSED [0.0020s] [ 73%] 2025-12-04T13:41:45.9046121Z test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_uint8 PASSED [0.0020s] [ 74%] 2025-12-04T13:41:45.9046408Z test_linalg.py::TestLinalgCUDA::test_pca_lowrank_cuda PASSED [42.1638s] [ 74%] 2025-12-04T13:41:45.9046713Z test_linalg.py::TestLinalgCUDA::test_permute_matmul_cuda PASSED [0.0024s] [ 74%] 2025-12-04T13:41:45.9047145Z test_linalg.py::TestLinalgCUDA::test_pinv_cuda_complex128 SKIPPED [0.0015s] (no MAGMA library detected) [ 74%] 2025-12-04T13:41:45.9047569Z test_linalg.py::TestLinalgCUDA::test_pinv_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 74%] 2025-12-04T13:41:45.9047994Z test_linalg.py::TestLinalgCUDA::test_pinv_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 74%] 2025-12-04T13:41:45.9048402Z test_linalg.py::TestLinalgCUDA::test_pinv_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 75%] 2025-12-04T13:41:45.9048945Z test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_complex128 SKIPPED [0.0018s] (no MAGMA library detected) [ 75%] 2025-12-04T13:41:45.9049468Z test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 75%] 2025-12-04T13:41:45.9049974Z test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 75%] 2025-12-04T13:41:45.9050494Z test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 75%] 2025-12-04T13:41:45.9050945Z test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 75%] 2025-12-04T13:41:45.9051407Z test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 76%] 2025-12-04T13:41:45.9051843Z test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 76%] 2025-12-04T13:41:45.9052275Z test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 76%] 2025-12-04T13:41:45.9052616Z test_linalg.py::TestLinalgCUDA::test_preferred_blas_library_cuda PASSED [0.0106s] [ 76%] 2025-12-04T13:41:45.9053091Z test_linalg.py::TestLinalgCUDA::test_preferred_linalg_library_cuda SKIPPED [0.0015s] (no MAGMA library detected) [ 76%] 2025-12-04T13:41:45.9053419Z test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_complex128 PASSED [0.0054s] [ 77%] 2025-12-04T13:41:45.9053728Z test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_complex64 PASSED [0.0049s] [ 77%] 2025-12-04T13:41:45.9054025Z test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_float32 PASSED [0.0044s] [ 77%] 2025-12-04T13:41:45.9054339Z test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_float64 PASSED [0.0049s] [ 77%] 2025-12-04T13:41:45.9054625Z test_linalg.py::TestLinalgCUDA::test_qr_cuda_complex128 PASSED [0.0981s] [ 77%] 2025-12-04T13:41:45.9054959Z test_linalg.py::TestLinalgCUDA::test_qr_cuda_complex64 PASSED [0.0889s] [ 77%] 2025-12-04T13:41:45.9055250Z test_linalg.py::TestLinalgCUDA::test_qr_cuda_float32 PASSED [0.0819s] [ 78%] 2025-12-04T13:41:45.9055529Z test_linalg.py::TestLinalgCUDA::test_qr_cuda_float64 PASSED [0.0798s] [ 78%] 2025-12-04T13:41:45.9055860Z test_linalg.py::TestLinalgCUDA::test_qr_error_cases_cuda_float32 PASSED [0.0022s] [ 78%] 2025-12-04T13:41:45.9056176Z test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_complex128 PASSED [0.0100s] [ 78%] 2025-12-04T13:41:45.9056485Z test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_complex64 PASSED [0.0099s] [ 78%] 2025-12-04T13:41:45.9056796Z test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_float32 PASSED [0.0086s] [ 78%] 2025-12-04T13:41:45.9057127Z test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_float64 PASSED [0.0087s] [ 79%] 2025-12-04T13:41:45.9057506Z test_linalg.py::TestLinalgCUDA::test_renorm_cuda PASSED [0.0042s] [ 79%] 2025-12-04T13:41:45.9057875Z test_linalg.py::TestLinalgCUDA::test_renorm_ps_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 79%] 2025-12-04T13:41:45.9058343Z test_linalg.py::TestLinalgCUDA::test_rotating_buffer_tunableop_cuda_float32 PASSED [0.0018s] [ 79%] 2025-12-04T13:41:45.9059118Z test_linalg.py::TestLinalgCUDA::test_rowwise_scaled_gemm_numerics_tunableop_cuda_float8_e4m3fnuz SKIPPED [0.0020s] (test doesn't currently work on the CUDA stack) [ 79%] 2025-12-04T13:41:45.9059834Z test_linalg.py::TestLinalgCUDA::test_scaled_gemm_offline_tunableop_cuda_float8_e4m3fnuz SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 80%] 2025-12-04T13:41:45.9060503Z test_linalg.py::TestLinalgCUDA::test_scaled_gemm_offline_tunableop_cuda_float8_e5m2fnuz SKIPPED [0.0014s] (test doesn't currently work on the CUDA stack) [ 80%] 2025-12-04T13:41:45.9061123Z test_linalg.py::TestLinalgCUDA::test_scaled_gemm_tunableop_cuda_float8_e4m3fnuz SKIPPED [0.0014s] (test doesn't currently work on the CUDA stack) [ 80%] 2025-12-04T13:41:45.9061740Z test_linalg.py::TestLinalgCUDA::test_scaled_gemm_tunableop_cuda_float8_e5m2fnuz SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 80%] 2025-12-04T13:41:45.9062203Z test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 80%] 2025-12-04T13:41:45.9062643Z test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 80%] 2025-12-04T13:41:45.9063084Z test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 81%] 2025-12-04T13:41:45.9063512Z test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 81%] 2025-12-04T13:41:45.9064060Z test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 81%] 2025-12-04T13:41:45.9064643Z test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 81%] 2025-12-04T13:41:45.9065267Z test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 81%] 2025-12-04T13:41:45.9065806Z test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 81%] 2025-12-04T13:41:45.9066207Z test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_complex128 PASSED [0.0160s] [ 82%] 2025-12-04T13:41:45.9066600Z test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_complex64 PASSED [0.0071s] [ 82%] 2025-12-04T13:41:45.9066998Z test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_float32 PASSED [0.0064s] [ 82%] 2025-12-04T13:41:45.9067383Z test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_float64 PASSED [0.0080s] [ 82%] 2025-12-04T13:41:45.9067831Z test_linalg.py::TestLinalgCUDA::test_solve_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 82%] 2025-12-04T13:41:45.9068299Z test_linalg.py::TestLinalgCUDA::test_solve_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 82%] 2025-12-04T13:41:45.9068716Z test_linalg.py::TestLinalgCUDA::test_solve_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 83%] 2025-12-04T13:41:45.9069140Z test_linalg.py::TestLinalgCUDA::test_solve_cuda_float64 SKIPPED [0.0019s] (no MAGMA library detected) [ 83%] 2025-12-04T13:41:45.9069450Z test_linalg.py::TestLinalgCUDA::test_solve_removed_error_cuda PASSED [0.0017s] [ 83%] 2025-12-04T13:41:45.9069779Z test_linalg.py::TestLinalgCUDA::test_strided_mm_bmm_cuda_float32 PASSED [0.0026s] [ 83%] 2025-12-04T13:41:45.9070098Z test_linalg.py::TestLinalgCUDA::test_strided_mm_bmm_cuda_float64 PASSED [0.0025s] [ 83%] 2025-12-04T13:41:45.9070420Z test_linalg.py::TestLinalgCUDA::test_svd_cuda_complex128 PASSED [2.9699s] [ 84%] 2025-12-04T13:41:45.9070719Z test_linalg.py::TestLinalgCUDA::test_svd_cuda_complex64 PASSED [1.7142s] [ 84%] 2025-12-04T13:41:45.9071032Z test_linalg.py::TestLinalgCUDA::test_svd_cuda_float32 PASSED [1.5063s] [ 84%] 2025-12-04T13:41:45.9071312Z test_linalg.py::TestLinalgCUDA::test_svd_cuda_float64 PASSED [1.8079s] [ 84%] 2025-12-04T13:41:45.9071960Z test_linalg.py::TestLinalgCUDA::test_svd_lowrank_cuda_complex128 SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 84%] 2025-12-04T13:41:45.9072270Z test_linalg.py::TestLinalgCUDA::test_svd_lowrank_cuda_float64 PASSED [80.4909s] [ 84%] 2025-12-04T13:41:45.9072654Z test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_complex128 PASSED [0.3197s] [ 85%] 2025-12-04T13:41:45.9073021Z test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_complex64 PASSED [0.1707s] [ 85%] 2025-12-04T13:41:45.9073379Z test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_float32 PASSED [0.1564s] [ 85%] 2025-12-04T13:41:45.9073742Z test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_float64 PASSED [0.2405s] [ 85%] 2025-12-04T13:41:45.9074058Z test_linalg.py::TestLinalgCUDA::test_symeig_removed_error_cuda PASSED [0.0022s] [ 85%] 2025-12-04T13:41:45.9074352Z test_linalg.py::TestLinalgCUDA::test_tensordot_cuda PASSED [0.0070s] [ 85%] 2025-12-04T13:41:45.9074820Z test_linalg.py::TestLinalgCUDA::test_tensordot_out_kernel_errors_with_autograd_cuda_complex64 PASSED [0.0032s] [ 86%] 2025-12-04T13:41:45.9075274Z test_linalg.py::TestLinalgCUDA::test_tensordot_out_kernel_errors_with_autograd_cuda_float32 PASSED [0.0026s] [ 86%] 2025-12-04T13:41:45.9075744Z test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 86%] 2025-12-04T13:41:45.9076201Z test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 86%] 2025-12-04T13:41:45.9076653Z test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 86%] 2025-12-04T13:41:45.9077091Z test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 87%] 2025-12-04T13:41:45.9077585Z test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 87%] 2025-12-04T13:41:45.9078075Z test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 87%] 2025-12-04T13:41:45.9078542Z test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 87%] 2025-12-04T13:41:45.9079023Z test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_float64 SKIPPED [0.0015s] (no MAGMA library detected) [ 87%] 2025-12-04T13:41:45.9079581Z test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_complex128 SKIPPED [0.0017s] (no MAGMA library detected) [ 87%] 2025-12-04T13:41:45.9080132Z test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 88%] 2025-12-04T13:41:45.9080715Z test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 88%] 2025-12-04T13:41:45.9081249Z test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 88%] 2025-12-04T13:41:45.9081796Z test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 88%] 2025-12-04T13:41:45.9082323Z test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_complex64 SKIPPED [0.0017s] (no MAGMA library detected) [ 88%] 2025-12-04T13:41:45.9082840Z test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 88%] 2025-12-04T13:41:45.9083395Z test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 89%] 2025-12-04T13:41:45.9083894Z test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 89%] 2025-12-04T13:41:45.9084375Z test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 89%] 2025-12-04T13:41:45.9084859Z test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 89%] 2025-12-04T13:41:45.9085307Z test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_float64 SKIPPED [0.0017s] (no MAGMA library detected) [ 89%] 2025-12-04T13:41:45.9085815Z test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 90%] 2025-12-04T13:41:45.9086314Z test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 90%] 2025-12-04T13:41:45.9086808Z test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 90%] 2025-12-04T13:41:45.9087288Z test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 90%] 2025-12-04T13:41:45.9088085Z test_linalg.py::TestLinalgCUDA::test_tensorsolve_errors_and_warnings_cuda_float32 SKIPPED [0.0018s] (no MAGMA library detected) [ 90%] 2025-12-04T13:41:45.9088695Z test_linalg.py::TestLinalgCUDA::test_tf32_offline_tunableop_cuda_float32 SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 90%] 2025-12-04T13:41:45.9089238Z test_linalg.py::TestLinalgCUDA::test_tf32_tunableop_cuda_float32 SKIPPED [0.0015s] (test doesn't currently work on the CUDA stack) [ 91%] 2025-12-04T13:41:45.9089853Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 91%] 2025-12-04T13:41:45.9090452Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 91%] 2025-12-04T13:41:45.9091038Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 91%] 2025-12-04T13:41:45.9091633Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_broadcasting_cuda_float64 SKIPPED [0.0018s] (no MAGMA library detected) [ 91%] 2025-12-04T13:41:45.9092172Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 91%] 2025-12-04T13:41:45.9092708Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 92%] 2025-12-04T13:41:45.9093223Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_float32 SKIPPED [0.0014s] (no MAGMA library detected) [ 92%] 2025-12-04T13:41:45.9093737Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 92%] 2025-12-04T13:41:45.9094489Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_complex128 SKIPPED [0.0018s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 92%] 2025-12-04T13:41:45.9095287Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_complex64 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 92%] 2025-12-04T13:41:45.9096012Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_float32 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 92%] 2025-12-04T13:41:45.9096719Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_float64 SKIPPED [0.0014s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 93%] 2025-12-04T13:41:45.9097231Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_complex128 SKIPPED [0.0014s] (no MAGMA library detected) [ 93%] 2025-12-04T13:41:45.9097761Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_complex64 SKIPPED [0.0014s] (no MAGMA library detected) [ 93%] 2025-12-04T13:41:45.9098355Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_float32 SKIPPED [0.0018s] (no MAGMA library detected) [ 93%] 2025-12-04T13:41:45.9098840Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 93%] 2025-12-04T13:41:45.9099268Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_large_cuda_float32 PASSED [6.9079s] [ 94%] 2025-12-04T13:41:45.9099884Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_complex128 SKIPPED [0.0018s] (no MAGMA library detected) [ 94%] 2025-12-04T13:41:45.9100480Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_complex64 SKIPPED [0.0015s] (no MAGMA library detected) [ 94%] 2025-12-04T13:41:45.9101071Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_float32 SKIPPED [0.0019s] (no MAGMA library detected) [ 94%] 2025-12-04T13:41:45.9101665Z test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_float64 SKIPPED [0.0014s] (no MAGMA library detected) [ 94%] 2025-12-04T13:41:45.9102050Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_bfloat16 PASSED [0.0034s] [ 94%] 2025-12-04T13:41:45.9102432Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_bool PASSED [0.0025s] [ 95%] 2025-12-04T13:41:45.9102822Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_complex128 PASSED [0.0035s] [ 95%] 2025-12-04T13:41:45.9103207Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_complex64 PASSED [0.0042s] [ 95%] 2025-12-04T13:41:45.9103588Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_float16 PASSED [0.0033s] [ 95%] 2025-12-04T13:41:45.9103960Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_float32 PASSED [0.0033s] [ 95%] 2025-12-04T13:41:45.9104343Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_float64 PASSED [0.0033s] [ 95%] 2025-12-04T13:41:45.9104708Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int16 PASSED [0.0025s] [ 96%] 2025-12-04T13:41:45.9105071Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int32 PASSED [0.0025s] [ 96%] 2025-12-04T13:41:45.9105444Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int64 PASSED [0.0025s] [ 96%] 2025-12-04T13:41:45.9105804Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int8 PASSED [0.0028s] [ 96%] 2025-12-04T13:41:45.9106165Z test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_uint8 PASSED [0.0025s] [ 96%] 2025-12-04T13:41:45.9106659Z test_linalg.py::TestLinalgCUDA::test_triu_tril_large_matrix_64bit_cuda SKIPPED [0.1564s] (Insufficient cuda:0 memory) [ 97%] 2025-12-04T13:41:45.9107259Z test_linalg.py::TestLinalgCUDA::test_validator_tunableop_rocm_cuda_float32 SKIPPED [0.0017s] (test doesn't currently work on the CUDA stack) [ 97%] 2025-12-04T13:41:45.9107574Z test_linalg.py::TestLinalgCUDA::test_vdot_invalid_args_cuda PASSED [0.0035s] [ 97%] 2025-12-04T13:41:45.9107934Z test_linalg.py::TestLinalgCUDA::test_vdot_vs_numpy_cuda_complex64 PASSED [0.0058s] [ 97%] 2025-12-04T13:41:45.9108247Z test_linalg.py::TestLinalgCUDA::test_vdot_vs_numpy_cuda_float32 PASSED [0.0053s] [ 97%] 2025-12-04T13:41:45.9108569Z test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_bfloat16 PASSED [1.0891s] [ 97%] 2025-12-04T13:41:45.9108888Z test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_complex128 PASSED [0.6539s] [ 98%] 2025-12-04T13:41:45.9109208Z test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_complex64 PASSED [1.1065s] [ 98%] 2025-12-04T13:41:45.9109516Z test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_float16 PASSED [1.0847s] [ 98%] 2025-12-04T13:41:45.9109825Z test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_float32 PASSED [1.0724s] [ 98%] 2025-12-04T13:41:45.9110172Z test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_float64 PASSED [0.6359s] [ 98%] 2025-12-04T13:41:45.9110573Z test_linalg.py::TestLinalgCUDA::test_vector_norm_decom_unbacked_checks_cuda PASSED [0.5527s] [ 98%] 2025-12-04T13:41:45.9110921Z test_linalg.py::TestLinalgCUDA::test_vector_norm_dim_tuple_arg_cuda PASSED [0.0031s] [ 99%] 2025-12-04T13:41:45.9111258Z test_linalg.py::TestLinalgCUDA::test_vector_norm_extreme_values_cuda PASSED [0.0308s] [ 99%] 2025-12-04T13:41:45.9111713Z test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_complex128 PASSED [0.0187s] [ 99%] 2025-12-04T13:41:45.9112137Z test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_complex64 PASSED [0.0190s] [ 99%] 2025-12-04T13:41:45.9112541Z test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_float32 PASSED [0.0168s] [ 99%] 2025-12-04T13:41:45.9112951Z test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_float64 PASSED [0.0159s] [100%] 2025-12-04T13:41:45.9112959Z 2025-12-04T13:41:45.9113554Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-83ecadeff448aeea.xml - 2025-12-04T13:41:45.9113792Z ========= 335 passed, 235 skipped, 693 deselected in 209.40s (0:03:29) ========= 2025-12-04T13:41:45.9115247Z The following tests failed consistently: ['test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32', 'test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float64', 'test/test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128'] 2025-12-04T13:41:45.9115256Z 2025-12-04T13:41:45.9115643Z FINISHED PRINTING LOG FILE of test_linalg 1/1 (test/test-reports/test_linalg_1.1_2a5b6f269d53327b_.log) 2025-12-04T13:41:45.9115648Z 2025-12-04T13:41:45.9115927Z Finished test_linalg 1/1 ... [2025-12-04 13:41:45.462618][16109.583831455], took 7.41min 2025-12-04T13:41:45.9116568Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-77c9be2be0e8d8f0.xml 2025-12-04T13:41:45.9117246Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-87fea4734ba7fa90.xml 2025-12-04T13:41:45.9117870Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-cdf15fa2abf8389b.xml 2025-12-04T13:41:45.9118492Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-301657435283457f.xml 2025-12-04T13:41:45.9119128Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-274ad688fc221da2.xml 2025-12-04T13:41:45.9119750Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-c003f5ff32bae5e6.xml 2025-12-04T13:41:45.9120384Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-e9e5772b33a862c1.xml 2025-12-04T13:41:45.9121016Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-67ba282cb3193875.xml 2025-12-04T13:41:45.9121672Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-f88fa3b886672d5c.xml 2025-12-04T13:41:45.9122314Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_linalg/test_linalg-83ecadeff448aeea.xml 2025-12-04T13:41:46.5558701Z Uploading logs for 57119749259 to S3 2025-12-04T13:41:46.7946123Z Uploading artifacts took 0.85 seconds 2025-12-04T13:41:46.7946563Z test_linalg 1/1 failed! 2025-12-04T13:41:46.7950633Z Running test_ops 4/11 ... [2025-12-04 13:41:46.794902][16110.916120297] 2025-12-04T13:41:46.7951132Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T13:41:46.7956010Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '--shard-id=4', '--num-shards=11', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:41:46.795354] 2025-12-04T14:01:11.7874550Z 2025-12-04T14:01:11.7875456Z test_ops 4/11 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_4.11_cc5a88e4afdf55b7_.log 2025-12-04T14:01:11.9150779Z Running 3070 items in this shard: test/test_ops.py::TestSelfKwarg::test_self_kwargs, test/test_ops.py::TestCommonCUDA::test_compare_cpu__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_grid_sampler_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing___getitem___cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv_transpose2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_repeat_interleave_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_vsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rand___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atleast_2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_empty_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_index_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_index_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_vector_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_mul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pdist_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_relu6_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_log_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_take_along_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_transpose_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__unsafe_masked_index_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_arange_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atan_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diagflat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_expand_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_grid_sampler_2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_histc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_reduce_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_matrix_rank_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log10_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_long_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lu_unpack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_binary_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_pool2d_with_indices_backward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_min_reduction_no_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mvlgamma_mvlgamma_p_3_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ne_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_channel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_cosine_similarity_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_linear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_permute_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_randint_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ravel_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_repeat_interleave_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_resize_as__cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_general_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sparse_sampled_addmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_j1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_t_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tensordot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_to_sparse_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_triangular_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_errors___rmod___cuda, test/test_ops.py::TestCommonCUDA::test_errors_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bernoulli_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_errors_ge_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gt_cuda, test/test_ops.py::TestCommonCUDA::test_errors_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_errors_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_errors_item_cuda, test/test_ops.py::TestCommonCUDA::test_errors_lt_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_errors_neg_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_rrelu_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_hann_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_nuttall_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_angle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argwhere_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diff_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eye_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ldexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_vander_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_var_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_median_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scalar_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_searchsorted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_select_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_laguerre_polynomial_l_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trapz_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tril_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_any_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_byte_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_clamp_min_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expand_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_hfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nansum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_reshape_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_resize__cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_resize_as__cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_scatter_reduce_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_select_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_slice_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_airy_ai_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_i1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_list_args_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unique_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___radd___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmul___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rpow___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_put_accumulate_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_decomposed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gather_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gradient_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_inner_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kron_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cond_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_slogdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vander_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_reduction_no_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_median_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_channel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_embedding_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_constant_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_interleave_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize_as__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scalar_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scalar_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_hermite_polynomial_h_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_addbmm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_aminmax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_2inputs_2outputs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_tensorsolve_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vander_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_ravel_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_bartlett_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_cosine_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_out___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___ror___cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_gradient_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_bmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_gather_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_inf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ormqr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning___rdiv___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___rmatmul___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_bool_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_double_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_broadcast_shapes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_flip_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ge_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isnan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nan_to_num_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ne_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_tanhshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_permute_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_signbit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sinc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_squeeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_transpose_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tril_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_triu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_view_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__softmax_backward_data_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_amin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_baddbmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cdist_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cdouble_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cholesky_inverse_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clone_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cov_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cummax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diag_embed_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_exp2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fmod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_full_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_i0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_put_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_factor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_multi_dot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_vecdot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_long_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_multinomial_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_batch_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv_transpose1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_cosine_similarity_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_grid_sample_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hardsigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_nearest-exact_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_local_response_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_replicate_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_silu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_tanhshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_inf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_pca_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_permute_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_repeat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_decimals_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_amin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_gaussian_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_kaiser_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_y1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_k0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_squeeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_svd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_svd_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tril_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unbind_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unsafe_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_zeros_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_zeros_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_left_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_float8_e5m2, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardshrink_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mse_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_nll_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_complex_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_polar_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float8_e4m3fnuz, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_right_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float8_e4m3fn, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float8_e5m2fnuz, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lcm_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_amax_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_count_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_std_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_std_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_angle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagflat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ldexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_fro_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_repeat_interleave_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unbind_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_as_real_cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_gather_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mT_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nan_to_num_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rad2deg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_topk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hypot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_minimum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_permute_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_round_decimals_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_slice_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_zeta_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_arange_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_permuted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ifftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_geometric_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_hash_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_int_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nanquantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rand_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resize_as__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_zeta_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rdiv___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_conj_physical_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_empty_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lgamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_eigvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_and_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_xor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_resolve_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scalar_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_j1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_squeeze_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_topk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argwhere_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_geometric_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hash_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isfinite_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_rand_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_zeta_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vdot_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_byte_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_cdouble_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_clone_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_conj_physical_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_dsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_dstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_equal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_hfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_flipud_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_ne_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_permute_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_positive_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sin_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sinc_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_std_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_t_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tanh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addmv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_any_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_broadcast_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cumsum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_flatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_flip_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isnan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cholesky_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cross_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_ldl_factor_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_power_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_solve_triangular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_log2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_full_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_qr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_real_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_resolve_conj_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sigmoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_std_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_true_divide_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_var_mean_unbiased_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_zero__cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_zeros_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_short_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_any_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fliplr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_imag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_and_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logspace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logsumexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_permute_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_trace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unbind_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_var_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_add_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_any_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_1d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_broadcast_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cosh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cumprod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_half_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_unary_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cross_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_ldl_factor_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_factor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_solve_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_movedim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_constant_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_reflect_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_softsign_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_unfold_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_inf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_qr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ravel_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_renorm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tile_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_to_sparse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_trace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_unbiased_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_vstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rmul___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_abs_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_as_strided_partial_views_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_broadcast_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diag_embed_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_div_no_rounding_mode_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expand_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expand_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_irfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isreal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_svdvals_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_vector_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_or_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nextafter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_gelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rsqrt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rsub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_i0e_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_squeeze_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_trace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_var_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_view_as_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_vstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_char_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cholesky_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_column_stack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diff_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_dstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_empty_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_full_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_select_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lgamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_pinv_singular_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_min_binary_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nansum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardsigmoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardswish_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_upsample_bilinear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nonzero_static_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ormqr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_positive_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_reshape_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resize_as__cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_searchsorted_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_cosine_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_gaussian_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_general_hamming_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_slice_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_j0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_hermite_polynomial_he_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_k1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_square_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_squeeze_multiple_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tensor_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_to_sparse_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_trunc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unflatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unsafe_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_as_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bincount_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_or_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_right_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_geqrf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_gt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_heaviside_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_int_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_normal_in_place_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_quantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_resize_as__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_and_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_corrcoef_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_H_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_T_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__unsafe_masked_index_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_matrix_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_outer_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unflatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_expand_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nanmean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_gelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_repeat_interleave_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_eq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_eye_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_floor_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ge_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isfinite_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nansum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_narrow_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scalar_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_short_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_bessel_y0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unique_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_var_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_byte_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ge_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lcm_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_movedim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ne_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polar_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_zeta_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_squeeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestTagsCUDA::test_tags_H_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___rsub___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_arange_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_not_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fmod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isinf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_svd_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_round_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_stft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_true_divide_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unbind_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_as_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_not_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_broadcast_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cauchy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cdouble_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cfloat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_copysign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diagonal_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_empty_permuted_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_erfc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expand_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isclose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isinf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cond_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logaddexp2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_max_binary_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_maximum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_median_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_min_binary_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_minimum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nanmean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_narrow_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_native_batch_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softplus_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_norm_fro_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_norm_nuc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resize__cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sinc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_i0e_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sqrt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_std_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_take_along_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unique_consecutive_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_view_as_complex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_zero__cuda_float32, test/test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_div_trunc_rounding_cuda_float32 2025-12-04T14:01:12.0390177Z 2025-12-04T14:01:12.0390500Z Finished test_ops 4/11 ... [2025-12-04 14:01:11.791872][17275.913085636], took 19.42min 2025-12-04T14:01:12.0391559Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_ops/test_ops-2fce3d3e4cb230c2.xml 2025-12-04T14:01:12.0392539Z Running test_ops 9/11 ... [2025-12-04 14:01:11.979065][17276.100283085] 2025-12-04T14:01:12.0393029Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T14:01:12.0394200Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '--shard-id=9', '--num-shards=11', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:01:11.979479] 2025-12-04T14:21:25.6569031Z 2025-12-04T14:21:25.6569940Z test_ops 9/11 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_9.11_a89556c3277781cc_.log 2025-12-04T14:21:25.7848274Z Running 3067 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_partial_views_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_conj_physical_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_diag_embed_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_hfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_hsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_logaddexp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv_transpose1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_rand_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_true_divide_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsafe_split_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsqueeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rdiv___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_T_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_cfloat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_all_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_asinh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_broadcast_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cos_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_deg2rad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_erfc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_frexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ne_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_mse_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softmin_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softplus_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_repeat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_round_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_ndtri_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_squeeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_trace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__segment_reduce_offsets_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_acos_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_byte_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cartesian_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_dot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lgamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_householder_product_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_inv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_inv_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_ldl_factor_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_slogdet_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mH_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_amax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_matrix_exp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_batch_norm_without_cudnn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_ctc_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_group_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_reflect_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_replicate_negative_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nonzero_static_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_normal_in_place_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polar_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_positive_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_rand_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_sum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_blackman_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_nuttall_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signbit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_hermite_polynomial_h_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_hermite_polynomial_he_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_spherical_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_squeeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_stft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trunc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unique_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unsafe_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_errors_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_errors_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_errors_multinomial_cuda, test/test_ops.py::TestCommonCUDA::test_errors_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_errors_uniform_cuda, test/test_ops.py::TestCommonCUDA::test_errors_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_errors_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_errors_where_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_hash_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_min_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__unsafe_masked_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_2inputs_2outputs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_4inputs_with_extra_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_outer_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randint_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_interleave_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_slice_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_airy_ai_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_take_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zero__cuda_float32, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_argsort_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_chunk_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_clone_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diagflat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_equal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_float_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_put_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isneginf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_jiterator_binary_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_log10_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_lt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_meshgrid_list_of_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_reduction_no_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_minimum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pixel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pixel_unshuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_modified_bessel_k0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_with_sizes_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unfold_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_view_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmatmul___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__chunk_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diff_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_put_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_2inputs_2outputs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kthvalue_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvalsh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_grad_oriented_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_std_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matrix_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_msort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_searchsorted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signbit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_list_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triangular_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_allclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clone_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vecdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pairwise_distance_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pairwise_distance_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_rms_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bincount_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_corrcoef_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nonzero_static_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_randn_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_angle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lstsq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logaddexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nanmean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_linear_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_view_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_torch__scaled_mm_cuda_float8_e4m3fn, test/test_ops.py::TestCommonCUDA::test_out_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning___getitem___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_cdouble_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_half_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_alias_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_digamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_frac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_vector_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softmin_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softplus_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sgn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_split_with_sizes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tensor_split_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_transpose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unsqueeze_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cartesian_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clamp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_combinations_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_corrcoef_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_double_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_empty_permuted_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_eq_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_erf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_expand_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_frac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_reduce_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_binary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_le_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_qr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_slogdet_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_argmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_matrix_exp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nanquantile_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_native_dropout_backward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ne_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_reflect_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_nuc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_pinverse_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_quantile_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randn_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_sum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_nuttall_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sort_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sparse_mm_reduce_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trapz_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unsafe_split_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_where_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_gelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_eq_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_t_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_view_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_where_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_complex_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_complex_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_nll_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softplus_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_select_scatter_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lcm_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nextafter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_celu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float8_e4m3fnuz, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igammac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_elu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_min_binary_cuda, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__unsafe_masked_index_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bfloat16_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_permuted_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_binary_return_by_ref_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_kron_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_triangular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vander_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logcumsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanmean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scalar_tensor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tile_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triangular_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unbind_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_like_cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_conj_physical_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_movedim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_outer_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_permute_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_t_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tensordot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_transpose_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unbind_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addcdiv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cauchy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_char_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_eq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_randint_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unbind_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unsafe_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_xlogy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cauchy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_clamp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_einsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_vecdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logical_xor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_take_along_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unbind_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_alias_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_eq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_geometric_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_movedim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_narrow_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nonzero_static_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_airy_ai_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_var_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_alias_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_as_strided_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_expand_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_flip_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_heaviside_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_inv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_movedim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_decimals_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scalar_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_bessel_y0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unique_consecutive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_mean_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_double_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addcdiv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_all_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_any_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_empty_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_empty_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_exp2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_ifftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_flatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_lerp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_full_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_zeros_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_reshape_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sqrt_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_stft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unflatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_view_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__unsafe_masked_index_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addmm_decomposed_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_all_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_angle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_byte_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_clone_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_corrcoef_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_exp2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_hfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fliplr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_half_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isfinite_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_unary_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_eigvalsh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_rank_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_pinv_hermitian_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_log10_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_long_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mT_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_std_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_empty_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nonzero_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nonzero_static_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ones_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_pinverse_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sinh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sum_to_size_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tensor_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_trapz_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_view_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_H_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_T_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___getitem___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rdiv___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rpow___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_T_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_int_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_allclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_chunk_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_conj_physical_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_count_nonzero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_equal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fftshift_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_ifftshift_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logaddexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_movedim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_renorm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_roll_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sgn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_take_along_dim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_true_divide_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unsqueeze_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_vsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_abs_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_acos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_asin_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_exp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expm1_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_fft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_hfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_irfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_flipud_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_float_power_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_eig_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lstsq_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_power_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_svd_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logdet_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_cumsum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_matrix_exp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_pinverse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rand_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rot90_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_scalar_tensor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_split_with_sizes_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_squeeze_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_stft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_transpose_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rdiv___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_addcdiv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_alias_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_contiguous_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_equal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_hfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_irfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_rfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fliplr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_igamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_not_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_maximum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_channel_shuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_softshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rad2deg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_transpose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_view_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addcmul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_all_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atleast_1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cov_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diagflat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diagonal_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_erfc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_expand_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hash_tensor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_igammac_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isinf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_binary_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_kthvalue_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_cholesky_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_slogdet_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_tensorsolve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logical_or_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_logsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_median_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_matrix_exp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_meshgrid_list_of_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nanmedian_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_native_dropout_backward_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_avg_pool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_logsigmoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pdist_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nonzero_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_qr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ravel_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_real_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_repeat_interleave_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resolve_neg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_roll_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_short_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_hamming_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_slice_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_i1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_zeta_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_take_along_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_topk_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unique_consecutive_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_var_mean_unbiased_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake_all_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rmatmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__chunk_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addcmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_any_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_igamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_long_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_minimum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_searchsorted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_hann_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_uniform_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unravel_index_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_xor_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logaddexp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_minimum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_as_strided_partial_views_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_frac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_grid_sampler_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_ldexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_gather_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_imag_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_istft_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_blackman_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_i1e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_fake_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addcmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_copysign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_deg2rad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hash_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_istft_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_or_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nextafter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resize__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_searchsorted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_hann_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unbind_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_complex128, test/test_ops.py::TestTagsCUDA::test_tags___rmatmul___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___ror___cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_half_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_abs_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_asinh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_chunk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_clone_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_conj_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_copysign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_exp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expand_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_rfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_rfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fliplr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_frexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_index_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_index_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_lt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_movedim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_roll_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sinc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_stack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_std_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_transpose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_triu_indices_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_unfold_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_abs_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atleast_3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_clamp_min_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cos_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cummax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_deg2rad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diagflat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_double_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_erf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_hfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ifftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ifftshift_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_flipud_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_gcd_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_i0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_put_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_reduce_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isneginf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_jiterator_binary_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nanquantile_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_full_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_celu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_gelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_selu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_pinverse_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polar_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randn_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_renorm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_select_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_select_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_short_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_stack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_stft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_ops.py::TestTagsCUDA::test_tags_trapz_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unsafe_split_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unsqueeze_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_zeros_like_cuda_float32, test/test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_add_cuda_float32 2025-12-04T14:21:25.9093955Z 2025-12-04T14:21:25.9094264Z Finished test_ops 9/11 ... [2025-12-04 14:21:25.661120][18489.782336502], took 20.23min 2025-12-04T14:21:25.9095496Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_ops/test_ops-b0df9953dc6aab8d.xml 2025-12-04T14:21:27.1173513Z Uploading artifacts took 1.28 seconds 2025-12-04T14:21:27.1177554Z Running test_sparse_csr 2/2 ... [2025-12-04 14:21:27.117549][18491.238768575] 2025-12-04T14:21:27.1178082Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T14:21:27.1182030Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_sparse_csr.py', '--shard-id=2', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:21:27.117984] 2025-12-04T14:34:54.3215268Z 2025-12-04T14:34:54.3216488Z PRINTING LOG FILE of test_sparse_csr 2/2 (test/test-reports/test_sparse_csr_2.2_9a8f6f98dd0804ce_.log) 2025-12-04T14:34:54.3217511Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-bc7545635763bbdc.xml 2025-12-04T14:34:54.3218734Z ============================= test session starts ============================== 2025-12-04T14:34:54.3219530Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.3220375Z cachedir: .pytest_cache 2025-12-04T14:34:54.3221150Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.3221978Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.3222335Z configfile: pytest.ini 2025-12-04T14:34:54.3223551Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.3225004Z collecting ... collected 4932 items 2025-12-04T14:34:54.3225505Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T14:34:54.4445379Z Running 2401 items in this shard: test/test_sparse_csr.py::TestSparseCSRCUDA::test_add_SparseCSC_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_add_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_all_sparse_csr_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_all_sparse_csr_SparseCSR_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_dense_result_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_dense_result_SparseCSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_dense_result_SparseCSC_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_dense_result_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_dense_result_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_errors_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_0_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_0_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_1_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_25_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_25_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_25_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_0_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_0_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_0_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_0_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_0_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_1_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_1_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_1_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_1_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_1_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_25_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_25_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_25_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_0_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_0_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_0_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_0_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_1_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_1_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_1_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_25_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_25_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_25_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_25_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_25_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_0_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_0_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_0_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_1_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_1_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_1_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_1_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_25_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_25_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_25_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_0_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_0_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_0_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_1_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_1_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_1_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_25_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_25_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_0_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_0_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_0_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_1_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_1_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_25_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_0_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_0_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_0_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_0_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_1_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_1_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_25_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_25_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_25_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_25_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_10_m_0_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_10_m_1_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_10_m_1_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_10_m_25_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_10_m_25_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_1_m_0_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_1_m_0_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_1_m_1_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_1_m_1_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_1_m_1_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_11x9_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_11x9_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_11x9_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_3x3_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_5x7_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_5x7_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_abs_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_angle_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_angle_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_asin_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_asinh_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_atan_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_atanh_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_ceil_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_erf_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_erfinv_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_expm1_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_isnan_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_isneginf_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_neg_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_positive_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_positive_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_rad2deg_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_sgn_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_sgn_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_signbit_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_sin_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_sin_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_sinh_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_trunc_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_baddbmm_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_baddbmm_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_baddbmm_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int32_noncontiguous_False_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int32_noncontiguous_False_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int32_noncontiguous_True_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int32_noncontiguous_True_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int32_noncontiguous_True_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int64_noncontiguous_False_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int64_noncontiguous_False_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int64_noncontiguous_True_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int64_noncontiguous_True_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int64_noncontiguous_True_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int32_noncontiguous_False_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int32_noncontiguous_False_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int32_noncontiguous_False_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_False_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_False_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_False_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_True_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_True_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_True_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_True_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_True_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int32_noncontiguous_False_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int32_noncontiguous_False_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int32_noncontiguous_False_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int32_noncontiguous_True_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int64_noncontiguous_False_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int64_noncontiguous_False_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int64_noncontiguous_True_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int32_noncontiguous_False_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int32_noncontiguous_False_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int32_noncontiguous_True_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int64_noncontiguous_False_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int64_noncontiguous_False_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int64_noncontiguous_True_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int64_noncontiguous_True_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int32_noncontiguous_False_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int32_noncontiguous_False_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int32_noncontiguous_False_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int32_noncontiguous_True_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_False_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_False_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_False_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_False_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_True_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_True_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int32_noncontiguous_False_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int32_noncontiguous_False_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int32_noncontiguous_True_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int64_noncontiguous_False_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int64_noncontiguous_True_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int64_noncontiguous_True_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_bmm_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_bmm_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseBSC_SparseBSC_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseBSC_SparseCSR_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseBSR_SparseBSR_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseBSR_SparseCSC_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseCSC_SparseCSC_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseCSR_SparseBSC_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseCSR_SparseCSC_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseCSR_SparseCSR_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_coo_to_csr_convert_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_coo_conversion_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_coo_conversion_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_coo_conversion_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_coo_conversion_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_coo_conversion_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_matvec_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_matvec_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_matvec_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_storage_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_stride_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_to_block_csr_blocksize_2_cuda_float64_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_csr_to_block_csr_blocksize_4_cuda_float64_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseBSC_NonBatched_Hybrid_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseBSC_NonBatched_NonHybrid_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseBSR_Batched_NonHybrid_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseBSR_NonBatched_Hybrid_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseBSR_NonBatched_NonHybrid_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseCSC_NonBatched_NonHybrid_cuda, test/test_sparse_csr.py::TestSparseCSRCUDA::test_direct_coo_csr_conversion_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_direct_coo_csr_conversion_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_direct_coo_csr_conversion_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_exercise_detach_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_exercise_detach_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_exercise_detach_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_exercise_detach_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_matmul_device_mismatch_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mm_errors_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSR_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSR_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseBSC_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseBSR_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseCSR_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_autograd_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_autograd_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_errors_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_errors_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_zero_sized_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_zero_sized_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_zero_sized_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int32_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int32_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int32_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_add_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_add_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_add_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csc_to_dense_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csc_to_dense_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csc_to_dense_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csc_to_dense_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csc_to_dense_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_from_dense_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_from_dense_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_from_dense_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_from_dense_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_from_dense_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_angle_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_angle_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_angle_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_angle_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_angle_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_ceil_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_ceil_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_ceil_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_ceil_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_ceil_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_deg2rad_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_deg2rad_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_deg2rad_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_deg2rad_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erf_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erf_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erf_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erf_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erfinv_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erfinv_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erfinv_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erfinv_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_floor_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_floor_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_floor_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_floor_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_frac_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_frac_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_frac_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isneginf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isneginf_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isneginf_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isneginf_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isneginf_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isposinf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isposinf_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isposinf_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isposinf_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_nn_functional_relu_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_rad2deg_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_rad2deg_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_rad2deg_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_rad2deg_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_round_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_round_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_round_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_round_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_round_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sign_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sign_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_signbit_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_signbit_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_signbit_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_signbit_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_signbit_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sin_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sin_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sin_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sinh_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sinh_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sinh_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sinh_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tan_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tan_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tan_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tan_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_trunc_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_trunc_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_trunc_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_trunc_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_abs_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_abs_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_abs_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_abs_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_angle_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_angle_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_angle_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_angle_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asinh_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asinh_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asinh_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asinh_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asinh_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atan_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atan_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atan_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atan_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atan_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atanh_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atanh_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atanh_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atanh_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atanh_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_ceil_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_ceil_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_ceil_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_conj_physical_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_conj_physical_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_conj_physical_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_conj_physical_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_conj_physical_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erf_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erf_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erf_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_expm1_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_expm1_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_floor_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_floor_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_floor_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_floor_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_floor_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_frac_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_frac_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_frac_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_frac_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isneginf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isneginf_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isneginf_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_nn_functional_relu_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_nn_functional_relu_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_nn_functional_relu_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_positive_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_positive_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_positive_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_positive_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_positive_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_rad2deg_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_rad2deg_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_rad2deg_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_rad2deg_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_rad2deg_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sign_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sign_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sign_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sign_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sign_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_signbit_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_signbit_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_signbit_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_signbit_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_signbit_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_mm_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_mm_reduce_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_mm_reduce_sum_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_to_sparse_compressed_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_to_sparse_compressed_SparseCSR_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_triangular_solve_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSC_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSC_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_ceil_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_ceil_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_ceil_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_ceil_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_ceil_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_deg2rad_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_deg2rad_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_deg2rad_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_deg2rad_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_frac_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_frac_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_frac_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isinf_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isinf_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isinf_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isneginf_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isneginf_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isneginf_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isposinf_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isposinf_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isposinf_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_log1p_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_log1p_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_log1p_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_log1p_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_log1p_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_nn_functional_relu_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_positive_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_positive_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_positive_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_positive_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_round_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_round_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_round_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sign_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sign_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sign_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_signbit_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_signbit_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_signbit_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_int64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sqrt_cuda_complex64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sqrt_cuda_int16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sqrt_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sqrt_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tan_cuda_complex128, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tan_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tan_cuda_int8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tan_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_bool, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_complex32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_float16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_float32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_int32, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_uint8, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_trunc_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_trunc_cuda_float64, test/test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_trunc_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atan_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atan_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atan_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atan_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atan_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atanh_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atanh_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atanh_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atanh_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atanh_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erf_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erf_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erf_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_floor_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_floor_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_floor_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_floor_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_frac_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_frac_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isinf_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isinf_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isinf_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isinf_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isinf_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isnan_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isnan_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isnan_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isnan_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isnan_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isneginf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isneginf_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isneginf_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isneginf_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isneginf_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_log1p_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_log1p_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_log1p_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amin_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amin_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amin_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amin_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amin_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_mean_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_mean_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_mean_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_mean_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_mean_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_prod_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_prod_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_prod_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_prod_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_prod_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_sum_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_sum_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_sum_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_sum_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_sum_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_nn_functional_relu_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_nn_functional_relu_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_nn_functional_relu_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_randn_like_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_randn_like_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_randn_like_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_randn_like_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sign_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sign_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sign_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sum_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sum_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sum_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sum_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tanh_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tanh_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tanh_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tanh_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_trunc_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_trunc_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_trunc_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_trunc_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_trunc_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_zeros_like_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_zeros_like_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_zeros_like_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_abs_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_abs_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_angle_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_angle_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_angle_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_angle_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_ceil_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_ceil_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_deg2rad_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_deg2rad_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_deg2rad_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erf_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erf_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erfinv_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erfinv_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erfinv_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erfinv_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erfinv_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_expm1_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_expm1_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_expm1_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_expm1_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_expm1_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_floor_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_floor_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_floor_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_floor_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_floor_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_frac_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_frac_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isnan_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isnan_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isnan_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isnan_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isneginf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isneginf_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isneginf_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isneginf_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isposinf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isposinf_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isposinf_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isposinf_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amax_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amax_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amax_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amax_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amax_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amin_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amin_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amin_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amin_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_mean_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_mean_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_sum_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_sum_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_sum_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_sum_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_nn_functional_relu_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_nn_functional_relu_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_nn_functional_relu_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_nn_functional_relu_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_nn_functional_relu_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_rad2deg_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_rad2deg_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_rad2deg_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_rad2deg_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_rad2deg_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_randn_like_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_randn_like_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_round_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_round_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_round_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_round_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_round_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sign_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sign_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sign_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sign_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_signbit_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_signbit_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_signbit_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_signbit_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_signbit_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sin_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sin_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sin_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sin_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sin_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sqrt_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sqrt_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sqrt_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sqrt_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sqrt_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sum_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sum_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sum_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sum_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tanh_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tanh_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tanh_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tanh_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_to_sparse_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_to_sparse_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_to_sparse_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_to_sparse_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_to_sparse_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_trunc_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_trunc_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_abs_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_abs_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_abs_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_abs_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_abs_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_angle_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_angle_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_angle_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_angle_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atan_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atan_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atan_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atan_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atan_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_ceil_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_ceil_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_ceil_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_ceil_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_ceil_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_deg2rad_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_deg2rad_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_deg2rad_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_deg2rad_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_erf_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_erf_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_erf_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_erfinv_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_expm1_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_expm1_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_expm1_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_frac_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isnan_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isnan_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isnan_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isnan_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isposinf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isposinf_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isposinf_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isposinf_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_log1p_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_log1p_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_log1p_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_log1p_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_log1p_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amax_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amax_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amax_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amax_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amax_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amin_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amin_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amin_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amin_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_mean_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_mean_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_mean_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_mul_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_mul_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_mul_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_mul_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_mul_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_rad2deg_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_rad2deg_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_rad2deg_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_rad2deg_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_rad2deg_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_randn_like_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_randn_like_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_randn_like_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_randn_like_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_round_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_round_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_round_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sign_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sign_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sign_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sign_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_signbit_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_signbit_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_signbit_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sinh_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sinh_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sinh_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sinh_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sqrt_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sqrt_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sqrt_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sqrt_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sqrt_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sum_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sum_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sum_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sum_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_to_sparse_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_to_sparse_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_to_sparse_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_to_sparse_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_zeros_like_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_zeros_like_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_zeros_like_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_zeros_like_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_zeros_like_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asinh_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asinh_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_ceil_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_ceil_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_ceil_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_ceil_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_conj_physical_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_conj_physical_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_conj_physical_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_conj_physical_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_deg2rad_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_deg2rad_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_deg2rad_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_floor_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_floor_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_floor_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_floor_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_floor_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_frac_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_frac_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isinf_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isinf_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isinf_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isinf_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isnan_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isnan_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isnan_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isnan_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isposinf_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isposinf_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isposinf_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isposinf_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_log1p_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_log1p_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_log1p_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_log1p_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_log1p_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amax_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amax_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amax_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amax_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amax_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amin_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amin_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amin_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amin_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amin_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_mean_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_mean_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_prod_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_prod_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_prod_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_prod_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_nn_functional_relu_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_nn_functional_relu_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_nn_functional_relu_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_nn_functional_relu_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_rad2deg_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_rad2deg_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_rad2deg_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_rad2deg_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_rad2deg_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_randn_like_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_randn_like_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_randn_like_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_randn_like_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_round_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_round_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_round_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_round_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_round_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sgn_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sgn_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sgn_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sgn_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_signbit_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_signbit_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_signbit_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_signbit_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_signbit_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sin_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sin_cuda_complex32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sin_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sin_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sqrt_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sqrt_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sqrt_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sqrt_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sum_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sum_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sum_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sum_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tan_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tan_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tan_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tan_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tan_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_trunc_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_trunc_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_trunc_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_trunc_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_zeros_like_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_zeros_like_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_zeros_like_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_zeros_like_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_zeros_like_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_dim_SparseBSC_cuda, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_dim_SparseBSR_cuda, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_dim_SparseCSC_cuda, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_dim_SparseCSR_cuda, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_invalid_input_SparseBSC_target_sparse_compressed_tensor_no_size_cuda, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_invalid_input_SparseBSR_target_sparse_compressed_tensor_cuda, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_invalid_input_SparseBSR_target_sparse_compressed_tensor_no_size_cuda, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_layout_SparseBSR_cuda, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_layout_SparseCSR_cuda, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_pickle_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_pickle_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_pickle_SparseCSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_print_SparseBSR_cuda, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_print_SparseCSR_cuda, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int32_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int32_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int32_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int32_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int64_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int64_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int64_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int64_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int64_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int32_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int32_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int32_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int64_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int64_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int64_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int64_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int64_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int32_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSC_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSR_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_complex128, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSR_cuda_bool, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSR_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSR_cuda_uint8, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSC_cuda_bfloat16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSC_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSC_cuda_float64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSC_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSC_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_complex64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_int16, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_int32, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_int64, test/test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_16_int64_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_16_int64_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_32_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_32_int32_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_32_int64_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_64_int32_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_64_int64_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_scatter_mm_blocksize_16_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_scatter_mm_blocksize_2_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_scatter_mm_blocksize_2x3_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_scatter_mm_blocksize_32_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_softmax_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_softmax_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_16_out_dtype_unspecified_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_16x32_out_dtype_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_16x32_out_dtype_unspecified_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_16x32_out_dtype_unspecified_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_16x32_out_dtype_unspecified_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_16_out_dtype_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_16_out_dtype_int32_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_16x32_out_dtype_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_16x32_out_dtype_int32_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_16_out_dtype_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_16_out_dtype_unspecified_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_16_out_dtype_unspecified_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_16x32_out_dtype_int32_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_16x32_out_dtype_unspecified_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_32_out_dtype_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_32_out_dtype_unspecified_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_16_out_dtype_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_16_out_dtype_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_16_out_dtype_unspecified_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_16x32_out_dtype_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_16x32_out_dtype_unspecified_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_sampled_addmm_block_size_64_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_sampled_addmm_block_size_64_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_scaled_dot_product_attention_block_size_32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_scaled_dot_product_attention_block_size_64_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_scatter_mm_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op__int_bsr_dense_addmm_out_dtype_int32_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op__int_bsr_dense_addmm_out_dtype_int32_cuda_float32, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op__int_bsr_dense_addmm_out_dtype_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op__int_bsr_dense_addmm_out_dtype_unspecified_cuda_float16, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op_bsr_dense_addmm_out_dtype_int32_cuda_int8, test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op_bsr_dense_addmm_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.5630676Z 2025-12-04T14:34:54.5631103Z test_sparse_csr.py::TestSparseCSRCUDA::test_add_SparseCSC_cuda_float32 PASSED [0.1654s] [ 0%] 2025-12-04T14:34:54.5632000Z test_sparse_csr.py::TestSparseCSRCUDA::test_add_SparseCSC_cuda_float64 PASSED [0.1566s] [ 0%] 2025-12-04T14:34:54.5632933Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_all_sparse_csr_SparseCSC_cuda_float64 PASSED [0.4679s] [ 0%] 2025-12-04T14:34:54.5633947Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_all_sparse_csr_SparseCSR_cuda_float64 PASSED [0.0490s] [ 0%] 2025-12-04T14:34:54.5635061Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_dense_result_SparseCSC_cuda_complex128 SKIPPED [0.0019s] (Only runs on cpu) [ 0%] 2025-12-04T14:34:54.5636278Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_dense_result_SparseCSC_cuda_complex64 SKIPPED [0.0016s] (Only runs on cpu) [ 0%] 2025-12-04T14:34:54.5638891Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_dense_result_SparseCSC_cuda_float32 SKIPPED [0.0018s] (Only runs on cpu) [ 0%] 2025-12-04T14:34:54.5640089Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_dense_result_SparseCSC_cuda_float64 SKIPPED [0.0015s] (Only runs on cpu) [ 0%] 2025-12-04T14:34:54.5641273Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_dense_result_SparseCSR_cuda_float32 SKIPPED [0.0014s] (Only runs on cpu) [ 0%] 2025-12-04T14:34:54.5642471Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_errors_cuda_float32 PASSED [0.0045s] [ 0%] 2025-12-04T14:34:54.5643416Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_0_cuda_float16 PASSED [0.0043s] [ 0%] 2025-12-04T14:34:54.5644482Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_0_cuda_float64 PASSED [0.0038s] [ 0%] 2025-12-04T14:34:54.5645560Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_1_cuda_complex128 PASSED [0.0038s] [ 0%] 2025-12-04T14:34:54.5646648Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_25_cuda_complex64 PASSED [0.0038s] [ 0%] 2025-12-04T14:34:54.5647839Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_25_cuda_float32 PASSED [0.0039s] [ 0%] 2025-12-04T14:34:54.5648971Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_0_m_25_cuda_float64 PASSED [0.0037s] [ 0%] 2025-12-04T14:34:54.5650058Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_0_cuda_complex128 PASSED [0.0039s] [ 0%] 2025-12-04T14:34:54.5651146Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_0_cuda_complex64 PASSED [0.0038s] [ 0%] 2025-12-04T14:34:54.5652210Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_0_cuda_float16 PASSED [0.0038s] [ 0%] 2025-12-04T14:34:54.5653273Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_0_cuda_float32 PASSED [0.0040s] [ 0%] 2025-12-04T14:34:54.5654331Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_0_cuda_float64 PASSED [0.0039s] [ 0%] 2025-12-04T14:34:54.5655414Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_1_cuda_complex128 PASSED [0.0091s] [ 0%] 2025-12-04T14:34:54.5656490Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_1_cuda_complex64 PASSED [0.0061s] [ 0%] 2025-12-04T14:34:54.5657565Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_1_cuda_float16 PASSED [0.0045s] [ 0%] 2025-12-04T14:34:54.5658742Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_1_cuda_float32 PASSED [0.0047s] [ 1%] 2025-12-04T14:34:54.5659812Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_1_cuda_float64 PASSED [0.0045s] [ 1%] 2025-12-04T14:34:54.5660887Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_25_cuda_complex128 PASSED [0.0047s] [ 1%] 2025-12-04T14:34:54.5661986Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_25_cuda_complex64 PASSED [0.0047s] [ 1%] 2025-12-04T14:34:54.5663076Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_10_m_25_cuda_float32 PASSED [0.0045s] [ 1%] 2025-12-04T14:34:54.5664163Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_0_cuda_complex128 PASSED [0.0039s] [ 1%] 2025-12-04T14:34:54.5665233Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_0_cuda_complex64 PASSED [0.0042s] [ 1%] 2025-12-04T14:34:54.5666304Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_0_cuda_float32 PASSED [0.0039s] [ 1%] 2025-12-04T14:34:54.5667367Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_0_cuda_float64 PASSED [0.0037s] [ 1%] 2025-12-04T14:34:54.5668443Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_1_cuda_complex64 PASSED [0.0045s] [ 1%] 2025-12-04T14:34:54.5669508Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_1_cuda_float16 PASSED [0.0043s] [ 1%] 2025-12-04T14:34:54.5670572Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_1_cuda_float32 PASSED [0.0043s] [ 1%] 2025-12-04T14:34:54.5671694Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_25_cuda_complex128 PASSED [0.0051s] [ 1%] 2025-12-04T14:34:54.5672783Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_25_cuda_complex64 PASSED [0.0046s] [ 1%] 2025-12-04T14:34:54.5673846Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_25_cuda_float16 PASSED [0.0045s] [ 1%] 2025-12-04T14:34:54.5674918Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_25_cuda_float32 PASSED [0.0044s] [ 1%] 2025-12-04T14:34:54.5675973Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_0_n_1_m_25_cuda_float64 PASSED [0.0045s] [ 1%] 2025-12-04T14:34:54.5677119Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_0_cuda_complex64 PASSED [0.0037s] [ 1%] 2025-12-04T14:34:54.5678192Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_0_cuda_float32 PASSED [0.0036s] [ 1%] 2025-12-04T14:34:54.5679296Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_0_cuda_float64 PASSED [0.0040s] [ 1%] 2025-12-04T14:34:54.5680378Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_1_cuda_complex128 PASSED [0.0039s] [ 1%] 2025-12-04T14:34:54.5681464Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_1_cuda_complex64 PASSED [0.0038s] [ 1%] 2025-12-04T14:34:54.5682527Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_1_cuda_float16 PASSED [0.0038s] [ 1%] 2025-12-04T14:34:54.5683594Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_1_cuda_float64 PASSED [0.0038s] [ 1%] 2025-12-04T14:34:54.5684680Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_25_cuda_complex128 PASSED [0.0039s] [ 2%] 2025-12-04T14:34:54.5685777Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_25_cuda_complex64 PASSED [0.0049s] [ 2%] 2025-12-04T14:34:54.5686840Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_0_m_25_cuda_float32 PASSED [0.0039s] [ 2%] 2025-12-04T14:34:54.5688101Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_0_cuda_complex64 PASSED [0.0041s] [ 2%] 2025-12-04T14:34:54.5689180Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_0_cuda_float16 PASSED [0.0039s] [ 2%] 2025-12-04T14:34:54.5690246Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_0_cuda_float32 PASSED [0.0039s] [ 2%] 2025-12-04T14:34:54.5691302Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_1_cuda_complex64 PASSED [0.0053s] [ 2%] 2025-12-04T14:34:54.5692383Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_1_cuda_float16 PASSED [0.0050s] [ 2%] 2025-12-04T14:34:54.5693455Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_1_cuda_float32 PASSED [0.0054s] [ 2%] 2025-12-04T14:34:54.5694539Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_25_cuda_complex64 PASSED [0.0053s] [ 2%] 2025-12-04T14:34:54.5695630Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_10_m_25_cuda_float16 PASSED [0.0051s] [ 2%] 2025-12-04T14:34:54.5696714Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_0_cuda_float16 PASSED [0.0040s] [ 2%] 2025-12-04T14:34:54.5697781Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_0_cuda_float32 PASSED [0.0038s] [ 2%] 2025-12-04T14:34:54.5698910Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_0_cuda_float64 PASSED [0.0039s] [ 2%] 2025-12-04T14:34:54.5699989Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_1_cuda_complex128 PASSED [0.0054s] [ 2%] 2025-12-04T14:34:54.5701065Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_1_cuda_float32 PASSED [0.0048s] [ 2%] 2025-12-04T14:34:54.5702185Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_1_n_1_m_25_cuda_float16 PASSED [0.0050s] [ 2%] 2025-12-04T14:34:54.5703274Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_0_cuda_complex128 PASSED [0.0037s] [ 2%] 2025-12-04T14:34:54.5704359Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_0_cuda_complex64 PASSED [0.0036s] [ 2%] 2025-12-04T14:34:54.5705423Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_0_cuda_float16 PASSED [0.0036s] [ 2%] 2025-12-04T14:34:54.5706540Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_0_cuda_float32 PASSED [0.0040s] [ 2%] 2025-12-04T14:34:54.5707661Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_1_cuda_complex64 PASSED [0.0038s] [ 2%] 2025-12-04T14:34:54.5708772Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_1_cuda_float64 PASSED [0.0038s] [ 2%] 2025-12-04T14:34:54.5709859Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_25_cuda_complex128 PASSED [0.0039s] [ 2%] 2025-12-04T14:34:54.5710949Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_25_cuda_complex64 PASSED [0.0038s] [ 3%] 2025-12-04T14:34:54.5712013Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_25_cuda_float16 PASSED [0.0038s] [ 3%] 2025-12-04T14:34:54.5713083Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_0_m_25_cuda_float32 PASSED [0.0038s] [ 3%] 2025-12-04T14:34:54.5714164Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_10_m_0_cuda_complex64 PASSED [0.0044s] [ 3%] 2025-12-04T14:34:54.5715256Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_10_m_1_cuda_complex128 PASSED [0.0053s] [ 3%] 2025-12-04T14:34:54.5716333Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_10_m_1_cuda_float16 PASSED [0.0050s] [ 3%] 2025-12-04T14:34:54.5717402Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_10_m_25_cuda_float16 PASSED [0.0052s] [ 3%] 2025-12-04T14:34:54.5718476Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_10_m_25_cuda_float32 PASSED [0.0050s] [ 3%] 2025-12-04T14:34:54.5719553Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_1_m_0_cuda_complex128 PASSED [0.0040s] [ 3%] 2025-12-04T14:34:54.5720611Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_1_m_0_cuda_float32 PASSED [0.0042s] [ 3%] 2025-12-04T14:34:54.5721685Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_1_m_1_cuda_complex128 PASSED [0.0051s] [ 3%] 2025-12-04T14:34:54.5722769Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_1_m_1_cuda_complex64 PASSED [0.0050s] [ 3%] 2025-12-04T14:34:54.5723844Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmm_sizes_all_sparse_csr_k_8_n_1_m_1_cuda_float16 PASSED [0.0049s] [ 3%] 2025-12-04T14:34:54.5724905Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_11x9_cuda_complex128 SKIPPED [0.0015s] (Only runs on cpu) [ 3%] 2025-12-04T14:34:54.5725988Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_11x9_cuda_complex64 SKIPPED [0.0015s] (Only runs on cpu) [ 3%] 2025-12-04T14:34:54.5727051Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_11x9_cuda_float32 SKIPPED [0.0015s] (Only runs on cpu) [ 3%] 2025-12-04T14:34:54.5728095Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_3x3_cuda_float64 SKIPPED [0.0019s] (Only runs on cpu) [ 3%] 2025-12-04T14:34:54.5729144Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_5x7_cuda_complex64 SKIPPED [0.0014s] (Only runs on cpu) [ 3%] 2025-12-04T14:34:54.5730207Z test_sparse_csr.py::TestSparseCSRCUDA::test_addmv_shape_5x7_cuda_float32 SKIPPED [0.0015s] (Only runs on cpu) [ 3%] 2025-12-04T14:34:54.5731272Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_abs_cuda_float64 PASSED [0.3105s] [ 3%] 2025-12-04T14:34:54.5732628Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_angle_cuda_complex128 SKIPPED [0.0030s] (Skipped! Unary op angle not supported with CSR input and autograd) [ 3%] 2025-12-04T14:34:54.5734280Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_angle_cuda_float64 SKIPPED [0.0029s] (Skipped! Unary op angle not supported with CSR input and autograd) [ 3%] 2025-12-04T14:34:54.5735923Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_asin_cuda_float64 SKIPPED [0.0026s] (Skipped! Unary op asin not supported with CSR input and autograd) [ 3%] 2025-12-04T14:34:54.5737651Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_asinh_cuda_float64 SKIPPED [0.0028s] (Skipped! Unary op asinh not supported with CSR input and autograd) [ 3%] 2025-12-04T14:34:54.5739425Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_atan_cuda_complex128 SKIPPED [0.0026s] (Skipped! Unary op atan not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5741095Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_atanh_cuda_complex128 SKIPPED [0.0026s] (Skipped! Unary op atanh not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5742728Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_ceil_cuda_float64 SKIPPED [0.0028s] (Skipped! Unary op ceil not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5744332Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_erf_cuda_float64 SKIPPED [0.0026s] (Skipped! Unary op erf not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5745962Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_erfinv_cuda_float64 SKIPPED [0.0026s] (Skipped! Unary op erfinv not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5747635Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_expm1_cuda_complex128 SKIPPED [0.0029s] (Skipped! Unary op expm1 not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5749306Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_isnan_cuda_complex128 SKIPPED [0.0026s] (Skipped! Unary op isnan not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5750984Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_isneginf_cuda_float64 SKIPPED [0.0026s] (Skipped! Unary op isneginf not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5752349Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_neg_cuda_float64 PASSED [0.2296s] [ 4%] 2025-12-04T14:34:54.5753402Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_positive_cuda_complex128 PASSED [0.0091s] [ 4%] 2025-12-04T14:34:54.5754478Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_positive_cuda_float64 PASSED [0.2287s] [ 4%] 2025-12-04T14:34:54.5755513Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_rad2deg_cuda_float64 PASSED [0.0092s] [ 4%] 2025-12-04T14:34:54.5756850Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_sgn_cuda_complex128 SKIPPED [0.0027s] (Skipped! Unary op sgn not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5758460Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_sgn_cuda_float64 SKIPPED [0.0029s] (Skipped! Unary op sgn not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5760106Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_signbit_cuda_float64 SKIPPED [0.0026s] (Skipped! Unary op signbit not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5761778Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_sin_cuda_complex128 SKIPPED [0.0026s] (Skipped! Unary op sin not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5763417Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_sin_cuda_float64 SKIPPED [0.0029s] (Skipped! Unary op sin not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5765028Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_sinh_cuda_float64 SKIPPED [0.0026s] (Skipped! Unary op sinh not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5766662Z test_sparse_csr.py::TestSparseCSRCUDA::test_autograd_sparse_csr_unary_trunc_cuda_float64 SKIPPED [0.0026s] (Skipped! Unary op trunc not supported with CSR input and autograd) [ 4%] 2025-12-04T14:34:54.5767948Z test_sparse_csr.py::TestSparseCSRCUDA::test_baddbmm_cuda_complex128 PASSED [0.0212s] [ 4%] 2025-12-04T14:34:54.5768790Z test_sparse_csr.py::TestSparseCSRCUDA::test_baddbmm_cuda_float32 PASSED [0.0172s] [ 4%] 2025-12-04T14:34:54.5769597Z test_sparse_csr.py::TestSparseCSRCUDA::test_baddbmm_cuda_float64 PASSED [0.0171s] [ 4%] 2025-12-04T14:34:54.5770636Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int32_noncontiguous_False_cuda_float32 PASSED [0.9482s] [ 4%] 2025-12-04T14:34:54.5771842Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int32_noncontiguous_False_cuda_float64 PASSED [0.0892s] [ 4%] 2025-12-04T14:34:54.5773042Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int32_noncontiguous_True_cuda_complex128 PASSED [0.0994s] [ 5%] 2025-12-04T14:34:54.5774257Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int32_noncontiguous_True_cuda_complex64 PASSED [0.0996s] [ 5%] 2025-12-04T14:34:54.5775455Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int32_noncontiguous_True_cuda_float16 PASSED [0.1045s] [ 5%] 2025-12-04T14:34:54.5776648Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int64_noncontiguous_False_cuda_float32 PASSED [0.1046s] [ 5%] 2025-12-04T14:34:54.5777843Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int64_noncontiguous_False_cuda_float64 PASSED [0.0938s] [ 5%] 2025-12-04T14:34:54.5779104Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int64_noncontiguous_True_cuda_complex128 PASSED [0.1036s] [ 5%] 2025-12-04T14:34:54.5780309Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int64_noncontiguous_True_cuda_float16 PASSED [0.1034s] [ 5%] 2025-12-04T14:34:54.5781500Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_2_int64_noncontiguous_True_cuda_float32 PASSED [0.1082s] [ 5%] 2025-12-04T14:34:54.5782705Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int32_noncontiguous_False_cuda_complex128 PASSED [0.0960s] [ 5%] 2025-12-04T14:34:54.5783907Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int32_noncontiguous_False_cuda_float16 PASSED [0.1020s] [ 5%] 2025-12-04T14:34:54.5785105Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int32_noncontiguous_False_cuda_float64 PASSED [0.0900s] [ 5%] 2025-12-04T14:34:54.5786318Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_False_cuda_complex128 PASSED [0.0997s] [ 5%] 2025-12-04T14:34:54.5787534Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_False_cuda_complex64 PASSED [0.1002s] [ 5%] 2025-12-04T14:34:54.5788877Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_False_cuda_float16 PASSED [0.1021s] [ 5%] 2025-12-04T14:34:54.5790087Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_True_cuda_complex128 PASSED [0.1046s] [ 5%] 2025-12-04T14:34:54.5791307Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_True_cuda_complex64 PASSED [0.1040s] [ 5%] 2025-12-04T14:34:54.5792508Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_True_cuda_float16 PASSED [0.1030s] [ 5%] 2025-12-04T14:34:54.5793754Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_True_cuda_float32 PASSED [0.1082s] [ 5%] 2025-12-04T14:34:54.5794939Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmm_block_size_3_int64_noncontiguous_True_cuda_float64 PASSED [0.0979s] [ 5%] 2025-12-04T14:34:54.5796144Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int32_noncontiguous_False_cuda_complex128 PASSED [0.0142s] [ 5%] 2025-12-04T14:34:54.5797367Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int32_noncontiguous_False_cuda_complex64 PASSED [0.0140s] [ 5%] 2025-12-04T14:34:54.5798634Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int32_noncontiguous_False_cuda_float32 PASSED [0.0135s] [ 5%] 2025-12-04T14:34:54.5799873Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int32_noncontiguous_True_cuda_float32 PASSED [0.0107s] [ 5%] 2025-12-04T14:34:54.5801126Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int64_noncontiguous_False_cuda_complex128 PASSED [0.0135s] [ 5%] 2025-12-04T14:34:54.5802331Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int64_noncontiguous_False_cuda_float64 PASSED [0.0138s] [ 6%] 2025-12-04T14:34:54.5803516Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_2_int64_noncontiguous_True_cuda_complex64 PASSED [0.0111s] [ 6%] 2025-12-04T14:34:54.5804723Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int32_noncontiguous_False_cuda_complex128 PASSED [0.0157s] [ 6%] 2025-12-04T14:34:54.5805924Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int32_noncontiguous_False_cuda_float32 PASSED [0.0153s] [ 6%] 2025-12-04T14:34:54.5807131Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int32_noncontiguous_True_cuda_complex128 PASSED [0.0109s] [ 6%] 2025-12-04T14:34:54.5808348Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int64_noncontiguous_False_cuda_complex128 PASSED [0.0151s] [ 6%] 2025-12-04T14:34:54.5809549Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int64_noncontiguous_False_cuda_float32 PASSED [0.0147s] [ 6%] 2025-12-04T14:34:54.5810740Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int64_noncontiguous_True_cuda_complex64 PASSED [0.0117s] [ 6%] 2025-12-04T14:34:54.5811926Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_addmv_block_size_3_int64_noncontiguous_True_cuda_float32 PASSED [0.0104s] [ 6%] 2025-12-04T14:34:54.5813181Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int32_noncontiguous_False_cuda_complex128 PASSED [0.0947s] [ 6%] 2025-12-04T14:34:54.5814503Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int32_noncontiguous_False_cuda_complex64 PASSED [0.0808s] [ 6%] 2025-12-04T14:34:54.5815811Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int32_noncontiguous_False_cuda_float32 PASSED [0.0783s] [ 6%] 2025-12-04T14:34:54.5817118Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int32_noncontiguous_True_cuda_complex64 PASSED [0.0743s] [ 6%] 2025-12-04T14:34:54.5818503Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_False_cuda_complex128 PASSED [0.0752s] [ 6%] 2025-12-04T14:34:54.5819831Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_False_cuda_complex64 PASSED [0.0695s] [ 6%] 2025-12-04T14:34:54.5821144Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_False_cuda_float32 PASSED [0.0655s] [ 6%] 2025-12-04T14:34:54.5822456Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_False_cuda_float64 PASSED [0.0661s] [ 6%] 2025-12-04T14:34:54.5823770Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_True_cuda_complex128 PASSED [0.0655s] [ 6%] 2025-12-04T14:34:54.5825124Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_2_int64_noncontiguous_True_cuda_complex64 PASSED [0.0654s] [ 6%] 2025-12-04T14:34:54.5826455Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int32_noncontiguous_False_cuda_complex128 PASSED [0.0675s] [ 6%] 2025-12-04T14:34:54.5827786Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int32_noncontiguous_False_cuda_complex64 PASSED [0.0679s] [ 6%] 2025-12-04T14:34:54.5829107Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int32_noncontiguous_True_cuda_complex64 PASSED [0.0643s] [ 6%] 2025-12-04T14:34:54.5830491Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int64_noncontiguous_False_cuda_float32 PASSED [0.0672s] [ 6%] 2025-12-04T14:34:54.5831827Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int64_noncontiguous_True_cuda_complex128 PASSED [0.0654s] [ 6%] 2025-12-04T14:34:54.5833146Z test_sparse_csr.py::TestSparseCSRCUDA::test_block_triangular_solve_block_size_3_int64_noncontiguous_True_cuda_float64 PASSED [0.0640s] [ 7%] 2025-12-04T14:34:54.5834183Z test_sparse_csr.py::TestSparseCSRCUDA::test_bmm_cuda_complex64 PASSED [0.0184s] [ 7%] 2025-12-04T14:34:54.5834957Z test_sparse_csr.py::TestSparseCSRCUDA::test_bmm_cuda_float32 PASSED [0.0168s] [ 7%] 2025-12-04T14:34:54.5835956Z test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseBSC_SparseBSC_cuda PASSED [0.0072s] [ 7%] 2025-12-04T14:34:54.5837202Z test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseBSC_SparseCSR_cuda PASSED [0.0047s] [ 7%] 2025-12-04T14:34:54.5838444Z test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseBSR_SparseBSR_cuda PASSED [0.0062s] [ 7%] 2025-12-04T14:34:54.5839680Z test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseBSR_SparseCSC_cuda PASSED [0.0048s] [ 7%] 2025-12-04T14:34:54.5840911Z test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseCSC_SparseCSC_cuda PASSED [0.0060s] [ 7%] 2025-12-04T14:34:54.5842150Z test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseCSR_SparseBSC_cuda PASSED [0.0039s] [ 7%] 2025-12-04T14:34:54.5843388Z test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseCSR_SparseCSC_cuda PASSED [0.0069s] [ 7%] 2025-12-04T14:34:54.5844629Z test_sparse_csr.py::TestSparseCSRCUDA::test_compressed_layout_conversions_coverage_SparseCSR_SparseCSR_cuda PASSED [0.0052s] [ 7%] 2025-12-04T14:34:54.5845721Z test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_complex128 PASSED [0.0073s] [ 7%] 2025-12-04T14:34:54.5846641Z test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_complex64 PASSED [0.0072s] [ 7%] 2025-12-04T14:34:54.5847535Z test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_float32 PASSED [0.0073s] [ 7%] 2025-12-04T14:34:54.5848424Z test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_float64 PASSED [0.0069s] [ 7%] 2025-12-04T14:34:54.5849321Z test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_int16 PASSED [0.0059s] [ 7%] 2025-12-04T14:34:54.5850184Z test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_int32 PASSED [0.0058s] [ 7%] 2025-12-04T14:34:54.5851062Z test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_int64 PASSED [0.0058s] [ 7%] 2025-12-04T14:34:54.5851929Z test_sparse_csr.py::TestSparseCSRCUDA::test_coo_csr_conversion_cuda_int8 PASSED [0.0058s] [ 7%] 2025-12-04T14:34:54.5852806Z test_sparse_csr.py::TestSparseCSRCUDA::test_coo_to_csr_convert_cuda_float64 PASSED [0.0100s] [ 7%] 2025-12-04T14:34:54.5853692Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_coo_conversion_cuda_bfloat16 PASSED [0.0065s] [ 7%] 2025-12-04T14:34:54.5854637Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_coo_conversion_cuda_float16 PASSED [0.0064s] [ 7%] 2025-12-04T14:34:54.5855532Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_coo_conversion_cuda_float64 PASSED [0.0064s] [ 7%] 2025-12-04T14:34:54.5856414Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_coo_conversion_cuda_int16 PASSED [0.0053s] [ 7%] 2025-12-04T14:34:54.5857280Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_coo_conversion_cuda_int8 PASSED [0.0053s] [ 8%] 2025-12-04T14:34:54.5858192Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_matvec_cuda_complex64 PASSED [0.0365s] [ 8%] 2025-12-04T14:34:54.5859136Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_matvec_cuda_float32 PASSED [0.0349s] [ 8%] 2025-12-04T14:34:54.5860076Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_matvec_cuda_float64 PASSED [0.0345s] [ 8%] 2025-12-04T14:34:54.5861487Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_storage_cuda PASSED [0.0031s] [ 8%] 2025-12-04T14:34:54.5862520Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_stride_cuda PASSED [0.0030s] [ 8%] 2025-12-04T14:34:54.5865833Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_to_block_csr_blocksize_2_cuda_float64_int32 PASSED [0.0170s] [ 8%] 2025-12-04T14:34:54.5866868Z test_sparse_csr.py::TestSparseCSRCUDA::test_csr_to_block_csr_blocksize_4_cuda_float64_int64 PASSED [0.0260s] [ 8%] 2025-12-04T14:34:54.5868000Z test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseBSC_NonBatched_Hybrid_cuda PASSED [0.0281s] [ 8%] 2025-12-04T14:34:54.5869254Z test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseBSC_NonBatched_NonHybrid_cuda PASSED [0.0196s] [ 8%] 2025-12-04T14:34:54.5870511Z test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseBSR_Batched_NonHybrid_cuda PASSED [0.2665s] [ 8%] 2025-12-04T14:34:54.5871743Z test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseBSR_NonBatched_Hybrid_cuda PASSED [0.0273s] [ 8%] 2025-12-04T14:34:54.5872984Z test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseBSR_NonBatched_NonHybrid_cuda PASSED [0.0194s] [ 8%] 2025-12-04T14:34:54.5874246Z test_sparse_csr.py::TestSparseCSRCUDA::test_dense_to_from_sparse_compressed_SparseCSC_NonBatched_NonHybrid_cuda PASSED [0.0073s] [ 8%] 2025-12-04T14:34:54.5875372Z test_sparse_csr.py::TestSparseCSRCUDA::test_direct_coo_csr_conversion_cuda_complex128 PASSED [0.0074s] [ 8%] 2025-12-04T14:34:54.5876340Z test_sparse_csr.py::TestSparseCSRCUDA::test_direct_coo_csr_conversion_cuda_int16 PASSED [0.0058s] [ 8%] 2025-12-04T14:34:54.5877267Z test_sparse_csr.py::TestSparseCSRCUDA::test_direct_coo_csr_conversion_cuda_uint8 PASSED [0.0063s] [ 8%] 2025-12-04T14:34:54.5878173Z test_sparse_csr.py::TestSparseCSRCUDA::test_exercise_detach_cuda_float16 PASSED [0.0046s] [ 8%] 2025-12-04T14:34:54.5879039Z test_sparse_csr.py::TestSparseCSRCUDA::test_exercise_detach_cuda_int16 PASSED [0.0041s] [ 8%] 2025-12-04T14:34:54.5879899Z test_sparse_csr.py::TestSparseCSRCUDA::test_exercise_detach_cuda_int32 PASSED [0.0041s] [ 8%] 2025-12-04T14:34:54.5880738Z test_sparse_csr.py::TestSparseCSRCUDA::test_exercise_detach_cuda_int8 PASSED [0.0041s] [ 8%] 2025-12-04T14:34:54.5881632Z test_sparse_csr.py::TestSparseCSRCUDA::test_matmul_device_mismatch_cuda_float64 PASSED [0.0044s] [ 8%] 2025-12-04T14:34:54.5882512Z test_sparse_csr.py::TestSparseCSRCUDA::test_mm_errors_cuda_float32 PASSED [0.0028s] [ 8%] 2025-12-04T14:34:54.5883301Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_cuda_float32 PASSED [0.2693s] [ 8%] 2025-12-04T14:34:54.5884218Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_bool PASSED [0.6608s] [ 9%] 2025-12-04T14:34:54.5885340Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_complex128 PASSED [1.3910s] [ 9%] 2025-12-04T14:34:54.5886486Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_complex64 PASSED [1.3898s] [ 9%] 2025-12-04T14:34:54.5887672Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_float16 PASSED [1.1000s] [ 9%] 2025-12-04T14:34:54.5888934Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_float32 PASSED [1.2191s] [ 9%] 2025-12-04T14:34:54.5890048Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_float64 PASSED [1.2138s] [ 9%] 2025-12-04T14:34:54.5891145Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_int32 PASSED [0.8575s] [ 9%] 2025-12-04T14:34:54.5892233Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_int64 PASSED [0.8615s] [ 9%] 2025-12-04T14:34:54.5893430Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_int8 PASSED [0.8623s] [ 9%] 2025-12-04T14:34:54.5894518Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSC_cuda_uint8 PASSED [0.8644s] [ 9%] 2025-12-04T14:34:54.5895677Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSR_cuda_complex128 PASSED [1.3884s] [ 9%] 2025-12-04T14:34:54.5896818Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSR_cuda_complex64 PASSED [1.3858s] [ 9%] 2025-12-04T14:34:54.5897926Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSR_cuda_float32 PASSED [1.2195s] [ 9%] 2025-12-04T14:34:54.5899111Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSR_cuda_float64 PASSED [1.2084s] [ 9%] 2025-12-04T14:34:54.5900215Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseBSR_cuda_int8 PASSED [0.8584s] [ 9%] 2025-12-04T14:34:54.5901330Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_bfloat16 PASSED [1.1961s] [ 9%] 2025-12-04T14:34:54.5902415Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_bool PASSED [0.6558s] [ 9%] 2025-12-04T14:34:54.5903539Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_complex128 PASSED [1.3622s] [ 9%] 2025-12-04T14:34:54.5904684Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_complex64 PASSED [1.3600s] [ 9%] 2025-12-04T14:34:54.5905810Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_float16 PASSED [1.0786s] [ 9%] 2025-12-04T14:34:54.5906906Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_float64 PASSED [1.1886s] [ 9%] 2025-12-04T14:34:54.5908008Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_int16 PASSED [0.8403s] [ 9%] 2025-12-04T14:34:54.5909098Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_int64 PASSED [0.8404s] [ 9%] 2025-12-04T14:34:54.5910183Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSC_cuda_int8 PASSED [0.8429s] [ 9%] 2025-12-04T14:34:54.5911273Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_bfloat16 PASSED [1.1901s] [ 10%] 2025-12-04T14:34:54.5912374Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_bool PASSED [0.6533s] [ 10%] 2025-12-04T14:34:54.5913479Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_complex64 PASSED [1.3545s] [ 10%] 2025-12-04T14:34:54.5914587Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_int16 PASSED [0.8396s] [ 10%] 2025-12-04T14:34:54.5915666Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_int64 PASSED [0.8401s] [ 10%] 2025-12-04T14:34:54.5916750Z test_sparse_csr.py::TestSparseCSRCUDA::test_mul_scalar_enable_hybrid_False_SparseCSR_cuda_int8 PASSED [0.8408s] [ 10%] 2025-12-04T14:34:54.5917822Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseBSC_cuda_bool PASSED [0.0313s] [ 10%] 2025-12-04T14:34:54.5918954Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseBSC_cuda_float32 PASSED [0.0296s] [ 10%] 2025-12-04T14:34:54.5920017Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseBSR_cuda_bool PASSED [0.0268s] [ 10%] 2025-12-04T14:34:54.5921091Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseBSR_cuda_float32 PASSED [0.0270s] [ 10%] 2025-12-04T14:34:54.5922162Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseCSR_cuda_bool PASSED [0.0579s] [ 10%] 2025-12-04T14:34:54.5923279Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_as_sparse_compressed_SparseCSR_cuda_float32 PASSED [0.0583s] [ 10%] 2025-12-04T14:34:54.5924244Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_cuda_bfloat16 PASSED [0.0052s] [ 10%] 2025-12-04T14:34:54.5925061Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_cuda_bool PASSED [0.0050s] [ 10%] 2025-12-04T14:34:54.5925840Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_cuda_float16 PASSED [0.0050s] [ 10%] 2025-12-04T14:34:54.5926609Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_cuda_int8 PASSED [0.0049s] [ 10%] 2025-12-04T14:34:54.5927380Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_cuda_uint8 PASSED [0.0048s] [ 10%] 2025-12-04T14:34:54.5928199Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_bfloat16 PASSED [0.0045s] [ 10%] 2025-12-04T14:34:54.5929067Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_complex64 PASSED [0.0035s] [ 10%] 2025-12-04T14:34:54.5929917Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_float16 PASSED [0.0036s] [ 10%] 2025-12-04T14:34:54.5930771Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_float32 PASSED [0.0035s] [ 10%] 2025-12-04T14:34:54.5931613Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_float64 PASSED [0.0035s] [ 10%] 2025-12-04T14:34:54.5932459Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_int16 PASSED [0.0035s] [ 10%] 2025-12-04T14:34:54.5933281Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_int32 PASSED [0.0035s] [ 10%] 2025-12-04T14:34:54.5934115Z test_sparse_csr.py::TestSparseCSRCUDA::test_resize_errors_cuda_int8 PASSED [0.0040s] [ 11%] 2025-12-04T14:34:54.5935022Z test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_autograd_cuda_complex128 PASSED [0.0075s] [ 11%] 2025-12-04T14:34:54.5935983Z test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_autograd_cuda_float64 PASSED [0.0065s] [ 11%] 2025-12-04T14:34:54.5936887Z test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_cuda_complex128 PASSED [1.9680s] [ 11%] 2025-12-04T14:34:54.5937778Z test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_cuda_complex64 PASSED [1.9691s] [ 11%] 2025-12-04T14:34:54.5938722Z test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_cuda_float64 PASSED [1.8963s] [ 11%] 2025-12-04T14:34:54.5939617Z test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_errors_cuda_complex128 PASSED [0.0055s] [ 11%] 2025-12-04T14:34:54.5940553Z test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_errors_cuda_float64 PASSED [0.0038s] [ 11%] 2025-12-04T14:34:54.5941930Z test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_zero_sized_cuda_complex128 SKIPPED [0.0018s] (Causes CUDA memory exception, see https://github.com/pytorch/pytorch/issues/72177) [ 11%] 2025-12-04T14:34:54.5943707Z test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_zero_sized_cuda_float32 SKIPPED [0.0017s] (Causes CUDA memory exception, see https://github.com/pytorch/pytorch/issues/72177) [ 11%] 2025-12-04T14:34:54.5945464Z test_sparse_csr.py::TestSparseCSRCUDA::test_sampled_addmm_zero_sized_cuda_float64 SKIPPED [0.0016s] (Causes CUDA memory exception, see https://github.com/pytorch/pytorch/issues/72177) [ 11%] 2025-12-04T14:34:54.5946799Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_float16 PASSED [0.0121s] [ 11%] 2025-12-04T14:34:54.5947793Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_float32 PASSED [0.0121s] [ 11%] 2025-12-04T14:34:54.5948734Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_float64 PASSED [0.0115s] [ 11%] 2025-12-04T14:34:54.5949665Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_int16 PASSED [0.0108s] [ 11%] 2025-12-04T14:34:54.5950577Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_int32 PASSED [0.0107s] [ 11%] 2025-12-04T14:34:54.5951500Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_int64 PASSED [0.0107s] [ 11%] 2025-12-04T14:34:54.5952455Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int32_cuda_int8 PASSED [0.0107s] [ 11%] 2025-12-04T14:34:54.5953425Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_bfloat16 PASSED [0.0111s] [ 11%] 2025-12-04T14:34:54.5954381Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_bool PASSED [0.0114s] [ 11%] 2025-12-04T14:34:54.5955330Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_complex128 PASSED [0.0113s] [ 11%] 2025-12-04T14:34:54.5956296Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_complex64 PASSED [0.0114s] [ 11%] 2025-12-04T14:34:54.5957234Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_float32 PASSED [0.0111s] [ 11%] 2025-12-04T14:34:54.5958170Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSC_int64_cuda_float64 PASSED [0.0111s] [ 11%] 2025-12-04T14:34:54.5959113Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_bfloat16 PASSED [0.0098s] [ 12%] 2025-12-04T14:34:54.5960052Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_bool PASSED [0.0096s] [ 12%] 2025-12-04T14:34:54.5960981Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_complex64 PASSED [0.0099s] [ 12%] 2025-12-04T14:34:54.5961933Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_float16 PASSED [0.0097s] [ 12%] 2025-12-04T14:34:54.5962870Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_float32 PASSED [0.0097s] [ 12%] 2025-12-04T14:34:54.5964030Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_float64 PASSED [0.0099s] [ 12%] 2025-12-04T14:34:54.5964962Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_int8 PASSED [0.0090s] [ 12%] 2025-12-04T14:34:54.5965880Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int32_cuda_uint8 PASSED [0.0096s] [ 12%] 2025-12-04T14:34:54.5966819Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_bfloat16 PASSED [0.0095s] [ 12%] 2025-12-04T14:34:54.5967758Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_bool PASSED [0.0088s] [ 12%] 2025-12-04T14:34:54.5968692Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_complex128 PASSED [0.0097s] [ 12%] 2025-12-04T14:34:54.5969664Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_complex64 PASSED [0.0095s] [ 12%] 2025-12-04T14:34:54.5970611Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_float32 PASSED [0.0093s] [ 12%] 2025-12-04T14:34:54.5971542Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_int16 PASSED [0.0086s] [ 12%] 2025-12-04T14:34:54.5972452Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseBSR_int64_cuda_uint8 PASSED [0.0093s] [ 12%] 2025-12-04T14:34:54.5973373Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int32_cuda_int16 PASSED [0.0158s] [ 12%] 2025-12-04T14:34:54.5974294Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int32_cuda_int32 PASSED [0.0159s] [ 12%] 2025-12-04T14:34:54.5975220Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int32_cuda_int64 PASSED [0.0158s] [ 12%] 2025-12-04T14:34:54.5976129Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int32_cuda_int8 PASSED [0.0158s] [ 12%] 2025-12-04T14:34:54.5977151Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_bfloat16 PASSED [0.0161s] [ 12%] 2025-12-04T14:34:54.5978088Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_bool PASSED [0.0160s] [ 12%] 2025-12-04T14:34:54.5979102Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_complex128 PASSED [0.0163s] [ 12%] 2025-12-04T14:34:54.5980071Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_complex64 PASSED [0.0163s] [ 12%] 2025-12-04T14:34:54.5981026Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_float32 PASSED [0.0160s] [ 12%] 2025-12-04T14:34:54.5982012Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_float64 PASSED [0.0160s] [ 13%] 2025-12-04T14:34:54.5982962Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_int16 PASSED [0.0154s] [ 13%] 2025-12-04T14:34:54.5983920Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSC_int64_cuda_int32 PASSED [0.0160s] [ 13%] 2025-12-04T14:34:54.5984864Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_bfloat16 PASSED [0.0131s] [ 13%] 2025-12-04T14:34:54.5985799Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_bool PASSED [0.0124s] [ 13%] 2025-12-04T14:34:54.5986732Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_complex128 PASSED [0.0134s] [ 13%] 2025-12-04T14:34:54.5987694Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_complex64 PASSED [0.0135s] [ 13%] 2025-12-04T14:34:54.5988783Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_float16 PASSED [0.0131s] [ 13%] 2025-12-04T14:34:54.5989726Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_float32 PASSED [0.0129s] [ 13%] 2025-12-04T14:34:54.5990649Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_int16 PASSED [0.0128s] [ 13%] 2025-12-04T14:34:54.5991576Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_int32 PASSED [0.0122s] [ 13%] 2025-12-04T14:34:54.5992502Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_int8 PASSED [0.0123s] [ 13%] 2025-12-04T14:34:54.5993428Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int32_cuda_uint8 PASSED [0.0123s] [ 13%] 2025-12-04T14:34:54.5994351Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_bfloat16 PASSED [0.0127s] [ 13%] 2025-12-04T14:34:54.5995316Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_complex128 PASSED [0.0128s] [ 13%] 2025-12-04T14:34:54.5996283Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_complex64 PASSED [0.0133s] [ 13%] 2025-12-04T14:34:54.5997228Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_float16 PASSED [0.0126s] [ 13%] 2025-12-04T14:34:54.5998161Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_int32 PASSED [0.0118s] [ 13%] 2025-12-04T14:34:54.5999086Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_int64 PASSED [0.0117s] [ 13%] 2025-12-04T14:34:54.6000003Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_int8 PASSED [0.0117s] [ 13%] 2025-12-04T14:34:54.6000909Z test_sparse_csr.py::TestSparseCSRCUDA::test_select_SparseCSR_int64_cuda_uint8 PASSED [0.0118s] [ 13%] 2025-12-04T14:34:54.6001802Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_add_cuda_complex128 PASSED [0.4126s] [ 13%] 2025-12-04T14:34:54.6002660Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_add_cuda_complex64 PASSED [0.4102s] [ 13%] 2025-12-04T14:34:54.6003503Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_add_cuda_float32 PASSED [0.3940s] [ 13%] 2025-12-04T14:34:54.6004377Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csc_to_dense_cuda_complex128 PASSED [0.0087s] [ 14%] 2025-12-04T14:34:54.6005303Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csc_to_dense_cuda_complex64 PASSED [0.0084s] [ 14%] 2025-12-04T14:34:54.6006298Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csc_to_dense_cuda_float64 PASSED [0.0079s] [ 14%] 2025-12-04T14:34:54.6007203Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csc_to_dense_cuda_int16 PASSED [0.0073s] [ 14%] 2025-12-04T14:34:54.6008079Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csc_to_dense_cuda_uint8 PASSED [0.0066s] [ 14%] 2025-12-04T14:34:54.6008988Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_from_dense_cuda_float16 PASSED [0.0038s] [ 14%] 2025-12-04T14:34:54.6009909Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_from_dense_cuda_float64 PASSED [0.0037s] [ 14%] 2025-12-04T14:34:54.6010869Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_from_dense_cuda_int16 PASSED [0.0035s] [ 14%] 2025-12-04T14:34:54.6011819Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_from_dense_cuda_int32 PASSED [0.0035s] [ 14%] 2025-12-04T14:34:54.6012776Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_from_dense_cuda_uint8 PASSED [0.0035s] [ 14%] 2025-12-04T14:34:54.6013698Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_bfloat16 PASSED [0.0083s] [ 14%] 2025-12-04T14:34:54.6014608Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_complex128 PASSED [0.0080s] [ 14%] 2025-12-04T14:34:54.6015534Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_complex64 PASSED [0.0080s] [ 14%] 2025-12-04T14:34:54.6016449Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_float16 PASSED [0.0077s] [ 14%] 2025-12-04T14:34:54.6017344Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_int16 PASSED [0.0065s] [ 14%] 2025-12-04T14:34:54.6018339Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_to_dense_cuda_int64 PASSED [0.0064s] [ 14%] 2025-12-04T14:34:54.6019301Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_bfloat16 PASSED [0.2608s] [ 14%] 2025-12-04T14:34:54.6020320Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_complex128 PASSED [0.0075s] [ 14%] 2025-12-04T14:34:54.6021333Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_complex32 PASSED [0.2297s] [ 14%] 2025-12-04T14:34:54.6022350Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_complex64 PASSED [0.0074s] [ 14%] 2025-12-04T14:34:54.6023346Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_int16 PASSED [0.2277s] [ 14%] 2025-12-04T14:34:54.6024316Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_int8 PASSED [0.0075s] [ 14%] 2025-12-04T14:34:54.6025274Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_abs_cuda_uint8 PASSED [0.2306s] [ 14%] 2025-12-04T14:34:54.6026457Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_angle_cuda_bool SKIPPED [0.0054s] (Skipped! Inplace variant not supported!) [ 14%] 2025-12-04T14:34:54.6027884Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_angle_cuda_complex128 SKIPPED [0.0050s] (Skipped! Inplace variant not supported!) [ 15%] 2025-12-04T14:34:54.6029331Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_angle_cuda_complex32 SKIPPED [0.0051s] (Skipped! Inplace variant not supported!) [ 15%] 2025-12-04T14:34:54.6030759Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_angle_cuda_float32 SKIPPED [0.0048s] (Skipped! Inplace variant not supported!) [ 15%] 2025-12-04T14:34:54.6032147Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_angle_cuda_uint8 SKIPPED [0.0051s] (Skipped! Inplace variant not supported!) [ 15%] 2025-12-04T14:34:54.6033357Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_bfloat16 PASSED [0.0075s] [ 15%] 2025-12-04T14:34:54.6034390Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_complex128 PASSED [0.2294s] [ 15%] 2025-12-04T14:34:54.6035420Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_complex32 PASSED [0.0094s] [ 15%] 2025-12-04T14:34:54.6036472Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_complex64 PASSED [0.2270s] [ 15%] 2025-12-04T14:34:54.6037487Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_float32 PASSED [0.0078s] [ 15%] 2025-12-04T14:34:54.6038494Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_float64 PASSED [0.2299s] [ 15%] 2025-12-04T14:34:54.6039483Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asin_cuda_int16 PASSED [0.0074s] [ 15%] 2025-12-04T14:34:54.6040453Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_bool PASSED [0.2303s] [ 15%] 2025-12-04T14:34:54.6041536Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_complex128 PASSED [0.0078s] [ 15%] 2025-12-04T14:34:54.6042572Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_complex32 PASSED [0.2294s] [ 15%] 2025-12-04T14:34:54.6043622Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_complex64 PASSED [0.0079s] [ 15%] 2025-12-04T14:34:54.6044634Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_int16 PASSED [0.2286s] [ 15%] 2025-12-04T14:34:54.6045626Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_int64 PASSED [0.0074s] [ 15%] 2025-12-04T14:34:54.6046620Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_asinh_cuda_uint8 PASSED [0.2287s] [ 15%] 2025-12-04T14:34:54.6047587Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_bool PASSED [0.0073s] [ 15%] 2025-12-04T14:34:54.6048589Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_complex128 PASSED [0.2302s] [ 15%] 2025-12-04T14:34:54.6049626Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_complex64 PASSED [0.0099s] [ 15%] 2025-12-04T14:34:54.6050641Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_float16 PASSED [0.2303s] [ 15%] 2025-12-04T14:34:54.6051628Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_float32 PASSED [0.0077s] [ 15%] 2025-12-04T14:34:54.6052624Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_float64 PASSED [0.2327s] [ 15%] 2025-12-04T14:34:54.6053614Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_int16 PASSED [0.0074s] [ 16%] 2025-12-04T14:34:54.6054591Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atan_cuda_int64 PASSED [0.2341s] [ 16%] 2025-12-04T14:34:54.6055567Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_bool PASSED [0.0074s] [ 16%] 2025-12-04T14:34:54.6056579Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_complex128 PASSED [0.2350s] [ 16%] 2025-12-04T14:34:54.6057614Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_complex32 PASSED [0.0100s] [ 16%] 2025-12-04T14:34:54.6058735Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_complex64 PASSED [0.2321s] [ 16%] 2025-12-04T14:34:54.6059745Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_float16 PASSED [0.0078s] [ 16%] 2025-12-04T14:34:54.6060755Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_float32 PASSED [0.2303s] [ 16%] 2025-12-04T14:34:54.6061768Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_float64 PASSED [0.0078s] [ 16%] 2025-12-04T14:34:54.6062771Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_int64 PASSED [0.2287s] [ 16%] 2025-12-04T14:34:54.6063746Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_atanh_cuda_int8 PASSED [0.0074s] [ 16%] 2025-12-04T14:34:54.6064745Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_ceil_cuda_bfloat16 PASSED [0.2275s] [ 16%] 2025-12-04T14:34:54.6065794Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_ceil_cuda_float16 PASSED [0.0078s] [ 16%] 2025-12-04T14:34:54.6066787Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_ceil_cuda_int64 PASSED [0.2301s] [ 16%] 2025-12-04T14:34:54.6067748Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_ceil_cuda_int8 PASSED [0.0076s] [ 16%] 2025-12-04T14:34:54.6068730Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_ceil_cuda_uint8 PASSED [0.2292s] [ 16%] 2025-12-04T14:34:54.6069756Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_bool PASSED [0.0075s] [ 16%] 2025-12-04T14:34:54.6070919Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_complex128 PASSED [0.2296s] [ 16%] 2025-12-04T14:34:54.6072030Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_complex64 PASSED [0.0079s] [ 16%] 2025-12-04T14:34:54.6073186Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_float16 PASSED [0.2295s] [ 16%] 2025-12-04T14:34:54.6074283Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_float64 PASSED [0.0077s] [ 16%] 2025-12-04T14:34:54.6075366Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_int32 PASSED [0.2277s] [ 16%] 2025-12-04T14:34:54.6076419Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_conj_physical_cuda_uint8 PASSED [0.0075s] [ 16%] 2025-12-04T14:34:54.6077467Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_deg2rad_cuda_float32 PASSED [0.2283s] [ 16%] 2025-12-04T14:34:54.6078486Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_deg2rad_cuda_int32 PASSED [0.0075s] [ 17%] 2025-12-04T14:34:54.6079497Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_deg2rad_cuda_int64 PASSED [0.2275s] [ 17%] 2025-12-04T14:34:54.6080503Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_deg2rad_cuda_uint8 PASSED [0.0074s] [ 17%] 2025-12-04T14:34:54.6081512Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erf_cuda_float16 PASSED [0.2288s] [ 17%] 2025-12-04T14:34:54.6082508Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erf_cuda_float64 PASSED [0.0078s] [ 17%] 2025-12-04T14:34:54.6083486Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erf_cuda_int64 PASSED [0.2285s] [ 17%] 2025-12-04T14:34:54.6084437Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erf_cuda_int8 PASSED [0.0074s] [ 17%] 2025-12-04T14:34:54.6109108Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erfinv_cuda_bfloat16 PASSED [0.2292s] [ 17%] 2025-12-04T14:34:54.6110280Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erfinv_cuda_bool PASSED [0.0074s] [ 17%] 2025-12-04T14:34:54.6111290Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erfinv_cuda_float32 PASSED [0.2298s] [ 17%] 2025-12-04T14:34:54.6112296Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_erfinv_cuda_uint8 PASSED [0.0073s] [ 17%] 2025-12-04T14:34:54.6113271Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_bool PASSED [0.2326s] [ 17%] 2025-12-04T14:34:54.6114282Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_complex128 PASSED [0.0079s] [ 17%] 2025-12-04T14:34:54.6115302Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_float16 PASSED [0.2291s] [ 17%] 2025-12-04T14:34:54.6116308Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_int32 PASSED [0.0074s] [ 17%] 2025-12-04T14:34:54.6117279Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_int8 PASSED [0.2298s] [ 17%] 2025-12-04T14:34:54.6118255Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_expm1_cuda_uint8 PASSED [0.0073s] [ 17%] 2025-12-04T14:34:54.6119352Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_floor_cuda_int16 PASSED [0.2350s] [ 17%] 2025-12-04T14:34:54.6120319Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_floor_cuda_int32 PASSED [0.0076s] [ 17%] 2025-12-04T14:34:54.6121276Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_floor_cuda_int64 PASSED [0.2315s] [ 17%] 2025-12-04T14:34:54.6122248Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_floor_cuda_uint8 PASSED [0.0075s] [ 17%] 2025-12-04T14:34:54.6123235Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_frac_cuda_float16 PASSED [0.2279s] [ 17%] 2025-12-04T14:34:54.6124334Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_frac_cuda_float32 PASSED [0.0078s] [ 17%] 2025-12-04T14:34:54.6125318Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_frac_cuda_float64 PASSED [0.2278s] [ 17%] 2025-12-04T14:34:54.6126577Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_bfloat16 SKIPPED [0.0054s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6128002Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_complex32 SKIPPED [0.0050s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6129426Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_complex64 SKIPPED [0.0051s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6130827Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_float16 SKIPPED [0.0048s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6132228Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_float32 SKIPPED [0.0051s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6133625Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_float64 SKIPPED [0.0051s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6135017Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_int32 SKIPPED [0.0049s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6136397Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isinf_cuda_int64 SKIPPED [0.0052s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6137776Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_bfloat16 SKIPPED [0.0049s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6139285Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_bool SKIPPED [0.0051s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6140689Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_complex128 SKIPPED [0.0049s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6142101Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_float32 SKIPPED [0.0052s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6143476Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_int16 SKIPPED [0.0049s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6144852Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isnan_cuda_int64 SKIPPED [0.0052s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6146251Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isneginf_cuda_bfloat16 SKIPPED [0.0048s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6147682Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isneginf_cuda_bool SKIPPED [0.0052s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6149085Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isneginf_cuda_float16 SKIPPED [0.0049s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6150556Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isneginf_cuda_float32 SKIPPED [0.0051s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6151972Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isneginf_cuda_float64 SKIPPED [0.0048s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6153406Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isposinf_cuda_bfloat16 SKIPPED [0.0052s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6154844Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isposinf_cuda_float16 SKIPPED [0.0048s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6156322Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isposinf_cuda_int32 SKIPPED [0.0056s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6157767Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_isposinf_cuda_uint8 SKIPPED [0.0049s] (Skipped! Inplace variant not supported!) [ 18%] 2025-12-04T14:34:54.6158980Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_bfloat16 PASSED [0.2342s] [ 18%] 2025-12-04T14:34:54.6159977Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_bool PASSED [0.0073s] [ 19%] 2025-12-04T14:34:54.6160975Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_complex128 PASSED [0.2304s] [ 19%] 2025-12-04T14:34:54.6162000Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_float64 PASSED [0.0077s] [ 19%] 2025-12-04T14:34:54.6163003Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_int16 PASSED [0.2280s] [ 19%] 2025-12-04T14:34:54.6163994Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_log1p_cuda_int64 PASSED [0.0072s] [ 19%] 2025-12-04T14:34:54.6164976Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_bfloat16 PASSED [0.2297s] [ 19%] 2025-12-04T14:34:54.6165978Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_complex128 PASSED [0.0093s] [ 19%] 2025-12-04T14:34:54.6166978Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_float16 PASSED [0.2290s] [ 19%] 2025-12-04T14:34:54.6167962Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_float32 PASSED [0.0076s] [ 19%] 2025-12-04T14:34:54.6168934Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_float64 PASSED [0.2308s] [ 19%] 2025-12-04T14:34:54.6169907Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_neg_cuda_int32 PASSED [0.0075s] [ 19%] 2025-12-04T14:34:54.6171164Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_nn_functional_relu_cuda_bfloat16 SKIPPED [0.0050s] (Skipped! Inplace variant not supported!) [ 19%] 2025-12-04T14:34:54.6172672Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_complex128 SKIPPED [0.0052s] (Skipped! Inplace variant not supported!) [ 19%] 2025-12-04T14:34:54.6174126Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_complex64 SKIPPED [0.0049s] (Skipped! Inplace variant not supported!) [ 19%] 2025-12-04T14:34:54.6175557Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_int16 SKIPPED [0.0051s] (Skipped! Inplace variant not supported!) [ 19%] 2025-12-04T14:34:54.6176971Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_int32 SKIPPED [0.0049s] (Skipped! Inplace variant not supported!) [ 19%] 2025-12-04T14:34:54.6178449Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_int8 SKIPPED [0.0049s] (Skipped! Inplace variant not supported!) [ 19%] 2025-12-04T14:34:54.6179861Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_positive_cuda_uint8 SKIPPED [0.0051s] (Skipped! Inplace variant not supported!) [ 19%] 2025-12-04T14:34:54.6181132Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_rad2deg_cuda_bfloat16 PASSED [0.2347s] [ 19%] 2025-12-04T14:34:54.6182157Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_rad2deg_cuda_float16 PASSED [0.0077s] [ 19%] 2025-12-04T14:34:54.6183176Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_rad2deg_cuda_float32 PASSED [0.2315s] [ 19%] 2025-12-04T14:34:54.6184186Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_rad2deg_cuda_int32 PASSED [0.0074s] [ 19%] 2025-12-04T14:34:54.6185210Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_round_cuda_float16 PASSED [0.2325s] [ 19%] 2025-12-04T14:34:54.6186259Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_round_cuda_float32 PASSED [0.0078s] [ 19%] 2025-12-04T14:34:54.6187283Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_round_cuda_int16 PASSED [0.2382s] [ 20%] 2025-12-04T14:34:54.6188441Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_round_cuda_int32 PASSED [0.0075s] [ 20%] 2025-12-04T14:34:54.6189415Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_round_cuda_uint8 PASSED [0.2323s] [ 20%] 2025-12-04T14:34:54.6190382Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_bool PASSED [0.0075s] [ 20%] 2025-12-04T14:34:54.6191354Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_complex64 PASSED [0.2319s] [ 20%] 2025-12-04T14:34:54.6192337Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_float16 PASSED [0.0078s] [ 20%] 2025-12-04T14:34:54.6193320Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_float32 PASSED [0.2295s] [ 20%] 2025-12-04T14:34:54.6194291Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_int16 PASSED [0.0075s] [ 20%] 2025-12-04T14:34:54.6195247Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_int32 PASSED [0.2294s] [ 20%] 2025-12-04T14:34:54.6196195Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_int64 PASSED [0.0075s] [ 20%] 2025-12-04T14:34:54.6197145Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sgn_cuda_int8 PASSED [0.2290s] [ 20%] 2025-12-04T14:34:54.6198117Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sign_cuda_float16 PASSED [0.0078s] [ 20%] 2025-12-04T14:34:54.6199093Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sign_cuda_int8 PASSED [0.2287s] [ 20%] 2025-12-04T14:34:54.6200270Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_signbit_cuda_bool SKIPPED [0.0054s] (Skipped! Inplace variant not supported!) [ 20%] 2025-12-04T14:34:54.6201676Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_signbit_cuda_float32 SKIPPED [0.0049s] (Skipped! Inplace variant not supported!) [ 20%] 2025-12-04T14:34:54.6203088Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_signbit_cuda_int16 SKIPPED [0.0051s] (Skipped! Inplace variant not supported!) [ 20%] 2025-12-04T14:34:54.6204487Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_signbit_cuda_int64 SKIPPED [0.0049s] (Skipped! Inplace variant not supported!) [ 20%] 2025-12-04T14:34:54.6205885Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_signbit_cuda_uint8 SKIPPED [0.0051s] (Skipped! Inplace variant not supported!) [ 20%] 2025-12-04T14:34:54.6207078Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sin_cuda_complex64 PASSED [0.0094s] [ 20%] 2025-12-04T14:34:54.6208083Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sin_cuda_float16 PASSED [0.2338s] [ 20%] 2025-12-04T14:34:54.6209065Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sin_cuda_int32 PASSED [0.0074s] [ 20%] 2025-12-04T14:34:54.6210112Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sinh_cuda_bool PASSED [0.2327s] [ 20%] 2025-12-04T14:34:54.6211083Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sinh_cuda_int16 PASSED [0.0073s] [ 20%] 2025-12-04T14:34:54.6212067Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sinh_cuda_int32 PASSED [0.2277s] [ 20%] 2025-12-04T14:34:54.6213043Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sinh_cuda_int8 PASSED [0.0073s] [ 21%] 2025-12-04T14:34:54.6214017Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_bool PASSED [0.2280s] [ 21%] 2025-12-04T14:34:54.6215058Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_complex128 PASSED [0.0095s] [ 21%] 2025-12-04T14:34:54.6216126Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_complex32 PASSED [0.2339s] [ 21%] 2025-12-04T14:34:54.6217200Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_complex64 PASSED [0.0097s] [ 21%] 2025-12-04T14:34:54.6218266Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_float32 PASSED [0.2313s] [ 21%] 2025-12-04T14:34:54.6219276Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_float64 PASSED [0.0078s] [ 21%] 2025-12-04T14:34:54.6220273Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_int16 PASSED [0.2282s] [ 21%] 2025-12-04T14:34:54.6221260Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_sqrt_cuda_int32 PASSED [0.0073s] [ 21%] 2025-12-04T14:34:54.6222237Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tan_cuda_float16 PASSED [0.2296s] [ 21%] 2025-12-04T14:34:54.6223228Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tan_cuda_int32 PASSED [0.0074s] [ 21%] 2025-12-04T14:34:54.6224201Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tan_cuda_int64 PASSED [0.2290s] [ 21%] 2025-12-04T14:34:54.6225178Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tan_cuda_uint8 PASSED [0.0073s] [ 21%] 2025-12-04T14:34:54.6226153Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_bfloat16 PASSED [0.2322s] [ 21%] 2025-12-04T14:34:54.6227169Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_complex32 PASSED [0.0079s] [ 21%] 2025-12-04T14:34:54.6228193Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_complex64 PASSED [0.2434s] [ 21%] 2025-12-04T14:34:54.6229202Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_float32 PASSED [0.0078s] [ 21%] 2025-12-04T14:34:54.6230184Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_int16 PASSED [0.2308s] [ 21%] 2025-12-04T14:34:54.6231169Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_int32 PASSED [0.0073s] [ 21%] 2025-12-04T14:34:54.6232154Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_int64 PASSED [0.2299s] [ 21%] 2025-12-04T14:34:54.6233133Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_tanh_cuda_int8 PASSED [0.0073s] [ 21%] 2025-12-04T14:34:54.6234115Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_trunc_cuda_bfloat16 PASSED [0.2334s] [ 21%] 2025-12-04T14:34:54.6235131Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_trunc_cuda_float64 PASSED [0.0079s] [ 21%] 2025-12-04T14:34:54.6236134Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_trunc_cuda_int64 PASSED [0.2310s] [ 21%] 2025-12-04T14:34:54.6237132Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_inplace_trunc_cuda_uint8 PASSED [0.0075s] [ 22%] 2025-12-04T14:34:54.6238112Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_abs_cuda_complex128 PASSED [0.2359s] [ 22%] 2025-12-04T14:34:54.6239077Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_abs_cuda_complex32 PASSED [0.0121s] [ 22%] 2025-12-04T14:34:54.6240070Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_abs_cuda_int64 PASSED [0.2322s] [ 22%] 2025-12-04T14:34:54.6241002Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_abs_cuda_int8 PASSED [0.0119s] [ 22%] 2025-12-04T14:34:54.6241924Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_angle_cuda_bool PASSED [0.2367s] [ 22%] 2025-12-04T14:34:54.6242895Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_angle_cuda_complex32 PASSED [0.0137s] [ 22%] 2025-12-04T14:34:54.6243873Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_angle_cuda_float32 PASSED [0.2362s] [ 22%] 2025-12-04T14:34:54.6244871Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_angle_cuda_int64 PASSED [0.0122s] [ 22%] 2025-12-04T14:34:54.6245850Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_bfloat16 PASSED [0.2363s] [ 22%] 2025-12-04T14:34:54.6246838Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_bool PASSED [0.0107s] [ 22%] 2025-12-04T14:34:54.6247817Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_complex128 PASSED [0.2391s] [ 22%] 2025-12-04T14:34:54.6248792Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_complex32 PASSED [0.0124s] [ 22%] 2025-12-04T14:34:54.6249770Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_complex64 PASSED [0.2352s] [ 22%] 2025-12-04T14:34:54.6250734Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_float64 PASSED [0.0121s] [ 22%] 2025-12-04T14:34:54.6251685Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_int32 PASSED [0.2337s] [ 22%] 2025-12-04T14:34:54.6252617Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asin_cuda_int8 PASSED [0.0121s] [ 22%] 2025-12-04T14:34:54.6253558Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asinh_cuda_bool PASSED [0.2350s] [ 22%] 2025-12-04T14:34:54.6254526Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asinh_cuda_complex64 PASSED [0.0122s] [ 22%] 2025-12-04T14:34:54.6255500Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asinh_cuda_float16 PASSED [0.2341s] [ 22%] 2025-12-04T14:34:54.6256450Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asinh_cuda_float64 PASSED [0.0122s] [ 22%] 2025-12-04T14:34:54.6257406Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_asinh_cuda_int64 PASSED [0.2340s] [ 22%] 2025-12-04T14:34:54.6258425Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atan_cuda_bfloat16 PASSED [0.0122s] [ 22%] 2025-12-04T14:34:54.6259420Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atan_cuda_complex128 PASSED [0.2344s] [ 22%] 2025-12-04T14:34:54.6260393Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atan_cuda_complex32 PASSED [0.0142s] [ 23%] 2025-12-04T14:34:54.6261358Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atan_cuda_int16 PASSED [0.2333s] [ 23%] 2025-12-04T14:34:54.6262306Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atan_cuda_uint8 PASSED [0.0121s] [ 23%] 2025-12-04T14:34:54.6263250Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atanh_cuda_bool PASSED [0.2377s] [ 23%] 2025-12-04T14:34:54.6264181Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atanh_cuda_int16 PASSED [0.0120s] [ 23%] 2025-12-04T14:34:54.6265132Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atanh_cuda_int32 PASSED [0.2383s] [ 23%] 2025-12-04T14:34:54.6266079Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atanh_cuda_int64 PASSED [0.0120s] [ 23%] 2025-12-04T14:34:54.6267031Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_atanh_cuda_int8 PASSED [0.2341s] [ 23%] 2025-12-04T14:34:54.6267970Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_ceil_cuda_bfloat16 PASSED [0.0120s] [ 23%] 2025-12-04T14:34:54.6268972Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_ceil_cuda_int64 PASSED [0.2330s] [ 23%] 2025-12-04T14:34:54.6269911Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_ceil_cuda_int8 PASSED [0.0119s] [ 23%] 2025-12-04T14:34:54.6270914Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_conj_physical_cuda_bfloat16 PASSED [0.2339s] [ 23%] 2025-12-04T14:34:54.6271932Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_conj_physical_cuda_bool PASSED [0.0117s] [ 23%] 2025-12-04T14:34:54.6272964Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_conj_physical_cuda_float64 PASSED [0.2391s] [ 23%] 2025-12-04T14:34:54.6274035Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_conj_physical_cuda_int32 PASSED [0.0120s] [ 23%] 2025-12-04T14:34:54.6275087Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_conj_physical_cuda_uint8 PASSED [0.2454s] [ 23%] 2025-12-04T14:34:54.6276136Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_bfloat16 PASSED [0.0121s] [ 23%] 2025-12-04T14:34:54.6277119Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_bool PASSED [0.2381s] [ 23%] 2025-12-04T14:34:54.6278090Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_float16 PASSED [0.0120s] [ 23%] 2025-12-04T14:34:54.6279063Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_float32 PASSED [0.2374s] [ 23%] 2025-12-04T14:34:54.6280042Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_float64 PASSED [0.0121s] [ 23%] 2025-12-04T14:34:54.6281021Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_int16 PASSED [0.2358s] [ 23%] 2025-12-04T14:34:54.6281984Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_int32 PASSED [0.0121s] [ 23%] 2025-12-04T14:34:54.6282933Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_int64 PASSED [0.2352s] [ 23%] 2025-12-04T14:34:54.6283894Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_int8 PASSED [0.0122s] [ 24%] 2025-12-04T14:34:54.6284852Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_deg2rad_cuda_uint8 PASSED [0.2346s] [ 24%] 2025-12-04T14:34:54.6285811Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erf_cuda_bfloat16 PASSED [0.0121s] [ 24%] 2025-12-04T14:34:54.6286754Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erf_cuda_float32 PASSED [0.2409s] [ 24%] 2025-12-04T14:34:54.6287697Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erf_cuda_int32 PASSED [0.0121s] [ 24%] 2025-12-04T14:34:54.6288780Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erf_cuda_int64 PASSED [0.2382s] [ 24%] 2025-12-04T14:34:54.6289744Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_bfloat16 PASSED [0.0121s] [ 24%] 2025-12-04T14:34:54.6290703Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_bool PASSED [0.2354s] [ 24%] 2025-12-04T14:34:54.6291667Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_float32 PASSED [0.0121s] [ 24%] 2025-12-04T14:34:54.6292762Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_int16 PASSED [0.2404s] [ 24%] 2025-12-04T14:34:54.6293830Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_int32 PASSED [0.0121s] [ 24%] 2025-12-04T14:34:54.6294775Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_int64 PASSED [0.2382s] [ 24%] 2025-12-04T14:34:54.6295735Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_erfinv_cuda_uint8 PASSED [0.0108s] [ 24%] 2025-12-04T14:34:54.6296687Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_expm1_cuda_int8 PASSED [0.2358s] [ 24%] 2025-12-04T14:34:54.6297635Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_expm1_cuda_uint8 PASSED [0.0122s] [ 24%] 2025-12-04T14:34:54.6298741Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_floor_cuda_float16 PASSED [0.2370s] [ 24%] 2025-12-04T14:34:54.6299704Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_floor_cuda_float32 PASSED [0.0120s] [ 24%] 2025-12-04T14:34:54.6300660Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_floor_cuda_int64 PASSED [0.2355s] [ 24%] 2025-12-04T14:34:54.6301599Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_floor_cuda_int8 PASSED [0.0120s] [ 24%] 2025-12-04T14:34:54.6302532Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_floor_cuda_uint8 PASSED [0.2342s] [ 24%] 2025-12-04T14:34:54.6303553Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_frac_cuda_bfloat16 PASSED [0.0120s] [ 24%] 2025-12-04T14:34:54.6304556Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_frac_cuda_float16 PASSED [0.2351s] [ 24%] 2025-12-04T14:34:54.6305547Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_frac_cuda_float32 PASSED [0.0120s] [ 24%] 2025-12-04T14:34:54.6306505Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_frac_cuda_float64 PASSED [0.2331s] [ 24%] 2025-12-04T14:34:54.6307593Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_bool SKIPPED [0.0054s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6308837Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_complex128 SKIPPED [0.0050s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6310110Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_complex64 SKIPPED [0.0051s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6311359Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_float16 SKIPPED [0.0049s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6312598Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_float32 SKIPPED [0.0051s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6313842Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_float64 SKIPPED [0.0051s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6315073Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_int32 SKIPPED [0.0049s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6316285Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isinf_cuda_int64 SKIPPED [0.0051s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6317518Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_bfloat16 SKIPPED [0.0049s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6318755Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_bool SKIPPED [0.0051s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6319999Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_complex128 SKIPPED [0.0049s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6321256Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_float16 SKIPPED [0.0051s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6322507Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_float32 SKIPPED [0.0049s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6323751Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_float64 SKIPPED [0.0051s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6324983Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_int16 SKIPPED [0.0048s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6326199Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_int32 SKIPPED [0.0051s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6327418Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_int64 SKIPPED [0.0048s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6328677Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isnan_cuda_uint8 SKIPPED [0.0052s] (Skipped! Out not supported) [ 25%] 2025-12-04T14:34:54.6329790Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isneginf_cuda_bfloat16 PASSED [0.2371s] [ 25%] 2025-12-04T14:34:54.6330764Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isneginf_cuda_int8 PASSED [0.0118s] [ 25%] 2025-12-04T14:34:54.6331724Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isneginf_cuda_uint8 PASSED [0.2346s] [ 25%] 2025-12-04T14:34:54.6332707Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_bfloat16 PASSED [0.0118s] [ 25%] 2025-12-04T14:34:54.6333743Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_float16 PASSED [0.2369s] [ 25%] 2025-12-04T14:34:54.6334760Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_float32 PASSED [0.0117s] [ 25%] 2025-12-04T14:34:54.6335777Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_float64 PASSED [0.2341s] [ 26%] 2025-12-04T14:34:54.6336754Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_int64 PASSED [0.0118s] [ 26%] 2025-12-04T14:34:54.6337716Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_isposinf_cuda_int8 PASSED [0.2370s] [ 26%] 2025-12-04T14:34:54.6338743Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_bool PASSED [0.0121s] [ 26%] 2025-12-04T14:34:54.6339702Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_complex128 PASSED [0.2354s] [ 26%] 2025-12-04T14:34:54.6340687Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_complex64 PASSED [0.0121s] [ 26%] 2025-12-04T14:34:54.6341660Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_float16 PASSED [0.2380s] [ 26%] 2025-12-04T14:34:54.6342598Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_int32 PASSED [0.0121s] [ 26%] 2025-12-04T14:34:54.6343535Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_int64 PASSED [0.2360s] [ 26%] 2025-12-04T14:34:54.6344472Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_int8 PASSED [0.0121s] [ 26%] 2025-12-04T14:34:54.6345401Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_log1p_cuda_uint8 PASSED [0.2350s] [ 26%] 2025-12-04T14:34:54.6346335Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_bfloat16 PASSED [0.0120s] [ 26%] 2025-12-04T14:34:54.6347270Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_float16 PASSED [0.2390s] [ 26%] 2025-12-04T14:34:54.6348205Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_float32 PASSED [0.0120s] [ 26%] 2025-12-04T14:34:54.6349138Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_int16 PASSED [0.2371s] [ 26%] 2025-12-04T14:34:54.6350054Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_int64 PASSED [0.0119s] [ 26%] 2025-12-04T14:34:54.6350966Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_int8 PASSED [0.2359s] [ 26%] 2025-12-04T14:34:54.6351881Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_neg_cuda_uint8 PASSED [0.0119s] [ 26%] 2025-12-04T14:34:54.6353021Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_nn_functional_relu_cuda_float16 SKIPPED [0.0050s] (Skipped! Out not supported) [ 26%] 2025-12-04T14:34:54.6354378Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_nn_functional_relu_cuda_float64 SKIPPED [0.0051s] (Skipped! Out not supported) [ 26%] 2025-12-04T14:34:54.6355728Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_nn_functional_relu_cuda_uint8 SKIPPED [0.0049s] (Skipped! Out not supported) [ 26%] 2025-12-04T14:34:54.6357043Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_positive_cuda_bfloat16 SKIPPED [0.0051s] (Skipped! Out not supported) [ 26%] 2025-12-04T14:34:54.6358383Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_positive_cuda_complex128 SKIPPED [0.0048s] (Skipped! Out not supported) [ 26%] 2025-12-04T14:34:54.6359683Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_positive_cuda_complex32 SKIPPED [0.0049s] (Skipped! Out not supported) [ 26%] 2025-12-04T14:34:54.6360970Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_positive_cuda_float64 SKIPPED [0.0051s] (Skipped! Out not supported) [ 27%] 2025-12-04T14:34:54.6362225Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_positive_cuda_int32 SKIPPED [0.0048s] (Skipped! Out not supported) [ 27%] 2025-12-04T14:34:54.6363411Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_rad2deg_cuda_bfloat16 PASSED [0.2367s] [ 27%] 2025-12-04T14:34:54.6364393Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_rad2deg_cuda_float16 PASSED [0.0120s] [ 27%] 2025-12-04T14:34:54.6365401Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_rad2deg_cuda_float32 PASSED [0.2389s] [ 27%] 2025-12-04T14:34:54.6366364Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_rad2deg_cuda_int16 PASSED [0.0121s] [ 27%] 2025-12-04T14:34:54.6367315Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_rad2deg_cuda_int64 PASSED [0.2378s] [ 27%] 2025-12-04T14:34:54.6368262Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_bfloat16 PASSED [0.0120s] [ 27%] 2025-12-04T14:34:54.6369219Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_float16 PASSED [0.2371s] [ 27%] 2025-12-04T14:34:54.6370173Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_float32 PASSED [0.0120s] [ 27%] 2025-12-04T14:34:54.6371118Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_int16 PASSED [0.2356s] [ 27%] 2025-12-04T14:34:54.6372041Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_int64 PASSED [0.0119s] [ 27%] 2025-12-04T14:34:54.6372977Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_round_cuda_int8 PASSED [0.2350s] [ 27%] 2025-12-04T14:34:54.6373902Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_bool PASSED [0.0118s] [ 27%] 2025-12-04T14:34:54.6374835Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_complex32 PASSED [0.2399s] [ 27%] 2025-12-04T14:34:54.6375783Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_complex64 PASSED [0.0121s] [ 27%] 2025-12-04T14:34:54.6376737Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_float32 PASSED [0.2374s] [ 27%] 2025-12-04T14:34:54.6377684Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_int16 PASSED [0.0118s] [ 27%] 2025-12-04T14:34:54.6378685Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_int32 PASSED [0.2362s] [ 27%] 2025-12-04T14:34:54.6379610Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_int64 PASSED [0.0118s] [ 27%] 2025-12-04T14:34:54.6380544Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sgn_cuda_uint8 PASSED [0.2359s] [ 27%] 2025-12-04T14:34:54.6381487Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sign_cuda_float16 PASSED [0.0121s] [ 27%] 2025-12-04T14:34:54.6382426Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sign_cuda_float32 PASSED [0.2391s] [ 27%] 2025-12-04T14:34:54.6383374Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sign_cuda_float64 PASSED [0.0119s] [ 27%] 2025-12-04T14:34:54.6384324Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sign_cuda_int16 PASSED [0.2347s] [ 28%] 2025-12-04T14:34:54.6385261Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sign_cuda_int32 PASSED [0.0120s] [ 28%] 2025-12-04T14:34:54.6386218Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_signbit_cuda_bfloat16 PASSED [0.2357s] [ 28%] 2025-12-04T14:34:54.6387242Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_signbit_cuda_bool PASSED [0.0118s] [ 28%] 2025-12-04T14:34:54.6388369Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_signbit_cuda_float16 PASSED [0.2356s] [ 28%] 2025-12-04T14:34:54.6389339Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_signbit_cuda_int8 PASSED [0.0120s] [ 28%] 2025-12-04T14:34:54.6390294Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_signbit_cuda_uint8 PASSED [0.2367s] [ 28%] 2025-12-04T14:34:54.6391257Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_bfloat16 PASSED [0.0120s] [ 28%] 2025-12-04T14:34:54.6392262Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_bool PASSED [0.2357s] [ 28%] 2025-12-04T14:34:54.6393269Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_complex128 PASSED [0.0139s] [ 28%] 2025-12-04T14:34:54.6394275Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_complex32 PASSED [0.2417s] [ 28%] 2025-12-04T14:34:54.6395234Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_float16 PASSED [0.0121s] [ 28%] 2025-12-04T14:34:54.6396184Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_float32 PASSED [0.2368s] [ 28%] 2025-12-04T14:34:54.6397132Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_float64 PASSED [0.0120s] [ 28%] 2025-12-04T14:34:54.6398062Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_int32 PASSED [0.2386s] [ 28%] 2025-12-04T14:34:54.6399003Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sin_cuda_int64 PASSED [0.0120s] [ 28%] 2025-12-04T14:34:54.6399952Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_bfloat16 PASSED [0.2353s] [ 28%] 2025-12-04T14:34:54.6400900Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_bool PASSED [0.0120s] [ 28%] 2025-12-04T14:34:54.6401854Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_complex128 PASSED [0.2376s] [ 28%] 2025-12-04T14:34:54.6402840Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_complex64 PASSED [0.0139s] [ 28%] 2025-12-04T14:34:54.6403800Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_float16 PASSED [0.2344s] [ 28%] 2025-12-04T14:34:54.6404748Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sinh_cuda_int16 PASSED [0.0122s] [ 28%] 2025-12-04T14:34:54.6405703Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_complex128 PASSED [0.2368s] [ 28%] 2025-12-04T14:34:54.6406691Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_complex32 PASSED [0.0121s] [ 28%] 2025-12-04T14:34:54.6407678Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_complex64 PASSED [0.2396s] [ 29%] 2025-12-04T14:34:54.6408652Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_float16 PASSED [0.0120s] [ 29%] 2025-12-04T14:34:54.6409593Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_float64 PASSED [0.2347s] [ 29%] 2025-12-04T14:34:54.6410541Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_int32 PASSED [0.0121s] [ 29%] 2025-12-04T14:34:54.6411488Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_sqrt_cuda_uint8 PASSED [0.2436s] [ 29%] 2025-12-04T14:34:54.6412413Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_bool PASSED [0.0122s] [ 29%] 2025-12-04T14:34:54.6413374Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_complex128 PASSED [0.2399s] [ 29%] 2025-12-04T14:34:54.6414360Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_complex32 PASSED [0.0121s] [ 29%] 2025-12-04T14:34:54.6415335Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_complex64 PASSED [0.2366s] [ 29%] 2025-12-04T14:34:54.6416330Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_float32 PASSED [0.0120s] [ 29%] 2025-12-04T14:34:54.6417276Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_int64 PASSED [0.2350s] [ 29%] 2025-12-04T14:34:54.6418295Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tan_cuda_uint8 PASSED [0.0121s] [ 29%] 2025-12-04T14:34:54.6419236Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_bool PASSED [0.2330s] [ 29%] 2025-12-04T14:34:54.6420188Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_complex128 PASSED [0.0121s] [ 29%] 2025-12-04T14:34:54.6421219Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_complex64 PASSED [0.2359s] [ 29%] 2025-12-04T14:34:54.6422223Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_float16 PASSED [0.0120s] [ 29%] 2025-12-04T14:34:54.6423211Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_float64 PASSED [0.2333s] [ 29%] 2025-12-04T14:34:54.6424149Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_int64 PASSED [0.0120s] [ 29%] 2025-12-04T14:34:54.6425092Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_tanh_cuda_uint8 PASSED [0.2332s] [ 29%] 2025-12-04T14:34:54.6426055Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_bfloat16 PASSED [0.0120s] [ 29%] 2025-12-04T14:34:54.6427024Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_float32 PASSED [0.2351s] [ 29%] 2025-12-04T14:34:54.6427973Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_float64 PASSED [0.0120s] [ 29%] 2025-12-04T14:34:54.6428936Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_int16 PASSED [0.2498s] [ 29%] 2025-12-04T14:34:54.6429883Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_int64 PASSED [0.0119s] [ 29%] 2025-12-04T14:34:54.6430832Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_csr_unary_out_trunc_cuda_int8 PASSED [0.2440s] [ 30%] 2025-12-04T14:34:54.6431715Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_mm_cuda_complex128 PASSED [0.0128s] [ 30%] 2025-12-04T14:34:54.6432673Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_mm_reduce_cuda_float64 SKIPPED [0.0017s] (Only runs on cpu) [ 30%] 2025-12-04T14:34:54.6433749Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_mm_reduce_sum_cuda_float64 SKIPPED [0.0015s] (Only runs on cpu) [ 30%] 2025-12-04T14:34:54.6434844Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_to_sparse_compressed_SparseBSC_cuda_float64 PASSED [0.0113s] [ 30%] 2025-12-04T14:34:54.6435935Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_to_sparse_compressed_SparseCSR_cuda_float64 PASSED [0.0220s] [ 30%] 2025-12-04T14:34:54.6436963Z test_sparse_csr.py::TestSparseCSRCUDA::test_sparse_triangular_solve_cuda_float32 PASSED [0.1245s] [ 30%] 2025-12-04T14:34:54.6437819Z test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_bool PASSED [0.0297s] [ 30%] 2025-12-04T14:34:54.6438560Z test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_float32 PASSED [0.0371s] [ 30%] 2025-12-04T14:34:54.6439315Z test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_int16 PASSED [0.0291s] [ 30%] 2025-12-04T14:34:54.6440059Z test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_int64 PASSED [0.0288s] [ 30%] 2025-12-04T14:34:54.6440798Z test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_int8 PASSED [0.0293s] [ 30%] 2025-12-04T14:34:54.6441526Z test_sparse_csr.py::TestSparseCSRCUDA::test_sum_cuda_uint8 PASSED [0.0291s] [ 30%] 2025-12-04T14:34:54.6442380Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSC_cuda_complex128 PASSED [1.4081s] [ 30%] 2025-12-04T14:34:54.6443338Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSC_cuda_complex64 PASSED [1.4025s] [ 30%] 2025-12-04T14:34:54.6444279Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSC_cuda_float16 PASSED [1.3693s] [ 30%] 2025-12-04T14:34:54.6445219Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSC_cuda_int64 PASSED [1.2670s] [ 30%] 2025-12-04T14:34:54.6446143Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_bfloat16 PASSED [1.4452s] [ 30%] 2025-12-04T14:34:54.6447092Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_complex64 PASSED [1.4777s] [ 30%] 2025-12-04T14:34:54.6448024Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_float16 PASSED [1.4510s] [ 30%] 2025-12-04T14:34:54.6448927Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_int16 PASSED [1.3501s] [ 30%] 2025-12-04T14:34:54.6449862Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_int32 PASSED [1.3468s] [ 30%] 2025-12-04T14:34:54.6451162Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_int64 PASSED [1.3493s] [ 30%] 2025-12-04T14:34:54.6452093Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseBSR_cuda_uint8 PASSED [1.3466s] [ 30%] 2025-12-04T14:34:54.6453027Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_bfloat16 PASSED [0.8247s] [ 31%] 2025-12-04T14:34:54.6453946Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_bool PASSED [0.7670s] [ 31%] 2025-12-04T14:34:54.6454877Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_complex128 PASSED [0.8377s] [ 31%] 2025-12-04T14:34:54.6455822Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_complex64 PASSED [0.8371s] [ 31%] 2025-12-04T14:34:54.6456757Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_float16 PASSED [0.8223s] [ 31%] 2025-12-04T14:34:54.6457683Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_float32 PASSED [0.8206s] [ 31%] 2025-12-04T14:34:54.6458681Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_float64 PASSED [0.8194s] [ 31%] 2025-12-04T14:34:54.6459588Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_int16 PASSED [0.7688s] [ 31%] 2025-12-04T14:34:54.6460491Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_int64 PASSED [0.7689s] [ 31%] 2025-12-04T14:34:54.6461392Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSC_cuda_uint8 PASSED [0.7694s] [ 31%] 2025-12-04T14:34:54.6462326Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_complex128 PASSED [0.8951s] [ 31%] 2025-12-04T14:34:54.6463261Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_float32 PASSED [0.8804s] [ 31%] 2025-12-04T14:34:54.6464186Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_float64 PASSED [0.8798s] [ 31%] 2025-12-04T14:34:54.6465101Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_int16 PASSED [0.8290s] [ 31%] 2025-12-04T14:34:54.6465989Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_int8 PASSED [0.8278s] [ 31%] 2025-12-04T14:34:54.6466888Z test_sparse_csr.py::TestSparseCSRCUDA::test_transpose_SparseCSR_cuda_uint8 PASSED [0.8271s] [ 31%] 2025-12-04T14:34:54.6467871Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_bool PASSED [0.0047s] [ 31%] 2025-12-04T14:34:54.6468962Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_complex128 PASSED [0.0048s] [ 31%] 2025-12-04T14:34:54.6470073Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_complex32 PASSED [0.0047s] [ 31%] 2025-12-04T14:34:54.6471183Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_complex64 PASSED [0.0051s] [ 31%] 2025-12-04T14:34:54.6472286Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_float16 PASSED [0.0045s] [ 31%] 2025-12-04T14:34:54.6473379Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_float32 PASSED [0.0045s] [ 31%] 2025-12-04T14:34:54.6474453Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_float64 PASSED [0.0045s] [ 31%] 2025-12-04T14:34:54.6475574Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_int32 PASSED [0.0041s] [ 31%] 2025-12-04T14:34:54.6476637Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_abs_cuda_int64 PASSED [0.0042s] [ 32%] 2025-12-04T14:34:54.6477746Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_complex128 PASSED [0.0062s] [ 32%] 2025-12-04T14:34:54.6478864Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_complex32 PASSED [0.0052s] [ 32%] 2025-12-04T14:34:54.6480026Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_complex64 PASSED [0.0060s] [ 32%] 2025-12-04T14:34:54.6481177Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_float64 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6482306Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_int64 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6483379Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_angle_cuda_int8 PASSED [0.0050s] [ 32%] 2025-12-04T14:34:54.6484488Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_complex32 PASSED [0.0046s] [ 32%] 2025-12-04T14:34:54.6485607Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_complex64 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6486726Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_float32 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6487992Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_float64 PASSED [0.0048s] [ 32%] 2025-12-04T14:34:54.6489089Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_int32 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6490165Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asin_cuda_int8 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6491259Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_bfloat16 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6492372Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_complex32 PASSED [0.0050s] [ 32%] 2025-12-04T14:34:54.6493506Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_complex64 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6494627Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_float32 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6495741Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_float64 PASSED [0.0044s] [ 32%] 2025-12-04T14:34:54.6496813Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_asinh_cuda_int8 PASSED [0.0049s] [ 32%] 2025-12-04T14:34:54.6497868Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_bool PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6499021Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_complex32 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6500110Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_float32 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6501174Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_int16 PASSED [0.0049s] [ 32%] 2025-12-04T14:34:54.6502230Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_int32 PASSED [0.0045s] [ 32%] 2025-12-04T14:34:54.6503290Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_int64 PASSED [0.0045s] [ 33%] 2025-12-04T14:34:54.6504348Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atan_cuda_int8 PASSED [0.0045s] [ 33%] 2025-12-04T14:34:54.6505462Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_bool PASSED [0.0049s] [ 33%] 2025-12-04T14:34:54.6506547Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_complex32 PASSED [0.0045s] [ 33%] 2025-12-04T14:34:54.6507658Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_complex64 PASSED [0.0046s] [ 33%] 2025-12-04T14:34:54.6508757Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_float32 PASSED [0.0044s] [ 33%] 2025-12-04T14:34:54.6509841Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_float64 PASSED [0.0049s] [ 33%] 2025-12-04T14:34:54.6511015Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_int32 PASSED [0.0045s] [ 33%] 2025-12-04T14:34:54.6512090Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_atanh_cuda_int64 PASSED [0.0045s] [ 33%] 2025-12-04T14:34:54.6513212Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_ceil_cuda_bfloat16 PASSED [0.0045s] [ 33%] 2025-12-04T14:34:54.6514296Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_ceil_cuda_float16 PASSED [0.0049s] [ 33%] 2025-12-04T14:34:54.6515380Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_ceil_cuda_float32 PASSED [0.0045s] [ 33%] 2025-12-04T14:34:54.6516460Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_ceil_cuda_float64 PASSED [0.0045s] [ 33%] 2025-12-04T14:34:54.6517528Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_ceil_cuda_uint8 PASSED [0.0041s] [ 33%] 2025-12-04T14:34:54.6518626Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_bool PASSED [0.0043s] [ 33%] 2025-12-04T14:34:54.6519797Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_complex128 PASSED [0.0046s] [ 33%] 2025-12-04T14:34:54.6520988Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_complex32 PASSED [0.0061s] [ 33%] 2025-12-04T14:34:54.6522167Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_complex64 PASSED [0.0046s] [ 33%] 2025-12-04T14:34:54.6523340Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_float16 PASSED [0.0046s] [ 33%] 2025-12-04T14:34:54.6524498Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_float64 PASSED [0.0042s] [ 33%] 2025-12-04T14:34:54.6525653Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_int16 PASSED [0.0039s] [ 33%] 2025-12-04T14:34:54.6526795Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_int32 PASSED [0.0039s] [ 33%] 2025-12-04T14:34:54.6527935Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_conj_physical_cuda_uint8 PASSED [0.0043s] [ 33%] 2025-12-04T14:34:54.6529050Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_deg2rad_cuda_float16 PASSED [0.0045s] [ 33%] 2025-12-04T14:34:54.6530150Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_deg2rad_cuda_float32 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6531235Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_deg2rad_cuda_int8 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6532319Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_deg2rad_cuda_uint8 PASSED [0.0051s] [ 34%] 2025-12-04T14:34:54.6533404Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_bfloat16 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6534479Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_bool PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6535599Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_float64 PASSED [0.0044s] [ 34%] 2025-12-04T14:34:54.6536667Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_int16 PASSED [0.0049s] [ 34%] 2025-12-04T14:34:54.6537713Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_int8 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6538837Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erf_cuda_uint8 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6539927Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_bfloat16 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6541174Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_bool PASSED [0.0049s] [ 34%] 2025-12-04T14:34:54.6542267Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_float16 PASSED [0.0051s] [ 34%] 2025-12-04T14:34:54.6543402Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_int32 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6544494Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_int8 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6545583Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_erfinv_cuda_uint8 PASSED [0.0049s] [ 34%] 2025-12-04T14:34:54.6546681Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_bfloat16 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6547801Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_complex128 PASSED [0.0046s] [ 34%] 2025-12-04T14:34:54.6548932Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_complex64 PASSED [0.0046s] [ 34%] 2025-12-04T14:34:54.6550051Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_float32 PASSED [0.0049s] [ 34%] 2025-12-04T14:34:54.6551141Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_float64 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6552232Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_expm1_cuda_int64 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6553325Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_bfloat16 PASSED [0.0045s] [ 34%] 2025-12-04T14:34:54.6554432Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_float16 PASSED [0.0049s] [ 34%] 2025-12-04T14:34:54.6555525Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_float32 PASSED [0.0044s] [ 34%] 2025-12-04T14:34:54.6559356Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_float64 PASSED [0.0045s] [ 35%] 2025-12-04T14:34:54.6560464Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_int64 PASSED [0.0041s] [ 35%] 2025-12-04T14:34:54.6561535Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_floor_cuda_int8 PASSED [0.0046s] [ 35%] 2025-12-04T14:34:54.6562619Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_frac_cuda_bfloat16 PASSED [0.0045s] [ 35%] 2025-12-04T14:34:54.6563720Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_frac_cuda_float16 PASSED [0.0045s] [ 35%] 2025-12-04T14:34:54.6564814Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_frac_cuda_float64 PASSED [0.0045s] [ 35%] 2025-12-04T14:34:54.6566137Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isinf_cuda_float32 PASSED [0.0047s] [ 35%] 2025-12-04T14:34:54.6567280Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isinf_cuda_int64 PASSED [0.0042s] [ 35%] 2025-12-04T14:34:54.6568348Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isinf_cuda_int8 PASSED [0.0042s] [ 35%] 2025-12-04T14:34:54.6569445Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_bfloat16 PASSED [0.0042s] [ 35%] 2025-12-04T14:34:54.6570543Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_bool PASSED [0.0046s] [ 35%] 2025-12-04T14:34:54.6571638Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_float16 PASSED [0.0042s] [ 35%] 2025-12-04T14:34:54.6572723Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_int16 PASSED [0.0042s] [ 35%] 2025-12-04T14:34:54.6573884Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_int64 PASSED [0.0042s] [ 35%] 2025-12-04T14:34:54.6575011Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isnan_cuda_uint8 PASSED [0.0046s] [ 35%] 2025-12-04T14:34:54.6576148Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isneginf_cuda_int16 PASSED [0.0042s] [ 35%] 2025-12-04T14:34:54.6577249Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isneginf_cuda_int32 PASSED [0.0042s] [ 35%] 2025-12-04T14:34:54.6578439Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isneginf_cuda_uint8 PASSED [0.0042s] [ 35%] 2025-12-04T14:34:54.6579555Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isposinf_cuda_int16 PASSED [0.0046s] [ 35%] 2025-12-04T14:34:54.6580668Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isposinf_cuda_int32 PASSED [0.0042s] [ 35%] 2025-12-04T14:34:54.6581773Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_isposinf_cuda_int8 PASSED [0.0042s] [ 35%] 2025-12-04T14:34:54.6582875Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_log1p_cuda_bool PASSED [0.0045s] [ 35%] 2025-12-04T14:34:54.6583986Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_log1p_cuda_complex128 PASSED [0.0050s] [ 35%] 2025-12-04T14:34:54.6585102Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_log1p_cuda_int32 PASSED [0.0045s] [ 35%] 2025-12-04T14:34:54.6586172Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_log1p_cuda_int64 PASSED [0.0045s] [ 36%] 2025-12-04T14:34:54.6587251Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_log1p_cuda_int8 PASSED [0.0045s] [ 36%] 2025-12-04T14:34:54.6588486Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_complex32 PASSED [0.0066s] [ 36%] 2025-12-04T14:34:54.6589602Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_complex64 PASSED [0.0059s] [ 36%] 2025-12-04T14:34:54.6590792Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_float16 PASSED [0.0045s] [ 36%] 2025-12-04T14:34:54.6591881Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_float64 PASSED [0.0045s] [ 36%] 2025-12-04T14:34:54.6592950Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_int16 PASSED [0.0046s] [ 36%] 2025-12-04T14:34:54.6594010Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_int32 PASSED [0.0041s] [ 36%] 2025-12-04T14:34:54.6595062Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_neg_cuda_uint8 PASSED [0.0041s] [ 36%] 2025-12-04T14:34:54.6596201Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_nn_functional_relu_cuda_int16 PASSED [0.0042s] [ 36%] 2025-12-04T14:34:54.6597399Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_positive_cuda_complex128 PASSED [0.0047s] [ 36%] 2025-12-04T14:34:54.6598555Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_positive_cuda_float16 PASSED [0.0042s] [ 36%] 2025-12-04T14:34:54.6599682Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_positive_cuda_float32 PASSED [0.0042s] [ 36%] 2025-12-04T14:34:54.6600811Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_positive_cuda_int16 PASSED [0.0039s] [ 36%] 2025-12-04T14:34:54.6601941Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_bfloat16 PASSED [0.0049s] [ 36%] 2025-12-04T14:34:54.6603067Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_float16 PASSED [0.0045s] [ 36%] 2025-12-04T14:34:54.6604227Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_float32 PASSED [0.0045s] [ 36%] 2025-12-04T14:34:54.6605388Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_float64 PASSED [0.0045s] [ 36%] 2025-12-04T14:34:54.6606547Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_int32 PASSED [0.0050s] [ 36%] 2025-12-04T14:34:54.6607654Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_int64 PASSED [0.0045s] [ 36%] 2025-12-04T14:34:54.6608744Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_int8 PASSED [0.0045s] [ 36%] 2025-12-04T14:34:54.6609848Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_rad2deg_cuda_uint8 PASSED [0.0045s] [ 36%] 2025-12-04T14:34:54.6610953Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_round_cuda_float16 PASSED [0.0049s] [ 36%] 2025-12-04T14:34:54.6612047Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_round_cuda_int16 PASSED [0.0042s] [ 36%] 2025-12-04T14:34:54.6613121Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_round_cuda_int64 PASSED [0.0041s] [ 37%] 2025-12-04T14:34:54.6614211Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_bfloat16 PASSED [0.0045s] [ 37%] 2025-12-04T14:34:54.6615285Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_bool PASSED [0.0046s] [ 37%] 2025-12-04T14:34:54.6616354Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_float64 PASSED [0.0045s] [ 37%] 2025-12-04T14:34:54.6617415Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_int16 PASSED [0.0042s] [ 37%] 2025-12-04T14:34:54.6618538Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_int64 PASSED [0.0041s] [ 37%] 2025-12-04T14:34:54.6619604Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sgn_cuda_uint8 PASSED [0.0046s] [ 37%] 2025-12-04T14:34:54.6620764Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sign_cuda_float32 PASSED [0.0045s] [ 37%] 2025-12-04T14:34:54.6621851Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sign_cuda_float64 PASSED [0.0045s] [ 37%] 2025-12-04T14:34:54.6622939Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sign_cuda_int16 PASSED [0.0041s] [ 37%] 2025-12-04T14:34:54.6624028Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_signbit_cuda_bool PASSED [0.0046s] [ 37%] 2025-12-04T14:34:54.6625136Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_signbit_cuda_int16 PASSED [0.0042s] [ 37%] 2025-12-04T14:34:54.6626226Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_signbit_cuda_uint8 PASSED [0.0041s] [ 37%] 2025-12-04T14:34:54.6627327Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_bfloat16 PASSED [0.0045s] [ 37%] 2025-12-04T14:34:54.6628404Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_bool PASSED [0.0049s] [ 37%] 2025-12-04T14:34:54.6629489Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_complex128 PASSED [0.0046s] [ 37%] 2025-12-04T14:34:54.6630573Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_float16 PASSED [0.0044s] [ 37%] 2025-12-04T14:34:54.6631655Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_float32 PASSED [0.0044s] [ 37%] 2025-12-04T14:34:54.6632732Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_int16 PASSED [0.0049s] [ 37%] 2025-12-04T14:34:54.6633795Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_int64 PASSED [0.0045s] [ 37%] 2025-12-04T14:34:54.6634884Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sin_cuda_uint8 PASSED [0.0045s] [ 37%] 2025-12-04T14:34:54.6636000Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_bfloat16 PASSED [0.0044s] [ 37%] 2025-12-04T14:34:54.6637257Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_bool PASSED [0.0049s] [ 37%] 2025-12-04T14:34:54.6638354Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_complex128 PASSED [0.0046s] [ 37%] 2025-12-04T14:34:54.6639464Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_complex64 PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6640569Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_float32 PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6641653Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sinh_cuda_uint8 PASSED [0.0051s] [ 38%] 2025-12-04T14:34:54.6642752Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sqrt_cuda_complex64 PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6643838Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sqrt_cuda_int16 PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6644910Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sqrt_cuda_int32 PASSED [0.0044s] [ 38%] 2025-12-04T14:34:54.6645976Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_sqrt_cuda_int8 PASSED [0.0049s] [ 38%] 2025-12-04T14:34:54.6647063Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tan_cuda_complex128 PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6648134Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tan_cuda_int32 PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6649191Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tan_cuda_int8 PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6650250Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tan_cuda_uint8 PASSED [0.0049s] [ 38%] 2025-12-04T14:34:54.6651356Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_bool PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6652438Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_complex32 PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6653543Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_float16 PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6654638Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_float32 PASSED [0.0049s] [ 38%] 2025-12-04T14:34:54.6655726Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_float64 PASSED [0.0044s] [ 38%] 2025-12-04T14:34:54.6656793Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_int32 PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6657873Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_tanh_cuda_uint8 PASSED [0.0045s] [ 38%] 2025-12-04T14:34:54.6659037Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_trunc_cuda_bfloat16 PASSED [0.0049s] [ 38%] 2025-12-04T14:34:54.6660153Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_trunc_cuda_float64 PASSED [0.0044s] [ 38%] 2025-12-04T14:34:54.6661238Z test_sparse_csr.py::TestSparseCSRCUDA::test_zero_to_zero_correspondence_unary_trunc_cuda_uint8 PASSED [0.0042s] [ 38%] 2025-12-04T14:34:54.6662257Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSC_cuda_bool PASSED [0.0438s] [ 38%] 2025-12-04T14:34:54.6663212Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSC_cuda_float32 PASSED [0.0546s] [ 38%] 2025-12-04T14:34:54.6664168Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSC_cuda_int8 PASSED [0.0435s] [ 38%] 2025-12-04T14:34:54.6665132Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSC_cuda_uint8 PASSED [0.0444s] [ 39%] 2025-12-04T14:34:54.6666104Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_bool PASSED [0.0433s] [ 39%] 2025-12-04T14:34:54.6667086Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_float32 PASSED [0.0541s] [ 39%] 2025-12-04T14:34:54.6668057Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_float64 PASSED [0.0544s] [ 39%] 2025-12-04T14:34:54.6668996Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_int16 PASSED [0.0435s] [ 39%] 2025-12-04T14:34:54.6669929Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_int32 PASSED [0.0440s] [ 39%] 2025-12-04T14:34:54.6670864Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseBSR_cuda_int64 PASSED [0.0432s] [ 39%] 2025-12-04T14:34:54.6671811Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSC_cuda_bfloat16 PASSED [0.0520s] [ 39%] 2025-12-04T14:34:54.6672780Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSC_cuda_float64 PASSED [0.0517s] [ 39%] 2025-12-04T14:34:54.6673733Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSC_cuda_int16 PASSED [0.0411s] [ 39%] 2025-12-04T14:34:54.6674677Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSR_cuda_bool PASSED [0.0417s] [ 39%] 2025-12-04T14:34:54.6675610Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSR_cuda_float16 PASSED [0.0518s] [ 39%] 2025-12-04T14:34:54.6676571Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSR_cuda_float64 PASSED [0.0517s] [ 39%] 2025-12-04T14:34:54.6677520Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSR_cuda_int16 PASSED [0.0411s] [ 39%] 2025-12-04T14:34:54.6678460Z test_sparse_csr.py::TestSparseCompressedCUDA::test_clone_SparseCSR_cuda_int64 PASSED [0.0411s] [ 39%] 2025-12-04T14:34:54.6679467Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_bfloat16 PASSED [0.2298s] [ 39%] 2025-12-04T14:34:54.6680535Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_bool PASSED [0.2177s] [ 39%] 2025-12-04T14:34:54.6681655Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_complex128 PASSED [0.2124s] [ 39%] 2025-12-04T14:34:54.6682744Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_float32 PASSED [0.2146s] [ 39%] 2025-12-04T14:34:54.6683806Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_float64 PASSED [0.2157s] [ 39%] 2025-12-04T14:34:54.6684872Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_int16 PASSED [0.2161s] [ 39%] 2025-12-04T14:34:54.6685934Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_int32 PASSED [0.2159s] [ 39%] 2025-12-04T14:34:54.6686985Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_abs_cuda_int64 PASSED [0.2167s] [ 39%] 2025-12-04T14:34:54.6688237Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_complex128 PASSED [0.2130s] [ 39%] 2025-12-04T14:34:54.6689373Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_complex32 PASSED [0.2222s] [ 40%] 2025-12-04T14:34:54.6690489Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_complex64 PASSED [0.2178s] [ 40%] 2025-12-04T14:34:54.6691598Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_float64 PASSED [0.2130s] [ 40%] 2025-12-04T14:34:54.6692678Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_int32 PASSED [0.2158s] [ 40%] 2025-12-04T14:34:54.6693752Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_angle_cuda_uint8 PASSED [0.2167s] [ 40%] 2025-12-04T14:34:54.6694817Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_bool PASSED [0.2157s] [ 40%] 2025-12-04T14:34:54.6696006Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_complex64 PASSED [0.2132s] [ 40%] 2025-12-04T14:34:54.6697097Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_float16 PASSED [0.2140s] [ 40%] 2025-12-04T14:34:54.6698299Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_float32 PASSED [0.2081s] [ 40%] 2025-12-04T14:34:54.6699393Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_float64 PASSED [0.2119s] [ 40%] 2025-12-04T14:34:54.6700474Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_int16 PASSED [0.2136s] [ 40%] 2025-12-04T14:34:54.6701546Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_int32 PASSED [0.2202s] [ 40%] 2025-12-04T14:34:54.6702610Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_int64 PASSED [0.2087s] [ 40%] 2025-12-04T14:34:54.6703669Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asin_cuda_int8 PASSED [0.2242s] [ 40%] 2025-12-04T14:34:54.6704767Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_complex32 PASSED [0.2222s] [ 40%] 2025-12-04T14:34:54.6705877Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_float32 PASSED [0.2248s] [ 40%] 2025-12-04T14:34:54.6706970Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_float64 PASSED [0.2201s] [ 40%] 2025-12-04T14:34:54.6708045Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_int16 PASSED [0.2119s] [ 40%] 2025-12-04T14:34:54.6709123Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_int32 PASSED [0.2098s] [ 40%] 2025-12-04T14:34:54.6710206Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_asinh_cuda_uint8 PASSED [0.2139s] [ 40%] 2025-12-04T14:34:54.6711313Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atan_cuda_complex128 PASSED [0.2110s] [ 40%] 2025-12-04T14:34:54.6712467Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atan_cuda_complex64 PASSED [0.2146s] [ 40%] 2025-12-04T14:34:54.6713577Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atan_cuda_float32 PASSED [0.2178s] [ 40%] 2025-12-04T14:34:54.6714656Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atan_cuda_int64 PASSED [0.2149s] [ 40%] 2025-12-04T14:34:54.6715729Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atan_cuda_uint8 PASSED [0.2134s] [ 41%] 2025-12-04T14:34:54.6716803Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atanh_cuda_bfloat16 PASSED [0.2101s] [ 41%] 2025-12-04T14:34:54.6717893Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atanh_cuda_bool PASSED [0.2171s] [ 41%] 2025-12-04T14:34:54.6718988Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atanh_cuda_float64 PASSED [0.2139s] [ 41%] 2025-12-04T14:34:54.6720073Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atanh_cuda_int64 PASSED [0.2146s] [ 41%] 2025-12-04T14:34:54.6721146Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_atanh_cuda_int8 PASSED [0.2156s] [ 41%] 2025-12-04T14:34:54.6722228Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_float16 PASSED [0.2136s] [ 41%] 2025-12-04T14:34:54.6723310Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_float64 PASSED [0.2291s] [ 41%] 2025-12-04T14:34:54.6724376Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_int16 PASSED [0.2177s] [ 41%] 2025-12-04T14:34:54.6725437Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_int64 PASSED [0.2354s] [ 41%] 2025-12-04T14:34:54.6726531Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_int8 PASSED [0.2245s] [ 41%] 2025-12-04T14:34:54.6727623Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_ceil_cuda_uint8 PASSED [0.2134s] [ 41%] 2025-12-04T14:34:54.6728771Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_bool PASSED [0.2152s] [ 41%] 2025-12-04T14:34:54.6729952Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_complex64 PASSED [0.2139s] [ 41%] 2025-12-04T14:34:54.6731142Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_float16 PASSED [0.2146s] [ 41%] 2025-12-04T14:34:54.6732305Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_int16 PASSED [0.2103s] [ 41%] 2025-12-04T14:34:54.6733451Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_int32 PASSED [0.2284s] [ 41%] 2025-12-04T14:34:54.6734609Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_conj_physical_cuda_int64 PASSED [0.2192s] [ 41%] 2025-12-04T14:34:54.6735748Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_float16 PASSED [0.2196s] [ 41%] 2025-12-04T14:34:54.6736863Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_float64 PASSED [0.2148s] [ 41%] 2025-12-04T14:34:54.6737953Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_int16 PASSED [0.2137s] [ 41%] 2025-12-04T14:34:54.6739139Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_int32 PASSED [0.2110s] [ 41%] 2025-12-04T14:34:54.6740234Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_int8 PASSED [0.2138s] [ 41%] 2025-12-04T14:34:54.6741322Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_deg2rad_cuda_uint8 PASSED [0.2185s] [ 41%] 2025-12-04T14:34:54.6742615Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erf_cuda_bfloat16 PASSED [0.2142s] [ 42%] 2025-12-04T14:34:54.6743754Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erf_cuda_float16 PASSED [0.2143s] [ 42%] 2025-12-04T14:34:54.6744828Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erf_cuda_int64 PASSED [0.2146s] [ 42%] 2025-12-04T14:34:54.6745885Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erf_cuda_uint8 PASSED [0.2135s] [ 42%] 2025-12-04T14:34:54.6746939Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_bool PASSED [0.2183s] [ 42%] 2025-12-04T14:34:54.6748034Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_float32 PASSED [0.2150s] [ 42%] 2025-12-04T14:34:54.6749134Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_int16 PASSED [0.2131s] [ 42%] 2025-12-04T14:34:54.6750222Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_int32 PASSED [0.2153s] [ 42%] 2025-12-04T14:34:54.6751294Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_int64 PASSED [0.2168s] [ 42%] 2025-12-04T14:34:54.6752377Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_int8 PASSED [0.2176s] [ 42%] 2025-12-04T14:34:54.6753458Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_erfinv_cuda_uint8 PASSED [0.2230s] [ 42%] 2025-12-04T14:34:54.6754543Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_bool PASSED [0.2266s] [ 42%] 2025-12-04T14:34:54.6755628Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_complex128 PASSED [0.2246s] [ 42%] 2025-12-04T14:34:54.6756750Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_complex64 PASSED [0.2286s] [ 42%] 2025-12-04T14:34:54.6757901Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_float16 PASSED [0.2168s] [ 42%] 2025-12-04T14:34:54.6759025Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_float32 PASSED [0.2183s] [ 42%] 2025-12-04T14:34:54.6759523Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_int16 PASSED [0.2182s] [ 42%] 2025-12-04T14:34:54.6759989Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_int32 PASSED [0.2199s] [ 42%] 2025-12-04T14:34:54.6760459Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_expm1_cuda_int64 PASSED [0.2194s] [ 42%] 2025-12-04T14:34:54.6760929Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_floor_cuda_float16 PASSED [0.2187s] [ 42%] 2025-12-04T14:34:54.6761402Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_floor_cuda_int16 PASSED [0.2222s] [ 42%] 2025-12-04T14:34:54.6761864Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_floor_cuda_int32 PASSED [0.2199s] [ 42%] 2025-12-04T14:34:54.6762326Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_floor_cuda_int8 PASSED [0.2184s] [ 42%] 2025-12-04T14:34:54.6762808Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_frac_cuda_float16 PASSED [0.2122s] [ 42%] 2025-12-04T14:34:54.6763278Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_frac_cuda_float32 PASSED [0.2166s] [ 43%] 2025-12-04T14:34:54.6763781Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isinf_cuda_complex128 PASSED [0.2196s] [ 43%] 2025-12-04T14:34:54.6764253Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isinf_cuda_float16 PASSED [0.2198s] [ 43%] 2025-12-04T14:34:54.6764725Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isinf_cuda_float32 PASSED [0.2170s] [ 43%] 2025-12-04T14:34:54.6765211Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isinf_cuda_float64 PASSED [0.2155s] [ 43%] 2025-12-04T14:34:54.6765678Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isinf_cuda_int32 PASSED [0.2201s] [ 43%] 2025-12-04T14:34:54.6766187Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isnan_cuda_bool PASSED [0.2173s] [ 43%] 2025-12-04T14:34:54.6766678Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isnan_cuda_complex128 PASSED [0.2283s] [ 43%] 2025-12-04T14:34:54.6767160Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isnan_cuda_complex64 PASSED [0.2184s] [ 43%] 2025-12-04T14:34:54.6767649Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isnan_cuda_float32 PASSED [0.2136s] [ 43%] 2025-12-04T14:34:54.6768108Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isnan_cuda_int32 PASSED [0.2173s] [ 43%] 2025-12-04T14:34:54.6768625Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isneginf_cuda_bfloat16 PASSED [0.2167s] [ 43%] 2025-12-04T14:34:54.6769107Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isneginf_cuda_bool PASSED [0.2190s] [ 43%] 2025-12-04T14:34:54.6769601Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isneginf_cuda_float16 PASSED [0.2243s] [ 43%] 2025-12-04T14:34:54.6770091Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isneginf_cuda_int16 PASSED [0.2196s] [ 43%] 2025-12-04T14:34:54.6770564Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isneginf_cuda_int8 PASSED [0.2232s] [ 43%] 2025-12-04T14:34:54.6771049Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_bool PASSED [0.2244s] [ 43%] 2025-12-04T14:34:54.6771538Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_float16 PASSED [0.2227s] [ 43%] 2025-12-04T14:34:54.6772026Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_float32 PASSED [0.2231s] [ 43%] 2025-12-04T14:34:54.6772580Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_int16 PASSED [0.2140s] [ 43%] 2025-12-04T14:34:54.6773090Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_int32 PASSED [0.2171s] [ 43%] 2025-12-04T14:34:54.6773579Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_isposinf_cuda_uint8 PASSED [0.2177s] [ 43%] 2025-12-04T14:34:54.6774070Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_log1p_cuda_complex128 PASSED [0.2159s] [ 43%] 2025-12-04T14:34:54.6774544Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_log1p_cuda_float16 PASSED [0.2199s] [ 43%] 2025-12-04T14:34:54.6775024Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_log1p_cuda_float32 PASSED [0.2182s] [ 44%] 2025-12-04T14:34:54.6775856Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_bfloat16 SKIPPED [0.0030s] (masked.amax does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6776693Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_float32 SKIPPED [0.0029s] (masked.amax does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6777508Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_float64 SKIPPED [0.0026s] (masked.amax does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6778410Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_int16 SKIPPED [0.0028s] (masked.amax does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6779214Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_int32 SKIPPED [0.0026s] (masked.amax does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6780019Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_int64 SKIPPED [0.0026s] (masked.amax does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6780864Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amax_cuda_uint8 SKIPPED [0.0029s] (masked.amax does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6781672Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amin_cuda_float16 SKIPPED [0.0026s] (masked.amin does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6782492Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amin_cuda_float32 SKIPPED [0.0026s] (masked.amin does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6783300Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amin_cuda_float64 SKIPPED [0.0029s] (masked.amin does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6784114Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amin_cuda_int8 SKIPPED [0.0026s] (masked.amin does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6784919Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_amin_cuda_uint8 SKIPPED [0.0026s] (masked.amin does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6785757Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_mean_cuda_complex128 SKIPPED [0.0029s] (masked.mean does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6786597Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_mean_cuda_complex64 SKIPPED [0.0026s] (masked.mean does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6787410Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_mean_cuda_float16 SKIPPED [0.0026s] (masked.mean does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6788472Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_mean_cuda_float32 SKIPPED [0.0029s] (masked.mean does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6789327Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_mean_cuda_float64 SKIPPED [0.0026s] (masked.mean does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6790160Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_prod_cuda_bfloat16 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6791000Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_prod_cuda_complex128 SKIPPED [0.0029s] (masked.prod does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6791809Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_prod_cuda_float32 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6792630Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_prod_cuda_int64 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6793428Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_prod_cuda_int8 SKIPPED [0.0029s] (masked.prod does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6794226Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_sum_cuda_bool SKIPPED [0.0026s] (masked.sum does not support input with torch.sparse_bsc layout) [ 44%] 2025-12-04T14:34:54.6795046Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_sum_cuda_complex128 SKIPPED [0.0026s] (masked.sum does not support input with torch.sparse_bsc layout) [ 45%] 2025-12-04T14:34:54.6795862Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_sum_cuda_float64 SKIPPED [0.0026s] (masked.sum does not support input with torch.sparse_bsc layout) [ 45%] 2025-12-04T14:34:54.6796705Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_sum_cuda_int64 SKIPPED [0.0029s] (masked.sum does not support input with torch.sparse_bsc layout) [ 45%] 2025-12-04T14:34:54.6797491Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_masked_sum_cuda_uint8 SKIPPED [0.0026s] (masked.sum does not support input with torch.sparse_bsc layout) [ 45%] 2025-12-04T14:34:54.6797987Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_complex128 PASSED [0.0142s] [ 45%] 2025-12-04T14:34:54.6798467Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_complex32 PASSED [0.2443s] [ 45%] 2025-12-04T14:34:54.6798953Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_complex64 PASSED [0.0143s] [ 45%] 2025-12-04T14:34:54.6799424Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_float16 PASSED [0.0131s] [ 45%] 2025-12-04T14:34:54.6799884Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_int16 PASSED [0.2310s] [ 45%] 2025-12-04T14:34:54.6800347Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_int32 PASSED [0.0124s] [ 45%] 2025-12-04T14:34:54.6800799Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_int8 PASSED [0.0119s] [ 45%] 2025-12-04T14:34:54.6801263Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_mul_cuda_uint8 PASSED [0.2281s] [ 45%] 2025-12-04T14:34:54.6801746Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_complex128 PASSED [0.2214s] [ 45%] 2025-12-04T14:34:54.6802226Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_complex32 PASSED [0.2318s] [ 45%] 2025-12-04T14:34:54.6802801Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_complex64 PASSED [0.2237s] [ 45%] 2025-12-04T14:34:54.6803269Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_float16 PASSED [0.2299s] [ 45%] 2025-12-04T14:34:54.6803777Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_float32 PASSED [0.2387s] [ 45%] 2025-12-04T14:34:54.6804244Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_float64 PASSED [0.2346s] [ 45%] 2025-12-04T14:34:54.6804698Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_int16 PASSED [0.2242s] [ 45%] 2025-12-04T14:34:54.6805162Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_neg_cuda_int32 PASSED [0.2220s] [ 45%] 2025-12-04T14:34:54.6805708Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_nn_functional_relu_cuda_float16 PASSED [0.2188s] [ 45%] 2025-12-04T14:34:54.6806266Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_nn_functional_relu_cuda_float64 PASSED [0.2208s] [ 45%] 2025-12-04T14:34:54.6806804Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_nn_functional_relu_cuda_int32 PASSED [0.2228s] [ 45%] 2025-12-04T14:34:54.6807307Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_bfloat16 PASSED [0.2212s] [ 45%] 2025-12-04T14:34:54.6807839Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_complex128 PASSED [0.2194s] [ 46%] 2025-12-04T14:34:54.6808346Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_complex32 PASSED [0.2190s] [ 46%] 2025-12-04T14:34:54.6808837Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_int16 PASSED [0.2208s] [ 46%] 2025-12-04T14:34:54.6809317Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_int32 PASSED [0.2213s] [ 46%] 2025-12-04T14:34:54.6809805Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_int8 PASSED [0.2238s] [ 46%] 2025-12-04T14:34:54.6810336Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_positive_cuda_uint8 PASSED [0.2212s] [ 46%] 2025-12-04T14:34:54.6810810Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_bool PASSED [0.2190s] [ 46%] 2025-12-04T14:34:54.6811306Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_float16 PASSED [0.2189s] [ 46%] 2025-12-04T14:34:54.6811792Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_float32 PASSED [0.2197s] [ 46%] 2025-12-04T14:34:54.6812274Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_float64 PASSED [0.2196s] [ 46%] 2025-12-04T14:34:54.6812758Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_int16 PASSED [0.2274s] [ 46%] 2025-12-04T14:34:54.6813240Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_rad2deg_cuda_uint8 PASSED [0.2303s] [ 46%] 2025-12-04T14:34:54.6813761Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_randn_like_cuda_bfloat16 PASSED [0.0158s] [ 46%] 2025-12-04T14:34:54.6814278Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_randn_like_cuda_complex32 PASSED [0.2297s] [ 46%] 2025-12-04T14:34:54.6814782Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_randn_like_cuda_float16 PASSED [0.0122s] [ 46%] 2025-12-04T14:34:54.6815293Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_randn_like_cuda_float64 PASSED [0.2253s] [ 46%] 2025-12-04T14:34:54.6815775Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_bfloat16 PASSED [0.2195s] [ 46%] 2025-12-04T14:34:54.6816263Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_float16 PASSED [0.2234s] [ 46%] 2025-12-04T14:34:54.6816801Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_float64 PASSED [0.2207s] [ 46%] 2025-12-04T14:34:54.6817300Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_int32 PASSED [0.2302s] [ 46%] 2025-12-04T14:34:54.6817778Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_int64 PASSED [0.2281s] [ 46%] 2025-12-04T14:34:54.6818319Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_round_cuda_int8 PASSED [0.2297s] [ 46%] 2025-12-04T14:34:54.6818806Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_bfloat16 PASSED [0.2226s] [ 46%] 2025-12-04T14:34:54.6819288Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_complex128 PASSED [0.2280s] [ 46%] 2025-12-04T14:34:54.6819761Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_complex32 PASSED [0.2234s] [ 47%] 2025-12-04T14:34:54.6820245Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_float16 PASSED [0.2189s] [ 47%] 2025-12-04T14:34:54.6820709Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_float32 PASSED [0.2200s] [ 47%] 2025-12-04T14:34:54.6821178Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sgn_cuda_int32 PASSED [0.2216s] [ 47%] 2025-12-04T14:34:54.6821634Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sign_cuda_bool PASSED [0.2240s] [ 47%] 2025-12-04T14:34:54.6822092Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sign_cuda_int32 PASSED [0.2217s] [ 47%] 2025-12-04T14:34:54.6822556Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sign_cuda_int8 PASSED [0.2200s] [ 47%] 2025-12-04T14:34:54.6823047Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_bfloat16 PASSED [0.2314s] [ 47%] 2025-12-04T14:34:54.6823539Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_bool PASSED [0.2202s] [ 47%] 2025-12-04T14:34:54.6824067Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_float16 PASSED [0.2191s] [ 47%] 2025-12-04T14:34:54.6824541Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_int32 PASSED [0.2187s] [ 47%] 2025-12-04T14:34:54.6825019Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_int8 PASSED [0.2297s] [ 47%] 2025-12-04T14:34:54.6825491Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_signbit_cuda_uint8 PASSED [0.2239s] [ 47%] 2025-12-04T14:34:54.6825980Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_complex32 PASSED [0.2250s] [ 47%] 2025-12-04T14:34:54.6826451Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_complex64 PASSED [0.2200s] [ 47%] 2025-12-04T14:34:54.6826921Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_float16 PASSED [0.2205s] [ 47%] 2025-12-04T14:34:54.6827401Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_float32 PASSED [0.2230s] [ 47%] 2025-12-04T14:34:54.6827856Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_int16 PASSED [0.2202s] [ 47%] 2025-12-04T14:34:54.6828322Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_int64 PASSED [0.2206s] [ 47%] 2025-12-04T14:34:54.6828769Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_int8 PASSED [0.2190s] [ 47%] 2025-12-04T14:34:54.6829225Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sin_cuda_uint8 PASSED [0.2202s] [ 47%] 2025-12-04T14:34:54.6829708Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_bfloat16 PASSED [0.2206s] [ 47%] 2025-12-04T14:34:54.6830194Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_bool PASSED [0.2214s] [ 47%] 2025-12-04T14:34:54.6830716Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_complex64 PASSED [0.2178s] [ 47%] 2025-12-04T14:34:54.6831220Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_float32 PASSED [0.2203s] [ 48%] 2025-12-04T14:34:54.6831680Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_int16 PASSED [0.2228s] [ 48%] 2025-12-04T14:34:54.6832140Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_int32 PASSED [0.2223s] [ 48%] 2025-12-04T14:34:54.6832588Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_int8 PASSED [0.2197s] [ 48%] 2025-12-04T14:34:54.6833057Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sinh_cuda_uint8 PASSED [0.2212s] [ 48%] 2025-12-04T14:34:54.6833534Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_complex32 PASSED [0.2198s] [ 48%] 2025-12-04T14:34:54.6834016Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_complex64 PASSED [0.2241s] [ 48%] 2025-12-04T14:34:54.6834492Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_float64 PASSED [0.2236s] [ 48%] 2025-12-04T14:34:54.6834947Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_int32 PASSED [0.2198s] [ 48%] 2025-12-04T14:34:54.6835408Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_int8 PASSED [0.2204s] [ 48%] 2025-12-04T14:34:54.6835867Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sqrt_cuda_uint8 PASSED [0.2262s] [ 48%] 2025-12-04T14:34:54.6836311Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sum_cuda_bool PASSED [0.2308s] [ 48%] 2025-12-04T14:34:54.6836782Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sum_cuda_float16 PASSED [0.2332s] [ 48%] 2025-12-04T14:34:54.6837246Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sum_cuda_float32 PASSED [0.2313s] [ 48%] 2025-12-04T14:34:54.6837748Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_sum_cuda_int16 PASSED [0.2309s] [ 48%] 2025-12-04T14:34:54.6838217Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_bfloat16 PASSED [0.2261s] [ 48%] 2025-12-04T14:34:54.6838664Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_bool PASSED [0.2234s] [ 48%] 2025-12-04T14:34:54.6839142Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_complex32 PASSED [0.2195s] [ 48%] 2025-12-04T14:34:54.6839590Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_int16 PASSED [0.2215s] [ 48%] 2025-12-04T14:34:54.6840045Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_int32 PASSED [0.2218s] [ 48%] 2025-12-04T14:34:54.6840495Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_int8 PASSED [0.2229s] [ 48%] 2025-12-04T14:34:54.6840945Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tan_cuda_uint8 PASSED [0.2212s] [ 48%] 2025-12-04T14:34:54.6841407Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tanh_cuda_bool PASSED [0.2216s] [ 48%] 2025-12-04T14:34:54.6841870Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tanh_cuda_float32 PASSED [0.2233s] [ 48%] 2025-12-04T14:34:54.6842329Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tanh_cuda_int16 PASSED [0.2304s] [ 49%] 2025-12-04T14:34:54.6842783Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_tanh_cuda_uint8 PASSED [0.2355s] [ 49%] 2025-12-04T14:34:54.6843581Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_bfloat16 SKIPPED [0.0030s] (to_sparse does not support input with torch.sparse_bsc layout) [ 49%] 2025-12-04T14:34:54.6844464Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_complex128 SKIPPED [0.0029s] (to_sparse does not support input with torch.sparse_bsc layout) [ 49%] 2025-12-04T14:34:54.6845292Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_complex64 SKIPPED [0.0026s] (to_sparse does not support input with torch.sparse_bsc layout) [ 49%] 2025-12-04T14:34:54.6846086Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_float32 SKIPPED [0.0028s] (to_sparse does not support input with torch.sparse_bsc layout) [ 49%] 2025-12-04T14:34:54.6846868Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_float64 SKIPPED [0.0026s] (to_sparse does not support input with torch.sparse_bsc layout) [ 49%] 2025-12-04T14:34:54.6847634Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_int16 SKIPPED [0.0026s] (to_sparse does not support input with torch.sparse_bsc layout) [ 49%] 2025-12-04T14:34:54.6848422Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_to_sparse_cuda_uint8 SKIPPED [0.0029s] (to_sparse does not support input with torch.sparse_bsc layout) [ 49%] 2025-12-04T14:34:54.6848900Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_trunc_cuda_bfloat16 PASSED [0.2284s] [ 49%] 2025-12-04T14:34:54.6849384Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_trunc_cuda_float32 PASSED [0.2215s] [ 49%] 2025-12-04T14:34:54.6849850Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_trunc_cuda_int16 PASSED [0.2215s] [ 49%] 2025-12-04T14:34:54.6850320Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_trunc_cuda_int64 PASSED [0.2185s] [ 49%] 2025-12-04T14:34:54.6850785Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_trunc_cuda_uint8 PASSED [0.2209s] [ 49%] 2025-12-04T14:34:54.6851302Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_zeros_like_cuda_complex32 PASSED [0.0107s] [ 49%] 2025-12-04T14:34:54.6851845Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_zeros_like_cuda_float32 PASSED [0.2308s] [ 49%] 2025-12-04T14:34:54.6852339Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSC_zeros_like_cuda_float64 PASSED [0.0105s] [ 49%] 2025-12-04T14:34:54.6852820Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_abs_cuda_bfloat16 PASSED [0.2265s] [ 49%] 2025-12-04T14:34:54.6853299Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_abs_cuda_complex128 PASSED [0.2226s] [ 49%] 2025-12-04T14:34:54.6853786Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_angle_cuda_complex32 PASSED [0.2183s] [ 49%] 2025-12-04T14:34:54.6854283Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_angle_cuda_complex64 PASSED [0.2203s] [ 49%] 2025-12-04T14:34:54.6854760Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_angle_cuda_float64 PASSED [0.2189s] [ 49%] 2025-12-04T14:34:54.6855237Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_angle_cuda_int64 PASSED [0.2220s] [ 49%] 2025-12-04T14:34:54.6855695Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_bool PASSED [0.2197s] [ 49%] 2025-12-04T14:34:54.6856175Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_complex64 PASSED [0.2186s] [ 50%] 2025-12-04T14:34:54.6856637Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_int16 PASSED [0.2200s] [ 50%] 2025-12-04T14:34:54.6857095Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_int32 PASSED [0.2219s] [ 50%] 2025-12-04T14:34:54.6857551Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_int64 PASSED [0.2204s] [ 50%] 2025-12-04T14:34:54.6858089Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asin_cuda_uint8 PASSED [0.2203s] [ 50%] 2025-12-04T14:34:54.6858659Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_complex128 PASSED [0.2193s] [ 50%] 2025-12-04T14:34:54.6859192Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_complex32 PASSED [0.2179s] [ 50%] 2025-12-04T14:34:54.6859678Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_complex64 PASSED [0.2195s] [ 50%] 2025-12-04T14:34:54.6860152Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_float16 PASSED [0.2225s] [ 50%] 2025-12-04T14:34:54.6860633Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_float32 PASSED [0.2211s] [ 50%] 2025-12-04T14:34:54.6861095Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_int16 PASSED [0.2202s] [ 50%] 2025-12-04T14:34:54.6861569Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_int32 PASSED [0.2194s] [ 50%] 2025-12-04T14:34:54.6862032Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_asinh_cuda_uint8 PASSED [0.2167s] [ 50%] 2025-12-04T14:34:54.6862503Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_bfloat16 PASSED [0.2205s] [ 50%] 2025-12-04T14:34:54.6862998Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_complex128 PASSED [0.2194s] [ 50%] 2025-12-04T14:34:54.6863476Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_complex32 PASSED [0.2207s] [ 50%] 2025-12-04T14:34:54.6863957Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_float16 PASSED [0.2188s] [ 50%] 2025-12-04T14:34:54.6864426Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_float64 PASSED [0.2139s] [ 50%] 2025-12-04T14:34:54.6864888Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_int32 PASSED [0.2131s] [ 50%] 2025-12-04T14:34:54.6865387Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_int64 PASSED [0.2138s] [ 50%] 2025-12-04T14:34:54.6865843Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_int8 PASSED [0.2174s] [ 50%] 2025-12-04T14:34:54.6866310Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atan_cuda_uint8 PASSED [0.2261s] [ 50%] 2025-12-04T14:34:54.6866797Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_complex32 PASSED [0.2217s] [ 50%] 2025-12-04T14:34:54.6867277Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_complex64 PASSED [0.2196s] [ 50%] 2025-12-04T14:34:54.6867759Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_float16 PASSED [0.2167s] [ 51%] 2025-12-04T14:34:54.6868220Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_int32 PASSED [0.2182s] [ 51%] 2025-12-04T14:34:54.6868691Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_int8 PASSED [0.2194s] [ 51%] 2025-12-04T14:34:54.6869158Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_atanh_cuda_uint8 PASSED [0.2172s] [ 51%] 2025-12-04T14:34:54.6869625Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_ceil_cuda_float64 PASSED [0.2158s] [ 51%] 2025-12-04T14:34:54.6870088Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_ceil_cuda_int16 PASSED [0.2145s] [ 51%] 2025-12-04T14:34:54.6870589Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_bool PASSED [0.2142s] [ 51%] 2025-12-04T14:34:54.6871133Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_complex128 PASSED [0.2142s] [ 51%] 2025-12-04T14:34:54.6871806Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_complex32 PASSED [0.2125s] [ 51%] 2025-12-04T14:34:54.6872474Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_float16 PASSED [0.2190s] [ 51%] 2025-12-04T14:34:54.6873029Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_float64 PASSED [0.2184s] [ 51%] 2025-12-04T14:34:54.6873535Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_int32 PASSED [0.2150s] [ 51%] 2025-12-04T14:34:54.6874048Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_int64 PASSED [0.2139s] [ 51%] 2025-12-04T14:34:54.6874548Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_conj_physical_cuda_int8 PASSED [0.2120s] [ 51%] 2025-12-04T14:34:54.6875029Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_deg2rad_cuda_float64 PASSED [0.2176s] [ 51%] 2025-12-04T14:34:54.6875520Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_deg2rad_cuda_int16 PASSED [0.2207s] [ 51%] 2025-12-04T14:34:54.6875992Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_deg2rad_cuda_int8 PASSED [0.2156s] [ 51%] 2025-12-04T14:34:54.6876467Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erf_cuda_float16 PASSED [0.2184s] [ 51%] 2025-12-04T14:34:54.6876919Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erf_cuda_uint8 PASSED [0.2160s] [ 51%] 2025-12-04T14:34:54.6877403Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erfinv_cuda_bfloat16 PASSED [0.2147s] [ 51%] 2025-12-04T14:34:54.6877889Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erfinv_cuda_float64 PASSED [0.2229s] [ 51%] 2025-12-04T14:34:54.6878354Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erfinv_cuda_int16 PASSED [0.2188s] [ 51%] 2025-12-04T14:34:54.6878829Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erfinv_cuda_int32 PASSED [0.2191s] [ 51%] 2025-12-04T14:34:54.6879391Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_erfinv_cuda_int8 PASSED [0.2181s] [ 51%] 2025-12-04T14:34:54.6879873Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_expm1_cuda_bfloat16 PASSED [0.2220s] [ 52%] 2025-12-04T14:34:54.6880338Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_expm1_cuda_bool PASSED [0.2220s] [ 52%] 2025-12-04T14:34:54.6880815Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_expm1_cuda_float32 PASSED [0.2184s] [ 52%] 2025-12-04T14:34:54.6881276Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_expm1_cuda_int8 PASSED [0.2195s] [ 52%] 2025-12-04T14:34:54.6881736Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_expm1_cuda_uint8 PASSED [0.2162s] [ 52%] 2025-12-04T14:34:54.6882210Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_floor_cuda_bfloat16 PASSED [0.2272s] [ 52%] 2025-12-04T14:34:54.6882693Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_floor_cuda_float32 PASSED [0.2221s] [ 52%] 2025-12-04T14:34:54.6883165Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_floor_cuda_float64 PASSED [0.2227s] [ 52%] 2025-12-04T14:34:54.6883631Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_floor_cuda_int8 PASSED [0.2208s] [ 52%] 2025-12-04T14:34:54.6884092Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_floor_cuda_uint8 PASSED [0.2218s] [ 52%] 2025-12-04T14:34:54.6884556Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_frac_cuda_float16 PASSED [0.2213s] [ 52%] 2025-12-04T14:34:54.6885029Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_frac_cuda_float64 PASSED [0.2201s] [ 52%] 2025-12-04T14:34:54.6885517Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_bool PASSED [0.2257s] [ 52%] 2025-12-04T14:34:54.6886048Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_complex128 PASSED [0.2194s] [ 52%] 2025-12-04T14:34:54.6886549Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_float16 PASSED [0.2193s] [ 52%] 2025-12-04T14:34:54.6887021Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_float32 PASSED [0.2195s] [ 52%] 2025-12-04T14:34:54.6887498Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_float64 PASSED [0.2179s] [ 52%] 2025-12-04T14:34:54.6888122Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_int64 PASSED [0.2167s] [ 52%] 2025-12-04T14:34:54.6888591Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isinf_cuda_uint8 PASSED [0.2194s] [ 52%] 2025-12-04T14:34:54.6889051Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isnan_cuda_bool PASSED [0.2182s] [ 52%] 2025-12-04T14:34:54.6889539Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isnan_cuda_complex64 PASSED [0.2193s] [ 52%] 2025-12-04T14:34:54.6890023Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isnan_cuda_float16 PASSED [0.2156s] [ 52%] 2025-12-04T14:34:54.6890483Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isnan_cuda_int32 PASSED [0.2157s] [ 52%] 2025-12-04T14:34:54.6890991Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isneginf_cuda_bfloat16 PASSED [0.2175s] [ 52%] 2025-12-04T14:34:54.6891479Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isneginf_cuda_float32 PASSED [0.2159s] [ 53%] 2025-12-04T14:34:54.6891955Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isneginf_cuda_int16 PASSED [0.2277s] [ 53%] 2025-12-04T14:34:54.6892437Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isneginf_cuda_int32 PASSED [0.2225s] [ 53%] 2025-12-04T14:34:54.6893004Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isposinf_cuda_bfloat16 PASSED [0.2188s] [ 53%] 2025-12-04T14:34:54.6893497Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isposinf_cuda_float32 PASSED [0.2175s] [ 53%] 2025-12-04T14:34:54.6893973Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isposinf_cuda_int32 PASSED [0.2165s] [ 53%] 2025-12-04T14:34:54.6894454Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_isposinf_cuda_int64 PASSED [0.2162s] [ 53%] 2025-12-04T14:34:54.6894939Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_bfloat16 PASSED [0.2176s] [ 53%] 2025-12-04T14:34:54.6895429Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_complex128 PASSED [0.2180s] [ 53%] 2025-12-04T14:34:54.6895926Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_complex64 PASSED [0.2170s] [ 53%] 2025-12-04T14:34:54.6896401Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_float32 PASSED [0.2171s] [ 53%] 2025-12-04T14:34:54.6896890Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_float64 PASSED [0.2202s] [ 53%] 2025-12-04T14:34:54.6897375Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_int16 PASSED [0.2168s] [ 53%] 2025-12-04T14:34:54.6897842Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_int32 PASSED [0.2214s] [ 53%] 2025-12-04T14:34:54.6898401Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_int64 PASSED [0.2169s] [ 53%] 2025-12-04T14:34:54.6898871Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_int8 PASSED [0.2221s] [ 53%] 2025-12-04T14:34:54.6899390Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_log1p_cuda_uint8 PASSED [0.2173s] [ 53%] 2025-12-04T14:34:54.6900271Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amax_cuda_float32 SKIPPED [0.0030s] (masked.amax does not support input with torch.sparse_bsr layout) [ 53%] 2025-12-04T14:34:54.6901134Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amax_cuda_float64 SKIPPED [0.0028s] (masked.amax does not support input with torch.sparse_bsr layout) [ 53%] 2025-12-04T14:34:54.6901957Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amax_cuda_int32 SKIPPED [0.0026s] (masked.amax does not support input with torch.sparse_bsr layout) [ 53%] 2025-12-04T14:34:54.6902763Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amax_cuda_int64 SKIPPED [0.0028s] (masked.amax does not support input with torch.sparse_bsr layout) [ 53%] 2025-12-04T14:34:54.6903583Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amax_cuda_uint8 SKIPPED [0.0026s] (masked.amax does not support input with torch.sparse_bsr layout) [ 53%] 2025-12-04T14:34:54.6904408Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amin_cuda_bfloat16 SKIPPED [0.0026s] (masked.amin does not support input with torch.sparse_bsr layout) [ 53%] 2025-12-04T14:34:54.6905225Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amin_cuda_float32 SKIPPED [0.0028s] (masked.amin does not support input with torch.sparse_bsr layout) [ 53%] 2025-12-04T14:34:54.6906044Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amin_cuda_int64 SKIPPED [0.0026s] (masked.amin does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6906845Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_amin_cuda_uint8 SKIPPED [0.0026s] (masked.amin does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6907684Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_mean_cuda_bfloat16 SKIPPED [0.0028s] (masked.mean does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6908583Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_mean_cuda_complex128 SKIPPED [0.0026s] (masked.mean does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6909434Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_complex128 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6910252Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_float16 SKIPPED [0.0029s] (masked.prod does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6911075Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_float32 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6911913Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_float64 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6912721Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_int16 SKIPPED [0.0029s] (masked.prod does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6913539Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_int32 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6914343Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_prod_cuda_uint8 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6915173Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_sum_cuda_complex64 SKIPPED [0.0029s] (masked.sum does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6916069Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_sum_cuda_float16 SKIPPED [0.0026s] (masked.sum does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6916862Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_sum_cuda_int16 SKIPPED [0.0026s] (masked.sum does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6917670Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_masked_sum_cuda_uint8 SKIPPED [0.0029s] (masked.sum does not support input with torch.sparse_bsr layout) [ 54%] 2025-12-04T14:34:54.6918150Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_bfloat16 PASSED [0.2268s] [ 54%] 2025-12-04T14:34:54.6918646Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_complex32 PASSED [0.0141s] [ 54%] 2025-12-04T14:34:54.6919117Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_float16 PASSED [0.0130s] [ 54%] 2025-12-04T14:34:54.6919592Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_float32 PASSED [0.2266s] [ 54%] 2025-12-04T14:34:54.6920075Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_float64 PASSED [0.0137s] [ 54%] 2025-12-04T14:34:54.6920536Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_int16 PASSED [0.0118s] [ 54%] 2025-12-04T14:34:54.6921009Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_mul_cuda_int8 PASSED [0.2249s] [ 54%] 2025-12-04T14:34:54.6921480Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_bfloat16 PASSED [0.2189s] [ 54%] 2025-12-04T14:34:54.6921962Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_complex128 PASSED [0.2326s] [ 54%] 2025-12-04T14:34:54.6922454Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_complex32 PASSED [0.2384s] [ 55%] 2025-12-04T14:34:54.6922953Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_float16 PASSED [0.2246s] [ 55%] 2025-12-04T14:34:54.6923437Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_float32 PASSED [0.2204s] [ 55%] 2025-12-04T14:34:54.6923905Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_float64 PASSED [0.2210s] [ 55%] 2025-12-04T14:34:54.6924361Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_int64 PASSED [0.2206s] [ 55%] 2025-12-04T14:34:54.6924825Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_int8 PASSED [0.2191s] [ 55%] 2025-12-04T14:34:54.6925279Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_neg_cuda_uint8 PASSED [0.2172s] [ 55%] 2025-12-04T14:34:54.6925848Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_nn_functional_relu_cuda_bfloat16 PASSED [0.2177s] [ 55%] 2025-12-04T14:34:54.6926397Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_nn_functional_relu_cuda_float16 PASSED [0.2276s] [ 55%] 2025-12-04T14:34:54.6926939Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_nn_functional_relu_cuda_float64 PASSED [0.2247s] [ 55%] 2025-12-04T14:34:54.6927483Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_nn_functional_relu_cuda_int8 PASSED [0.2251s] [ 55%] 2025-12-04T14:34:54.6928019Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_nn_functional_relu_cuda_uint8 PASSED [0.2248s] [ 55%] 2025-12-04T14:34:54.6928534Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_bfloat16 PASSED [0.2182s] [ 55%] 2025-12-04T14:34:54.6929050Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_complex128 PASSED [0.2197s] [ 55%] 2025-12-04T14:34:54.6929627Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_complex32 PASSED [0.2191s] [ 55%] 2025-12-04T14:34:54.6930178Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_complex64 PASSED [0.2191s] [ 55%] 2025-12-04T14:34:54.6930671Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_float16 PASSED [0.2171s] [ 55%] 2025-12-04T14:34:54.6931179Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_float32 PASSED [0.2200s] [ 55%] 2025-12-04T14:34:54.6931660Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_int16 PASSED [0.2176s] [ 55%] 2025-12-04T14:34:54.6932140Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_int64 PASSED [0.2221s] [ 55%] 2025-12-04T14:34:54.6932638Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_positive_cuda_uint8 PASSED [0.2202s] [ 55%] 2025-12-04T14:34:54.6933135Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_rad2deg_cuda_bfloat16 PASSED [0.2219s] [ 55%] 2025-12-04T14:34:54.6933638Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_rad2deg_cuda_float32 PASSED [0.2250s] [ 55%] 2025-12-04T14:34:54.6934110Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_rad2deg_cuda_int16 PASSED [0.2240s] [ 55%] 2025-12-04T14:34:54.6934586Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_rad2deg_cuda_int32 PASSED [0.2276s] [ 56%] 2025-12-04T14:34:54.6935071Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_rad2deg_cuda_int8 PASSED [0.2195s] [ 56%] 2025-12-04T14:34:54.6935582Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_randn_like_cuda_bfloat16 PASSED [0.0121s] [ 56%] 2025-12-04T14:34:54.6936109Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_randn_like_cuda_complex64 PASSED [0.2315s] [ 56%] 2025-12-04T14:34:54.6936593Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_round_cuda_bfloat16 PASSED [0.2189s] [ 56%] 2025-12-04T14:34:54.6937104Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_round_cuda_float16 PASSED [0.2178s] [ 56%] 2025-12-04T14:34:54.6937593Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_round_cuda_float32 PASSED [0.2228s] [ 56%] 2025-12-04T14:34:54.6938061Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_round_cuda_int32 PASSED [0.2289s] [ 56%] 2025-12-04T14:34:54.6938651Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_round_cuda_int8 PASSED [0.2232s] [ 56%] 2025-12-04T14:34:54.6939125Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_bfloat16 PASSED [0.2178s] [ 56%] 2025-12-04T14:34:54.6939603Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_complex32 PASSED [0.2230s] [ 56%] 2025-12-04T14:34:54.6940088Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_float16 PASSED [0.2204s] [ 56%] 2025-12-04T14:34:54.6940560Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_float32 PASSED [0.2225s] [ 56%] 2025-12-04T14:34:54.6941039Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_float64 PASSED [0.2226s] [ 56%] 2025-12-04T14:34:54.6941497Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_int64 PASSED [0.2271s] [ 56%] 2025-12-04T14:34:54.6941957Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sgn_cuda_uint8 PASSED [0.2176s] [ 56%] 2025-12-04T14:34:54.6942429Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sign_cuda_bool PASSED [0.2183s] [ 56%] 2025-12-04T14:34:54.6942901Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sign_cuda_float32 PASSED [0.2191s] [ 56%] 2025-12-04T14:34:54.6943441Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sign_cuda_int16 PASSED [0.2200s] [ 56%] 2025-12-04T14:34:54.6943904Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sign_cuda_int64 PASSED [0.2177s] [ 56%] 2025-12-04T14:34:54.6944426Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_signbit_cuda_bfloat16 PASSED [0.2174s] [ 56%] 2025-12-04T14:34:54.6944923Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_signbit_cuda_float64 PASSED [0.2264s] [ 56%] 2025-12-04T14:34:54.6945399Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_signbit_cuda_int32 PASSED [0.2259s] [ 56%] 2025-12-04T14:34:54.6945878Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_signbit_cuda_int8 PASSED [0.2201s] [ 56%] 2025-12-04T14:34:54.6946354Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_signbit_cuda_uint8 PASSED [0.2195s] [ 57%] 2025-12-04T14:34:54.6946827Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sin_cuda_bfloat16 PASSED [0.2208s] [ 57%] 2025-12-04T14:34:54.6947319Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sin_cuda_complex32 PASSED [0.2196s] [ 57%] 2025-12-04T14:34:54.6947793Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sin_cuda_complex64 PASSED [0.2190s] [ 57%] 2025-12-04T14:34:54.6948268Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sin_cuda_float32 PASSED [0.2187s] [ 57%] 2025-12-04T14:34:54.6948728Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sin_cuda_int16 PASSED [0.2206s] [ 57%] 2025-12-04T14:34:54.6949203Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_bfloat16 PASSED [0.2295s] [ 57%] 2025-12-04T14:34:54.6949695Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_complex32 PASSED [0.2303s] [ 57%] 2025-12-04T14:34:54.6950181Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_complex64 PASSED [0.2295s] [ 57%] 2025-12-04T14:34:54.6950695Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_float32 PASSED [0.2205s] [ 57%] 2025-12-04T14:34:54.6951168Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_float64 PASSED [0.2171s] [ 57%] 2025-12-04T14:34:54.6951628Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_int32 PASSED [0.2172s] [ 57%] 2025-12-04T14:34:54.6952098Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_int64 PASSED [0.2152s] [ 57%] 2025-12-04T14:34:54.6952558Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sinh_cuda_uint8 PASSED [0.2175s] [ 57%] 2025-12-04T14:34:54.6953029Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sqrt_cuda_bool PASSED [0.2153s] [ 57%] 2025-12-04T14:34:54.6953514Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sqrt_cuda_complex32 PASSED [0.2222s] [ 57%] 2025-12-04T14:34:54.6953989Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sqrt_cuda_float64 PASSED [0.2303s] [ 57%] 2025-12-04T14:34:54.6954463Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sqrt_cuda_int16 PASSED [0.2257s] [ 57%] 2025-12-04T14:34:54.6954921Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sqrt_cuda_int64 PASSED [0.2293s] [ 57%] 2025-12-04T14:34:54.6955406Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sum_cuda_bfloat16 PASSED [0.2330s] [ 57%] 2025-12-04T14:34:54.6955872Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sum_cuda_float64 PASSED [0.2334s] [ 57%] 2025-12-04T14:34:54.6956328Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sum_cuda_int64 PASSED [0.2287s] [ 57%] 2025-12-04T14:34:54.6956831Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_sum_cuda_uint8 PASSED [0.2293s] [ 57%] 2025-12-04T14:34:54.6957338Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_bfloat16 PASSED [0.2195s] [ 57%] 2025-12-04T14:34:54.6957877Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_complex128 PASSED [0.2261s] [ 58%] 2025-12-04T14:34:54.6958356Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_complex32 PASSED [0.2245s] [ 58%] 2025-12-04T14:34:54.6958830Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_complex64 PASSED [0.2191s] [ 58%] 2025-12-04T14:34:54.6959303Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_float16 PASSED [0.2205s] [ 58%] 2025-12-04T14:34:54.6959768Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_float64 PASSED [0.2207s] [ 58%] 2025-12-04T14:34:54.6960240Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_int16 PASSED [0.2212s] [ 58%] 2025-12-04T14:34:54.6960698Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_int32 PASSED [0.2257s] [ 58%] 2025-12-04T14:34:54.6961158Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tan_cuda_int64 PASSED [0.2243s] [ 58%] 2025-12-04T14:34:54.6961661Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tanh_cuda_complex128 PASSED [0.2235s] [ 58%] 2025-12-04T14:34:54.6962134Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tanh_cuda_float16 PASSED [0.2243s] [ 58%] 2025-12-04T14:34:54.6962604Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tanh_cuda_int8 PASSED [0.2205s] [ 58%] 2025-12-04T14:34:54.6963062Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_tanh_cuda_uint8 PASSED [0.2166s] [ 58%] 2025-12-04T14:34:54.6963844Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_to_sparse_cuda_bool SKIPPED [0.0030s] (to_sparse does not support input with torch.sparse_bsr layout) [ 58%] 2025-12-04T14:34:54.6964707Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_to_sparse_cuda_complex128 SKIPPED [0.0029s] (to_sparse does not support input with torch.sparse_bsr layout) [ 58%] 2025-12-04T14:34:54.6965493Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_to_sparse_cuda_float64 SKIPPED [0.0026s] (to_sparse does not support input with torch.sparse_bsr layout) [ 58%] 2025-12-04T14:34:54.6966282Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_to_sparse_cuda_int16 SKIPPED [0.0028s] (to_sparse does not support input with torch.sparse_bsr layout) [ 58%] 2025-12-04T14:34:54.6967063Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_to_sparse_cuda_int32 SKIPPED [0.0026s] (to_sparse does not support input with torch.sparse_bsr layout) [ 58%] 2025-12-04T14:34:54.6967544Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_trunc_cuda_float32 PASSED [0.2220s] [ 58%] 2025-12-04T14:34:54.6968027Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_trunc_cuda_int64 PASSED [0.2190s] [ 58%] 2025-12-04T14:34:54.6968539Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_bfloat16 PASSED [0.0105s] [ 58%] 2025-12-04T14:34:54.6969068Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_complex128 PASSED [0.2230s] [ 58%] 2025-12-04T14:34:54.6969581Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_complex32 PASSED [0.0106s] [ 58%] 2025-12-04T14:34:54.6970096Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_complex64 PASSED [0.2198s] [ 58%] 2025-12-04T14:34:54.6970610Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_float64 PASSED [0.0103s] [ 58%] 2025-12-04T14:34:54.6971134Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_int16 PASSED [0.2194s] [ 59%] 2025-12-04T14:34:54.6971671Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_int32 PASSED [0.0100s] [ 59%] 2025-12-04T14:34:54.6972188Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseBSR_zeros_like_cuda_int64 PASSED [0.2190s] [ 59%] 2025-12-04T14:34:54.6972644Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_abs_cuda_bool PASSED [0.2386s] [ 59%] 2025-12-04T14:34:54.6973136Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_abs_cuda_complex128 PASSED [0.2208s] [ 59%] 2025-12-04T14:34:54.6973588Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_abs_cuda_int16 PASSED [0.2204s] [ 59%] 2025-12-04T14:34:54.6974056Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_abs_cuda_int32 PASSED [0.2169s] [ 59%] 2025-12-04T14:34:54.6974514Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_abs_cuda_uint8 PASSED [0.2166s] [ 59%] 2025-12-04T14:34:54.6974978Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_angle_cuda_bool PASSED [0.2200s] [ 59%] 2025-12-04T14:34:54.6975468Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_angle_cuda_float64 PASSED [0.2182s] [ 59%] 2025-12-04T14:34:54.6975930Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_angle_cuda_int32 PASSED [0.2172s] [ 59%] 2025-12-04T14:34:54.6976413Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_angle_cuda_uint8 PASSED [0.2189s] [ 59%] 2025-12-04T14:34:54.6976902Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_complex128 PASSED [0.2328s] [ 59%] 2025-12-04T14:34:54.6977384Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_complex32 PASSED [0.2210s] [ 59%] 2025-12-04T14:34:54.6977871Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_float32 PASSED [0.2324s] [ 59%] 2025-12-04T14:34:54.6978461Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_int32 PASSED [0.2249s] [ 59%] 2025-12-04T14:34:54.6978939Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_int64 PASSED [0.2213s] [ 59%] 2025-12-04T14:34:54.6979396Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asin_cuda_int8 PASSED [0.2220s] [ 59%] 2025-12-04T14:34:54.6979878Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_bfloat16 PASSED [0.2159s] [ 59%] 2025-12-04T14:34:54.6980386Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_complex128 PASSED [0.2168s] [ 59%] 2025-12-04T14:34:54.6980871Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_complex32 PASSED [0.2209s] [ 59%] 2025-12-04T14:34:54.6981376Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_complex64 PASSED [0.2254s] [ 59%] 2025-12-04T14:34:54.6981857Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_float16 PASSED [0.2230s] [ 59%] 2025-12-04T14:34:54.6982330Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_int64 PASSED [0.2227s] [ 59%] 2025-12-04T14:34:54.6982812Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_asinh_cuda_uint8 PASSED [0.2126s] [ 60%] 2025-12-04T14:34:54.6983270Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atan_cuda_bool PASSED [0.2196s] [ 60%] 2025-12-04T14:34:54.6983768Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atan_cuda_complex32 PASSED [0.2217s] [ 60%] 2025-12-04T14:34:54.6984238Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atan_cuda_float16 PASSED [0.2166s] [ 60%] 2025-12-04T14:34:54.6984708Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atan_cuda_float64 PASSED [0.2175s] [ 60%] 2025-12-04T14:34:54.6985247Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atan_cuda_int16 PASSED [0.2204s] [ 60%] 2025-12-04T14:34:54.6985763Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_bfloat16 PASSED [0.2190s] [ 60%] 2025-12-04T14:34:54.6986264Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_complex64 PASSED [0.2225s] [ 60%] 2025-12-04T14:34:54.6986737Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_float32 PASSED [0.2275s] [ 60%] 2025-12-04T14:34:54.6987204Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_int16 PASSED [0.2165s] [ 60%] 2025-12-04T14:34:54.6987684Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_int32 PASSED [0.2262s] [ 60%] 2025-12-04T14:34:54.6988297Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_int64 PASSED [0.2238s] [ 60%] 2025-12-04T14:34:54.6988776Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_atanh_cuda_int8 PASSED [0.2286s] [ 60%] 2025-12-04T14:34:54.6989253Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_ceil_cuda_float16 PASSED [0.2213s] [ 60%] 2025-12-04T14:34:54.7000013Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_ceil_cuda_float32 PASSED [0.2299s] [ 60%] 2025-12-04T14:34:54.7000614Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_ceil_cuda_float64 PASSED [0.2488s] [ 60%] 2025-12-04T14:34:54.7001087Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_ceil_cuda_int32 PASSED [0.2475s] [ 60%] 2025-12-04T14:34:54.7001559Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_ceil_cuda_int8 PASSED [0.2493s] [ 60%] 2025-12-04T14:34:54.7002081Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_bfloat16 PASSED [0.2516s] [ 60%] 2025-12-04T14:34:54.7002633Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_complex128 PASSED [0.2525s] [ 60%] 2025-12-04T14:34:54.7003316Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_complex64 PASSED [0.2525s] [ 60%] 2025-12-04T14:34:54.7003834Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_float16 PASSED [0.2528s] [ 60%] 2025-12-04T14:34:54.7004360Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_float32 PASSED [0.2493s] [ 60%] 2025-12-04T14:34:54.7004862Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_int16 PASSED [0.2452s] [ 60%] 2025-12-04T14:34:54.7005367Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_int32 PASSED [0.2289s] [ 61%] 2025-12-04T14:34:54.7005883Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_int64 PASSED [0.2172s] [ 61%] 2025-12-04T14:34:54.7006383Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_int8 PASSED [0.2204s] [ 61%] 2025-12-04T14:34:54.7006904Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_conj_physical_cuda_uint8 PASSED [0.2197s] [ 61%] 2025-12-04T14:34:54.7007394Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_deg2rad_cuda_bfloat16 PASSED [0.2251s] [ 61%] 2025-12-04T14:34:54.7007878Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_deg2rad_cuda_float64 PASSED [0.2244s] [ 61%] 2025-12-04T14:34:54.7008365Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_deg2rad_cuda_int16 PASSED [0.2229s] [ 61%] 2025-12-04T14:34:54.7008835Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_deg2rad_cuda_int8 PASSED [0.2210s] [ 61%] 2025-12-04T14:34:54.7009371Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_erf_cuda_float16 PASSED [0.2211s] [ 61%] 2025-12-04T14:34:54.7009876Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_erf_cuda_int32 PASSED [0.2285s] [ 61%] 2025-12-04T14:34:54.7010377Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_erf_cuda_int64 PASSED [0.2227s] [ 61%] 2025-12-04T14:34:54.7010869Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_erfinv_cuda_float32 PASSED [0.2191s] [ 61%] 2025-12-04T14:34:54.7011337Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_expm1_cuda_float64 PASSED [0.2172s] [ 61%] 2025-12-04T14:34:54.7011807Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_expm1_cuda_int8 PASSED [0.2188s] [ 61%] 2025-12-04T14:34:54.7012270Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_expm1_cuda_uint8 PASSED [0.2170s] [ 61%] 2025-12-04T14:34:54.7012751Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_bfloat16 PASSED [0.2173s] [ 61%] 2025-12-04T14:34:54.7013227Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_int16 PASSED [0.2167s] [ 61%] 2025-12-04T14:34:54.7013697Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_int32 PASSED [0.2198s] [ 61%] 2025-12-04T14:34:54.7014157Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_int64 PASSED [0.2197s] [ 61%] 2025-12-04T14:34:54.7014629Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_int8 PASSED [0.2162s] [ 61%] 2025-12-04T14:34:54.7015095Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_floor_cuda_uint8 PASSED [0.2235s] [ 61%] 2025-12-04T14:34:54.7015574Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_frac_cuda_float32 PASSED [0.2212s] [ 61%] 2025-12-04T14:34:54.7016056Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_complex32 PASSED [0.2217s] [ 61%] 2025-12-04T14:34:54.7016546Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_complex64 PASSED [0.2167s] [ 61%] 2025-12-04T14:34:54.7017062Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_float16 PASSED [0.2154s] [ 62%] 2025-12-04T14:34:54.7017539Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_float64 PASSED [0.2233s] [ 62%] 2025-12-04T14:34:54.7018011Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_int32 PASSED [0.2183s] [ 62%] 2025-12-04T14:34:54.7018595Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isinf_cuda_uint8 PASSED [0.2199s] [ 62%] 2025-12-04T14:34:54.7019059Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isnan_cuda_bool PASSED [0.2161s] [ 62%] 2025-12-04T14:34:54.7019559Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isnan_cuda_complex128 PASSED [0.2185s] [ 62%] 2025-12-04T14:34:54.7020031Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isnan_cuda_int16 PASSED [0.2159s] [ 62%] 2025-12-04T14:34:54.7020506Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isnan_cuda_int64 PASSED [0.2207s] [ 62%] 2025-12-04T14:34:54.7021006Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_bfloat16 PASSED [0.2204s] [ 62%] 2025-12-04T14:34:54.7021500Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_float16 PASSED [0.2193s] [ 62%] 2025-12-04T14:34:54.7022001Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_float32 PASSED [0.2173s] [ 62%] 2025-12-04T14:34:54.7022482Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_int16 PASSED [0.2178s] [ 62%] 2025-12-04T14:34:54.7022973Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_int64 PASSED [0.2140s] [ 62%] 2025-12-04T14:34:54.7023539Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isneginf_cuda_uint8 PASSED [0.2157s] [ 62%] 2025-12-04T14:34:54.7024040Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isposinf_cuda_bfloat16 PASSED [0.2116s] [ 62%] 2025-12-04T14:34:54.7024562Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isposinf_cuda_bool PASSED [0.2170s] [ 62%] 2025-12-04T14:34:54.7025054Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isposinf_cuda_float16 PASSED [0.2159s] [ 62%] 2025-12-04T14:34:54.7025553Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_isposinf_cuda_float32 PASSED [0.2155s] [ 62%] 2025-12-04T14:34:54.7026034Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_log1p_cuda_bfloat16 PASSED [0.2137s] [ 62%] 2025-12-04T14:34:54.7026508Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_log1p_cuda_float32 PASSED [0.2150s] [ 62%] 2025-12-04T14:34:54.7026985Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_log1p_cuda_int16 PASSED [0.2163s] [ 62%] 2025-12-04T14:34:54.7027448Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_log1p_cuda_int8 PASSED [0.2180s] [ 62%] 2025-12-04T14:34:54.7027923Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_log1p_cuda_uint8 PASSED [0.2154s] [ 62%] 2025-12-04T14:34:54.7028751Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amax_cuda_bfloat16 SKIPPED [0.0030s] (masked.amax does not support input with torch.sparse_csc layout) [ 62%] 2025-12-04T14:34:54.7029570Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amax_cuda_float16 SKIPPED [0.0028s] (masked.amax does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7030399Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amax_cuda_float64 SKIPPED [0.0026s] (masked.amax does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7031239Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amax_cuda_int32 SKIPPED [0.0028s] (masked.amax does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7032036Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amax_cuda_int8 SKIPPED [0.0026s] (masked.amax does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7032857Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amin_cuda_float32 SKIPPED [0.0026s] (masked.amin does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7033669Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amin_cuda_float64 SKIPPED [0.0029s] (masked.amin does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7034471Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amin_cuda_int16 SKIPPED [0.0026s] (masked.amin does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7035273Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_amin_cuda_int32 SKIPPED [0.0026s] (masked.amin does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7036107Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_mean_cuda_complex64 SKIPPED [0.0029s] (masked.mean does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7036920Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_mean_cuda_float16 SKIPPED [0.0026s] (masked.mean does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7037744Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_mean_cuda_float32 SKIPPED [0.0026s] (masked.mean does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7038598Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_bool SKIPPED [0.0029s] (masked.prod does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7039462Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_complex64 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7040283Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_float16 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7041086Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_int16 SKIPPED [0.0028s] (masked.prod does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7041895Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_int64 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7042695Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_prod_cuda_uint8 SKIPPED [0.0026s] (masked.prod does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7043517Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_bfloat16 SKIPPED [0.0029s] (masked.sum does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7044308Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_float32 SKIPPED [0.0026s] (masked.sum does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7045102Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_float64 SKIPPED [0.0026s] (masked.sum does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7045894Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_int16 SKIPPED [0.0029s] (masked.sum does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7046716Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_int32 SKIPPED [0.0026s] (masked.sum does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7047505Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_int8 SKIPPED [0.0026s] (masked.sum does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7048289Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_masked_sum_cuda_uint8 SKIPPED [0.0026s] (masked.sum does not support input with torch.sparse_csc layout) [ 63%] 2025-12-04T14:34:54.7048783Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_mul_cuda_complex128 PASSED [0.2345s] [ 64%] 2025-12-04T14:34:54.7049262Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_mul_cuda_complex32 PASSED [0.2268s] [ 64%] 2025-12-04T14:34:54.7049731Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_mul_cuda_float32 PASSED [0.2285s] [ 64%] 2025-12-04T14:34:54.7050197Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_mul_cuda_int64 PASSED [0.2234s] [ 64%] 2025-12-04T14:34:54.7050653Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_mul_cuda_int8 PASSED [0.2208s] [ 64%] 2025-12-04T14:34:54.7051139Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_complex128 PASSED [0.2254s] [ 64%] 2025-12-04T14:34:54.7051615Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_complex32 PASSED [0.2217s] [ 64%] 2025-12-04T14:34:54.7052090Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_complex64 PASSED [0.2257s] [ 64%] 2025-12-04T14:34:54.7052566Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_float16 PASSED [0.2213s] [ 64%] 2025-12-04T14:34:54.7053063Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_float32 PASSED [0.2164s] [ 64%] 2025-12-04T14:34:54.7053561Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_int16 PASSED [0.2176s] [ 64%] 2025-12-04T14:34:54.7054044Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_int32 PASSED [0.2156s] [ 64%] 2025-12-04T14:34:54.7054496Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_int64 PASSED [0.2173s] [ 64%] 2025-12-04T14:34:54.7054952Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_int8 PASSED [0.2108s] [ 64%] 2025-12-04T14:34:54.7055404Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_neg_cuda_uint8 PASSED [0.2202s] [ 64%] 2025-12-04T14:34:54.7055958Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_bfloat16 PASSED [0.2225s] [ 64%] 2025-12-04T14:34:54.7056502Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_float16 PASSED [0.2196s] [ 64%] 2025-12-04T14:34:54.7057046Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_float32 PASSED [0.2148s] [ 64%] 2025-12-04T14:34:54.7057599Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_float64 PASSED [0.2243s] [ 64%] 2025-12-04T14:34:54.7058199Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_int16 PASSED [0.2187s] [ 64%] 2025-12-04T14:34:54.7058745Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_int64 PASSED [0.2219s] [ 64%] 2025-12-04T14:34:54.7059275Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_int8 PASSED [0.2235s] [ 64%] 2025-12-04T14:34:54.7059804Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_nn_functional_relu_cuda_uint8 PASSED [0.2171s] [ 64%] 2025-12-04T14:34:54.7060332Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_complex128 PASSED [0.2125s] [ 64%] 2025-12-04T14:34:54.7060881Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_complex32 PASSED [0.2158s] [ 65%] 2025-12-04T14:34:54.7061383Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_float32 PASSED [0.2166s] [ 65%] 2025-12-04T14:34:54.7061863Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_int64 PASSED [0.2180s] [ 65%] 2025-12-04T14:34:54.7062337Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_int8 PASSED [0.2250s] [ 65%] 2025-12-04T14:34:54.7062831Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_positive_cuda_uint8 PASSED [0.2190s] [ 65%] 2025-12-04T14:34:54.7063318Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_rad2deg_cuda_float16 PASSED [0.2175s] [ 65%] 2025-12-04T14:34:54.7063821Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_rad2deg_cuda_float32 PASSED [0.2177s] [ 65%] 2025-12-04T14:34:54.7064294Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_rad2deg_cuda_int16 PASSED [0.2172s] [ 65%] 2025-12-04T14:34:54.7064764Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_rad2deg_cuda_int8 PASSED [0.2186s] [ 65%] 2025-12-04T14:34:54.7065248Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_rad2deg_cuda_uint8 PASSED [0.2240s] [ 65%] 2025-12-04T14:34:54.7065759Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_randn_like_cuda_bfloat16 PASSED [0.0132s] [ 65%] 2025-12-04T14:34:54.7066282Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_randn_like_cuda_complex32 PASSED [0.2380s] [ 65%] 2025-12-04T14:34:54.7066820Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_randn_like_cuda_complex64 PASSED [0.0134s] [ 65%] 2025-12-04T14:34:54.7067349Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_randn_like_cuda_float32 PASSED [0.2404s] [ 65%] 2025-12-04T14:34:54.7067866Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_round_cuda_float32 PASSED [0.2421s] [ 65%] 2025-12-04T14:34:54.7068331Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_round_cuda_int64 PASSED [0.2314s] [ 65%] 2025-12-04T14:34:54.7068808Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_round_cuda_uint8 PASSED [0.2233s] [ 65%] 2025-12-04T14:34:54.7069261Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_bool PASSED [0.2222s] [ 65%] 2025-12-04T14:34:54.7069740Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_complex128 PASSED [0.2145s] [ 65%] 2025-12-04T14:34:54.7070225Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_complex64 PASSED [0.2128s] [ 65%] 2025-12-04T14:34:54.7070690Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_float16 PASSED [0.2196s] [ 65%] 2025-12-04T14:34:54.7071167Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_float32 PASSED [0.2137s] [ 65%] 2025-12-04T14:34:54.7071618Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_int64 PASSED [0.2164s] [ 65%] 2025-12-04T14:34:54.7072064Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sgn_cuda_uint8 PASSED [0.2224s] [ 65%] 2025-12-04T14:34:54.7072551Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sign_cuda_bfloat16 PASSED [0.2153s] [ 66%] 2025-12-04T14:34:54.7073017Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sign_cuda_float16 PASSED [0.2179s] [ 66%] 2025-12-04T14:34:54.7073486Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sign_cuda_int32 PASSED [0.2140s] [ 66%] 2025-12-04T14:34:54.7073945Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sign_cuda_int64 PASSED [0.2164s] [ 66%] 2025-12-04T14:34:54.7074464Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_signbit_cuda_float64 PASSED [0.2163s] [ 66%] 2025-12-04T14:34:54.7074945Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_signbit_cuda_int16 PASSED [0.2183s] [ 66%] 2025-12-04T14:34:54.7075411Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_signbit_cuda_int8 PASSED [0.2214s] [ 66%] 2025-12-04T14:34:54.7075878Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_bfloat16 PASSED [0.2196s] [ 66%] 2025-12-04T14:34:54.7076338Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_bool PASSED [0.2130s] [ 66%] 2025-12-04T14:34:54.7076816Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_complex128 PASSED [0.2168s] [ 66%] 2025-12-04T14:34:54.7077300Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_complex64 PASSED [0.2206s] [ 66%] 2025-12-04T14:34:54.7077763Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_float16 PASSED [0.2135s] [ 66%] 2025-12-04T14:34:54.7078223Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_float64 PASSED [0.2148s] [ 66%] 2025-12-04T14:34:54.7078691Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_int16 PASSED [0.2149s] [ 66%] 2025-12-04T14:34:54.7079147Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_int32 PASSED [0.2165s] [ 66%] 2025-12-04T14:34:54.7079609Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sin_cuda_int64 PASSED [0.2192s] [ 66%] 2025-12-04T14:34:54.7080092Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sinh_cuda_complex128 PASSED [0.2183s] [ 66%] 2025-12-04T14:34:54.7080604Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sinh_cuda_float32 PASSED [0.2184s] [ 66%] 2025-12-04T14:34:54.7081114Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sinh_cuda_float64 PASSED [0.2198s] [ 66%] 2025-12-04T14:34:54.7081606Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sinh_cuda_int64 PASSED [0.2165s] [ 66%] 2025-12-04T14:34:54.7082084Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sqrt_cuda_float16 PASSED [0.2298s] [ 66%] 2025-12-04T14:34:54.7082555Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sqrt_cuda_float32 PASSED [0.2178s] [ 66%] 2025-12-04T14:34:54.7083015Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sqrt_cuda_int64 PASSED [0.2192s] [ 66%] 2025-12-04T14:34:54.7083477Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sqrt_cuda_int8 PASSED [0.2145s] [ 66%] 2025-12-04T14:34:54.7083939Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sqrt_cuda_uint8 PASSED [0.2141s] [ 67%] 2025-12-04T14:34:54.7084404Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sum_cuda_bool PASSED [0.2274s] [ 67%] 2025-12-04T14:34:54.7084885Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sum_cuda_complex128 PASSED [0.2332s] [ 67%] 2025-12-04T14:34:54.7085351Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sum_cuda_float64 PASSED [0.2329s] [ 67%] 2025-12-04T14:34:54.7085813Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_sum_cuda_int32 PASSED [0.2388s] [ 67%] 2025-12-04T14:34:54.7086281Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_bfloat16 PASSED [0.2185s] [ 67%] 2025-12-04T14:34:54.7086741Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_bool PASSED [0.2125s] [ 67%] 2025-12-04T14:34:54.7087214Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_complex64 PASSED [0.2149s] [ 67%] 2025-12-04T14:34:54.7087886Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_float32 PASSED [0.2177s] [ 67%] 2025-12-04T14:34:54.7088353Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_int32 PASSED [0.2214s] [ 67%] 2025-12-04T14:34:54.7088807Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_int64 PASSED [0.2191s] [ 67%] 2025-12-04T14:34:54.7089267Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tan_cuda_int8 PASSED [0.2171s] [ 67%] 2025-12-04T14:34:54.7089747Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_complex32 PASSED [0.2122s] [ 67%] 2025-12-04T14:34:54.7090230Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_complex64 PASSED [0.2197s] [ 67%] 2025-12-04T14:34:54.7090714Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_float16 PASSED [0.2166s] [ 67%] 2025-12-04T14:34:54.7091185Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_float32 PASSED [0.2167s] [ 67%] 2025-12-04T14:34:54.7091663Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_int16 PASSED [0.2188s] [ 67%] 2025-12-04T14:34:54.7092125Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_int32 PASSED [0.2185s] [ 67%] 2025-12-04T14:34:54.7092586Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_int64 PASSED [0.2163s] [ 67%] 2025-12-04T14:34:54.7093051Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_tanh_cuda_int8 PASSED [0.2143s] [ 67%] 2025-12-04T14:34:54.7093546Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_to_sparse_cuda_float32 PASSED [0.2300s] [ 67%] 2025-12-04T14:34:54.7094055Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_to_sparse_cuda_float64 PASSED [0.2456s] [ 67%] 2025-12-04T14:34:54.7094641Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_to_sparse_cuda_int64 PASSED [0.2293s] [ 67%] 2025-12-04T14:34:54.7095165Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_to_sparse_cuda_int8 PASSED [0.2164s] [ 67%] 2025-12-04T14:34:54.7095661Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_bfloat16 PASSED [0.2149s] [ 68%] 2025-12-04T14:34:54.7096135Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_float32 PASSED [0.2164s] [ 68%] 2025-12-04T14:34:54.7096614Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_int16 PASSED [0.2215s] [ 68%] 2025-12-04T14:34:54.7097074Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_int32 PASSED [0.2292s] [ 68%] 2025-12-04T14:34:54.7097532Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_int8 PASSED [0.2182s] [ 68%] 2025-12-04T14:34:54.7098016Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_trunc_cuda_uint8 PASSED [0.2177s] [ 68%] 2025-12-04T14:34:54.7098571Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_zeros_like_cuda_bool PASSED [0.0110s] [ 68%] 2025-12-04T14:34:54.7099104Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_zeros_like_cuda_complex128 PASSED [0.2172s] [ 68%] 2025-12-04T14:34:54.7099611Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_zeros_like_cuda_float64 PASSED [0.0113s] [ 68%] 2025-12-04T14:34:54.7100101Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_zeros_like_cuda_int16 PASSED [0.2219s] [ 68%] 2025-12-04T14:34:54.7100601Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSC_zeros_like_cuda_int32 PASSED [0.0110s] [ 68%] 2025-12-04T14:34:54.7101069Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_bfloat16 PASSED [0.2193s] [ 68%] 2025-12-04T14:34:54.7101535Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_bool PASSED [0.2181s] [ 68%] 2025-12-04T14:34:54.7102069Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_complex128 PASSED [0.2156s] [ 68%] 2025-12-04T14:34:54.7102544Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_complex32 PASSED [0.2132s] [ 68%] 2025-12-04T14:34:54.7103029Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_complex64 PASSED [0.2190s] [ 68%] 2025-12-04T14:34:54.7103491Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_float16 PASSED [0.2118s] [ 68%] 2025-12-04T14:34:54.7103966Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_float32 PASSED [0.2203s] [ 68%] 2025-12-04T14:34:54.7104433Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_float64 PASSED [0.2175s] [ 68%] 2025-12-04T14:34:54.7104894Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_int16 PASSED [0.2155s] [ 68%] 2025-12-04T14:34:54.7105362Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_int32 PASSED [0.2137s] [ 68%] 2025-12-04T14:34:54.7105819Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_abs_cuda_uint8 PASSED [0.2160s] [ 68%] 2025-12-04T14:34:54.7106288Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_bool PASSED [0.2186s] [ 68%] 2025-12-04T14:34:54.7106765Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_float32 PASSED [0.2216s] [ 68%] 2025-12-04T14:34:54.7107244Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_float64 PASSED [0.2164s] [ 69%] 2025-12-04T14:34:54.7107719Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_int16 PASSED [0.2160s] [ 69%] 2025-12-04T14:34:54.7108215Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_int32 PASSED [0.2121s] [ 69%] 2025-12-04T14:34:54.7108715Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_int8 PASSED [0.2148s] [ 69%] 2025-12-04T14:34:54.7109281Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_angle_cuda_uint8 PASSED [0.2219s] [ 69%] 2025-12-04T14:34:54.7109753Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_bfloat16 PASSED [0.2408s] [ 69%] 2025-12-04T14:34:54.7110224Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_bool PASSED [0.2291s] [ 69%] 2025-12-04T14:34:54.7110711Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_complex128 PASSED [0.2237s] [ 69%] 2025-12-04T14:34:54.7111207Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_complex32 PASSED [0.2302s] [ 69%] 2025-12-04T14:34:54.7111678Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_float32 PASSED [0.2251s] [ 69%] 2025-12-04T14:34:54.7112154Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asin_cuda_float64 PASSED [0.2278s] [ 69%] 2025-12-04T14:34:54.7112656Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asinh_cuda_complex128 PASSED [0.2255s] [ 69%] 2025-12-04T14:34:54.7113122Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_asinh_cuda_uint8 PASSED [0.2261s] [ 69%] 2025-12-04T14:34:54.7113613Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_complex32 PASSED [0.2166s] [ 69%] 2025-12-04T14:34:54.7114094Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_complex64 PASSED [0.2165s] [ 69%] 2025-12-04T14:34:54.7114559Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_float32 PASSED [0.2164s] [ 69%] 2025-12-04T14:34:54.7115030Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_int32 PASSED [0.2190s] [ 69%] 2025-12-04T14:34:54.7115515Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_int8 PASSED [0.2214s] [ 69%] 2025-12-04T14:34:54.7115978Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atan_cuda_uint8 PASSED [0.2193s] [ 69%] 2025-12-04T14:34:54.7116444Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_bool PASSED [0.2190s] [ 69%] 2025-12-04T14:34:54.7116934Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_complex128 PASSED [0.2390s] [ 69%] 2025-12-04T14:34:54.7117429Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_complex64 PASSED [0.2462s] [ 69%] 2025-12-04T14:34:54.7117904Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_float16 PASSED [0.2410s] [ 69%] 2025-12-04T14:34:54.7118377Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_float32 PASSED [0.2244s] [ 69%] 2025-12-04T14:34:54.7118863Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_float64 PASSED [0.2215s] [ 70%] 2025-12-04T14:34:54.7119329Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_int32 PASSED [0.2186s] [ 70%] 2025-12-04T14:34:54.7119802Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_int64 PASSED [0.2179s] [ 70%] 2025-12-04T14:34:54.7120259Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_int8 PASSED [0.2280s] [ 70%] 2025-12-04T14:34:54.7120719Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_atanh_cuda_uint8 PASSED [0.2150s] [ 70%] 2025-12-04T14:34:54.7121197Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_ceil_cuda_float64 PASSED [0.2178s] [ 70%] 2025-12-04T14:34:54.7121681Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_ceil_cuda_int16 PASSED [0.2187s] [ 70%] 2025-12-04T14:34:54.7122180Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_ceil_cuda_int64 PASSED [0.2154s] [ 70%] 2025-12-04T14:34:54.7122668Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_ceil_cuda_uint8 PASSED [0.2139s] [ 70%] 2025-12-04T14:34:54.7123191Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_conj_physical_cuda_bfloat16 PASSED [0.2194s] [ 70%] 2025-12-04T14:34:54.7123730Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_conj_physical_cuda_complex32 PASSED [0.2134s] [ 70%] 2025-12-04T14:34:54.7124253Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_conj_physical_cuda_complex64 PASSED [0.2200s] [ 70%] 2025-12-04T14:34:54.7124768Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_conj_physical_cuda_int32 PASSED [0.2175s] [ 70%] 2025-12-04T14:34:54.7125261Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_deg2rad_cuda_bfloat16 PASSED [0.2202s] [ 70%] 2025-12-04T14:34:54.7125736Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_deg2rad_cuda_int16 PASSED [0.2194s] [ 70%] 2025-12-04T14:34:54.7126228Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_deg2rad_cuda_int8 PASSED [0.2378s] [ 70%] 2025-12-04T14:34:54.7126676Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_bool PASSED [0.2193s] [ 70%] 2025-12-04T14:34:54.7127153Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_float16 PASSED [0.2296s] [ 70%] 2025-12-04T14:34:54.7127607Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_int16 PASSED [0.2367s] [ 70%] 2025-12-04T14:34:54.7128058Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_int32 PASSED [0.2238s] [ 70%] 2025-12-04T14:34:54.7128528Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_int64 PASSED [0.2218s] [ 70%] 2025-12-04T14:34:54.7129030Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erf_cuda_uint8 PASSED [0.2281s] [ 70%] 2025-12-04T14:34:54.7129510Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_bool PASSED [0.2285s] [ 70%] 2025-12-04T14:34:54.7129989Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_float16 PASSED [0.2287s] [ 70%] 2025-12-04T14:34:54.7130468Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_float32 PASSED [0.2260s] [ 71%] 2025-12-04T14:34:54.7130955Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_float64 PASSED [0.2328s] [ 71%] 2025-12-04T14:34:54.7131424Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_int16 PASSED [0.2258s] [ 71%] 2025-12-04T14:34:54.7131899Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_int64 PASSED [0.2239s] [ 71%] 2025-12-04T14:34:54.7132369Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_erfinv_cuda_int8 PASSED [0.2211s] [ 71%] 2025-12-04T14:34:54.7132851Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_bfloat16 PASSED [0.2233s] [ 71%] 2025-12-04T14:34:54.7133354Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_complex128 PASSED [0.2291s] [ 71%] 2025-12-04T14:34:54.7133838Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_complex64 PASSED [0.2439s] [ 71%] 2025-12-04T14:34:54.7134324Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_float16 PASSED [0.2228s] [ 71%] 2025-12-04T14:34:54.7134794Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_float32 PASSED [0.2234s] [ 71%] 2025-12-04T14:34:54.7135265Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_float64 PASSED [0.2164s] [ 71%] 2025-12-04T14:34:54.7135797Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_int32 PASSED [0.2220s] [ 71%] 2025-12-04T14:34:54.7136295Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_int64 PASSED [0.2197s] [ 71%] 2025-12-04T14:34:54.7136764Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_int8 PASSED [0.2174s] [ 71%] 2025-12-04T14:34:54.7137228Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_expm1_cuda_uint8 PASSED [0.2178s] [ 71%] 2025-12-04T14:34:54.7137708Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_floor_cuda_bfloat16 PASSED [0.2204s] [ 71%] 2025-12-04T14:34:54.7138252Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_floor_cuda_float16 PASSED [0.2215s] [ 71%] 2025-12-04T14:34:54.7138730Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_floor_cuda_float32 PASSED [0.2228s] [ 71%] 2025-12-04T14:34:54.7139209Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_floor_cuda_int8 PASSED [0.2224s] [ 71%] 2025-12-04T14:34:54.7139678Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_floor_cuda_uint8 PASSED [0.2183s] [ 71%] 2025-12-04T14:34:54.7140153Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_frac_cuda_bfloat16 PASSED [0.2186s] [ 71%] 2025-12-04T14:34:54.7140637Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_frac_cuda_float64 PASSED [0.2170s] [ 71%] 2025-12-04T14:34:54.7141101Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isinf_cuda_bool PASSED [0.2164s] [ 71%] 2025-12-04T14:34:54.7141578Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isinf_cuda_int32 PASSED [0.2287s] [ 71%] 2025-12-04T14:34:54.7142039Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isinf_cuda_int8 PASSED [0.2287s] [ 72%] 2025-12-04T14:34:54.7142510Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isinf_cuda_uint8 PASSED [0.2195s] [ 72%] 2025-12-04T14:34:54.7143043Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isnan_cuda_bfloat16 PASSED [0.2220s] [ 72%] 2025-12-04T14:34:54.7143507Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isnan_cuda_bool PASSED [0.2215s] [ 72%] 2025-12-04T14:34:54.7143982Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isnan_cuda_int64 PASSED [0.2170s] [ 72%] 2025-12-04T14:34:54.7144440Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isnan_cuda_uint8 PASSED [0.2178s] [ 72%] 2025-12-04T14:34:54.7144935Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_bfloat16 PASSED [0.2198s] [ 72%] 2025-12-04T14:34:54.7145423Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_bool PASSED [0.2179s] [ 72%] 2025-12-04T14:34:54.7145919Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_float32 PASSED [0.2139s] [ 72%] 2025-12-04T14:34:54.7146426Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_float64 PASSED [0.2196s] [ 72%] 2025-12-04T14:34:54.7146903Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_int32 PASSED [0.2168s] [ 72%] 2025-12-04T14:34:54.7147381Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_int64 PASSED [0.2175s] [ 72%] 2025-12-04T14:34:54.7147872Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_int8 PASSED [0.2184s] [ 72%] 2025-12-04T14:34:54.7148350Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isneginf_cuda_uint8 PASSED [0.2204s] [ 72%] 2025-12-04T14:34:54.7148838Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isposinf_cuda_bool PASSED [0.2165s] [ 72%] 2025-12-04T14:34:54.7149382Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isposinf_cuda_int32 PASSED [0.2184s] [ 72%] 2025-12-04T14:34:54.7149867Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isposinf_cuda_int64 PASSED [0.2144s] [ 72%] 2025-12-04T14:34:54.7150384Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_isposinf_cuda_uint8 PASSED [0.2294s] [ 72%] 2025-12-04T14:34:54.7150863Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_log1p_cuda_bfloat16 PASSED [0.2247s] [ 72%] 2025-12-04T14:34:54.7151347Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_log1p_cuda_float16 PASSED [0.2203s] [ 72%] 2025-12-04T14:34:54.7151812Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_log1p_cuda_int16 PASSED [0.2134s] [ 72%] 2025-12-04T14:34:54.7152269Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_log1p_cuda_int64 PASSED [0.2171s] [ 72%] 2025-12-04T14:34:54.7152742Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_log1p_cuda_int8 PASSED [0.2198s] [ 72%] 2025-12-04T14:34:54.7153260Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amax_cuda_bfloat16 PASSED [0.3160s] [ 72%] 2025-12-04T14:34:54.7153786Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amax_cuda_float16 PASSED [0.3178s] [ 73%] 2025-12-04T14:34:54.7154295Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amax_cuda_float64 PASSED [0.3123s] [ 73%] 2025-12-04T14:34:54.7155205Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amax_cuda_int8 SKIPPED [0.0031s] (masked.amax does not support input with torch.sparse_csr layout and torch.int8 dtype) [ 73%] 2025-12-04T14:34:54.7156127Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amax_cuda_uint8 SKIPPED [0.0029s] (masked.amax does not support input with torch.sparse_csr layout and torch.uint8 dtype) [ 73%] 2025-12-04T14:34:54.7156647Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amin_cuda_bfloat16 PASSED [0.3138s] [ 73%] 2025-12-04T14:34:54.7157198Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amin_cuda_float64 PASSED [0.3183s] [ 73%] 2025-12-04T14:34:54.7158355Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amin_cuda_int16 SKIPPED [0.0030s] (masked.amin does not support input with torch.sparse_csr layout and torch.int16 dtype) [ 73%] 2025-12-04T14:34:54.7159278Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amin_cuda_int64 SKIPPED [0.0029s] (masked.amin does not support input with torch.sparse_csr layout and torch.int64 dtype) [ 73%] 2025-12-04T14:34:54.7160185Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_amin_cuda_uint8 SKIPPED [0.0026s] (masked.amin does not support input with torch.sparse_csr layout and torch.uint8 dtype) [ 73%] 2025-12-04T14:34:54.7161136Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_mean_cuda_complex64 SKIPPED [0.0028s] (masked.mean does not support input with torch.sparse_csr layout and torch.complex64 dtype) [ 73%] 2025-12-04T14:34:54.7161657Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_mean_cuda_float64 PASSED [0.1014s] [ 73%] 2025-12-04T14:34:54.7162180Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_prod_cuda_complex64 PASSED [0.3040s] [ 73%] 2025-12-04T14:34:54.7162688Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_prod_cuda_int32 PASSED [0.2912s] [ 73%] 2025-12-04T14:34:54.7163185Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_prod_cuda_int64 PASSED [0.2888s] [ 73%] 2025-12-04T14:34:54.7163677Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_prod_cuda_int8 PASSED [0.2844s] [ 73%] 2025-12-04T14:34:54.7164280Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_complex128 PASSED [0.2869s] [ 73%] 2025-12-04T14:34:54.7164786Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_float64 PASSED [0.3260s] [ 73%] 2025-12-04T14:34:54.7165318Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_int16 PASSED [0.2905s] [ 73%] 2025-12-04T14:34:54.7165807Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_int32 PASSED [0.2938s] [ 73%] 2025-12-04T14:34:54.7166293Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_int8 PASSED [0.2878s] [ 73%] 2025-12-04T14:34:54.7166792Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_masked_sum_cuda_uint8 PASSED [0.2855s] [ 73%] 2025-12-04T14:34:54.7167263Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_bfloat16 PASSED [0.2359s] [ 73%] 2025-12-04T14:34:54.7167756Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_complex128 PASSED [0.2317s] [ 73%] 2025-12-04T14:34:54.7168236Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_complex32 PASSED [0.2305s] [ 73%] 2025-12-04T14:34:54.7168706Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_float16 PASSED [0.2347s] [ 74%] 2025-12-04T14:34:54.7169181Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_float32 PASSED [0.2254s] [ 74%] 2025-12-04T14:34:54.7169645Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_float64 PASSED [0.2292s] [ 74%] 2025-12-04T14:34:54.7170115Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_int32 PASSED [0.2243s] [ 74%] 2025-12-04T14:34:54.7170566Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_mul_cuda_int64 PASSED [0.2267s] [ 74%] 2025-12-04T14:34:54.7171040Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_bfloat16 PASSED [0.2190s] [ 74%] 2025-12-04T14:34:54.7171561Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_complex128 PASSED [0.2195s] [ 74%] 2025-12-04T14:34:54.7172038Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_complex32 PASSED [0.2193s] [ 74%] 2025-12-04T14:34:54.7172526Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_complex64 PASSED [0.2214s] [ 74%] 2025-12-04T14:34:54.7172993Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_float32 PASSED [0.2169s] [ 74%] 2025-12-04T14:34:54.7173456Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_float64 PASSED [0.2187s] [ 74%] 2025-12-04T14:34:54.7173921Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_int64 PASSED [0.2168s] [ 74%] 2025-12-04T14:34:54.7174368Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_int8 PASSED [0.2170s] [ 74%] 2025-12-04T14:34:54.7174841Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_neg_cuda_uint8 PASSED [0.2252s] [ 74%] 2025-12-04T14:34:54.7175397Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_nn_functional_relu_cuda_bfloat16 PASSED [0.2181s] [ 74%] 2025-12-04T14:34:54.7175941Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_nn_functional_relu_cuda_float32 PASSED [0.2209s] [ 74%] 2025-12-04T14:34:54.7176486Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_nn_functional_relu_cuda_int16 PASSED [0.2244s] [ 74%] 2025-12-04T14:34:54.7177014Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_nn_functional_relu_cuda_int8 PASSED [0.2203s] [ 74%] 2025-12-04T14:34:54.7177529Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_complex32 PASSED [0.2181s] [ 74%] 2025-12-04T14:34:54.7178095Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_complex64 PASSED [0.2233s] [ 74%] 2025-12-04T14:34:54.7178652Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_float16 PASSED [0.2233s] [ 74%] 2025-12-04T14:34:54.7179208Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_float32 PASSED [0.2182s] [ 74%] 2025-12-04T14:34:54.7179702Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_float64 PASSED [0.2215s] [ 74%] 2025-12-04T14:34:54.7180192Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_int16 PASSED [0.2155s] [ 74%] 2025-12-04T14:34:54.7180673Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_int64 PASSED [0.2184s] [ 75%] 2025-12-04T14:34:54.7181151Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_int8 PASSED [0.2114s] [ 75%] 2025-12-04T14:34:54.7181643Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_positive_cuda_uint8 PASSED [0.2184s] [ 75%] 2025-12-04T14:34:54.7182134Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_rad2deg_cuda_bfloat16 PASSED [0.2208s] [ 75%] 2025-12-04T14:34:54.7182619Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_rad2deg_cuda_bool PASSED [0.2214s] [ 75%] 2025-12-04T14:34:54.7183093Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_rad2deg_cuda_int16 PASSED [0.2144s] [ 75%] 2025-12-04T14:34:54.7183565Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_rad2deg_cuda_int32 PASSED [0.2198s] [ 75%] 2025-12-04T14:34:54.7184050Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_rad2deg_cuda_uint8 PASSED [0.2159s] [ 75%] 2025-12-04T14:34:54.7184557Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_randn_like_cuda_bfloat16 PASSED [0.0132s] [ 75%] 2025-12-04T14:34:54.7185092Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_randn_like_cuda_complex128 PASSED [0.2262s] [ 75%] 2025-12-04T14:34:54.7185626Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_randn_like_cuda_float16 PASSED [0.0132s] [ 75%] 2025-12-04T14:34:54.7186127Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_randn_like_cuda_float64 PASSED [0.2259s] [ 75%] 2025-12-04T14:34:54.7186612Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_round_cuda_float64 PASSED [0.2200s] [ 75%] 2025-12-04T14:34:54.7187075Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_round_cuda_int32 PASSED [0.2190s] [ 75%] 2025-12-04T14:34:54.7187543Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_round_cuda_int64 PASSED [0.2210s] [ 75%] 2025-12-04T14:34:54.7188145Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_round_cuda_int8 PASSED [0.2187s] [ 75%] 2025-12-04T14:34:54.7188614Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_round_cuda_uint8 PASSED [0.2199s] [ 75%] 2025-12-04T14:34:54.7189096Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sgn_cuda_complex64 PASSED [0.2203s] [ 75%] 2025-12-04T14:34:54.7189562Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sgn_cuda_float64 PASSED [0.2156s] [ 75%] 2025-12-04T14:34:54.7190021Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sgn_cuda_int16 PASSED [0.2194s] [ 75%] 2025-12-04T14:34:54.7190465Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sgn_cuda_int8 PASSED [0.2238s] [ 75%] 2025-12-04T14:34:54.7190936Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_bfloat16 PASSED [0.2227s] [ 75%] 2025-12-04T14:34:54.7191406Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_float16 PASSED [0.2206s] [ 75%] 2025-12-04T14:34:54.7191936Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_float32 PASSED [0.2232s] [ 75%] 2025-12-04T14:34:54.7192444Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_int32 PASSED [0.2210s] [ 76%] 2025-12-04T14:34:54.7192938Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_int8 PASSED [0.2186s] [ 76%] 2025-12-04T14:34:54.7193398Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sign_cuda_uint8 PASSED [0.2199s] [ 76%] 2025-12-04T14:34:54.7193877Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_signbit_cuda_bool PASSED [0.2203s] [ 76%] 2025-12-04T14:34:54.7194352Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_signbit_cuda_int16 PASSED [0.2169s] [ 76%] 2025-12-04T14:34:54.7194837Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_signbit_cuda_int32 PASSED [0.2198s] [ 76%] 2025-12-04T14:34:54.7195310Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_signbit_cuda_int8 PASSED [0.2124s] [ 76%] 2025-12-04T14:34:54.7195788Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_signbit_cuda_uint8 PASSED [0.2276s] [ 76%] 2025-12-04T14:34:54.7196247Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sin_cuda_bool PASSED [0.2361s] [ 76%] 2025-12-04T14:34:54.7196717Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sin_cuda_complex32 PASSED [0.2264s] [ 76%] 2025-12-04T14:34:54.7197194Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sin_cuda_float64 PASSED [0.2181s] [ 76%] 2025-12-04T14:34:54.7197643Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sin_cuda_int32 PASSED [0.2189s] [ 76%] 2025-12-04T14:34:54.7198091Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_bool PASSED [0.2197s] [ 76%] 2025-12-04T14:34:54.7198591Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_complex128 PASSED [0.2205s] [ 76%] 2025-12-04T14:34:54.7199113Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_complex64 PASSED [0.2178s] [ 76%] 2025-12-04T14:34:54.7199594Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_float32 PASSED [0.2205s] [ 76%] 2025-12-04T14:34:54.7200049Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_int32 PASSED [0.2188s] [ 76%] 2025-12-04T14:34:54.7200502Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sinh_cuda_int8 PASSED [0.2269s] [ 76%] 2025-12-04T14:34:54.7200987Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sqrt_cuda_bfloat16 PASSED [0.2264s] [ 76%] 2025-12-04T14:34:54.7201447Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sqrt_cuda_float64 PASSED [0.2276s] [ 76%] 2025-12-04T14:34:54.7201920Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sqrt_cuda_int16 PASSED [0.2238s] [ 76%] 2025-12-04T14:34:54.7202380Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sqrt_cuda_int32 PASSED [0.2185s] [ 76%] 2025-12-04T14:34:54.7202847Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sum_cuda_bfloat16 PASSED [0.2355s] [ 76%] 2025-12-04T14:34:54.7203335Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sum_cuda_complex128 PASSED [0.2302s] [ 76%] 2025-12-04T14:34:54.7203786Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sum_cuda_int64 PASSED [0.2325s] [ 77%] 2025-12-04T14:34:54.7204249Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_sum_cuda_uint8 PASSED [0.2381s] [ 77%] 2025-12-04T14:34:54.7204722Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tan_cuda_complex64 PASSED [0.2216s] [ 77%] 2025-12-04T14:34:54.7205212Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tan_cuda_float16 PASSED [0.2124s] [ 77%] 2025-12-04T14:34:54.7205716Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tan_cuda_float32 PASSED [0.2176s] [ 77%] 2025-12-04T14:34:54.7206212Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tan_cuda_float64 PASSED [0.2200s] [ 77%] 2025-12-04T14:34:54.7206678Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tan_cuda_int32 PASSED [0.2162s] [ 77%] 2025-12-04T14:34:54.7207150Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_bfloat16 PASSED [0.2226s] [ 77%] 2025-12-04T14:34:54.7207599Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_bool PASSED [0.2189s] [ 77%] 2025-12-04T14:34:54.7208095Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_complex128 PASSED [0.2156s] [ 77%] 2025-12-04T14:34:54.7208569Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_complex64 PASSED [0.2230s] [ 77%] 2025-12-04T14:34:54.7209048Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_float16 PASSED [0.2244s] [ 77%] 2025-12-04T14:34:54.7209521Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_float32 PASSED [0.2176s] [ 77%] 2025-12-04T14:34:54.7209988Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_float64 PASSED [0.2192s] [ 77%] 2025-12-04T14:34:54.7210455Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_tanh_cuda_int32 PASSED [0.2191s] [ 77%] 2025-12-04T14:34:54.7210962Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_complex64 PASSED [0.2186s] [ 77%] 2025-12-04T14:34:54.7211457Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_float32 PASSED [0.2182s] [ 77%] 2025-12-04T14:34:54.7211946Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_float64 PASSED [0.2290s] [ 77%] 2025-12-04T14:34:54.7212432Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_int16 PASSED [0.2221s] [ 77%] 2025-12-04T14:34:54.7212953Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_int32 PASSED [0.2304s] [ 77%] 2025-12-04T14:34:54.7213433Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_to_sparse_cuda_int64 PASSED [0.2371s] [ 77%] 2025-12-04T14:34:54.7213904Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_trunc_cuda_float16 PASSED [0.2222s] [ 77%] 2025-12-04T14:34:54.7214385Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_trunc_cuda_float32 PASSED [0.2213s] [ 77%] 2025-12-04T14:34:54.7214847Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_trunc_cuda_int16 PASSED [0.2206s] [ 77%] 2025-12-04T14:34:54.7215312Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_trunc_cuda_uint8 PASSED [0.2238s] [ 78%] 2025-12-04T14:34:54.7215796Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_zeros_like_cuda_bool PASSED [0.0109s] [ 78%] 2025-12-04T14:34:54.7216303Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_zeros_like_cuda_float16 PASSED [0.2226s] [ 78%] 2025-12-04T14:34:54.7216806Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_zeros_like_cuda_float32 PASSED [0.0112s] [ 78%] 2025-12-04T14:34:54.7217294Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_zeros_like_cuda_int32 PASSED [0.2232s] [ 78%] 2025-12-04T14:34:54.7217792Z test_sparse_csr.py::TestSparseCompressedCUDA::test_consistency_SparseCSR_zeros_like_cuda_uint8 PASSED [0.0109s] [ 78%] 2025-12-04T14:34:54.7218246Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSC_cuda_bool PASSED [0.0126s] [ 78%] 2025-12-04T14:34:54.7218668Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSC_cuda_complex128 PASSED [0.0135s] [ 78%] 2025-12-04T14:34:54.7219152Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSC_cuda_complex64 PASSED [0.0132s] [ 78%] 2025-12-04T14:34:54.7219555Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSC_cuda_float64 PASSED [0.0130s] [ 78%] 2025-12-04T14:34:54.7219985Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSC_cuda_int64 PASSED [0.0126s] [ 78%] 2025-12-04T14:34:54.7220377Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_bool PASSED [0.0123s] [ 78%] 2025-12-04T14:34:54.7220797Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_complex128 PASSED [0.0132s] [ 78%] 2025-12-04T14:34:54.7221217Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_complex64 PASSED [0.0132s] [ 78%] 2025-12-04T14:34:54.7221616Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_float64 PASSED [0.0130s] [ 78%] 2025-12-04T14:34:54.7222014Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_int16 PASSED [0.0126s] [ 78%] 2025-12-04T14:34:54.7222407Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_int32 PASSED [0.0123s] [ 78%] 2025-12-04T14:34:54.7222799Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_int64 PASSED [0.0124s] [ 78%] 2025-12-04T14:34:54.7223194Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseBSR_cuda_int8 PASSED [0.0124s] [ 78%] 2025-12-04T14:34:54.7223580Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSC_cuda_bool PASSED [0.0244s] [ 78%] 2025-12-04T14:34:54.7224004Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSC_cuda_complex128 PASSED [0.0254s] [ 78%] 2025-12-04T14:34:54.7224401Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSC_cuda_float16 PASSED [0.0250s] [ 78%] 2025-12-04T14:34:54.7224790Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSC_cuda_uint8 PASSED [0.0244s] [ 78%] 2025-12-04T14:34:54.7225216Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_complex128 PASSED [0.0252s] [ 78%] 2025-12-04T14:34:54.7225660Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_complex64 PASSED [0.0253s] [ 79%] 2025-12-04T14:34:54.7226069Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_float32 PASSED [0.0253s] [ 79%] 2025-12-04T14:34:54.7226469Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_float64 PASSED [0.0249s] [ 79%] 2025-12-04T14:34:54.7226862Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_int64 PASSED [0.0243s] [ 79%] 2025-12-04T14:34:54.7227263Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_SparseCSR_cuda_uint8 PASSED [0.0244s] [ 79%] 2025-12-04T14:34:54.7227707Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSC_cuda_bfloat16 PASSED [0.0140s] [ 79%] 2025-12-04T14:34:54.7228159Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSC_cuda_complex128 PASSED [0.0133s] [ 79%] 2025-12-04T14:34:54.7228603Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSC_cuda_float32 PASSED [0.0132s] [ 79%] 2025-12-04T14:34:54.7229040Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSC_cuda_float64 PASSED [0.0135s] [ 79%] 2025-12-04T14:34:54.7229466Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSR_cuda_bool PASSED [0.0131s] [ 79%] 2025-12-04T14:34:54.7229914Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSR_cuda_complex64 PASSED [0.0132s] [ 79%] 2025-12-04T14:34:54.7230348Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSR_cuda_float32 PASSED [0.0132s] [ 79%] 2025-12-04T14:34:54.7230781Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseBSR_cuda_int32 PASSED [0.0131s] [ 79%] 2025-12-04T14:34:54.7231220Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_bfloat16 PASSED [0.0069s] [ 79%] 2025-12-04T14:34:54.7231706Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_bool PASSED [0.0072s] [ 79%] 2025-12-04T14:34:54.7232205Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_complex128 PASSED [0.0069s] [ 79%] 2025-12-04T14:34:54.7232669Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_float32 PASSED [0.0069s] [ 79%] 2025-12-04T14:34:54.7233115Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_float64 PASSED [0.0070s] [ 79%] 2025-12-04T14:34:54.7233532Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSC_cuda_int16 PASSED [0.0069s] [ 79%] 2025-12-04T14:34:54.7233976Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSR_cuda_float32 PASSED [0.0066s] [ 79%] 2025-12-04T14:34:54.7234399Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSR_cuda_int16 PASSED [0.0070s] [ 79%] 2025-12-04T14:34:54.7234816Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSR_cuda_int32 PASSED [0.0066s] [ 79%] 2025-12-04T14:34:54.7235246Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSR_cuda_int8 PASSED [0.0066s] [ 79%] 2025-12-04T14:34:54.7235666Z test_sparse_csr.py::TestSparseCompressedCUDA::test_copy_errors_SparseCSR_cuda_uint8 PASSED [0.0066s] [ 79%] 2025-12-04T14:34:54.7236129Z test_sparse_csr.py::TestSparseCompressedCUDA::test_dim_SparseBSC_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 80%] 2025-12-04T14:34:54.7236583Z test_sparse_csr.py::TestSparseCompressedCUDA::test_dim_SparseBSR_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 80%] 2025-12-04T14:34:54.7237030Z test_sparse_csr.py::TestSparseCompressedCUDA::test_dim_SparseCSC_cuda SKIPPED [0.0015s] (Only runs on cpu) [ 80%] 2025-12-04T14:34:54.7237491Z test_sparse_csr.py::TestSparseCompressedCUDA::test_dim_SparseCSR_cuda SKIPPED [0.0018s] (Only runs on cpu) [ 80%] 2025-12-04T14:34:54.7237900Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_bfloat16 PASSED [0.0210s] [ 80%] 2025-12-04T14:34:54.7238307Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_bool PASSED [0.0207s] [ 80%] 2025-12-04T14:34:54.7238759Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_complex64 PASSED [0.0206s] [ 80%] 2025-12-04T14:34:54.7239163Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_float64 PASSED [0.0206s] [ 80%] 2025-12-04T14:34:54.7239567Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_int16 PASSED [0.0204s] [ 80%] 2025-12-04T14:34:54.7239960Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_int32 PASSED [0.0209s] [ 80%] 2025-12-04T14:34:54.7240364Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_int64 PASSED [0.0203s] [ 80%] 2025-12-04T14:34:54.7240761Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_int8 PASSED [0.0205s] [ 80%] 2025-12-04T14:34:54.7241156Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSC_cuda_uint8 PASSED [0.0207s] [ 80%] 2025-12-04T14:34:54.7241587Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSR_cuda_complex64 PASSED [0.0204s] [ 80%] 2025-12-04T14:34:54.7241981Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSR_cuda_int32 PASSED [0.0205s] [ 80%] 2025-12-04T14:34:54.7242384Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_SparseCSR_cuda_int8 PASSED [0.0204s] [ 80%] 2025-12-04T14:34:54.7242839Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSC_cuda_complex64 PASSED [0.0025s] [ 80%] 2025-12-04T14:34:54.7243275Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSC_cuda_float32 PASSED [0.0017s] [ 80%] 2025-12-04T14:34:54.7243727Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSC_cuda_float64 PASSED [0.0016s] [ 80%] 2025-12-04T14:34:54.7244150Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSC_cuda_int32 PASSED [0.0016s] [ 80%] 2025-12-04T14:34:54.7244603Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSC_cuda_int8 PASSED [0.0016s] [ 80%] 2025-12-04T14:34:54.7245103Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_complex128 PASSED [0.0016s] [ 80%] 2025-12-04T14:34:54.7245573Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_float16 PASSED [0.0020s] [ 80%] 2025-12-04T14:34:54.7246025Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_float32 PASSED [0.0016s] [ 80%] 2025-12-04T14:34:54.7246452Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_int16 PASSED [0.0016s] [ 81%] 2025-12-04T14:34:54.7246875Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_int32 PASSED [0.0016s] [ 81%] 2025-12-04T14:34:54.7247310Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_int64 PASSED [0.0016s] [ 81%] 2025-12-04T14:34:54.7247731Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_errors_SparseCSR_cuda_uint8 PASSED [0.0016s] [ 81%] 2025-12-04T14:34:54.7248239Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_bfloat16 PASSED [0.0324s] [ 81%] 2025-12-04T14:34:54.7248708Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_bool PASSED [0.0315s] [ 81%] 2025-12-04T14:34:54.7249211Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_complex128 PASSED [0.0316s] [ 81%] 2025-12-04T14:34:54.7249707Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_float32 PASSED [0.0316s] [ 81%] 2025-12-04T14:34:54.7250177Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_int64 PASSED [0.0316s] [ 81%] 2025-12-04T14:34:54.7250653Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSC_cuda_int8 PASSED [0.0322s] [ 81%] 2025-12-04T14:34:54.7251135Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSR_cuda_float32 PASSED [0.0274s] [ 81%] 2025-12-04T14:34:54.7251605Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSR_cuda_int16 PASSED [0.0262s] [ 81%] 2025-12-04T14:34:54.7252111Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseBSR_cuda_int32 PASSED [0.0262s] [ 81%] 2025-12-04T14:34:54.7252599Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_bfloat16 PASSED [0.0262s] [ 81%] 2025-12-04T14:34:54.7253079Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_bool PASSED [0.0261s] [ 81%] 2025-12-04T14:34:54.7253582Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_complex128 PASSED [0.0265s] [ 81%] 2025-12-04T14:34:54.7254062Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_float16 PASSED [0.0261s] [ 81%] 2025-12-04T14:34:54.7254548Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_float64 PASSED [0.0262s] [ 81%] 2025-12-04T14:34:54.7255017Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_int16 PASSED [0.0262s] [ 81%] 2025-12-04T14:34:54.7255495Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_int32 PASSED [0.0261s] [ 81%] 2025-12-04T14:34:54.7255957Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSC_cuda_int8 PASSED [0.0261s] [ 81%] 2025-12-04T14:34:54.7256458Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSR_cuda_complex128 PASSED [0.0264s] [ 81%] 2025-12-04T14:34:54.7256959Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSR_cuda_complex64 PASSED [0.0260s] [ 81%] 2025-12-04T14:34:54.7257420Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSC_SparseCSR_cuda_int8 PASSED [0.0260s] [ 81%] 2025-12-04T14:34:54.7257917Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_bfloat16 PASSED [0.0262s] [ 82%] 2025-12-04T14:34:54.7258531Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_bool PASSED [0.0262s] [ 82%] 2025-12-04T14:34:54.7259070Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_complex64 PASSED [0.0261s] [ 82%] 2025-12-04T14:34:54.7259561Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_float64 PASSED [0.0265s] [ 82%] 2025-12-04T14:34:54.7260028Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_int16 PASSED [0.0261s] [ 82%] 2025-12-04T14:34:54.7260503Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_int32 PASSED [0.0261s] [ 82%] 2025-12-04T14:34:54.7260968Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_int64 PASSED [0.0261s] [ 82%] 2025-12-04T14:34:54.7261429Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_int8 PASSED [0.0261s] [ 82%] 2025-12-04T14:34:54.7261908Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSC_cuda_uint8 PASSED [0.0265s] [ 82%] 2025-12-04T14:34:54.7262375Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSR_cuda_bool PASSED [0.0314s] [ 82%] 2025-12-04T14:34:54.7262882Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSR_cuda_complex128 PASSED [0.0313s] [ 82%] 2025-12-04T14:34:54.7263351Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSR_cuda_int16 PASSED [0.0314s] [ 82%] 2025-12-04T14:34:54.7263823Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSR_cuda_int64 PASSED [0.0313s] [ 82%] 2025-12-04T14:34:54.7264294Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseBSR_cuda_uint8 PASSED [0.0317s] [ 82%] 2025-12-04T14:34:54.7264782Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_bfloat16 PASSED [0.0262s] [ 82%] 2025-12-04T14:34:54.7265291Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_complex128 PASSED [0.0261s] [ 82%] 2025-12-04T14:34:54.7265803Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_float16 PASSED [0.0260s] [ 82%] 2025-12-04T14:34:54.7266272Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_int16 PASSED [0.0261s] [ 82%] 2025-12-04T14:34:54.7266746Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_int32 PASSED [0.0261s] [ 82%] 2025-12-04T14:34:54.7267213Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_int64 PASSED [0.0264s] [ 82%] 2025-12-04T14:34:54.7267686Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSC_cuda_uint8 PASSED [0.0260s] [ 82%] 2025-12-04T14:34:54.7268174Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_bfloat16 PASSED [0.0260s] [ 82%] 2025-12-04T14:34:54.7268671Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_complex64 PASSED [0.0261s] [ 82%] 2025-12-04T14:34:54.7269162Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_float32 PASSED [0.0262s] [ 82%] 2025-12-04T14:34:54.7269644Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_float64 PASSED [0.0261s] [ 83%] 2025-12-04T14:34:54.7270116Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_int16 PASSED [0.0264s] [ 83%] 2025-12-04T14:34:54.7270585Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseBSR_SparseCSR_cuda_int32 PASSED [0.0261s] [ 83%] 2025-12-04T14:34:54.7271084Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_complex128 PASSED [0.0241s] [ 83%] 2025-12-04T14:34:54.7271603Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_float32 PASSED [0.0241s] [ 83%] 2025-12-04T14:34:54.7272116Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_float64 PASSED [0.0241s] [ 83%] 2025-12-04T14:34:54.7272625Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_int32 PASSED [0.0241s] [ 83%] 2025-12-04T14:34:54.7273098Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_int64 PASSED [0.0244s] [ 83%] 2025-12-04T14:34:54.7273564Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_int8 PASSED [0.0240s] [ 83%] 2025-12-04T14:34:54.7274039Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSC_cuda_uint8 PASSED [0.0241s] [ 83%] 2025-12-04T14:34:54.7274531Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_bfloat16 PASSED [0.0240s] [ 83%] 2025-12-04T14:34:54.7275007Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_bool PASSED [0.0241s] [ 83%] 2025-12-04T14:34:54.7275494Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_float32 PASSED [0.0249s] [ 83%] 2025-12-04T14:34:54.7275980Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_float64 PASSED [0.0241s] [ 83%] 2025-12-04T14:34:54.7276459Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_int16 PASSED [0.0241s] [ 83%] 2025-12-04T14:34:54.7276928Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_int32 PASSED [0.0241s] [ 83%] 2025-12-04T14:34:54.7277403Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseBSR_cuda_uint8 PASSED [0.0240s] [ 83%] 2025-12-04T14:34:54.7277894Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSC_cuda_bfloat16 PASSED [0.0299s] [ 83%] 2025-12-04T14:34:54.7278395Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSC_cuda_complex128 PASSED [0.0292s] [ 83%] 2025-12-04T14:34:54.7278917Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSC_cuda_float16 PASSED [0.0293s] [ 83%] 2025-12-04T14:34:54.7279400Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSC_cuda_float64 PASSED [0.0293s] [ 83%] 2025-12-04T14:34:54.7279876Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSC_cuda_int16 PASSED [0.0293s] [ 83%] 2025-12-04T14:34:54.7280367Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_bfloat16 PASSED [0.0242s] [ 83%] 2025-12-04T14:34:54.7280832Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_bool PASSED [0.0245s] [ 83%] 2025-12-04T14:34:54.7281341Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_complex128 PASSED [0.0241s] [ 84%] 2025-12-04T14:34:54.7281841Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_complex64 PASSED [0.0240s] [ 84%] 2025-12-04T14:34:54.7282333Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_float16 PASSED [0.0241s] [ 84%] 2025-12-04T14:34:54.7282819Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_float32 PASSED [0.0241s] [ 84%] 2025-12-04T14:34:54.7283290Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSC_SparseCSR_cuda_uint8 PASSED [0.0241s] [ 84%] 2025-12-04T14:34:54.7283781Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_bfloat16 PASSED [0.0244s] [ 84%] 2025-12-04T14:34:54.7284275Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_complex64 PASSED [0.0240s] [ 84%] 2025-12-04T14:34:54.7284758Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_float32 PASSED [0.0240s] [ 84%] 2025-12-04T14:34:54.7285279Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_int16 PASSED [0.0241s] [ 84%] 2025-12-04T14:34:54.7285780Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_int64 PASSED [0.0242s] [ 84%] 2025-12-04T14:34:54.7286282Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSC_cuda_int8 PASSED [0.0241s] [ 84%] 2025-12-04T14:34:54.7286745Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_bool PASSED [0.0244s] [ 84%] 2025-12-04T14:34:54.7287249Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_complex64 PASSED [0.0242s] [ 84%] 2025-12-04T14:34:54.7287870Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_float16 PASSED [0.0242s] [ 84%] 2025-12-04T14:34:54.7288356Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_float32 PASSED [0.0241s] [ 84%] 2025-12-04T14:34:54.7288854Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_float64 PASSED [0.0242s] [ 84%] 2025-12-04T14:34:54.7289327Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseBSR_cuda_int32 PASSED [0.0245s] [ 84%] 2025-12-04T14:34:54.7289828Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_bfloat16 PASSED [0.0241s] [ 84%] 2025-12-04T14:34:54.7290293Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_bool PASSED [0.0240s] [ 84%] 2025-12-04T14:34:54.7290793Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_complex128 PASSED [0.0241s] [ 84%] 2025-12-04T14:34:54.7291274Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_int16 PASSED [0.0241s] [ 84%] 2025-12-04T14:34:54.7291744Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_int32 PASSED [0.0241s] [ 84%] 2025-12-04T14:34:54.7292225Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSC_cuda_int8 PASSED [0.0245s] [ 84%] 2025-12-04T14:34:54.7292774Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSR_cuda_bfloat16 PASSED [0.0294s] [ 84%] 2025-12-04T14:34:54.7293273Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSR_cuda_complex64 PASSED [0.0292s] [ 85%] 2025-12-04T14:34:54.7293765Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSR_cuda_float16 PASSED [0.0294s] [ 85%] 2025-12-04T14:34:54.7294246Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSR_cuda_float32 PASSED [0.0293s] [ 85%] 2025-12-04T14:34:54.7294719Z test_sparse_csr.py::TestSparseCompressedCUDA::test_empty_like_SparseCSR_SparseCSR_cuda_int8 PASSED [0.0292s] [ 85%] 2025-12-04T14:34:54.7295338Z test_sparse_csr.py::TestSparseCompressedCUDA::test_invalid_input_SparseBSC_target_sparse_compressed_tensor_no_size_cuda PASSED [0.0060s] [ 85%] 2025-12-04T14:34:54.7295919Z test_sparse_csr.py::TestSparseCompressedCUDA::test_invalid_input_SparseBSR_target_sparse_compressed_tensor_cuda PASSED [0.0056s] [ 85%] 2025-12-04T14:34:54.7296536Z test_sparse_csr.py::TestSparseCompressedCUDA::test_invalid_input_SparseBSR_target_sparse_compressed_tensor_no_size_cuda PASSED [0.0034s] [ 85%] 2025-12-04T14:34:54.7297009Z test_sparse_csr.py::TestSparseCompressedCUDA::test_layout_SparseBSR_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 85%] 2025-12-04T14:34:54.7297491Z test_sparse_csr.py::TestSparseCompressedCUDA::test_layout_SparseCSR_cuda SKIPPED [0.0014s] (Only runs on cpu) [ 85%] 2025-12-04T14:34:54.7297903Z test_sparse_csr.py::TestSparseCompressedCUDA::test_pickle_SparseBSC_cuda_float64 PASSED [0.3395s] [ 85%] 2025-12-04T14:34:54.7298385Z test_sparse_csr.py::TestSparseCompressedCUDA::test_pickle_SparseBSR_cuda_float64 PASSED [0.3240s] [ 85%] 2025-12-04T14:34:54.7298804Z test_sparse_csr.py::TestSparseCompressedCUDA::test_pickle_SparseCSR_cuda_float64 PASSED [0.3358s] [ 85%] 2025-12-04T14:34:54.7299228Z test_sparse_csr.py::TestSparseCompressedCUDA::test_print_SparseBSR_cuda PASSED [0.5306s] [ 85%] 2025-12-04T14:34:54.7299646Z test_sparse_csr.py::TestSparseCompressedCUDA::test_print_SparseCSR_cuda PASSED [0.4871s] [ 85%] 2025-12-04T14:34:54.7300140Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_bool PASSED [0.4153s] [ 85%] 2025-12-04T14:34:54.7300619Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_complex128 PASSED [0.4872s] [ 85%] 2025-12-04T14:34:54.7301102Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_complex64 PASSED [0.4882s] [ 85%] 2025-12-04T14:34:54.7301564Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_float32 PASSED [0.4763s] [ 85%] 2025-12-04T14:34:54.7302034Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_float64 PASSED [0.4870s] [ 85%] 2025-12-04T14:34:54.7302490Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_int16 PASSED [0.4208s] [ 85%] 2025-12-04T14:34:54.7302949Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_int32 PASSED [0.4289s] [ 85%] 2025-12-04T14:34:54.7303406Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_int8 PASSED [0.4185s] [ 85%] 2025-12-04T14:34:54.7303859Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int32_cuda_uint8 PASSED [0.4219s] [ 85%] 2025-12-04T14:34:54.7304314Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_bool PASSED [0.4158s] [ 85%] 2025-12-04T14:34:54.7304786Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_complex64 PASSED [0.4914s] [ 86%] 2025-12-04T14:34:54.7305246Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_float16 PASSED [0.4787s] [ 86%] 2025-12-04T14:34:54.7305713Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_float64 PASSED [0.4736s] [ 86%] 2025-12-04T14:34:54.7306204Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_int16 PASSED [0.4179s] [ 86%] 2025-12-04T14:34:54.7306662Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_int32 PASSED [0.4166s] [ 86%] 2025-12-04T14:34:54.7307106Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSC_int64_cuda_int8 PASSED [0.4118s] [ 86%] 2025-12-04T14:34:54.7307577Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int32_cuda_complex64 PASSED [0.4852s] [ 86%] 2025-12-04T14:34:54.7308044Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int32_cuda_float16 PASSED [0.4727s] [ 86%] 2025-12-04T14:34:54.7308496Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int32_cuda_int16 PASSED [0.4208s] [ 86%] 2025-12-04T14:34:54.7308954Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int32_cuda_int32 PASSED [0.4263s] [ 86%] 2025-12-04T14:34:54.7309407Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int32_cuda_uint8 PASSED [0.4304s] [ 86%] 2025-12-04T14:34:54.7309886Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int64_cuda_complex128 PASSED [0.4965s] [ 86%] 2025-12-04T14:34:54.7310353Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int64_cuda_float16 PASSED [0.4799s] [ 86%] 2025-12-04T14:34:54.7310810Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int64_cuda_float64 PASSED [0.4813s] [ 86%] 2025-12-04T14:34:54.7311263Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int64_cuda_int32 PASSED [0.4258s] [ 86%] 2025-12-04T14:34:54.7311713Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseBSR_int64_cuda_uint8 PASSED [0.4169s] [ 86%] 2025-12-04T14:34:54.7312203Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int32_cuda_float32 PASSED [0.4696s] [ 86%] 2025-12-04T14:34:54.7312690Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int32_cuda_int32 PASSED [0.4120s] [ 86%] 2025-12-04T14:34:54.7313168Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int32_cuda_int8 PASSED [0.4120s] [ 86%] 2025-12-04T14:34:54.7313616Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int32_cuda_uint8 PASSED [0.4107s] [ 86%] 2025-12-04T14:34:54.7314085Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int64_cuda_float16 PASSED [0.4616s] [ 86%] 2025-12-04T14:34:54.7314535Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int64_cuda_int32 PASSED [0.4146s] [ 86%] 2025-12-04T14:34:54.7314998Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int64_cuda_int64 PASSED [0.4070s] [ 86%] 2025-12-04T14:34:54.7315448Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int64_cuda_int8 PASSED [0.4123s] [ 86%] 2025-12-04T14:34:54.7315904Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSC_int64_cuda_uint8 PASSED [0.4146s] [ 87%] 2025-12-04T14:34:54.7316380Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int32_cuda_float16 PASSED [0.4683s] [ 87%] 2025-12-04T14:34:54.7316833Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int32_cuda_int16 PASSED [0.4098s] [ 87%] 2025-12-04T14:34:54.7317311Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_bfloat16 PASSED [0.4711s] [ 87%] 2025-12-04T14:34:54.7317759Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_bool PASSED [0.4310s] [ 87%] 2025-12-04T14:34:54.7318235Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_complex128 PASSED [0.4872s] [ 87%] 2025-12-04T14:34:54.7318721Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_complex64 PASSED [0.4813s] [ 87%] 2025-12-04T14:34:54.7319190Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_float32 PASSED [0.4659s] [ 87%] 2025-12-04T14:34:54.7319693Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_float64 PASSED [0.4690s] [ 87%] 2025-12-04T14:34:54.7320148Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_int16 PASSED [0.4118s] [ 87%] 2025-12-04T14:34:54.7320598Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_int32 PASSED [0.4116s] [ 87%] 2025-12-04T14:34:54.7321049Z test_sparse_csr.py::TestSparseCompressedCUDA::test_select_copy_SparseCSR_int64_cuda_int8 PASSED [0.4193s] [ 87%] 2025-12-04T14:34:54.7321667Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSC_cuda_complex128 PASSED [0.3499s] [ 87%] 2025-12-04T14:34:54.7322263Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSC_cuda_int64 PASSED [0.2861s] [ 87%] 2025-12-04T14:34:54.7322865Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSR_cuda_float16 PASSED [0.3376s] [ 87%] 2025-12-04T14:34:54.7323469Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSR_cuda_float64 PASSED [0.3440s] [ 87%] 2025-12-04T14:34:54.7324052Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSR_cuda_int32 PASSED [0.2940s] [ 87%] 2025-12-04T14:34:54.7324635Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSR_cuda_int64 PASSED [0.3011s] [ 87%] 2025-12-04T14:34:54.7325225Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseBSR_cuda_uint8 PASSED [0.2982s] [ 87%] 2025-12-04T14:34:54.7325818Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSC_cuda_float32 PASSED [0.3354s] [ 87%] 2025-12-04T14:34:54.7326531Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSC_cuda_int16 PASSED [0.2859s] [ 87%] 2025-12-04T14:34:54.7327143Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSC_cuda_int32 PASSED [0.2803s] [ 87%] 2025-12-04T14:34:54.7327723Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSC_cuda_int64 PASSED [0.2810s] [ 87%] 2025-12-04T14:34:54.7328309Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSC_cuda_uint8 PASSED [0.2814s] [ 87%] 2025-12-04T14:34:54.7328905Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSR_cuda_bfloat16 PASSED [0.3363s] [ 88%] 2025-12-04T14:34:54.7329521Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSR_cuda_complex128 PASSED [0.3479s] [ 88%] 2025-12-04T14:34:54.7330117Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSR_cuda_float16 PASSED [0.3421s] [ 88%] 2025-12-04T14:34:54.7330715Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSR_cuda_float32 PASSED [0.3367s] [ 88%] 2025-12-04T14:34:54.7331314Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_list_SparseCSR_cuda_float64 PASSED [0.3392s] [ 88%] 2025-12-04T14:34:54.7331899Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_bool PASSED [0.3461s] [ 88%] 2025-12-04T14:34:54.7332525Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_complex64 PASSED [0.4601s] [ 88%] 2025-12-04T14:34:54.7333130Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_float16 PASSED [0.4641s] [ 88%] 2025-12-04T14:34:54.7333784Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_float32 PASSED [0.4543s] [ 88%] 2025-12-04T14:34:54.7334390Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_int16 PASSED [0.3423s] [ 88%] 2025-12-04T14:34:54.7334978Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSC_cuda_int8 PASSED [0.3411s] [ 88%] 2025-12-04T14:34:54.7335590Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSR_cuda_bfloat16 PASSED [0.4534s] [ 88%] 2025-12-04T14:34:54.7336178Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSR_cuda_bool PASSED [0.3443s] [ 88%] 2025-12-04T14:34:54.7336791Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSR_cuda_float64 PASSED [0.4525s] [ 88%] 2025-12-04T14:34:54.7337388Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSR_cuda_int16 PASSED [0.3450s] [ 88%] 2025-12-04T14:34:54.7337979Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseBSR_cuda_int64 PASSED [0.3411s] [ 88%] 2025-12-04T14:34:54.7338687Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_bfloat16 PASSED [0.4460s] [ 88%] 2025-12-04T14:34:54.7339276Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_bool PASSED [0.3462s] [ 88%] 2025-12-04T14:34:54.7339908Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_complex128 PASSED [0.4588s] [ 88%] 2025-12-04T14:34:54.7340555Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_float16 PASSED [0.4495s] [ 88%] 2025-12-04T14:34:54.7341174Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_int16 PASSED [0.3426s] [ 88%] 2025-12-04T14:34:54.7341797Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_int32 PASSED [0.3414s] [ 88%] 2025-12-04T14:34:54.7342388Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSC_cuda_int8 PASSED [0.3496s] [ 88%] 2025-12-04T14:34:54.7342984Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_bool PASSED [0.3403s] [ 88%] 2025-12-04T14:34:54.7343598Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_complex64 PASSED [0.4606s] [ 89%] 2025-12-04T14:34:54.7344204Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_float16 PASSED [0.4487s] [ 89%] 2025-12-04T14:34:54.7344812Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_float32 PASSED [0.4476s] [ 89%] 2025-12-04T14:34:54.7345420Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_float64 PASSED [0.4599s] [ 89%] 2025-12-04T14:34:54.7346020Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_int64 PASSED [0.3525s] [ 89%] 2025-12-04T14:34:54.7346610Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_____from_tensor_SparseCSR_cuda_uint8 PASSED [0.3419s] [ 89%] 2025-12-04T14:34:54.7347256Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_bfloat16 PASSED [0.3431s] [ 89%] 2025-12-04T14:34:54.7347907Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_complex128 PASSED [0.3443s] [ 89%] 2025-12-04T14:34:54.7348584Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_complex64 PASSED [0.3443s] [ 89%] 2025-12-04T14:34:54.7349222Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_float32 PASSED [0.3508s] [ 89%] 2025-12-04T14:34:54.7349854Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_float64 PASSED [0.3412s] [ 89%] 2025-12-04T14:34:54.7350477Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_int16 PASSED [0.2866s] [ 89%] 2025-12-04T14:34:54.7351090Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_int32 PASSED [0.2848s] [ 89%] 2025-12-04T14:34:54.7351709Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSC_cuda_int64 PASSED [0.2890s] [ 89%] 2025-12-04T14:34:54.7352357Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_bfloat16 PASSED [0.3393s] [ 89%] 2025-12-04T14:34:54.7352966Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_bool PASSED [0.2897s] [ 89%] 2025-12-04T14:34:54.7353617Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_complex128 PASSED [0.3411s] [ 89%] 2025-12-04T14:34:54.7354242Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_float16 PASSED [0.3414s] [ 89%] 2025-12-04T14:34:54.7354878Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_float64 PASSED [0.3406s] [ 89%] 2025-12-04T14:34:54.7355553Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_int16 PASSED [0.2893s] [ 89%] 2025-12-04T14:34:54.7356197Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseBSR_cuda_int32 PASSED [0.2890s] [ 89%] 2025-12-04T14:34:54.7356835Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSC_cuda_bfloat16 PASSED [0.3406s] [ 89%] 2025-12-04T14:34:54.7357482Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSC_cuda_complex128 PASSED [0.3457s] [ 89%] 2025-12-04T14:34:54.7358100Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSC_cuda_int8 PASSED [0.2885s] [ 89%] 2025-12-04T14:34:54.7358712Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSC_cuda_uint8 PASSED [0.2879s] [ 90%] 2025-12-04T14:34:54.7359329Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSR_cuda_bool PASSED [0.2899s] [ 90%] 2025-12-04T14:34:54.7359981Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSR_cuda_complex64 PASSED [0.3448s] [ 90%] 2025-12-04T14:34:54.7360600Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSR_cuda_float16 PASSED [0.3448s] [ 90%] 2025-12-04T14:34:54.7361227Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSR_cuda_float32 PASSED [0.3441s] [ 90%] 2025-12-04T14:34:54.7361838Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_list_SparseCSR_cuda_uint8 PASSED [0.2879s] [ 90%] 2025-12-04T14:34:54.7362459Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSC_cuda_bool PASSED [0.3416s] [ 90%] 2025-12-04T14:34:54.7363148Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSC_cuda_float16 PASSED [0.4549s] [ 90%] 2025-12-04T14:34:54.7363781Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSC_cuda_float64 PASSED [0.4598s] [ 90%] 2025-12-04T14:34:54.7364408Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSC_cuda_int8 PASSED [0.3485s] [ 90%] 2025-12-04T14:34:54.7365025Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_bool PASSED [0.3479s] [ 90%] 2025-12-04T14:34:54.7365689Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_complex128 PASSED [0.4648s] [ 90%] 2025-12-04T14:34:54.7366344Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_complex64 PASSED [0.4686s] [ 90%] 2025-12-04T14:34:54.7366985Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_float16 PASSED [0.4596s] [ 90%] 2025-12-04T14:34:54.7367632Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_float64 PASSED [0.4661s] [ 90%] 2025-12-04T14:34:54.7368248Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseBSR_cuda_int8 PASSED [0.3546s] [ 90%] 2025-12-04T14:34:54.7368900Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSC_cuda_float16 PASSED [0.4530s] [ 90%] 2025-12-04T14:34:54.7369529Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSC_cuda_int32 PASSED [0.3423s] [ 90%] 2025-12-04T14:34:54.7370242Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_complex128 PASSED [0.4637s] [ 90%] 2025-12-04T14:34:54.7370943Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_complex64 PASSED [0.4625s] [ 90%] 2025-12-04T14:34:54.7371584Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_float16 PASSED [0.4518s] [ 90%] 2025-12-04T14:34:54.7372231Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_float32 PASSED [0.4488s] [ 90%] 2025-12-04T14:34:54.7372871Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_float64 PASSED [0.4511s] [ 90%] 2025-12-04T14:34:54.7373508Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor___factory_from_tensor_SparseCSR_cuda_uint8 PASSED [0.3450s] [ 90%] 2025-12-04T14:34:54.7374334Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_bfloat16 SKIPPED [0.2334s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7375133Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_bool SKIPPED [0.2272s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7375952Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_float32 SKIPPED [0.2263s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7376754Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_int16 SKIPPED [0.2347s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7377559Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_int8 SKIPPED [0.2282s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7378493Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSC_cuda_uint8 SKIPPED [0.2270s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7379320Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_bfloat16 SKIPPED [0.2296s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7380112Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_bool SKIPPED [0.2336s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7380935Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_complex128 SKIPPED [0.2280s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7381776Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_complex64 SKIPPED [0.2271s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7382591Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_float16 SKIPPED [0.2252s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7383400Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_int8 SKIPPED [0.2279s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7384195Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseBSR_cuda_uint8 SKIPPED [0.2296s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7385025Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSC_cuda_bfloat16 SKIPPED [0.2321s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7385895Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSC_cuda_bool SKIPPED [0.2277s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7386776Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSC_cuda_complex128 SKIPPED [0.2271s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7387606Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSC_cuda_complex64 SKIPPED [0.2262s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7388563Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_bfloat16 SKIPPED [0.2252s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7389399Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_complex128 SKIPPED [0.2387s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7390213Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_float32 SKIPPED [0.2284s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7391028Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_int32 SKIPPED [0.2281s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7391826Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_int64 SKIPPED [0.2273s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7392620Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_int8 SKIPPED [0.2272s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7393430Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_list_SparseCSR_cuda_uint8 SKIPPED [0.2260s] (nothing to test) [ 91%] 2025-12-04T14:34:54.7394229Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_bfloat16 PASSED [0.4386s] [ 92%] 2025-12-04T14:34:54.7394954Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_bool PASSED [0.3397s] [ 92%] 2025-12-04T14:34:54.7395705Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_complex128 PASSED [0.4493s] [ 92%] 2025-12-04T14:34:54.7396450Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_complex64 PASSED [0.4511s] [ 92%] 2025-12-04T14:34:54.7397164Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_int8 PASSED [0.3390s] [ 92%] 2025-12-04T14:34:54.7397890Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSC_cuda_uint8 PASSED [0.3411s] [ 92%] 2025-12-04T14:34:54.7398643Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSR_cuda_bfloat16 PASSED [0.4419s] [ 92%] 2025-12-04T14:34:54.7399358Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSR_cuda_bool PASSED [0.3385s] [ 92%] 2025-12-04T14:34:54.7400113Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSR_cuda_complex64 PASSED [0.4500s] [ 92%] 2025-12-04T14:34:54.7400842Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSR_cuda_float64 PASSED [0.4445s] [ 92%] 2025-12-04T14:34:54.7401659Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseBSR_cuda_int32 PASSED [0.3495s] [ 92%] 2025-12-04T14:34:54.7402451Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSC_cuda_complex128 PASSED [0.4470s] [ 92%] 2025-12-04T14:34:54.7403195Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSC_cuda_complex64 PASSED [0.4420s] [ 92%] 2025-12-04T14:34:54.7403928Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSC_cuda_float16 PASSED [0.4339s] [ 92%] 2025-12-04T14:34:54.7404655Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSC_cuda_float32 PASSED [0.4350s] [ 92%] 2025-12-04T14:34:54.7405387Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSC_cuda_uint8 PASSED [0.3353s] [ 92%] 2025-12-04T14:34:54.7406129Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSR_cuda_bfloat16 PASSED [0.4309s] [ 92%] 2025-12-04T14:34:54.7406886Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSR_cuda_complex128 PASSED [0.4436s] [ 92%] 2025-12-04T14:34:54.7407628Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSR_cuda_complex64 PASSED [0.4423s] [ 92%] 2025-12-04T14:34:54.7408355Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSR_cuda_float32 PASSED [0.4313s] [ 92%] 2025-12-04T14:34:54.7409086Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference___from_tensor_SparseCSR_cuda_uint8 PASSED [0.3318s] [ 92%] 2025-12-04T14:34:54.7409979Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSC_cuda_complex64 SKIPPED [0.2290s] (nothing to test) [ 92%] 2025-12-04T14:34:54.7410828Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSC_cuda_float32 SKIPPED [0.2213s] (nothing to test) [ 92%] 2025-12-04T14:34:54.7411658Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSC_cuda_int16 SKIPPED [0.2220s] (nothing to test) [ 92%] 2025-12-04T14:34:54.7412496Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSC_cuda_int64 SKIPPED [0.2210s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7413351Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_complex64 SKIPPED [0.2212s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7414199Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_float16 SKIPPED [0.2213s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7415051Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_float32 SKIPPED [0.2266s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7415889Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_float64 SKIPPED [0.2296s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7416724Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_int8 SKIPPED [0.2228s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7417620Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseBSR_cuda_uint8 SKIPPED [0.2224s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7418585Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_bfloat16 SKIPPED [0.2323s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7419440Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_complex64 SKIPPED [0.2239s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7420281Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_float16 SKIPPED [0.2303s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7421142Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_float32 SKIPPED [0.2188s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7421979Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_float64 SKIPPED [0.2203s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7422820Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_int64 SKIPPED [0.2262s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7423642Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_int8 SKIPPED [0.2227s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7424474Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSC_cuda_uint8 SKIPPED [0.2204s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7425316Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSR_cuda_float32 SKIPPED [0.2165s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7426194Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSR_cuda_int16 SKIPPED [0.2191s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7427020Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSR_cuda_int32 SKIPPED [0.2206s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7427847Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSR_cuda_int64 SKIPPED [0.2224s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7428687Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_list_SparseCSR_cuda_uint8 SKIPPED [0.2204s] (nothing to test) [ 93%] 2025-12-04T14:34:54.7429454Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSC_cuda_bfloat16 PASSED [0.4323s] [ 93%] 2025-12-04T14:34:54.7430229Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSC_cuda_float16 PASSED [0.4434s] [ 93%] 2025-12-04T14:34:54.7430990Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSC_cuda_float64 PASSED [0.4364s] [ 93%] 2025-12-04T14:34:54.7431748Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSC_cuda_int32 PASSED [0.3438s] [ 93%] 2025-12-04T14:34:54.7432494Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSC_cuda_int8 PASSED [0.3374s] [ 94%] 2025-12-04T14:34:54.7433334Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSR_cuda_complex64 PASSED [0.4415s] [ 94%] 2025-12-04T14:34:54.7434136Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSR_cuda_float16 PASSED [0.4364s] [ 94%] 2025-12-04T14:34:54.7434887Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSR_cuda_int64 PASSED [0.3321s] [ 94%] 2025-12-04T14:34:54.7435645Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSR_cuda_int8 PASSED [0.3315s] [ 94%] 2025-12-04T14:34:54.7436396Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseBSR_cuda_uint8 PASSED [0.3329s] [ 94%] 2025-12-04T14:34:54.7437180Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_bfloat16 PASSED [0.4271s] [ 94%] 2025-12-04T14:34:54.7437932Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_bool PASSED [0.3294s] [ 94%] 2025-12-04T14:34:54.7438696Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_float16 PASSED [0.4217s] [ 94%] 2025-12-04T14:34:54.7439453Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_int32 PASSED [0.3214s] [ 94%] 2025-12-04T14:34:54.7440204Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_int64 PASSED [0.3195s] [ 94%] 2025-12-04T14:34:54.7440971Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSC_cuda_uint8 PASSED [0.3227s] [ 94%] 2025-12-04T14:34:54.7441775Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_bfloat16 PASSED [0.4237s] [ 94%] 2025-12-04T14:34:54.7442524Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_bool PASSED [0.3350s] [ 94%] 2025-12-04T14:34:54.7443317Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_complex128 PASSED [0.4353s] [ 94%] 2025-12-04T14:34:54.7444094Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_complex64 PASSED [0.4350s] [ 94%] 2025-12-04T14:34:54.7444861Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_int8 PASSED [0.3289s] [ 94%] 2025-12-04T14:34:54.7445616Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_constructor_shape_and_device_inference_factory_from_tensor_SparseCSR_cuda_uint8 PASSED [0.3348s] [ 94%] 2025-12-04T14:34:54.7446171Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_bool PASSED [0.3761s] [ 94%] 2025-12-04T14:34:54.7446744Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_complex128 PASSED [0.3655s] [ 94%] 2025-12-04T14:34:54.7447300Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_float64 PASSED [0.3605s] [ 94%] 2025-12-04T14:34:54.7447888Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_int16 PASSED [0.3532s] [ 94%] 2025-12-04T14:34:54.7448472Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_int32 PASSED [0.3553s] [ 94%] 2025-12-04T14:34:54.7449051Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_int8 PASSED [0.3555s] [ 94%] 2025-12-04T14:34:54.7449597Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSC_cuda_uint8 PASSED [0.3568s] [ 95%] 2025-12-04T14:34:54.7450150Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_bfloat16 PASSED [0.3601s] [ 95%] 2025-12-04T14:34:54.7450729Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_complex128 PASSED [0.3654s] [ 95%] 2025-12-04T14:34:54.7451277Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_float32 PASSED [0.3876s] [ 95%] 2025-12-04T14:34:54.7451845Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_float64 PASSED [0.3698s] [ 95%] 2025-12-04T14:34:54.7452392Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_int32 PASSED [0.3711s] [ 95%] 2025-12-04T14:34:54.7453162Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_int8 PASSED [0.3625s] [ 95%] 2025-12-04T14:34:54.7453718Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseBSR_cuda_uint8 PASSED [0.3684s] [ 95%] 2025-12-04T14:34:54.7454271Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSC_cuda_bfloat16 PASSED [0.3688s] [ 95%] 2025-12-04T14:34:54.7454855Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSC_cuda_complex128 PASSED [0.3637s] [ 95%] 2025-12-04T14:34:54.7455408Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSC_cuda_float32 PASSED [0.3696s] [ 95%] 2025-12-04T14:34:54.7456008Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSC_cuda_int8 PASSED [0.3602s] [ 95%] 2025-12-04T14:34:54.7456565Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_bfloat16 PASSED [0.3778s] [ 95%] 2025-12-04T14:34:54.7457099Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_bool PASSED [0.3615s] [ 95%] 2025-12-04T14:34:54.7457673Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_complex64 PASSED [0.3633s] [ 95%] 2025-12-04T14:34:54.7458307Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_float32 PASSED [0.3517s] [ 95%] 2025-12-04T14:34:54.7458879Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_float64 PASSED [0.3624s] [ 95%] 2025-12-04T14:34:54.7459426Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_int16 PASSED [0.3472s] [ 95%] 2025-12-04T14:34:54.7459968Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_int32 PASSED [0.3437s] [ 95%] 2025-12-04T14:34:54.7460522Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_int64 PASSED [0.3641s] [ 95%] 2025-12-04T14:34:54.7461059Z test_sparse_csr.py::TestSparseCompressedCUDA::test_sparse_compressed_tensor_with_dims_SparseCSR_cuda_uint8 PASSED [0.3504s] [ 95%] 2025-12-04T14:34:54.7461499Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSC_cuda_bfloat16 PASSED [0.4241s] [ 95%] 2025-12-04T14:34:54.7461935Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSC_cuda_complex128 PASSED [0.4229s] [ 95%] 2025-12-04T14:34:54.7462388Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSC_cuda_float64 PASSED [0.4158s] [ 95%] 2025-12-04T14:34:54.7462840Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSC_cuda_int16 PASSED [0.4166s] [ 96%] 2025-12-04T14:34:54.7463279Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSC_cuda_int32 PASSED [0.4189s] [ 96%] 2025-12-04T14:34:54.7463725Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSR_cuda_complex128 PASSED [0.4210s] [ 96%] 2025-12-04T14:34:54.7464154Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSR_cuda_complex64 PASSED [0.4185s] [ 96%] 2025-12-04T14:34:54.7464568Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSR_cuda_float64 PASSED [0.4155s] [ 96%] 2025-12-04T14:34:54.7464984Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSR_cuda_int8 PASSED [0.4189s] [ 96%] 2025-12-04T14:34:54.7465398Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseBSR_cuda_uint8 PASSED [0.4189s] [ 96%] 2025-12-04T14:34:54.7465834Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_bfloat16 PASSED [0.4300s] [ 96%] 2025-12-04T14:34:54.7466237Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_bool PASSED [0.4300s] [ 96%] 2025-12-04T14:34:54.7466665Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_complex64 PASSED [0.4242s] [ 96%] 2025-12-04T14:34:54.7467087Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_float32 PASSED [0.4166s] [ 96%] 2025-12-04T14:34:54.7467493Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_int16 PASSED [0.4268s] [ 96%] 2025-12-04T14:34:54.7467895Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_int32 PASSED [0.4272s] [ 96%] 2025-12-04T14:34:54.7468306Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSC_cuda_int8 PASSED [0.4314s] [ 96%] 2025-12-04T14:34:54.7468727Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_bfloat16 PASSED [0.4315s] [ 96%] 2025-12-04T14:34:54.7469175Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_bool PASSED [0.4298s] [ 96%] 2025-12-04T14:34:54.7469614Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_complex128 PASSED [0.4387s] [ 96%] 2025-12-04T14:34:54.7470029Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_float16 PASSED [0.4347s] [ 96%] 2025-12-04T14:34:54.7470444Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_int64 PASSED [0.4281s] [ 96%] 2025-12-04T14:34:54.7470850Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_int8 PASSED [0.4316s] [ 96%] 2025-12-04T14:34:54.7471260Z test_sparse_csr.py::TestSparseCompressedCUDA::test_to_dtype_SparseCSR_cuda_uint8 PASSED [0.4325s] [ 96%] 2025-12-04T14:34:54.7471686Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_float16 PASSED [0.3066s] [ 96%] 2025-12-04T14:34:54.7472108Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_float32 PASSED [0.3086s] [ 96%] 2025-12-04T14:34:54.7472544Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_float64 PASSED [0.2977s] [ 96%] 2025-12-04T14:34:54.7472959Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_int16 PASSED [0.2930s] [ 97%] 2025-12-04T14:34:54.7473375Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_int32 PASSED [0.2929s] [ 97%] 2025-12-04T14:34:54.7473782Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSC_cuda_int8 PASSED [0.2909s] [ 97%] 2025-12-04T14:34:54.7474191Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSR_cuda_bool PASSED [0.2946s] [ 97%] 2025-12-04T14:34:54.7474619Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSR_cuda_float32 PASSED [0.3033s] [ 97%] 2025-12-04T14:34:54.7475060Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSR_cuda_int64 PASSED [0.2933s] [ 97%] 2025-12-04T14:34:54.7475508Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseBSR_cuda_uint8 PASSED [0.2901s] [ 97%] 2025-12-04T14:34:54.7475973Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSC_cuda_bfloat16 PASSED [0.2963s] [ 97%] 2025-12-04T14:34:54.7476398Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSC_cuda_float16 PASSED [0.2975s] [ 97%] 2025-12-04T14:34:54.7476834Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSC_cuda_float64 PASSED [0.2917s] [ 97%] 2025-12-04T14:34:54.7477249Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSC_cuda_int16 PASSED [0.2930s] [ 97%] 2025-12-04T14:34:54.7477672Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSC_cuda_int64 PASSED [0.2865s] [ 97%] 2025-12-04T14:34:54.7478112Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_complex64 PASSED [0.2951s] [ 97%] 2025-12-04T14:34:54.7478534Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_float16 PASSED [0.2938s] [ 97%] 2025-12-04T14:34:54.7478957Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_int16 PASSED [0.2885s] [ 97%] 2025-12-04T14:34:54.7479370Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_int32 PASSED [0.2864s] [ 97%] 2025-12-04T14:34:54.7479781Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_int64 PASSED [0.2856s] [ 97%] 2025-12-04T14:34:54.7480198Z test_sparse_csr.py::TestSparseCompressedCUDA::test_validate_SparseCSR_cuda_int8 PASSED [0.2948s] [ 97%] 2025-12-04T14:34:54.7481007Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_16_int64_cuda_float16 SKIPPED [0.0003s] (Skipped for internal with remote GPUs) [ 97%] 2025-12-04T14:34:54.7481810Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_16_int64_cuda_float32 SKIPPED [0.0002s] (Skipped for internal with remote GPUs) [ 97%] 2025-12-04T14:34:54.7482606Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_32_int32_cuda_float16 SKIPPED [0.0002s] (Skipped for internal with remote GPUs) [ 97%] 2025-12-04T14:34:54.7483442Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_32_int32_cuda_float32 SKIPPED [0.0002s] (Skipped for internal with remote GPUs) [ 97%] 2025-12-04T14:34:54.7484222Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_32_int64_cuda_float16 SKIPPED [0.0002s] (Skipped for internal with remote GPUs) [ 97%] 2025-12-04T14:34:54.7485004Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_64_int32_cuda_float32 SKIPPED [0.0002s] (Skipped for internal with remote GPUs) [ 97%] 2025-12-04T14:34:54.7485800Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_dense_bmm_block_size_64_int64_cuda_float32 SKIPPED [0.0002s] (Skipped for internal with remote GPUs) [ 98%] 2025-12-04T14:34:54.7486401Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_scatter_mm_blocksize_16_cuda_float32 PASSED [18.9305s] [ 98%] 2025-12-04T14:34:54.7487002Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_scatter_mm_blocksize_2_cuda_float32 PASSED [2.1766s] [ 98%] 2025-12-04T14:34:54.7487597Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_scatter_mm_blocksize_2x3_cuda_float32 PASSED [2.1636s] [ 98%] 2025-12-04T14:34:54.7488358Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_scatter_mm_blocksize_32_cuda_float32 PASSED [31.3896s] [ 98%] 2025-12-04T14:34:54.7489153Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_softmax_cuda_float16 SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 98%] 2025-12-04T14:34:54.7490038Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_bsr_softmax_cuda_float32 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 98%] 2025-12-04T14:34:54.7491162Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_16_out_dtype_unspecified_cuda_int8 SKIPPED [0.2479s] (triton kernel does not support support int8 blocks smaller than 32) [ 98%] 2025-12-04T14:34:54.7492040Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_16x32_out_dtype_int32_cuda_float16 SKIPPED [0.2155s] (out dtype not implemented) [ 98%] 2025-12-04T14:34:54.7493118Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_16x32_out_dtype_unspecified_cuda_float16 SKIPPED [0.2182s] (Redundant test: _int_bsr_dense_addmm on torch.float16 tensors) [ 98%] 2025-12-04T14:34:54.7494176Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_16x32_out_dtype_unspecified_cuda_float32 SKIPPED [0.2154s] (Redundant test: _int_bsr_dense_addmm on torch.float32 tensors) [ 98%] 2025-12-04T14:34:54.7495267Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_16x32_out_dtype_unspecified_cuda_int8 SKIPPED [0.2188s] (triton kernel does not support support int8 blocks smaller than 32) [ 98%] 2025-12-04T14:34:54.7496106Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 SKIPPED [0.2175s] (out dtype not implemented) [ 98%] 2025-12-04T14:34:54.7497162Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_float16 SKIPPED [0.2160s] (Redundant test: _int_bsr_dense_addmm on torch.float16 tensors) [ 98%] 2025-12-04T14:34:54.7499310Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7499626Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7499726Z ^ 2025-12-04T14:34:54.7499820Z module { 2025-12-04T14:34:54.7504390Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7504578Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7504693Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7504831Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7505006Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7505152Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7505261Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7505391Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7505500Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7505611Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7505730Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7505837Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7505954Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7506070Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7506179Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7506303Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7506407Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7506521Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7506642Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7506748Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7506848Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7506966Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7507073Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7507223Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7507351Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7507458Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7507613Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7507720Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7507827Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7508038Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7508238Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7508349Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7508526Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7508640Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7508756Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7508883Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7509126Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7509274Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7509397Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7509567Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7509770Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7509992Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7510249Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7510432Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7510640Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7510768Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7510905Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7511016Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7511153Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7511287Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7511408Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7511585Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7511776Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7512011Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7512281Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7512492Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7512698Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7512840Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7512979Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7513098Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7513224Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7513347Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7513471Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7513607Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7513739Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7513916Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7514107Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7514372Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7514571Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7514715Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7514840Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7515370Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7515526Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7515643Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7515761Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7515881Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7516017Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7516228Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7516392Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7516646Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7516798Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7517016Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7517162Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7517423Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7517514Z } 2025-12-04T14:34:54.7517671Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7517762Z tt.return 2025-12-04T14:34:54.7517850Z } 2025-12-04T14:34:54.7517949Z } 2025-12-04T14:34:54.7517958Z 2025-12-04T14:34:54.7518043Z {-# 2025-12-04T14:34:54.7518152Z external_resources: { 2025-12-04T14:34:54.7518263Z mlir_reproducer: { 2025-12-04T14:34:54.7524342Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7524504Z disable_threading: false, 2025-12-04T14:34:54.7524633Z verify_each: true 2025-12-04T14:34:54.7524732Z } 2025-12-04T14:34:54.7524818Z } 2025-12-04T14:34:54.7524903Z #-} 2025-12-04T14:34:54.7525631Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7526919Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7527063Z ('RERUN', {'yellow': True}) [0.5389s] [ 98%] 2025-12-04T14:34:54.7529114Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7529365Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7529467Z ^ 2025-12-04T14:34:54.7529558Z module { 2025-12-04T14:34:54.7534117Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7534287Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7534402Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7534512Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7534682Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7534794Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7534904Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7535022Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7535129Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7535242Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7535360Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7535467Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7535581Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7535692Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7535831Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7535954Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7536057Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7536201Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7536318Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7536420Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7536552Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7536665Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7536770Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7536910Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7537047Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7537154Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7537307Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7537415Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7537519Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7537730Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7537926Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7538039Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7538265Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7538386Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7538491Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7538630Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7538862Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7539005Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7539130Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7539303Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7539506Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7539731Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7539978Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7540229Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7540432Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7540556Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7540692Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7540801Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7540933Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7541066Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7541190Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7541369Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7541558Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7541792Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7542037Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7542215Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7542420Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7542533Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7542670Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7542794Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7542919Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7543040Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7543161Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7543294Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7543424Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7543711Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7543944Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7544211Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7544442Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7544577Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7544718Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7545253Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7545413Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7545534Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7545653Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7545780Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7545913Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7546120Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7546265Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7546523Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7546681Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7546901Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7547050Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7547318Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7547406Z } 2025-12-04T14:34:54.7547556Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7547670Z tt.return 2025-12-04T14:34:54.7547757Z } 2025-12-04T14:34:54.7547860Z } 2025-12-04T14:34:54.7547867Z 2025-12-04T14:34:54.7547955Z {-# 2025-12-04T14:34:54.7548095Z external_resources: { 2025-12-04T14:34:54.7548211Z mlir_reproducer: { 2025-12-04T14:34:54.7554274Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7554412Z disable_threading: false, 2025-12-04T14:34:54.7554515Z verify_each: true 2025-12-04T14:34:54.7554615Z } 2025-12-04T14:34:54.7554703Z } 2025-12-04T14:34:54.7554790Z #-} 2025-12-04T14:34:54.7555518Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7556800Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7556977Z ('RERUN', {'yellow': True}) [0.5248s] [ 98%] 2025-12-04T14:34:54.7559093Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7559342Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7559452Z ^ 2025-12-04T14:34:54.7559544Z module { 2025-12-04T14:34:54.7564120Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7564290Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7564403Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7564516Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7564688Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7564802Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7564915Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7565037Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7565147Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7565260Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7565380Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7565484Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7565592Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7565718Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7565825Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7565949Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7566056Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7566172Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7566294Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7566405Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7566510Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7566630Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7566737Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7566877Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7567014Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7567121Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7567276Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7567383Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7567524Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7567736Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7567963Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7568075Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7568253Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7568369Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7568475Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7568611Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7568840Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7568984Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7569109Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7569278Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7569483Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7569707Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7569952Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7570148Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7570348Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7570473Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7570609Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7570723Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7570862Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7570999Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7571124Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7571301Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7571496Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7571791Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7572033Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7572222Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7572430Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7572546Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7572684Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7572807Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7572935Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7573047Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7573184Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7573322Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7573462Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7573649Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7573845Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7574110Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7574311Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7574446Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7574580Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7575108Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7575262Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7575412Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7575530Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7575654Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7575816Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7576062Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7576207Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7576460Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7576617Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7576839Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7576987Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7577251Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7577342Z } 2025-12-04T14:34:54.7577495Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7577600Z tt.return 2025-12-04T14:34:54.7577688Z } 2025-12-04T14:34:54.7577777Z } 2025-12-04T14:34:54.7577796Z 2025-12-04T14:34:54.7577883Z {-# 2025-12-04T14:34:54.7577993Z external_resources: { 2025-12-04T14:34:54.7578110Z mlir_reproducer: { 2025-12-04T14:34:54.7584252Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7584426Z disable_threading: false, 2025-12-04T14:34:54.7584527Z verify_each: true 2025-12-04T14:34:54.7584628Z } 2025-12-04T14:34:54.7584716Z } 2025-12-04T14:34:54.7584802Z #-} 2025-12-04T14:34:54.7585531Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7586815Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7586924Z FAILED [0.5249s] [ 98%] 2025-12-04T14:34:54.7586941Z 2025-12-04T14:34:54.7587085Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.7587674Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.7587992Z Traceback (most recent call last): 2025-12-04T14:34:54.7588369Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7588489Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7589027Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1144, in _int_bsr_dense_addmm 2025-12-04T14:34:54.7589137Z return bsr_dense_addmm( 2025-12-04T14:34:54.7589709Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7589971Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7590489Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7590619Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7591063Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7591181Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7591608Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7591891Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7592289Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7592595Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7593032Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7593282Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7593733Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7593887Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7594386Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7594712Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7595237Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7595335Z pm.run(mod) 2025-12-04T14:34:54.7595469Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7595475Z 2025-12-04T14:34:54.7595750Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7596631Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7596638Z 2025-12-04T14:34:54.7596921Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7597506Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.7597626Z Traceback (most recent call last): 2025-12-04T14:34:54.7598009Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7598131Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7598664Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1144, in _int_bsr_dense_addmm 2025-12-04T14:34:54.7598779Z return bsr_dense_addmm( 2025-12-04T14:34:54.7599272Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7599506Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7599980Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7600104Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7600546Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7600665Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7601089Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7601374Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7601821Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7602143Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7602606Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7602854Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7603304Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7603449Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7603962Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7604294Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7604826Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7604928Z pm.run(mod) 2025-12-04T14:34:54.7605065Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7605071Z 2025-12-04T14:34:54.7605298Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7606174Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7606180Z 2025-12-04T14:34:54.7606458Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7606606Z =================================== FAILURES =================================== 2025-12-04T14:34:54.7607193Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.7607335Z Traceback (most recent call last): 2025-12-04T14:34:54.7607707Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7607863Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7608400Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1144, in _int_bsr_dense_addmm 2025-12-04T14:34:54.7608512Z return bsr_dense_addmm( 2025-12-04T14:34:54.7609021Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7609242Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7609716Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7609848Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7610293Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7610416Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7610846Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7611129Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7611524Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7611830Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7612261Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7612511Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7612961Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7613145Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7613670Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7614002Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7614556Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7614654Z pm.run(mod) 2025-12-04T14:34:54.7614786Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7614799Z 2025-12-04T14:34:54.7615016Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7615888Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7615894Z 2025-12-04T14:34:54.7616170Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7616799Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-bc7545635763bbdc.xml - 2025-12-04T14:34:54.7616988Z =========================== short test summary info ============================ 2025-12-04T14:34:54.7617889Z FAILED [0.5249s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7617895Z 2025-12-04T14:34:54.7618109Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7619060Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7619066Z 2025-12-04T14:34:54.7619330Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7619527Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.7619800Z ======= 1 failed, 2104 passed, 262 skipped, 2 rerun in 467.41s (0:07:47) ======= 2025-12-04T14:34:54.7619898Z Got exit code 1 2025-12-04T14:34:54.7620019Z Retrying single test... 2025-12-04T14:34:54.7620469Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-fad3091b7781ac2f.xml 2025-12-04T14:34:54.7620632Z ============================= test session starts ============================== 2025-12-04T14:34:54.7621000Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.7621108Z cachedir: .pytest_cache 2025-12-04T14:34:54.7621642Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.7621771Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.7621880Z configfile: pytest.ini 2025-12-04T14:34:54.7622492Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.7622727Z collecting ... collected 4932 items / 2400 deselected / 2532 selected 2025-12-04T14:34:54.7623704Z stepcurrent: skipping 2366 already run items. Running only test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7623818Z Running 1 items in this shard 2025-12-04T14:34:54.7623824Z 2025-12-04T14:34:54.7625875Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7626212Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7626313Z ^ 2025-12-04T14:34:54.7626423Z module { 2025-12-04T14:34:54.7631009Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7631151Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7631265Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7631385Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7631548Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7631660Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7631786Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7631897Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7632049Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7632160Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7632273Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7632391Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7632496Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7632611Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7632728Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7632840Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7632944Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7633071Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7633178Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7633286Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7633401Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7633509Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7633629Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7633769Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7633905Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7634025Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7634171Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7634279Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7634400Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7634603Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7634803Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7634924Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7635061Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7635187Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7635294Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7635452Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7635696Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7635866Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7635994Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7636208Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7636407Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7636642Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7636888Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7637074Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7637287Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7637401Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7637541Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7637665Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7637795Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7637931Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7638071Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7638242Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7638447Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7638671Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7638913Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7639106Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7639306Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7639421Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7639568Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7639715Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7639857Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7639969Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7640099Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7640245Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7640371Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7640550Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7640756Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7641006Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7641222Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7641359Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7641485Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7642035Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7642184Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7642302Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7642432Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7642543Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7642688Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7642889Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7643021Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7643282Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7643455Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7643705Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7643868Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7644150Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7644254Z } 2025-12-04T14:34:54.7644404Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7644501Z tt.return 2025-12-04T14:34:54.7644598Z } 2025-12-04T14:34:54.7644686Z } 2025-12-04T14:34:54.7644691Z 2025-12-04T14:34:54.7644778Z {-# 2025-12-04T14:34:54.7644897Z external_resources: { 2025-12-04T14:34:54.7645001Z mlir_reproducer: { 2025-12-04T14:34:54.7651050Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7651173Z disable_threading: false, 2025-12-04T14:34:54.7651288Z verify_each: true 2025-12-04T14:34:54.7651373Z } 2025-12-04T14:34:54.7651462Z } 2025-12-04T14:34:54.7651593Z #-} 2025-12-04T14:34:54.7652315Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7653612Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7653744Z ('RERUN', {'yellow': True}) [3.0811s] [100%] 2025-12-04T14:34:54.7655803Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7656068Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7656171Z ^ 2025-12-04T14:34:54.7656274Z module { 2025-12-04T14:34:54.7660990Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7661157Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7661271Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7661394Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7661557Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7661666Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7661793Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7661906Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7662012Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7662138Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7662249Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7662376Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7662480Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7662592Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7662712Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7662821Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7662927Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7663056Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7663158Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7663267Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7663382Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7663490Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7663643Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7663784Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7663914Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7664032Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7664174Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7664287Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7664403Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7664606Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7664802Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7664926Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7665059Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7665190Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7665297Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7665422Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7665671Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7665808Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7665932Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7666116Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7666310Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7666534Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7666792Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7666978Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7667186Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7667351Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7667520Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7667643Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7667775Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7667943Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7668079Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7668251Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7668459Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7668685Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7668927Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7669121Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7669324Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7669440Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7669591Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7669703Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7669843Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7669954Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7670079Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7670224Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7670348Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7670524Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7670730Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7670981Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7671192Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7671325Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7672016Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7672565Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7672714Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7672829Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7672960Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7673069Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7673202Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7673419Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7673550Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7673813Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7673957Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7674174Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7674338Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7674587Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7674677Z } 2025-12-04T14:34:54.7674837Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7674927Z tt.return 2025-12-04T14:34:54.7675028Z } 2025-12-04T14:34:54.7675111Z } 2025-12-04T14:34:54.7675116Z 2025-12-04T14:34:54.7675202Z {-# 2025-12-04T14:34:54.7675317Z external_resources: { 2025-12-04T14:34:54.7675417Z mlir_reproducer: { 2025-12-04T14:34:54.7681545Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7681692Z disable_threading: false, 2025-12-04T14:34:54.7681804Z verify_each: true 2025-12-04T14:34:54.7681891Z } 2025-12-04T14:34:54.7681975Z } 2025-12-04T14:34:54.7682072Z #-} 2025-12-04T14:34:54.7682786Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7684066Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7684206Z ('RERUN', {'yellow': True}) [0.5091s] [100%] 2025-12-04T14:34:54.7686260Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7686564Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7686661Z ^ 2025-12-04T14:34:54.7686766Z module { 2025-12-04T14:34:54.7691481Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7691621Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7691741Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7691921Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7692080Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7692232Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7692359Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7692471Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7692616Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7692740Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7692847Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7692962Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7693070Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7693183Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7693305Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7693417Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7693520Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7693647Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7693752Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7693858Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7693971Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7694077Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7694183Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7694333Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7694460Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7694578Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7694716Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7694824Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7694938Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7695142Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7695336Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7695454Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7695592Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7695708Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7695871Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7695999Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7696235Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7696370Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7696497Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7696679Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7696872Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7697095Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7697352Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7697537Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7697747Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7697868Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7698000Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7698190Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7698320Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7698452Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7698590Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7698757Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7698957Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7699181Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7699419Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7699655Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7699879Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7699999Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7700154Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7700292Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7700433Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7700542Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7700666Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7700816Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7700939Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7701116Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7701320Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7701572Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7701772Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7701919Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7702044Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7702586Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7702729Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7702849Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7702976Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7703087Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7703221Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7703427Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7703561Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7703825Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7704014Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7704237Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7704398Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7704650Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7704739Z } 2025-12-04T14:34:54.7710223Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7710365Z tt.return 2025-12-04T14:34:54.7710462Z } 2025-12-04T14:34:54.7710545Z } 2025-12-04T14:34:54.7710552Z 2025-12-04T14:34:54.7710635Z {-# 2025-12-04T14:34:54.7710756Z external_resources: { 2025-12-04T14:34:54.7710862Z mlir_reproducer: { 2025-12-04T14:34:54.7716993Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7717231Z disable_threading: false, 2025-12-04T14:34:54.7717340Z verify_each: true 2025-12-04T14:34:54.7717428Z } 2025-12-04T14:34:54.7717514Z } 2025-12-04T14:34:54.7717640Z #-} 2025-12-04T14:34:54.7718360Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7719665Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7719763Z FAILED [0.4946s] [100%] 2025-12-04T14:34:54.7719769Z 2025-12-04T14:34:54.7719910Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.7720511Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.7720634Z Traceback (most recent call last): 2025-12-04T14:34:54.7721022Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7721141Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7721665Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1144, in _int_bsr_dense_addmm 2025-12-04T14:34:54.7721785Z return bsr_dense_addmm( 2025-12-04T14:34:54.7722276Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7722507Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7722979Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7723095Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7723548Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7723775Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7724191Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7724489Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7725097Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7725425Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7725855Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7726091Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7726562Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7726708Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7727222Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7727554Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7728068Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7728180Z pm.run(mod) 2025-12-04T14:34:54.7728312Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7728319Z 2025-12-04T14:34:54.7728536Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7729417Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7729467Z 2025-12-04T14:34:54.7729769Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7730399Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.7730519Z Traceback (most recent call last): 2025-12-04T14:34:54.7730891Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7731022Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7731539Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1144, in _int_bsr_dense_addmm 2025-12-04T14:34:54.7731658Z return bsr_dense_addmm( 2025-12-04T14:34:54.7732149Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7732373Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7732858Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7732971Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7733417Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7733539Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7733953Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7734239Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7734632Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7734933Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7735375Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7735609Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7736090Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7736245Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7736742Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7737077Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7737586Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7737682Z pm.run(mod) 2025-12-04T14:34:54.7737819Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7737825Z 2025-12-04T14:34:54.7738045Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7738995Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7739014Z 2025-12-04T14:34:54.7739283Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7739428Z =================================== FAILURES =================================== 2025-12-04T14:34:54.7740018Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.7740137Z Traceback (most recent call last): 2025-12-04T14:34:54.7740503Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7740634Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7741151Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1144, in _int_bsr_dense_addmm 2025-12-04T14:34:54.7741305Z return bsr_dense_addmm( 2025-12-04T14:34:54.7741827Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7742079Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7742565Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7742675Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7743117Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7743242Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7743652Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7743944Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7744325Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7744633Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7745076Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7745309Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7745767Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7745905Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7746400Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7746733Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7747247Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7747373Z pm.run(mod) 2025-12-04T14:34:54.7747513Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7747518Z 2025-12-04T14:34:54.7747736Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7748614Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7748620Z 2025-12-04T14:34:54.7748881Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7749504Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-fad3091b7781ac2f.xml - 2025-12-04T14:34:54.7749685Z =========================== short test summary info ============================ 2025-12-04T14:34:54.7750588Z FAILED [0.4946s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7750595Z 2025-12-04T14:34:54.7750822Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7751687Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7751692Z 2025-12-04T14:34:54.7751970Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7752149Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.7752349Z ================= 1 failed, 2400 deselected, 2 rerun in 4.41s ================== 2025-12-04T14:34:54.7752455Z Got exit code 1 2025-12-04T14:34:54.7752562Z Retrying single test... 2025-12-04T14:34:54.7753040Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-74bf247f22e5b0d6.xml 2025-12-04T14:34:54.7753243Z ============================= test session starts ============================== 2025-12-04T14:34:54.7753640Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.7753764Z cachedir: .pytest_cache 2025-12-04T14:34:54.7754285Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.7754407Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.7754523Z configfile: pytest.ini 2025-12-04T14:34:54.7755111Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.7755347Z collecting ... collected 4932 items / 2400 deselected / 2532 selected 2025-12-04T14:34:54.7756317Z stepcurrent: skipping 2366 already run items. Running only test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7756435Z Running 1 items in this shard 2025-12-04T14:34:54.7756440Z 2025-12-04T14:34:54.7758509Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7758756Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7758863Z ^ 2025-12-04T14:34:54.7758951Z module { 2025-12-04T14:34:54.7763523Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7763688Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7763808Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7763919Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7764076Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7764196Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7764304Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7764422Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7764527Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7764636Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7764753Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7764887Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7764994Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7765119Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7765253Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7765364Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7765508Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7765621Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7765726Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7765845Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7765948Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7766063Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7766167Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7766305Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7766441Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7766546Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7766690Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7766801Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7766910Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7767117Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7767322Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7767431Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7767572Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7767686Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7767791Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7767922Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7768148Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7768282Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7768415Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7768588Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7768792Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7769051Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7769295Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7769487Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7769690Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7769802Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7769948Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7770061Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7770194Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7770331Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7770455Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7770632Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7770828Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7771053Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7771304Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7771485Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7771680Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7771803Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7771939Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7772060Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7772185Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7772295Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7772465Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7772597Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7772757Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7772941Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7773160Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7773420Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7773620Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7773753Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7773887Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7774418Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7774567Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7774692Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7774816Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7774935Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7775071Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7775275Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7775414Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7775664Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7775805Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7776024Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7776171Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7776431Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7776520Z } 2025-12-04T14:34:54.7776670Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7776802Z tt.return 2025-12-04T14:34:54.7776888Z } 2025-12-04T14:34:54.7776974Z } 2025-12-04T14:34:54.7776980Z 2025-12-04T14:34:54.7777075Z {-# 2025-12-04T14:34:54.7777185Z external_resources: { 2025-12-04T14:34:54.7777283Z mlir_reproducer: { 2025-12-04T14:34:54.7783450Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7783583Z disable_threading: false, 2025-12-04T14:34:54.7783681Z verify_each: true 2025-12-04T14:34:54.7783770Z } 2025-12-04T14:34:54.7783864Z } 2025-12-04T14:34:54.7783949Z #-} 2025-12-04T14:34:54.7784668Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7786036Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7786201Z ('RERUN', {'yellow': True}) [3.0642s] [100%] 2025-12-04T14:34:54.7788420Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7788671Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7788782Z ^ 2025-12-04T14:34:54.7788874Z module { 2025-12-04T14:34:54.7793440Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7793630Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7793753Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7793857Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7794012Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7794133Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7794241Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7794348Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7794466Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7794579Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7794697Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7794802Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7794910Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7795033Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7795139Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7795252Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7795363Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7795479Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7795588Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7795699Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7795802Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7795913Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7796020Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7796157Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7796295Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7796446Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7796586Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7796758Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7796867Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7797112Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7797324Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7797429Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7797572Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7797689Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7797791Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7797930Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7798157Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7798294Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7798433Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7798600Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7798799Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7799042Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7799290Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7799487Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7799684Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7799796Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7799942Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7800050Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7800175Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7800319Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7800442Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7800652Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7800841Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7801066Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7801314Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7801498Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7801695Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7801815Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7801951Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7802073Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7802200Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7802310Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7802441Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7802578Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7802701Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7802891Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7803083Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7803342Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7803542Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7803677Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7803810Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7804339Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7804543Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7804670Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7804788Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7804934Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7805067Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7805267Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7805408Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7805658Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7805795Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7806023Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7806174Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7806430Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7806521Z } 2025-12-04T14:34:54.7806666Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7806769Z tt.return 2025-12-04T14:34:54.7806857Z } 2025-12-04T14:34:54.7806940Z } 2025-12-04T14:34:54.7806945Z 2025-12-04T14:34:54.7807048Z {-# 2025-12-04T14:34:54.7807152Z external_resources: { 2025-12-04T14:34:54.7807250Z mlir_reproducer: { 2025-12-04T14:34:54.7813293Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7813453Z disable_threading: false, 2025-12-04T14:34:54.7813553Z verify_each: true 2025-12-04T14:34:54.7813636Z } 2025-12-04T14:34:54.7813733Z } 2025-12-04T14:34:54.7813821Z #-} 2025-12-04T14:34:54.7814534Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7815835Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7815967Z ('RERUN', {'yellow': True}) [0.5030s] [100%] 2025-12-04T14:34:54.7818028Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7818356Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7818499Z ^ 2025-12-04T14:34:54.7818596Z module { 2025-12-04T14:34:54.7823213Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7823337Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7823462Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7823572Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7823728Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7823846Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7823953Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7824058Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7824171Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7824282Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7824429Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7824534Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7824638Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7824757Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7824863Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7824970Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7825081Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7825195Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7825296Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7825411Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7825513Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7825618Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7825739Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7825874Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7826013Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7826118Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7826267Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7826377Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7826480Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7826680Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7826881Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7826989Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7827129Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7827241Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7827342Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7827475Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7827702Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7827869Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7828034Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7828205Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7828427Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7828657Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7828901Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7829090Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7829288Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7829402Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7829543Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7829654Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7829782Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7829922Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7830049Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7830222Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7830413Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7830632Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7830882Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7831061Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7831255Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7831377Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7831509Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7831631Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7831756Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7831899Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7832035Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7832173Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7832296Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7832482Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7832673Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7832919Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7833124Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7833255Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7833390Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7833921Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7834069Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7834193Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7834309Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7834417Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7834557Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7834754Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7834890Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7835139Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7835280Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7835557Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7835701Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7835980Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7836077Z } 2025-12-04T14:34:54.7836250Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7836352Z tt.return 2025-12-04T14:34:54.7836433Z } 2025-12-04T14:34:54.7836514Z } 2025-12-04T14:34:54.7836520Z 2025-12-04T14:34:54.7836609Z {-# 2025-12-04T14:34:54.7836710Z external_resources: { 2025-12-04T14:34:54.7836810Z mlir_reproducer: { 2025-12-04T14:34:54.7842892Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7843011Z disable_threading: false, 2025-12-04T14:34:54.7843108Z verify_each: true 2025-12-04T14:34:54.7843191Z } 2025-12-04T14:34:54.7843277Z } 2025-12-04T14:34:54.7843362Z #-} 2025-12-04T14:34:54.7844073Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7845394Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7845491Z FAILED [0.4887s] [100%] 2025-12-04T14:34:54.7845496Z 2025-12-04T14:34:54.7845642Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.7846226Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.7846342Z Traceback (most recent call last): 2025-12-04T14:34:54.7846723Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7846842Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7847369Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1144, in _int_bsr_dense_addmm 2025-12-04T14:34:54.7847478Z return bsr_dense_addmm( 2025-12-04T14:34:54.7847969Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7848194Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7848668Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7848777Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7849229Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7849343Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7849756Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7850104Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7850488Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7850827Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7851260Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7851498Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7851947Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7852084Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7852586Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7852916Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7853431Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7853536Z pm.run(mod) 2025-12-04T14:34:54.7853671Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7853677Z 2025-12-04T14:34:54.7853898Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7854767Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7854773Z 2025-12-04T14:34:54.7855036Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7855621Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.7855741Z Traceback (most recent call last): 2025-12-04T14:34:54.7856119Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7856270Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7856791Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1144, in _int_bsr_dense_addmm 2025-12-04T14:34:54.7856907Z return bsr_dense_addmm( 2025-12-04T14:34:54.7857396Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7857617Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7858085Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7858296Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7858747Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7858867Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7859282Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7859574Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7859958Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7860268Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7860702Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7860934Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7861394Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7861571Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7862109Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7862437Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7862977Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7863080Z pm.run(mod) 2025-12-04T14:34:54.7863209Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7863215Z 2025-12-04T14:34:54.7863430Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7864301Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7864308Z 2025-12-04T14:34:54.7864578Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7864727Z =================================== FAILURES =================================== 2025-12-04T14:34:54.7865311Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.7865434Z Traceback (most recent call last): 2025-12-04T14:34:54.7865813Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7865929Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7866455Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1144, in _int_bsr_dense_addmm 2025-12-04T14:34:54.7866563Z return bsr_dense_addmm( 2025-12-04T14:34:54.7867052Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7867276Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7867748Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7867902Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7868342Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7868461Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7868875Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7869152Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7869534Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7869836Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7870261Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7870502Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7870952Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7871091Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7871593Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7871921Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7872433Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7872523Z pm.run(mod) 2025-12-04T14:34:54.7872653Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7872659Z 2025-12-04T14:34:54.7872879Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7873809Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7873818Z 2025-12-04T14:34:54.7874122Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7874739Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-74bf247f22e5b0d6.xml - 2025-12-04T14:34:54.7874908Z =========================== short test summary info ============================ 2025-12-04T14:34:54.7875809Z FAILED [0.4887s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7875815Z 2025-12-04T14:34:54.7876025Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7876899Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7876906Z 2025-12-04T14:34:54.7877166Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7877343Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.7877549Z ================= 1 failed, 2400 deselected, 2 rerun in 4.39s ================== 2025-12-04T14:34:54.7877644Z Got exit code 1 2025-12-04T14:34:54.7878436Z FAILED CONSISTENTLY: test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.7878844Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T14:34:54.7879283Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-4c8f61ef5d858418.xml 2025-12-04T14:34:54.7879449Z ============================= test session starts ============================== 2025-12-04T14:34:54.7879843Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.7879950Z cachedir: .pytest_cache 2025-12-04T14:34:54.7880482Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.7880602Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.7880711Z configfile: pytest.ini 2025-12-04T14:34:54.7881300Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.7881533Z collecting ... collected 4932 items / 2367 deselected / 2565 selected 2025-12-04T14:34:54.7881687Z stepcurrent: skipping 2367 already run items. 2025-12-04T14:34:54.7881801Z Running 34 items in this shard 2025-12-04T14:34:54.7881806Z 2025-12-04T14:34:54.7882659Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_16_out_dtype_int32_cuda_float16 SKIPPED [0.0236s] (incompatible out dtype) [ 2%] 2025-12-04T14:34:54.7883493Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_16_out_dtype_int32_cuda_float32 SKIPPED [0.0016s] (incompatible out dtype) [ 5%] 2025-12-04T14:34:54.7884344Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_16x32_out_dtype_int32_cuda_float16 SKIPPED [0.0015s] (incompatible out dtype) [ 8%] 2025-12-04T14:34:54.7885198Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_16x32_out_dtype_int32_cuda_float32 SKIPPED [0.0014s] (incompatible out dtype) [ 11%] 2025-12-04T14:34:54.7886024Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_float16 SKIPPED [0.0014s] (incompatible out dtype) [ 14%] 2025-12-04T14:34:54.7888327Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7888580Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7888685Z ^ 2025-12-04T14:34:54.7888774Z module { 2025-12-04T14:34:54.7893336Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7893503Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7893623Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7893737Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7893892Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7894011Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7894124Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7894242Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7894348Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7894455Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7894569Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7894673Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7894774Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7894896Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7895004Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7895111Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7895231Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7895342Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7895446Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7895564Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7895669Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7895782Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7895890Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7896024Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7896160Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7896265Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7896402Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7896516Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7896618Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7896853Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7897089Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7897197Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7897442Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7897557Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7897660Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7897799Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7898023Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7898237Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7898375Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7898543Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7898734Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7898968Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7899214Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7899406Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7899607Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7899716Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7899856Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7899968Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7900089Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7900228Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7900348Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7900529Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7900720Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7900942Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7901223Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7901401Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7901595Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7901711Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7901845Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7901963Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7902088Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7902193Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7902327Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7902459Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7902584Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7902767Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7902963Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7903218Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7903418Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7903549Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7903683Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7904212Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7904355Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7904480Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7904626Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7904738Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7904901Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7905101Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7905267Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7905519Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7905657Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7905876Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7906025Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7906278Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7906364Z } 2025-12-04T14:34:54.7906512Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7906611Z tt.return 2025-12-04T14:34:54.7906698Z } 2025-12-04T14:34:54.7906781Z } 2025-12-04T14:34:54.7906788Z 2025-12-04T14:34:54.7906880Z {-# 2025-12-04T14:34:54.7906988Z external_resources: { 2025-12-04T14:34:54.7907086Z mlir_reproducer: { 2025-12-04T14:34:54.7913136Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7913289Z disable_threading: false, 2025-12-04T14:34:54.7913386Z verify_each: true 2025-12-04T14:34:54.7913470Z } 2025-12-04T14:34:54.7913564Z } 2025-12-04T14:34:54.7913648Z #-} 2025-12-04T14:34:54.7914363Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7915654Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7915791Z ('RERUN', {'yellow': True}) [3.0272s] [ 17%] 2025-12-04T14:34:54.7917792Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7918040Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7918145Z ^ 2025-12-04T14:34:54.7918233Z module { 2025-12-04T14:34:54.7922848Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7923011Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7923138Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7923248Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7923400Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7923522Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7923631Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7923736Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7923851Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7923961Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7924076Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7924181Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7924285Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7924409Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7924514Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7924655Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7924769Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7924886Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7924987Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7925099Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7925199Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7925303Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7925418Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7925549Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7925686Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7925793Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7925929Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7926047Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7926149Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7926349Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7926560Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7926667Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7926814Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7926926Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7927025Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7927163Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7927387Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7927523Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7927658Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7927823Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7928045Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7928307Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7928552Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7928775Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7928976Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7929088Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7929233Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7929342Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7929466Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7929611Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7929731Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7929908Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7930097Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7930323Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7930573Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7930752Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7930950Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7931069Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7931203Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7931319Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7931441Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7931545Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7931674Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7931808Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7931925Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7932151Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7932342Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7932592Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7932800Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7932930Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7933059Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7933590Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7933736Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7933862Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7933978Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7934091Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7934237Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7934442Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7934582Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7934828Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7934966Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7935188Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7935332Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7935584Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7935711Z } 2025-12-04T14:34:54.7935858Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7935959Z tt.return 2025-12-04T14:34:54.7936045Z } 2025-12-04T14:34:54.7936159Z } 2025-12-04T14:34:54.7936165Z 2025-12-04T14:34:54.7936257Z {-# 2025-12-04T14:34:54.7936362Z external_resources: { 2025-12-04T14:34:54.7936493Z mlir_reproducer: { 2025-12-04T14:34:54.7942670Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7942799Z disable_threading: false, 2025-12-04T14:34:54.7942897Z verify_each: true 2025-12-04T14:34:54.7942983Z } 2025-12-04T14:34:54.7943073Z } 2025-12-04T14:34:54.7943156Z #-} 2025-12-04T14:34:54.7943868Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7945163Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7945334Z ('RERUN', {'yellow': True}) [0.4832s] [ 17%] 2025-12-04T14:34:54.7947331Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.7947576Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.7947677Z ^ 2025-12-04T14:34:54.7947768Z module { 2025-12-04T14:34:54.7952367Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.7952548Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.7952671Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.7952782Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.7952942Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.7953061Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.7953168Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.7953274Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.7953391Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.7953501Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.7953603Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.7953719Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.7953817Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.7953942Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.7954048Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.7954154Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.7954271Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.7954381Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.7954481Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.7954591Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.7954690Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.7954795Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.7954911Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.7955045Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.7955174Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.7955276Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.7955416Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7955527Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.7955666Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.7955865Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.7956069Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.7956176Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.7956304Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.7956424Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.7956524Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.7956652Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.7956877Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.7957009Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7957138Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7957307Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7957500Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7957731Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.7957975Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7958164Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.7958362Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7958473Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.7958611Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.7958722Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.7958843Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.7958984Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7959139Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7959304Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7959545Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7959799Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.7960047Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7960227Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.7960420Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7960540Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.7960674Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.7960784Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.7960922Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.7961030Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.7961167Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.7961301Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.7961431Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.7961623Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.7961818Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.7962070Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7962275Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.7962406Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.7962536Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.7963062Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.7963206Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.7963333Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.7963484Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.7963592Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.7963732Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.7963931Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.7964070Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7964319Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.7964459Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.7964681Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.7964826Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.7965079Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.7965176Z } 2025-12-04T14:34:54.7965329Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.7965426Z tt.return 2025-12-04T14:34:54.7965511Z } 2025-12-04T14:34:54.7965594Z } 2025-12-04T14:34:54.7965600Z 2025-12-04T14:34:54.7965693Z {-# 2025-12-04T14:34:54.7965797Z external_resources: { 2025-12-04T14:34:54.7965895Z mlir_reproducer: { 2025-12-04T14:34:54.7971995Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.7972144Z disable_threading: false, 2025-12-04T14:34:54.7972242Z verify_each: true 2025-12-04T14:34:54.7972330Z } 2025-12-04T14:34:54.7972422Z } 2025-12-04T14:34:54.7972505Z #-} 2025-12-04T14:34:54.7973217Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.7974510Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.7974614Z FAILED [0.4740s] [ 17%] 2025-12-04T14:34:54.7974619Z 2025-12-04T14:34:54.7974764Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.7975301Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 _ 2025-12-04T14:34:54.7975420Z Traceback (most recent call last): 2025-12-04T14:34:54.7975802Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7975922Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7976414Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7976640Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7977144Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7977264Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7977710Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7977823Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7978341Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7978624Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7979015Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7979320Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7979747Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7979991Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7980440Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7980574Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7981078Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7981404Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7981921Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7982013Z pm.run(mod) 2025-12-04T14:34:54.7982141Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7982185Z 2025-12-04T14:34:54.7982409Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7983260Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.7983294Z 2025-12-04T14:34:54.7983569Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7984103Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 _ 2025-12-04T14:34:54.7984217Z Traceback (most recent call last): 2025-12-04T14:34:54.7984590Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7984708Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7985206Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7985430Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7985899Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7986021Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7986461Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7986574Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7986990Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7987265Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7987656Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7988111Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7988542Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7988851Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7989300Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7989446Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7989937Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7990260Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7990782Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7990876Z pm.run(mod) 2025-12-04T14:34:54.7991005Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7991013Z 2025-12-04T14:34:54.7991239Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.7992056Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.7992064Z 2025-12-04T14:34:54.7992340Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.7992483Z =================================== FAILURES =================================== 2025-12-04T14:34:54.7993010Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 _ 2025-12-04T14:34:54.7993135Z Traceback (most recent call last): 2025-12-04T14:34:54.7993502Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.7993631Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.7994125Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.7994383Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.7994898Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.7995055Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.7995512Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.7995626Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.7996036Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.7996329Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.7996705Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.7997003Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.7997445Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.7997679Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.7998132Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.7998270Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.7998767Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.7999104Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.7999612Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.7999717Z pm.run(mod) 2025-12-04T14:34:54.7999851Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.7999857Z 2025-12-04T14:34:54.8000070Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8000932Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8000938Z 2025-12-04T14:34:54.8001203Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8001830Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-4c8f61ef5d858418.xml - 2025-12-04T14:34:54.8002000Z =========================== short test summary info ============================ 2025-12-04T14:34:54.8002852Z FAILED [0.4740s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8002860Z 2025-12-04T14:34:54.8003075Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8003889Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8003896Z 2025-12-04T14:34:54.8004171Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8004350Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.8004573Z ============ 1 failed, 5 skipped, 2367 deselected, 2 rerun in 4.35s ============ 2025-12-04T14:34:54.8004674Z Got exit code 1 2025-12-04T14:34:54.8004777Z Retrying single test... 2025-12-04T14:34:54.8005225Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-c19baf682148544c.xml 2025-12-04T14:34:54.8005390Z ============================= test session starts ============================== 2025-12-04T14:34:54.8005793Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.8005910Z cachedir: .pytest_cache 2025-12-04T14:34:54.8006458Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.8006609Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.8006720Z configfile: pytest.ini 2025-12-04T14:34:54.8007307Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.8007551Z collecting ... collected 4932 items / 2400 deselected / 2532 selected 2025-12-04T14:34:54.8008459Z stepcurrent: skipping 2372 already run items. Running only test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8008570Z Running 1 items in this shard 2025-12-04T14:34:54.8008578Z 2025-12-04T14:34:54.8010586Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8010833Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8010937Z ^ 2025-12-04T14:34:54.8011025Z module { 2025-12-04T14:34:54.8015575Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8015727Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8015852Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8015962Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8016115Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8016237Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8016347Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8016462Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8016565Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8016669Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8016784Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8016888Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8016987Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8017107Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8017211Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8017350Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8017460Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8017571Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8017714Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8017820Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8017948Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8018062Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8018265Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8018402Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8018537Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8018642Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8018778Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8018893Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8018995Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8019193Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8019400Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8019509Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8019653Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8019764Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8019870Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8020002Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8020226Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8020360Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8020497Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8020664Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8020866Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8021088Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8021332Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8021563Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8021761Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8021874Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8022017Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8022126Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8022261Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8022396Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8022518Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8022695Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8022887Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8023111Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8023373Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8023557Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8023768Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8023878Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8024013Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8024131Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8024253Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8024362Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8024490Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8024625Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8024749Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8024967Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8025158Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8025448Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8025689Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8025821Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8025956Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8026487Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8026629Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8026755Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8026871Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8026988Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.8027124Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8027326Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8027463Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8027718Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8027858Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8028082Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8028229Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8028486Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8028571Z } 2025-12-04T14:34:54.8028718Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8028819Z tt.return 2025-12-04T14:34:54.8028906Z } 2025-12-04T14:34:54.8028989Z } 2025-12-04T14:34:54.8028994Z 2025-12-04T14:34:54.8029088Z {-# 2025-12-04T14:34:54.8029194Z external_resources: { 2025-12-04T14:34:54.8029325Z mlir_reproducer: { 2025-12-04T14:34:54.8035406Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8035532Z disable_threading: false, 2025-12-04T14:34:54.8035631Z verify_each: true 2025-12-04T14:34:54.8035718Z } 2025-12-04T14:34:54.8035806Z } 2025-12-04T14:34:54.8035892Z #-} 2025-12-04T14:34:54.8036711Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8038113Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8038289Z ('RERUN', {'yellow': True}) [3.0447s] [100%] 2025-12-04T14:34:54.8040349Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8040596Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8040697Z ^ 2025-12-04T14:34:54.8040784Z module { 2025-12-04T14:34:54.8045357Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8045523Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8045642Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8045754Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8045907Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8046026Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8046134Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8046249Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8046356Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8046467Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8046581Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8046684Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8046783Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8046902Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8047010Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8047118Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8047234Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8047349Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8047452Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8047571Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8047670Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8047779Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8047885Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8048017Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8048155Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8048262Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8048403Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8048517Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8048623Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8048853Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8049093Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8049206Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8049376Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8049492Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8049593Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8049727Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8049957Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8050089Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8050225Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8050397Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8050592Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8050829Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8051073Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8051269Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8051474Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8051587Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8051739Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8051850Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8051974Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8052123Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8052248Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8052428Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8052623Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8052846Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8053129Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8053313Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8053510Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8053636Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8053771Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8053895Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8054021Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8054133Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8054269Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8054406Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8054533Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8054715Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8054911Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8055178Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8055378Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8055512Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8055649Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8056175Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8056321Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8056445Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8056594Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8056713Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.8056880Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8057080Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8057244Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8057494Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8057638Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8057862Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8058010Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8058362Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8058449Z } 2025-12-04T14:34:54.8058599Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8058705Z tt.return 2025-12-04T14:34:54.8058790Z } 2025-12-04T14:34:54.8058873Z } 2025-12-04T14:34:54.8058881Z 2025-12-04T14:34:54.8058972Z {-# 2025-12-04T14:34:54.8059079Z external_resources: { 2025-12-04T14:34:54.8059181Z mlir_reproducer: { 2025-12-04T14:34:54.8065269Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8065432Z disable_threading: false, 2025-12-04T14:34:54.8065531Z verify_each: true 2025-12-04T14:34:54.8065617Z } 2025-12-04T14:34:54.8065711Z } 2025-12-04T14:34:54.8065796Z #-} 2025-12-04T14:34:54.8066506Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8067798Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8067934Z ('RERUN', {'yellow': True}) [0.4881s] [100%] 2025-12-04T14:34:54.8069935Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8070184Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8070289Z ^ 2025-12-04T14:34:54.8070380Z module { 2025-12-04T14:34:54.8075043Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8075194Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8075318Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8075428Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8075587Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8075706Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8075814Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8075922Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8076037Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8076148Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8076270Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8076377Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8076488Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8076610Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8076718Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8076858Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8076973Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8077089Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8077196Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8077311Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8077415Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8077516Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8077632Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8077770Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8077906Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8078011Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8078150Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8078263Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8078366Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8078565Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8078773Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8078881Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8079023Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8079137Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8079241Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8079374Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8079602Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8079735Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8079871Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8080038Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8080263Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8080526Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8080767Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8081003Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8081203Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8081317Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8081468Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8081578Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8081701Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8081843Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8081966Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8082148Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8082338Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8082565Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8082817Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8082997Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8083197Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8083317Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8083455Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8083580Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8083705Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8083816Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8083949Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8084084Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8084205Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8084422Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8084617Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8084873Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8085083Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8085217Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8085351Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8085881Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8086026Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8086153Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8086272Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8086382Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.8086522Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8086723Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8086866Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8087117Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8087255Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8087478Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8087627Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8088064Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8088220Z } 2025-12-04T14:34:54.8088368Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8088473Z tt.return 2025-12-04T14:34:54.8088562Z } 2025-12-04T14:34:54.8088682Z } 2025-12-04T14:34:54.8088688Z 2025-12-04T14:34:54.8088786Z {-# 2025-12-04T14:34:54.8088891Z external_resources: { 2025-12-04T14:34:54.8089034Z mlir_reproducer: { 2025-12-04T14:34:54.8095115Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8095244Z disable_threading: false, 2025-12-04T14:34:54.8095342Z verify_each: true 2025-12-04T14:34:54.8095425Z } 2025-12-04T14:34:54.8095520Z } 2025-12-04T14:34:54.8095602Z #-} 2025-12-04T14:34:54.8096311Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8097610Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8097749Z FAILED [0.4704s] [100%] 2025-12-04T14:34:54.8097755Z 2025-12-04T14:34:54.8097906Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.8098538Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 _ 2025-12-04T14:34:54.8098662Z Traceback (most recent call last): 2025-12-04T14:34:54.8099045Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8099164Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8099668Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8099889Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8100365Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8100490Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8100931Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8101047Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8101472Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8101753Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8102148Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8102449Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8102877Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8103162Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8103639Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8103825Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8104323Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8104655Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8105178Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8105272Z pm.run(mod) 2025-12-04T14:34:54.8105404Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8105410Z 2025-12-04T14:34:54.8105638Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8106460Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8106468Z 2025-12-04T14:34:54.8106744Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8107273Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 _ 2025-12-04T14:34:54.8107393Z Traceback (most recent call last): 2025-12-04T14:34:54.8107770Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8107888Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8108388Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8108605Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8109082Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8109246Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8109687Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8109805Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8110229Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8110507Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8110898Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8111201Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8111629Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8111873Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8112324Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8112473Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8112973Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8113301Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8113820Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8113914Z pm.run(mod) 2025-12-04T14:34:54.8114047Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8114063Z 2025-12-04T14:34:54.8114276Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8115155Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8115164Z 2025-12-04T14:34:54.8115439Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8115612Z =================================== FAILURES =================================== 2025-12-04T14:34:54.8116152Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 _ 2025-12-04T14:34:54.8116272Z Traceback (most recent call last): 2025-12-04T14:34:54.8116637Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8116763Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8117253Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8117477Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8117954Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8118067Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8118521Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8118635Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8119048Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8119335Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8119714Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8120014Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8120454Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8120688Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8121175Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8121314Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8121811Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8122145Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8122655Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8122755Z pm.run(mod) 2025-12-04T14:34:54.8122886Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8122891Z 2025-12-04T14:34:54.8123108Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8123935Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8123943Z 2025-12-04T14:34:54.8124206Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8124830Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-c19baf682148544c.xml - 2025-12-04T14:34:54.8124999Z =========================== short test summary info ============================ 2025-12-04T14:34:54.8125852Z FAILED [0.4704s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8125859Z 2025-12-04T14:34:54.8126074Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8126969Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8126977Z 2025-12-04T14:34:54.8127248Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8127451Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.8127653Z ================= 1 failed, 2400 deselected, 2 rerun in 4.33s ================== 2025-12-04T14:34:54.8127754Z Got exit code 1 2025-12-04T14:34:54.8127859Z Retrying single test... 2025-12-04T14:34:54.8128312Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-a8d85ae14c598d51.xml 2025-12-04T14:34:54.8128468Z ============================= test session starts ============================== 2025-12-04T14:34:54.8128815Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.8128929Z cachedir: .pytest_cache 2025-12-04T14:34:54.8129448Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.8129573Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.8129693Z configfile: pytest.ini 2025-12-04T14:34:54.8130292Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.8130535Z collecting ... collected 4932 items / 2400 deselected / 2532 selected 2025-12-04T14:34:54.8131448Z stepcurrent: skipping 2372 already run items. Running only test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8131563Z Running 1 items in this shard 2025-12-04T14:34:54.8131568Z 2025-12-04T14:34:54.8133578Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8133862Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8133969Z ^ 2025-12-04T14:34:54.8134059Z module { 2025-12-04T14:34:54.8138712Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8138876Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8139002Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8139118Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8139332Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8139441Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8139584Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8139701Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8139806Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8139917Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8140034Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8140140Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8140245Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8140366Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8140471Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8140578Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8140691Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8140809Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8140919Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8141030Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8141129Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8141244Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8141353Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8141491Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8141631Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8141735Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8141876Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8141992Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8142095Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8142308Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8142502Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8142613Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8142756Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8142904Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8143010Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8143145Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8143370Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8143506Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8143637Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8143805Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8144012Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8144238Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8144481Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8144673Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8144876Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8144990Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8145130Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8145243Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8145371Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8145507Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8145631Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8145808Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8145998Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8146217Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8146496Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8146706Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8146915Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8147059Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8147197Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8147318Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8147439Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8147548Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8147682Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8147814Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8147937Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8148122Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8148317Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8148580Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8148779Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8148912Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8149042Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8149570Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8149722Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8152177Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8152321Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8152431Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.8152574Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8154576Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8154779Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8155050Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8155200Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8155422Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8155580Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8155831Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8155920Z } 2025-12-04T14:34:54.8156113Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8156207Z tt.return 2025-12-04T14:34:54.8156296Z } 2025-12-04T14:34:54.8156387Z } 2025-12-04T14:34:54.8156392Z 2025-12-04T14:34:54.8156481Z {-# 2025-12-04T14:34:54.8156596Z external_resources: { 2025-12-04T14:34:54.8156696Z mlir_reproducer: { 2025-12-04T14:34:54.8162744Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8162911Z disable_threading: false, 2025-12-04T14:34:54.8163009Z verify_each: true 2025-12-04T14:34:54.8163094Z } 2025-12-04T14:34:54.8163187Z } 2025-12-04T14:34:54.8163268Z #-} 2025-12-04T14:34:54.8163981Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8165283Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8165412Z ('RERUN', {'yellow': True}) [3.0483s] [100%] 2025-12-04T14:34:54.8167413Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8167666Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8167758Z ^ 2025-12-04T14:34:54.8167852Z module { 2025-12-04T14:34:54.8172521Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8172688Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8172810Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8172920Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8173079Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8173196Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8173305Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8173409Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8173527Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8173637Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8173744Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8173855Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8173959Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8174069Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8174181Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8174328Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8174440Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8174557Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8174663Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8174781Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8174883Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8174985Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8175100Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8175239Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8175368Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8175482Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8175620Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8175734Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8175838Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8176035Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8176242Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8176355Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8176487Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8176607Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8176711Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8176835Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8177070Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8177203Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8177340Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8177550Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8177746Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8177976Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8178388Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8178606Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8178813Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8178926Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8179069Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8179179Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8179299Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8179440Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8179565Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8179737Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8179936Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8180161Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8180409Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8180592Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8180788Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8180908Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8181046Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8181156Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8181289Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8181395Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8181521Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8181662Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8181782Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8181998Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8182192Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8182444Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8182650Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8182782Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8182908Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8183450Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8183598Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8183725Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8183840Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8183952Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.8184092Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8184295Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8184426Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8184682Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8184823Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8185048Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8185196Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8185444Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8185573Z } 2025-12-04T14:34:54.8185722Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8185812Z tt.return 2025-12-04T14:34:54.8185908Z } 2025-12-04T14:34:54.8185993Z } 2025-12-04T14:34:54.8185998Z 2025-12-04T14:34:54.8186092Z {-# 2025-12-04T14:34:54.8186228Z external_resources: { 2025-12-04T14:34:54.8186359Z mlir_reproducer: { 2025-12-04T14:34:54.8192628Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8192750Z disable_threading: false, 2025-12-04T14:34:54.8192858Z verify_each: true 2025-12-04T14:34:54.8192944Z } 2025-12-04T14:34:54.8193039Z } 2025-12-04T14:34:54.8193124Z #-} 2025-12-04T14:34:54.8193844Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8195143Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8195342Z ('RERUN', {'yellow': True}) [0.4874s] [100%] 2025-12-04T14:34:54.8197349Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8197596Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8197692Z ^ 2025-12-04T14:34:54.8197791Z module { 2025-12-04T14:34:54.8202437Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8202608Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8202722Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8202839Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8203000Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8203117Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8203225Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8203333Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8203449Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8203557Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8203669Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8203782Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8203885Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8203994Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8204109Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8204218Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8204332Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8204443Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8204548Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8204660Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8204761Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8204864Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8204976Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8205114Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8205239Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8205351Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8205490Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8205595Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8205705Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8205935Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8206140Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8206249Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8206379Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8206504Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8206606Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8206730Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8206968Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8207103Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8207236Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8207403Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8207594Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8207828Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8208073Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8208256Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8208464Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8208578Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8208717Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8214107Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8214296Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8214447Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8214696Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8214878Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8215079Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8215356Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8215631Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8215823Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8216026Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8216141Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8216289Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8216401Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8216528Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8216647Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8216770Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8216905Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8217039Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8217218Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8217418Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8217671Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8217869Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8218009Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8218233Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8218770Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8218927Z %66 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8219046Z %67 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8219222Z %68 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8219332Z %69 = arith.muli %68, %67 : i64 2025-12-04T14:34:54.8219468Z %70 = tt.splat %69 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8219678Z %71 = tt.addptr %50, %70 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8219808Z %72 = tt.load %71 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8220057Z %73 = tt.dot %66, %72, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8220203Z %74 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8220417Z %75 = tt.addptr %arg32, %74 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8220571Z %76 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8220823Z scf.yield %75, %76, %73 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8220909Z } 2025-12-04T14:34:54.8221064Z tt.store %62, %65#2 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8221159Z tt.return 2025-12-04T14:34:54.8221243Z } 2025-12-04T14:34:54.8221331Z } 2025-12-04T14:34:54.8221342Z 2025-12-04T14:34:54.8221427Z {-# 2025-12-04T14:34:54.8221533Z external_resources: { 2025-12-04T14:34:54.8221641Z mlir_reproducer: { 2025-12-04T14:34:54.8227792Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8227953Z disable_threading: false, 2025-12-04T14:34:54.8228052Z verify_each: true 2025-12-04T14:34:54.8228148Z } 2025-12-04T14:34:54.8228232Z } 2025-12-04T14:34:54.8228316Z #-} 2025-12-04T14:34:54.8229046Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8230331Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8230443Z FAILED [0.4778s] [100%] 2025-12-04T14:34:54.8230449Z 2025-12-04T14:34:54.8230589Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.8231125Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 _ 2025-12-04T14:34:54.8231248Z Traceback (most recent call last): 2025-12-04T14:34:54.8231633Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8231748Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8232246Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8232470Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8232939Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8233085Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8233533Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8233648Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8234065Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8234350Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8234732Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8235040Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8235469Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8235710Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8236160Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8236301Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8236803Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8237133Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8237642Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8237743Z pm.run(mod) 2025-12-04T14:34:54.8237904Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8237911Z 2025-12-04T14:34:54.8238136Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8238982Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8239020Z 2025-12-04T14:34:54.8239287Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8239828Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 _ 2025-12-04T14:34:54.8239946Z Traceback (most recent call last): 2025-12-04T14:34:54.8240324Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8240444Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8240937Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8241165Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8241635Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8241757Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8242200Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8242314Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8242730Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8243013Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8243393Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8243700Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8244126Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8244367Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8244846Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8244988Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8245491Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8245818Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8246333Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8246427Z pm.run(mod) 2025-12-04T14:34:54.8246558Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8246565Z 2025-12-04T14:34:54.8246785Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8247609Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8247621Z 2025-12-04T14:34:54.8247894Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8248038Z =================================== FAILURES =================================== 2025-12-04T14:34:54.8248567Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 _ 2025-12-04T14:34:54.8248690Z Traceback (most recent call last): 2025-12-04T14:34:54.8249060Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8249179Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8249714Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8249930Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8250516Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8250657Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8251103Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8251228Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8251641Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8251920Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8252310Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8252613Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8253048Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8253285Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8253735Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8253881Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8254375Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8254708Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8255217Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8255307Z pm.run(mod) 2025-12-04T14:34:54.8255450Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8255456Z 2025-12-04T14:34:54.8255667Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8256533Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8256541Z 2025-12-04T14:34:54.8256805Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8257431Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-a8d85ae14c598d51.xml - 2025-12-04T14:34:54.8257610Z =========================== short test summary info ============================ 2025-12-04T14:34:54.8258541Z FAILED [0.4778s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8258551Z 2025-12-04T14:34:54.8258772Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8259585Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8259597Z 2025-12-04T14:34:54.8259859Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8260045Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.8260244Z ================= 1 failed, 2400 deselected, 2 rerun in 4.35s ================== 2025-12-04T14:34:54.8260346Z Got exit code 1 2025-12-04T14:34:54.8261075Z FAILED CONSISTENTLY: test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8 2025-12-04T14:34:54.8261526Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T14:34:54.8261978Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-8c50d88405c7b0b3.xml 2025-12-04T14:34:54.8262141Z ============================= test session starts ============================== 2025-12-04T14:34:54.8262531Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.8262666Z cachedir: .pytest_cache 2025-12-04T14:34:54.8263188Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.8263319Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.8263424Z configfile: pytest.ini 2025-12-04T14:34:54.8264019Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.8264257Z collecting ... collected 4932 items / 2373 deselected / 2559 selected 2025-12-04T14:34:54.8264404Z stepcurrent: skipping 2373 already run items. 2025-12-04T14:34:54.8264525Z Running 28 items in this shard 2025-12-04T14:34:54.8264530Z 2025-12-04T14:34:54.8265285Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_float16 PASSED [49.1540s] [ 3%] 2025-12-04T14:34:54.8267334Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8267590Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8267683Z ^ 2025-12-04T14:34:54.8267780Z module { 2025-12-04T14:34:54.8272314Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8272479Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8272588Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8272705Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8272863Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8272970Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8273088Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8273194Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8273306Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8273413Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8273550Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8273664Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8273768Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8273878Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8273993Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8274163Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8274264Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8274386Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8274489Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8274593Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8274705Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8274808Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8274922Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8275059Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8275184Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8275298Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8275437Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8275540Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8275655Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8275853Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8276048Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8276161Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8276293Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8276417Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8276521Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8276642Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8276881Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8277017Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8277143Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8277322Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8277545Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8277782Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8278025Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8278205Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8278413Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8278522Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8278653Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8278774Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8278895Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8279042Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8279167Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8279332Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8279534Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8279760Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8280002Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8280192Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8280396Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8280519Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8280652Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8280765Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8280925Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8281031Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8281149Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8281292Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8281415Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8281619Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8281847Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8282090Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8282292Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8282425Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8282551Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8283089Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8283231Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8283345Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8283470Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8283580Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8283719Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8283920Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8284049Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8284304Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8284443Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8284656Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8284812Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8285059Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8285153Z } 2025-12-04T14:34:54.8285378Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8285516Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8285618Z tt.return 2025-12-04T14:34:54.8285704Z } 2025-12-04T14:34:54.8285786Z } 2025-12-04T14:34:54.8285792Z 2025-12-04T14:34:54.8285882Z {-# 2025-12-04T14:34:54.8285985Z external_resources: { 2025-12-04T14:34:54.8286084Z mlir_reproducer: { 2025-12-04T14:34:54.8292286Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8292413Z disable_threading: false, 2025-12-04T14:34:54.8292510Z verify_each: true 2025-12-04T14:34:54.8292596Z } 2025-12-04T14:34:54.8292686Z } 2025-12-04T14:34:54.8292770Z #-} 2025-12-04T14:34:54.8293550Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8294887Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8295095Z ('RERUN', {'yellow': True}) [0.3047s] [ 7%] 2025-12-04T14:34:54.8297153Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8297401Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8297506Z ^ 2025-12-04T14:34:54.8297597Z module { 2025-12-04T14:34:54.8302246Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8302425Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8302548Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8302655Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8302810Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8302932Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8303043Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8303160Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8303267Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8303378Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8303496Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8303601Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8303705Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8303831Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8303935Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8304043Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8304152Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8304268Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8304371Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8304482Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8304584Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8304700Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8304806Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8304975Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8305117Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8305220Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8305360Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8305502Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8305637Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8305833Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8306037Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8306142Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8306282Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8306396Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8306497Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8306631Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8306856Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8306990Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8307125Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8307295Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8307488Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8307718Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8307958Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8308152Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8308350Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8308459Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8308604Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8308715Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8308838Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8308984Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8309138Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8309318Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8309510Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8309730Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8309977Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8310153Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8310347Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8310464Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8310601Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8310719Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8310840Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8310953Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8311082Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8311219Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8311340Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8311515Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8311703Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8311948Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8312142Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8312275Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8312436Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8312963Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8313139Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8313293Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8313410Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8313525Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8313655Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8313850Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8313989Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8314238Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8314376Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8314604Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8314751Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8315006Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8315093Z } 2025-12-04T14:34:54.8315284Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8315426Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8315517Z tt.return 2025-12-04T14:34:54.8315597Z } 2025-12-04T14:34:54.8315689Z } 2025-12-04T14:34:54.8315695Z 2025-12-04T14:34:54.8315779Z {-# 2025-12-04T14:34:54.8315882Z external_resources: { 2025-12-04T14:34:54.8315988Z mlir_reproducer: { 2025-12-04T14:34:54.8322034Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8322192Z disable_threading: false, 2025-12-04T14:34:54.8322295Z verify_each: true 2025-12-04T14:34:54.8322386Z } 2025-12-04T14:34:54.8322468Z } 2025-12-04T14:34:54.8322550Z #-} 2025-12-04T14:34:54.8323272Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8324552Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8324685Z ('RERUN', {'yellow': True}) [0.5073s] [ 7%] 2025-12-04T14:34:54.8326751Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8327010Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8327133Z ^ 2025-12-04T14:34:54.8327249Z module { 2025-12-04T14:34:54.8331792Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8331916Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8332027Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8332136Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8332300Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8332409Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8332523Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8332659Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8332763Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8332882Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8332991Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8333092Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8333197Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8333306Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8333409Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8333520Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8333619Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8333730Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8333843Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8333945Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8334055Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8334157Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8334258Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8334402Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8334528Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8334634Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8334779Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8334880Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8334983Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8335188Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8335380Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8335493Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8335625Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8335768Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8335880Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8336002Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8336229Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8336413Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8336564Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8336731Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8336925Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8337147Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8337457Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8337740Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8338005Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8338190Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8338324Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8338434Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8338558Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8338694Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8338824Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8338990Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8339176Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8339400Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8339635Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8339814Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8340017Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8340124Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8340264Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8340422Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8340545Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8340666Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8340785Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8340915Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8341043Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8341207Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8341392Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8341639Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8341833Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8341964Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8342085Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8342619Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8342768Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8342881Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8343001Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8343106Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8343235Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8343439Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8343568Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8343861Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8344008Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8344228Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8344416Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8344695Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8344783Z } 2025-12-04T14:34:54.8344987Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8345123Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8345212Z tt.return 2025-12-04T14:34:54.8345304Z } 2025-12-04T14:34:54.8345387Z } 2025-12-04T14:34:54.8345393Z 2025-12-04T14:34:54.8345475Z {-# 2025-12-04T14:34:54.8345588Z external_resources: { 2025-12-04T14:34:54.8345690Z mlir_reproducer: { 2025-12-04T14:34:54.8351754Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8351872Z disable_threading: false, 2025-12-04T14:34:54.8352008Z verify_each: true 2025-12-04T14:34:54.8352090Z } 2025-12-04T14:34:54.8352171Z } 2025-12-04T14:34:54.8352258Z #-} 2025-12-04T14:34:54.8352975Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8354268Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8354365Z FAILED [0.4968s] [ 7%] 2025-12-04T14:34:54.8354370Z 2025-12-04T14:34:54.8354509Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.8355088Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8355207Z Traceback (most recent call last): 2025-12-04T14:34:54.8355585Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8355705Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8356199Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8356426Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8356896Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8357004Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8357449Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8357594Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8358017Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8358300Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8358706Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8359043Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8359471Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8359711Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8360161Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8360295Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8360800Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8361127Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8361639Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8361744Z pm.run(mod) 2025-12-04T14:34:54.8361871Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8361877Z 2025-12-04T14:34:54.8362099Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8362950Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8362956Z 2025-12-04T14:34:54.8363216Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8363787Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8363905Z Traceback (most recent call last): 2025-12-04T14:34:54.8364308Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8364428Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8364919Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8365143Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8365614Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8365728Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8366168Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8366285Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8366701Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8366979Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8367363Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8367665Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8368091Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8368325Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8368770Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8368905Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8369436Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8369762Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8370311Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8370432Z pm.run(mod) 2025-12-04T14:34:54.8370559Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8370565Z 2025-12-04T14:34:54.8370780Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8371625Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8371631Z 2025-12-04T14:34:54.8371900Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8372048Z =================================== FAILURES =================================== 2025-12-04T14:34:54.8372607Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8372733Z Traceback (most recent call last): 2025-12-04T14:34:54.8373100Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8373219Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8373713Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8373930Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8374399Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8374508Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8374950Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8375070Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8375474Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8375788Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8376170Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8376468Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8376898Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8377126Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8377571Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8377713Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8378279Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8378616Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8379126Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8379223Z pm.run(mod) 2025-12-04T14:34:54.8379356Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8379362Z 2025-12-04T14:34:54.8379573Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8380430Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8380435Z 2025-12-04T14:34:54.8380750Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8381377Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-8c50d88405c7b0b3.xml - 2025-12-04T14:34:54.8381553Z =========================== short test summary info ============================ 2025-12-04T14:34:54.8382464Z FAILED [0.4968s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8382502Z 2025-12-04T14:34:54.8382720Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8383560Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8383565Z 2025-12-04T14:34:54.8383826Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8384008Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.8384223Z ============ 1 failed, 1 passed, 2373 deselected, 2 rerun in 50.80s ============ 2025-12-04T14:34:54.8384327Z Got exit code 1 2025-12-04T14:34:54.8384429Z Retrying single test... 2025-12-04T14:34:54.8384877Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-dad902e1dd428e6d.xml 2025-12-04T14:34:54.8385046Z ============================= test session starts ============================== 2025-12-04T14:34:54.8385397Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.8385500Z cachedir: .pytest_cache 2025-12-04T14:34:54.8386023Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.8386143Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.8386251Z configfile: pytest.ini 2025-12-04T14:34:54.8386841Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.8387104Z collecting ... collected 4932 items / 2400 deselected / 2532 selected 2025-12-04T14:34:54.8388214Z stepcurrent: skipping 2374 already run items. Running only test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8388327Z Running 1 items in this shard 2025-12-04T14:34:54.8388333Z 2025-12-04T14:34:54.8390367Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8390610Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8390712Z ^ 2025-12-04T14:34:54.8390800Z module { 2025-12-04T14:34:54.8395466Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8395623Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8395741Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8395849Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8396005Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8396124Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8396233Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8396339Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8396452Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8396559Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8396663Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8396777Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8396880Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8396999Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8397101Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8397206Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8397316Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8397427Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8397528Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8397638Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8397736Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8397839Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8397947Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8398081Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8398211Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8398355Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8398496Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8398607Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8398709Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8398913Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8399109Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8399213Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8399344Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8399459Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8399562Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8399688Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8399912Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8400046Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8400177Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8400346Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8400539Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8400762Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8401000Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8401182Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8401382Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8401490Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8401656Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8401766Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8401886Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8402031Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8402179Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8402375Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8402574Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8402791Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8403033Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8403210Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8403405Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8403524Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8403654Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8403763Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8403896Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8404008Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8404141Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8404275Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8404399Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8404576Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8404764Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8405002Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8405203Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8405337Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8405468Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8406000Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8406176Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8406298Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8406414Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8406520Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8406657Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8406855Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8406991Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8407241Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8407382Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8407602Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8407746Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8407994Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8408086Z } 2025-12-04T14:34:54.8408277Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8408417Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8408506Z tt.return 2025-12-04T14:34:54.8408589Z } 2025-12-04T14:34:54.8408680Z } 2025-12-04T14:34:54.8408686Z 2025-12-04T14:34:54.8408770Z {-# 2025-12-04T14:34:54.8408873Z external_resources: { 2025-12-04T14:34:54.8408980Z mlir_reproducer: { 2025-12-04T14:34:54.8415055Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8415209Z disable_threading: false, 2025-12-04T14:34:54.8415307Z verify_each: true 2025-12-04T14:34:54.8415400Z } 2025-12-04T14:34:54.8415481Z } 2025-12-04T14:34:54.8415562Z #-} 2025-12-04T14:34:54.8416288Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8417563Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8417701Z ('RERUN', {'yellow': True}) [3.1120s] [100%] 2025-12-04T14:34:54.8419795Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8420102Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8420206Z ^ 2025-12-04T14:34:54.8420294Z module { 2025-12-04T14:34:54.8424850Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8424979Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8425087Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8425196Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8425453Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8425560Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8425665Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8425779Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8425914Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8426051Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8426164Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8426267Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8426374Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8426482Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8426585Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8426699Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8426798Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8426908Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8427016Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8427124Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8427223Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8427333Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8427438Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8427573Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8427702Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8427808Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8427948Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8428051Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8428156Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8428358Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8428551Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8428656Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8428790Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8428905Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8429010Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8429133Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8429387Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8429528Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8429647Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8429812Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8430008Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8430226Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8430464Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8430651Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8430855Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8430974Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8431108Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8431221Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8431355Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8431493Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8431615Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8431786Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8431978Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8432203Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8432442Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8432651Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8432852Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8432965Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8433102Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8433246Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8433399Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8433515Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8433635Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8433766Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8433894Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8434062Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8434248Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8434500Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8434695Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8434826Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8434967Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8435499Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8435654Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8435768Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8435885Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8436001Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8436132Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8436333Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8436474Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8436722Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8436871Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8437121Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8437268Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8437527Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8437612Z } 2025-12-04T14:34:54.8437806Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8437952Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8438041Z tt.return 2025-12-04T14:34:54.8438132Z } 2025-12-04T14:34:54.8438214Z } 2025-12-04T14:34:54.8438220Z 2025-12-04T14:34:54.8438303Z {-# 2025-12-04T14:34:54.8438416Z external_resources: { 2025-12-04T14:34:54.8438519Z mlir_reproducer: { 2025-12-04T14:34:54.8444584Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8444701Z disable_threading: false, 2025-12-04T14:34:54.8444806Z verify_each: true 2025-12-04T14:34:54.8444889Z } 2025-12-04T14:34:54.8445030Z } 2025-12-04T14:34:54.8445121Z #-} 2025-12-04T14:34:54.8445832Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8447106Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8447241Z ('RERUN', {'yellow': True}) [0.4958s] [100%] 2025-12-04T14:34:54.8449270Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8449526Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8449622Z ^ 2025-12-04T14:34:54.8449716Z module { 2025-12-04T14:34:54.8454247Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8454409Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8454520Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8454633Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8454791Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8454901Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8455015Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8455121Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8455227Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8455340Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8455446Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8455557Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8455658Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8455768Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8455878Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8455985Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8456114Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8456236Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8456341Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8456444Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8456552Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8456682Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8456829Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8456968Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8457094Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8457203Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8457340Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8457443Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8457552Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8457749Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8457948Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8458067Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8458275Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8458395Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8458509Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8458642Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8458878Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8459013Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8459139Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8459317Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8459509Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8459733Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8459983Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8460164Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8460368Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8460521Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8460660Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8460777Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8460900Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8461034Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8461160Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8461327Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8461525Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8461745Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8461987Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8462171Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8462374Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8462490Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8462631Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8462743Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8462869Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8462981Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8463107Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8463246Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8463367Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8463532Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8463762Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8464004Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8464198Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8464363Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8464517Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8465055Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8465196Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8465308Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8465437Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8465544Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8465679Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8465891Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8466020Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8466277Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8466421Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8466635Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8466792Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8467036Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8467120Z } 2025-12-04T14:34:54.8467320Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8467453Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8467552Z tt.return 2025-12-04T14:34:54.8467634Z } 2025-12-04T14:34:54.8467717Z } 2025-12-04T14:34:54.8467723Z 2025-12-04T14:34:54.8467816Z {-# 2025-12-04T14:34:54.8467918Z external_resources: { 2025-12-04T14:34:54.8468015Z mlir_reproducer: { 2025-12-04T14:34:54.8474066Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8474194Z disable_threading: false, 2025-12-04T14:34:54.8474290Z verify_each: true 2025-12-04T14:34:54.8474370Z } 2025-12-04T14:34:54.8474463Z } 2025-12-04T14:34:54.8474551Z #-} 2025-12-04T14:34:54.8475259Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8476572Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8476669Z FAILED [0.4828s] [100%] 2025-12-04T14:34:54.8476674Z 2025-12-04T14:34:54.8476821Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.8477423Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8477570Z Traceback (most recent call last): 2025-12-04T14:34:54.8477948Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8478069Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8478564Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8478782Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8479254Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8479371Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8479809Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8479923Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8480341Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8480627Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8481014Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8481315Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8481741Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8481980Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8482428Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8482573Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8483103Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8483433Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8483949Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8484043Z pm.run(mod) 2025-12-04T14:34:54.8484174Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8484180Z 2025-12-04T14:34:54.8484400Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8485254Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8485261Z 2025-12-04T14:34:54.8485528Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8486095Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8486212Z Traceback (most recent call last): 2025-12-04T14:34:54.8486587Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8486703Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8487205Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8487425Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8488048Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8488234Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8488680Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8488796Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8489253Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8489570Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8489956Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8490254Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8490676Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8490922Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8491375Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8491522Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8492020Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8492349Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8492869Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8492963Z pm.run(mod) 2025-12-04T14:34:54.8493094Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8493110Z 2025-12-04T14:34:54.8493325Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8494173Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8494179Z 2025-12-04T14:34:54.8494449Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8494630Z =================================== FAILURES =================================== 2025-12-04T14:34:54.8495197Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8495319Z Traceback (most recent call last): 2025-12-04T14:34:54.8495686Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8495816Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8496305Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8496519Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8497003Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8497111Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8497559Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8497681Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8498089Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8498449Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8498824Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8499121Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8499557Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8499823Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8500278Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8500420Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8500953Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8501315Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8501824Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8501924Z pm.run(mod) 2025-12-04T14:34:54.8502052Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8502057Z 2025-12-04T14:34:54.8502270Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8503130Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8503136Z 2025-12-04T14:34:54.8503402Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8504041Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-dad902e1dd428e6d.xml - 2025-12-04T14:34:54.8504212Z =========================== short test summary info ============================ 2025-12-04T14:34:54.8505095Z FAILED [0.4828s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8505101Z 2025-12-04T14:34:54.8505320Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8506166Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8506171Z 2025-12-04T14:34:54.8506437Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8506647Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.8506854Z ================= 1 failed, 2400 deselected, 2 rerun in 4.42s ================== 2025-12-04T14:34:54.8506960Z Got exit code 1 2025-12-04T14:34:54.8507063Z Retrying single test... 2025-12-04T14:34:54.8507518Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-d765bceda0ac4437.xml 2025-12-04T14:34:54.8507676Z ============================= test session starts ============================== 2025-12-04T14:34:54.8508023Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.8508135Z cachedir: .pytest_cache 2025-12-04T14:34:54.8508657Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.8508775Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.8508894Z configfile: pytest.ini 2025-12-04T14:34:54.8509485Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.8509725Z collecting ... collected 4932 items / 2400 deselected / 2532 selected 2025-12-04T14:34:54.8510664Z stepcurrent: skipping 2374 already run items. Running only test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8510774Z Running 1 items in this shard 2025-12-04T14:34:54.8510779Z 2025-12-04T14:34:54.8512850Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8513132Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8513262Z ^ 2025-12-04T14:34:54.8513352Z module { 2025-12-04T14:34:54.8517921Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8518043Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8518161Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8518270Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8518434Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8518543Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8518682Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8518800Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8518909Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8519017Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8519131Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8519234Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8519337Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8519460Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8519562Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8519669Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8519779Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8519891Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8520007Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8520110Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8520212Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8520323Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8520434Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8520574Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8520710Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8520815Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8520956Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8521065Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8521174Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8521378Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8521571Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8521679Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8521855Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8521968Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8522073Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8522204Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8522466Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8522633Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8522763Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8522935Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8523135Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8523358Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8523597Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8523790Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8523990Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8524102Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8524243Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8524355Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8524490Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8524621Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8524747Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8524924Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8525115Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8525338Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8525583Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8525763Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8525966Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8526108Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8526242Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8526364Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8526488Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8526594Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8526723Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8526856Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8526975Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8527153Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8527342Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8527594Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8527788Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8527918Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8528048Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8528577Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8528725Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8528841Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8528953Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8529068Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8529198Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8529397Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8529563Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8529811Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8529955Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8530204Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8530820Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8531079Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8531166Z } 2025-12-04T14:34:54.8531360Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8531502Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8531594Z tt.return 2025-12-04T14:34:54.8531676Z } 2025-12-04T14:34:54.8531765Z } 2025-12-04T14:34:54.8531771Z 2025-12-04T14:34:54.8531851Z {-# 2025-12-04T14:34:54.8531958Z external_resources: { 2025-12-04T14:34:54.8532065Z mlir_reproducer: { 2025-12-04T14:34:54.8538223Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8538398Z disable_threading: false, 2025-12-04T14:34:54.8538496Z verify_each: true 2025-12-04T14:34:54.8538594Z } 2025-12-04T14:34:54.8538675Z } 2025-12-04T14:34:54.8538756Z #-} 2025-12-04T14:34:54.8539480Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8540769Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8540907Z ('RERUN', {'yellow': True}) [3.0665s] [100%] 2025-12-04T14:34:54.8542939Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8543204Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8543299Z ^ 2025-12-04T14:34:54.8543391Z module { 2025-12-04T14:34:54.8548018Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8548181Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8548297Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8548408Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8548578Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8548694Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8548817Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8548927Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8549032Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8549154Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8549265Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8549368Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8549481Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8549591Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8549696Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8549821Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8549924Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8550048Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8550152Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8550286Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8550402Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8550510Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8550617Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8550767Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8550896Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8551002Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8551151Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8551253Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8551372Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8551572Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8551769Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8551890Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8552020Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8552134Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8552251Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8552376Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8552605Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8552752Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8552875Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8553057Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8553252Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8553474Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8553757Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8553941Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8554141Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8554299Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8554460Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8554583Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8554707Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8554839Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8554974Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8555141Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8555332Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8555567Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8555803Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8555998Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8556200Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8556316Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8556460Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8556573Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8556696Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8556811Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8556932Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8557067Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8557198Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8557366Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8557560Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8557798Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8558028Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8558171Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8558296Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8558823Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8558972Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8559086Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8559209Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8559314Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8559447Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8559651Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8559785Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8560035Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8560185Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8560402Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8560558Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8560804Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8560891Z } 2025-12-04T14:34:54.8561091Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8561228Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8561352Z tt.return 2025-12-04T14:34:54.8561447Z } 2025-12-04T14:34:54.8561531Z } 2025-12-04T14:34:54.8561537Z 2025-12-04T14:34:54.8561620Z {-# 2025-12-04T14:34:54.8561732Z external_resources: { 2025-12-04T14:34:54.8561835Z mlir_reproducer: { 2025-12-04T14:34:54.8567932Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8568081Z disable_threading: false, 2025-12-04T14:34:54.8568188Z verify_each: true 2025-12-04T14:34:54.8568274Z } 2025-12-04T14:34:54.8568357Z } 2025-12-04T14:34:54.8568450Z #-} 2025-12-04T14:34:54.8569162Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8570449Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8570579Z ('RERUN', {'yellow': True}) [0.5041s] [100%] 2025-12-04T14:34:54.8572666Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8572916Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8573009Z ^ 2025-12-04T14:34:54.8573102Z module { 2025-12-04T14:34:54.8577674Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8577814Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8577956Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8578106Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8578330Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8578456Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8578566Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8578678Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8578801Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8578912Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8579021Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8579138Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8579240Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8579353Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8579472Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8579583Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8579699Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8579810Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8579916Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8580031Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8580131Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8580237Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8580354Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8580489Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8580619Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8580734Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8580875Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8580982Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8581094Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8581293Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8581549Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8581660Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8581794Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8581915Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8582019Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8582144Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8582377Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8582512Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8582645Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8582813Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8583009Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8583244Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8583490Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8583674Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8583881Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8583997Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8584139Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8584252Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8584378Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8584518Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8584642Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8584839Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8585040Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8585264Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8585533Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8585754Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8585949Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8586069Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8586205Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8586313Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8586445Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8586551Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8586673Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8586819Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8586941Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8587115Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8587308Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8587546Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8587947Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8588136Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8588263Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8588807Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8588953Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8589086Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8589202Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8589314Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8589594Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8589796Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8589933Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8590198Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8590342Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8590570Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8590716Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8590963Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8591062Z } 2025-12-04T14:34:54.8591256Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8591393Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8591502Z tt.return 2025-12-04T14:34:54.8591593Z } 2025-12-04T14:34:54.8591676Z } 2025-12-04T14:34:54.8591681Z 2025-12-04T14:34:54.8591776Z {-# 2025-12-04T14:34:54.8591885Z external_resources: { 2025-12-04T14:34:54.8591993Z mlir_reproducer: { 2025-12-04T14:34:54.8598109Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8598268Z disable_threading: false, 2025-12-04T14:34:54.8598371Z verify_each: true 2025-12-04T14:34:54.8598459Z } 2025-12-04T14:34:54.8598560Z } 2025-12-04T14:34:54.8598647Z #-} 2025-12-04T14:34:54.8599370Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8600658Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8600762Z FAILED [0.4830s] [100%] 2025-12-04T14:34:54.8600772Z 2025-12-04T14:34:54.8600924Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.8601492Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8601626Z Traceback (most recent call last): 2025-12-04T14:34:54.8601997Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8602117Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8602627Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8602848Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8603319Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8603481Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8603926Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8604056Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8604466Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8604750Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8605147Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8605450Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8605886Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8606119Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8606566Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8606718Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8607215Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8607541Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8608067Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8608160Z pm.run(mod) 2025-12-04T14:34:54.8608302Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8608308Z 2025-12-04T14:34:54.8608556Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8609408Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8609463Z 2025-12-04T14:34:54.8609759Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8610322Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8610448Z Traceback (most recent call last): 2025-12-04T14:34:54.8610818Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8610933Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8611431Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8611649Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8612130Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8612243Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8612715Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8612877Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8613296Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8613615Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8614016Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8614336Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8614773Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8615034Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8615559Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8615712Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8616218Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8616548Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8617290Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8617398Z pm.run(mod) 2025-12-04T14:34:54.8617532Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8617538Z 2025-12-04T14:34:54.8617765Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8618705Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8618714Z 2025-12-04T14:34:54.8618986Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8619142Z =================================== FAILURES =================================== 2025-12-04T14:34:54.8619707Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8619842Z Traceback (most recent call last): 2025-12-04T14:34:54.8620215Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8620337Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8620897Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8621127Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8621613Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8621759Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8622254Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8622384Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8622797Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8623080Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8623475Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8623783Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8624228Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8624469Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8624922Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8625077Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8625574Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8625918Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8626432Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8626530Z pm.run(mod) 2025-12-04T14:34:54.8626674Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8626679Z 2025-12-04T14:34:54.8626899Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8627752Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8627801Z 2025-12-04T14:34:54.8628071Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8628753Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-d765bceda0ac4437.xml - 2025-12-04T14:34:54.8628989Z =========================== short test summary info ============================ 2025-12-04T14:34:54.8629882Z FAILED [0.4830s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8629888Z 2025-12-04T14:34:54.8630118Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8631023Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8631035Z 2025-12-04T14:34:54.8631297Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8631492Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.8631697Z ================= 1 failed, 2400 deselected, 2 rerun in 4.39s ================== 2025-12-04T14:34:54.8631809Z Got exit code 1 2025-12-04T14:34:54.8632575Z FAILED CONSISTENTLY: test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8633026Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T14:34:54.8633484Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-42fbcb95d0a0c581.xml 2025-12-04T14:34:54.8633654Z ============================= test session starts ============================== 2025-12-04T14:34:54.8634036Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.8634189Z cachedir: .pytest_cache 2025-12-04T14:34:54.8634710Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.8634842Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.8634951Z configfile: pytest.ini 2025-12-04T14:34:54.8635546Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.8635793Z collecting ... collected 4932 items / 2375 deselected / 2557 selected 2025-12-04T14:34:54.8635949Z stepcurrent: skipping 2375 already run items. 2025-12-04T14:34:54.8636063Z Running 26 items in this shard 2025-12-04T14:34:54.8636077Z 2025-12-04T14:34:54.8636914Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_16_out_dtype_int32_cuda_int8 SKIPPED [0.0238s] (out dtype not implemented) [ 3%] 2025-12-04T14:34:54.8637671Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_16_out_dtype_unspecified_cuda_float32 PASSED [4.4227s] [ 7%] 2025-12-04T14:34:54.8638602Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_16_out_dtype_unspecified_cuda_int8 SKIPPED [0.0018s] (bsr_dense_linear does not support int8) [ 11%] 2025-12-04T14:34:54.8639471Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_16x32_out_dtype_int32_cuda_float32 SKIPPED [0.0018s] (out dtype not implemented) [ 15%] 2025-12-04T14:34:54.8640496Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_16x32_out_dtype_unspecified_cuda_float32 SKIPPED [0.0015s] (bsr_dense_linear does not support non-square blocks) [ 19%] 2025-12-04T14:34:54.8641384Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_32_out_dtype_int32_cuda_float16 SKIPPED [0.0015s] (out dtype not implemented) [ 23%] 2025-12-04T14:34:54.8642147Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_linear_blocksize_32_out_dtype_unspecified_cuda_float16 PASSED [1.1368s] [ 26%] 2025-12-04T14:34:54.8642970Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_16_out_dtype_int32_cuda_float16 SKIPPED [0.0018s] (out dtype not implemented) [ 30%] 2025-12-04T14:34:54.8643776Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_16_out_dtype_int32_cuda_int8 SKIPPED [0.0016s] (out dtype not implemented) [ 34%] 2025-12-04T14:34:54.8644506Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_16_out_dtype_unspecified_cuda_float32 PASSED [0.9970s] [ 38%] 2025-12-04T14:34:54.8645330Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_16x32_out_dtype_int32_cuda_int8 SKIPPED [0.0018s] (out dtype not implemented) [ 42%] 2025-12-04T14:34:54.8646078Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_16x32_out_dtype_unspecified_cuda_float16 PASSED [1.0105s] [ 46%] 2025-12-04T14:34:54.8646909Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_int32_cuda_float16 SKIPPED [0.0018s] (out dtype not implemented) [ 50%] 2025-12-04T14:34:54.8647676Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_float32 PASSED [1.5252s] [ 53%] 2025-12-04T14:34:54.8649716Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8650005Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8650100Z ^ 2025-12-04T14:34:54.8650192Z module { 2025-12-04T14:34:54.8654761Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8654899Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8655052Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8655165Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8655338Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8655453Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8655562Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8655681Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8655785Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8655895Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8656015Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8656121Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8656235Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8656348Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8656458Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8656579Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8656684Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8656802Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8656917Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8657026Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8657131Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8657242Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8657345Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8657495Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8657623Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8657729Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8657878Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8657984Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8658097Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8658423Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8658628Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8658739Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8658921Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8659066Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8659179Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8659309Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8659538Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8659684Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8659812Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8659982Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8660187Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8660415Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8660668Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8660858Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8661060Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8661187Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8661323Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8661436Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8661573Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8661710Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8661837Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8662015Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8662211Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8662436Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8662677Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8662892Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8663103Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8663220Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8663355Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8663480Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8663606Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8663727Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8663851Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8663987Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8664125Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8664297Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8664487Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8664743Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8664940Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8665078Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8665205Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8665740Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8665892Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8666010Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8666161Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8666279Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8666410Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8666619Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8666795Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8667081Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8667230Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8667447Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8667590Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8667845Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8667932Z } 2025-12-04T14:34:54.8668134Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8668274Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8668365Z tt.return 2025-12-04T14:34:54.8668458Z } 2025-12-04T14:34:54.8668545Z } 2025-12-04T14:34:54.8668553Z 2025-12-04T14:34:54.8668636Z {-# 2025-12-04T14:34:54.8668751Z external_resources: { 2025-12-04T14:34:54.8668852Z mlir_reproducer: { 2025-12-04T14:34:54.8674893Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8675044Z disable_threading: false, 2025-12-04T14:34:54.8675150Z verify_each: true 2025-12-04T14:34:54.8675235Z } 2025-12-04T14:34:54.8675320Z } 2025-12-04T14:34:54.8675413Z #-} 2025-12-04T14:34:54.8676125Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8677416Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8677560Z ('RERUN', {'yellow': True}) [0.0760s] [ 57%] 2025-12-04T14:34:54.8679582Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8679843Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8679937Z ^ 2025-12-04T14:34:54.8680035Z module { 2025-12-04T14:34:54.8684646Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8684814Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8684925Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8685044Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8685197Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8685306Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8685425Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8685532Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8685639Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8685755Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8685862Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8685981Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8686086Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8686197Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8686343Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8686455Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8686561Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8686688Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8686797Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8686901Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8687017Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8687120Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8687223Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8687370Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8687497Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8687609Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8687919Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8688023Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8688140Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8688344Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8688544Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8688667Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8688802Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8688929Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8689033Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8689159Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8689399Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8689534Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8689660Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8689902Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8690100Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8690328Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8690622Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8690846Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8691055Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8691170Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8691307Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8691431Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8691555Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8691690Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8691827Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8691995Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8692192Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8692420Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8692663Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8692852Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8693051Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8693162Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8693311Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8693421Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8693553Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8693663Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8693792Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8693934Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8694060Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8694269Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8694470Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8694710Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8694906Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8695043Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8695166Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8695708Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8695857Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8695973Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8696099Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8696213Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8696347Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8696556Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8696688Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8696948Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8697089Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8697307Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8697461Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8697752Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8697841Z } 2025-12-04T14:34:54.8698045Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8698253Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8698358Z tt.return 2025-12-04T14:34:54.8698484Z } 2025-12-04T14:34:54.8698600Z } 2025-12-04T14:34:54.8698607Z 2025-12-04T14:34:54.8698702Z {-# 2025-12-04T14:34:54.8698811Z external_resources: { 2025-12-04T14:34:54.8698933Z mlir_reproducer: { 2025-12-04T14:34:54.8704981Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8705112Z disable_threading: false, 2025-12-04T14:34:54.8705208Z verify_each: true 2025-12-04T14:34:54.8705291Z } 2025-12-04T14:34:54.8705385Z } 2025-12-04T14:34:54.8705467Z #-} 2025-12-04T14:34:54.8706184Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8707522Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8707949Z ('RERUN', {'yellow': True}) [0.2932s] [ 57%] 2025-12-04T14:34:54.8715998Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8716342Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8716440Z ^ 2025-12-04T14:34:54.8716534Z module { 2025-12-04T14:34:54.8721224Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8721400Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8721509Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8721630Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8721793Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8721904Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8722026Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8722133Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8722240Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8722360Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8722470Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8722573Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8722690Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8722801Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8722908Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8723027Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8723131Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8723254Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8723356Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8723461Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8723574Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8723679Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8723786Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8723938Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8724064Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8724172Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8724320Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8724477Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8724595Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8724799Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8724995Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8725117Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8725250Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8725364Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8725479Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8725603Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8725832Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8725979Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8726104Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8726285Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8726482Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8726710Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8727009Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8727195Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8727394Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8727515Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8727650Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8727774Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8727928Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8728064Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8728197Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8728370Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8728603Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8728857Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8729097Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8729285Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8729483Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8729591Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8729733Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8729847Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8729979Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8730089Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8730211Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8730355Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8730480Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8730643Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8730846Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8731091Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8731294Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8731428Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8731551Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8732096Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8732278Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8732394Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8732525Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8732633Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8732775Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8732982Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8733111Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8733371Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8733514Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8733733Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8733897Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8734153Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8734253Z } 2025-12-04T14:34:54.8734449Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8734586Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8734691Z tt.return 2025-12-04T14:34:54.8734777Z } 2025-12-04T14:34:54.8734860Z } 2025-12-04T14:34:54.8734868Z 2025-12-04T14:34:54.8734965Z {-# 2025-12-04T14:34:54.8735068Z external_resources: { 2025-12-04T14:34:54.8735167Z mlir_reproducer: { 2025-12-04T14:34:54.8741433Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8741595Z disable_threading: false, 2025-12-04T14:34:54.8741695Z verify_each: true 2025-12-04T14:34:54.8741780Z } 2025-12-04T14:34:54.8741880Z } 2025-12-04T14:34:54.8741966Z #-} 2025-12-04T14:34:54.8742686Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8743988Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8744090Z FAILED [0.2613s] [ 57%] 2025-12-04T14:34:54.8744097Z 2025-12-04T14:34:54.8744251Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.8744806Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8744925Z Traceback (most recent call last): 2025-12-04T14:34:54.8745314Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8745438Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8745926Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1802, in bsr_dense_mm 2025-12-04T14:34:54.8746212Z return bsr_dense_addmm(out, bsr, dense, alpha=1, beta=0, out=out) 2025-12-04T14:34:54.8746706Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8746937Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8747410Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8747533Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8747975Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8748094Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8748514Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8748800Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8749184Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8749503Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8749934Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8750181Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8750627Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8750762Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8751299Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8751632Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8752183Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8752310Z pm.run(mod) 2025-12-04T14:34:54.8752443Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8752450Z 2025-12-04T14:34:54.8752677Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8753518Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8753524Z 2025-12-04T14:34:54.8753799Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8754356Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8754476Z Traceback (most recent call last): 2025-12-04T14:34:54.8754869Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8754990Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8755464Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1802, in bsr_dense_mm 2025-12-04T14:34:54.8755696Z return bsr_dense_addmm(out, bsr, dense, alpha=1, beta=0, out=out) 2025-12-04T14:34:54.8756188Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8756420Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8756891Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8757008Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8757461Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8757607Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8758022Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8758319Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8758703Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8759016Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8759444Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8759676Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8760137Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8760275Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8760783Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8761116Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8761628Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8761731Z pm.run(mod) 2025-12-04T14:34:54.8761862Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8761867Z 2025-12-04T14:34:54.8762082Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8762957Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8762963Z 2025-12-04T14:34:54.8763228Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8763384Z =================================== FAILURES =================================== 2025-12-04T14:34:54.8763963Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8764181Z Traceback (most recent call last): 2025-12-04T14:34:54.8764564Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8764685Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8765170Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1802, in bsr_dense_mm 2025-12-04T14:34:54.8765398Z return bsr_dense_addmm(out, bsr, dense, alpha=1, beta=0, out=out) 2025-12-04T14:34:54.8765892Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8766126Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8766601Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8766732Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8767177Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8767296Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8767718Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8768000Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8768383Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8768699Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8769133Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8769413Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8769863Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8770004Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8770510Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8770836Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8771360Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8771454Z pm.run(mod) 2025-12-04T14:34:54.8771586Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8771594Z 2025-12-04T14:34:54.8771820Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8772652Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8772662Z 2025-12-04T14:34:54.8772937Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8773563Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-42fbcb95d0a0c581.xml - 2025-12-04T14:34:54.8773735Z =========================== short test summary info ============================ 2025-12-04T14:34:54.8774618Z FAILED [0.2613s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8774625Z 2025-12-04T14:34:54.8774870Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8775717Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8775724Z 2025-12-04T14:34:54.8776047Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8776226Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.8776473Z ====== 1 failed, 5 passed, 9 skipped, 2375 deselected, 2 rerun in 10.12s ======= 2025-12-04T14:34:54.8776572Z Got exit code 1 2025-12-04T14:34:54.8776685Z Retrying single test... 2025-12-04T14:34:54.8777139Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-da54dbb08fbaa628.xml 2025-12-04T14:34:54.8777301Z ============================= test session starts ============================== 2025-12-04T14:34:54.8777662Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.8777770Z cachedir: .pytest_cache 2025-12-04T14:34:54.8778376Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.8778517Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.8778629Z configfile: pytest.ini 2025-12-04T14:34:54.8779234Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.8779469Z collecting ... collected 4932 items / 2400 deselected / 2532 selected 2025-12-04T14:34:54.8780395Z stepcurrent: skipping 2389 already run items. Running only test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8780519Z Running 1 items in this shard 2025-12-04T14:34:54.8780527Z 2025-12-04T14:34:54.8782545Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8782844Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8782943Z ^ 2025-12-04T14:34:54.8783035Z module { 2025-12-04T14:34:54.8787624Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8787930Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8788042Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8788162Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8788395Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8788555Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8788677Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8788787Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8788891Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8789015Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8789123Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8789229Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8789343Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8789453Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8789557Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8789682Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8789782Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8789912Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8790017Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8790120Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8790239Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8790344Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8790450Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8790600Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8790727Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8790835Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8790989Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8791093Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8791209Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8791411Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8791612Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8791729Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8791902Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8792015Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8792131Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8792255Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8792486Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8792634Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8792760Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8792938Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8793134Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8793365Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8793615Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8793800Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8794004Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8794133Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8794272Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8794389Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8794514Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8794649Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8794789Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8794959Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8795149Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8795425Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8795673Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8795868Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8796095Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8796239Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8796387Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8796498Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8796623Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8796745Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8796868Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8797003Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8797135Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8797315Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8797517Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8797760Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8797961Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8798110Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8798240Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8798785Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8798930Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8799048Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8799177Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8799294Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8799426Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8799633Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8799797Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8800051Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8800204Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8800424Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8800581Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8800831Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8800919Z } 2025-12-04T14:34:54.8801124Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8801267Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8801358Z tt.return 2025-12-04T14:34:54.8801458Z } 2025-12-04T14:34:54.8801545Z } 2025-12-04T14:34:54.8801550Z 2025-12-04T14:34:54.8801642Z {-# 2025-12-04T14:34:54.8801748Z external_resources: { 2025-12-04T14:34:54.8801850Z mlir_reproducer: { 2025-12-04T14:34:54.8807982Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8808132Z disable_threading: false, 2025-12-04T14:34:54.8808245Z verify_each: true 2025-12-04T14:34:54.8808331Z } 2025-12-04T14:34:54.8808432Z } 2025-12-04T14:34:54.8808514Z #-} 2025-12-04T14:34:54.8809236Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8810535Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8810666Z ('RERUN', {'yellow': True}) [2.8451s] [100%] 2025-12-04T14:34:54.8812705Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8812956Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8813053Z ^ 2025-12-04T14:34:54.8813152Z module { 2025-12-04T14:34:54.8817711Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8817881Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8818003Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8818187Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8818350Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8818475Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8818584Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8818690Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8818809Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8818919Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8819026Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8819140Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8819287Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8819404Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8819522Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8819630Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8819747Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8819889Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8820022Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8820138Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8820243Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8820350Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8820467Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8820606Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8820735Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8820852Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8820991Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8821105Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8821216Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8821416Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8821621Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8821732Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8821870Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8821993Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8822096Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8822220Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8822457Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8822593Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8822727Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8822902Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8823101Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8823333Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8823614Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8823801Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8824011Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8824129Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8824272Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8824389Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8824514Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8824655Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8824782Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8824955Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8825153Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8825378Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8825633Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8825817Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8826014Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8826135Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8826271Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8826379Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8826511Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8826623Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8826743Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8826922Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8827050Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8827226Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8827421Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8827742Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8827949Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8828079Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8828202Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8828745Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8828891Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8829023Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8829142Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8829246Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8829390Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8829592Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8829723Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8829984Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8830121Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8830348Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8830493Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8830744Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8830838Z } 2025-12-04T14:34:54.8831036Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8831173Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8831303Z tt.return 2025-12-04T14:34:54.8831386Z } 2025-12-04T14:34:54.8831480Z } 2025-12-04T14:34:54.8831486Z 2025-12-04T14:34:54.8831570Z {-# 2025-12-04T14:34:54.8831677Z external_resources: { 2025-12-04T14:34:54.8831788Z mlir_reproducer: { 2025-12-04T14:34:54.8837849Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8837975Z disable_threading: false, 2025-12-04T14:34:54.8838076Z verify_each: true 2025-12-04T14:34:54.8838167Z } 2025-12-04T14:34:54.8838251Z } 2025-12-04T14:34:54.8838336Z #-} 2025-12-04T14:34:54.8839062Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8840376Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8840545Z ('RERUN', {'yellow': True}) [0.2673s] [100%] 2025-12-04T14:34:54.8842595Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8842847Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8842948Z ^ 2025-12-04T14:34:54.8843047Z module { 2025-12-04T14:34:54.8847610Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8847775Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8847887Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8847997Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8848162Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8848269Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8848377Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8848491Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8848597Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8848707Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8848826Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8848930Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8849035Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8849158Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8849264Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8849390Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8849490Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8849602Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8849715Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8849817Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8849920Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8850035Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8850141Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8850278Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8850410Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8850541Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8850692Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8850797Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8850899Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8851140Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8851369Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8851478Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8851629Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8851740Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8851843Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8851984Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8852218Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8852366Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8852492Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8852659Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8852866Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8853094Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8853337Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8853532Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8853729Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8853852Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8853984Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8854093Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8854226Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8854360Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8854489Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8854665Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8854883Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8855120Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8855361Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8855542Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8855750Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8855866Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8856003Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8856119Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8856247Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8856356Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8856486Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8856622Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8856757Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8856928Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8857122Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8857366Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8857566Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8857697Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8857836Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8858484Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8858640Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8858756Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8858909Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8859056Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8859217Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8859414Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8859554Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8859803Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8859951Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8860173Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8860321Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8860583Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8860668Z } 2025-12-04T14:34:54.8860863Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8861010Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8861101Z tt.return 2025-12-04T14:34:54.8861194Z } 2025-12-04T14:34:54.8861275Z } 2025-12-04T14:34:54.8861281Z 2025-12-04T14:34:54.8861363Z {-# 2025-12-04T14:34:54.8861476Z external_resources: { 2025-12-04T14:34:54.8861576Z mlir_reproducer: { 2025-12-04T14:34:54.8867650Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8867801Z disable_threading: false, 2025-12-04T14:34:54.8867906Z verify_each: true 2025-12-04T14:34:54.8867993Z } 2025-12-04T14:34:54.8868077Z } 2025-12-04T14:34:54.8868170Z #-} 2025-12-04T14:34:54.8868889Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8870179Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8870293Z FAILED [0.2511s] [100%] 2025-12-04T14:34:54.8870298Z 2025-12-04T14:34:54.8870436Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.8871000Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8871121Z Traceback (most recent call last): 2025-12-04T14:34:54.8871494Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8871669Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8872148Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1802, in bsr_dense_mm 2025-12-04T14:34:54.8872383Z return bsr_dense_addmm(out, bsr, dense, alpha=1, beta=0, out=out) 2025-12-04T14:34:54.8872915Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8873271Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8873913Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8874030Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8874478Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8874608Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8875030Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8875320Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8875704Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8876012Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8876459Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8876692Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8877149Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8877288Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8877786Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8878121Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8878634Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8878793Z pm.run(mod) 2025-12-04T14:34:54.8878930Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8878936Z 2025-12-04T14:34:54.8879150Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8879997Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8880003Z 2025-12-04T14:34:54.8880269Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8880826Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8880950Z Traceback (most recent call last): 2025-12-04T14:34:54.8881316Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8881448Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8881923Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1802, in bsr_dense_mm 2025-12-04T14:34:54.8882145Z return bsr_dense_addmm(out, bsr, dense, alpha=1, beta=0, out=out) 2025-12-04T14:34:54.8882642Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8882913Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8883393Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8883505Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8884005Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8884131Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8884544Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8884865Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8885292Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8885592Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8886033Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8886264Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8886710Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8886861Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8887359Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8887860Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8888377Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8888471Z pm.run(mod) 2025-12-04T14:34:54.8888609Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8888614Z 2025-12-04T14:34:54.8888830Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8889679Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8889685Z 2025-12-04T14:34:54.8889954Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8890098Z =================================== FAILURES =================================== 2025-12-04T14:34:54.8890722Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8890846Z Traceback (most recent call last): 2025-12-04T14:34:54.8891227Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8891346Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8891818Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1802, in bsr_dense_mm 2025-12-04T14:34:54.8892051Z return bsr_dense_addmm(out, bsr, dense, alpha=1, beta=0, out=out) 2025-12-04T14:34:54.8892542Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.8892762Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.8893245Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.8893361Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.8893810Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.8893926Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.8894336Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.8894625Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.8895009Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.8895318Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.8895788Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.8896021Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.8896482Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.8896698Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.8897198Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.8897539Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.8898052Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.8898222Z pm.run(mod) 2025-12-04T14:34:54.8898363Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8898369Z 2025-12-04T14:34:54.8898588Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8899437Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8899445Z 2025-12-04T14:34:54.8899711Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8900354Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-da54dbb08fbaa628.xml - 2025-12-04T14:34:54.8900524Z =========================== short test summary info ============================ 2025-12-04T14:34:54.8901399Z FAILED [0.2511s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.8901406Z 2025-12-04T14:34:54.8901632Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.8902471Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8902513Z 2025-12-04T14:34:54.8902786Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.8902965Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.8903169Z ================= 1 failed, 2400 deselected, 2 rerun in 3.69s ================== 2025-12-04T14:34:54.8903274Z Got exit code 1 2025-12-04T14:34:54.8903379Z Retrying single test... 2025-12-04T14:34:54.8903832Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-90277c9f3d74bea5.xml 2025-12-04T14:34:54.8903994Z ============================= test session starts ============================== 2025-12-04T14:34:54.8904349Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.8904466Z cachedir: .pytest_cache 2025-12-04T14:34:54.8904991Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.8905110Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.8905227Z configfile: pytest.ini 2025-12-04T14:34:54.8905821Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.8906068Z collecting ... collected 4932 items / 2400 deselected / 2532 selected 2025-12-04T14:34:54.8906994Z stepcurrent: skipping 2389 already run items. Running only test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.8907108Z Running 1 items in this shard 2025-12-04T14:34:54.8907113Z 2025-12-04T14:34:54.8909207Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8909485Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8909587Z ^ 2025-12-04T14:34:54.8909676Z module { 2025-12-04T14:34:54.8914244Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8914370Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8914490Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8914597Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8914813Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8914921Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8915035Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8915150Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8915256Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8915368Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8915488Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8915593Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8915697Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8915814Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8915922Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8916038Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8916139Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8916257Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8916375Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8916478Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8916583Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8916701Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8916809Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8916946Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8917081Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8917188Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8917326Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8917441Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8917546Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8917756Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8917985Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8918095Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8918238Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8918356Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8918459Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8918625Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8918881Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8919029Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8919150Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8919324Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8919527Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8919753Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8919999Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8920187Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8920395Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8920514Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8920666Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8920783Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8920908Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8921058Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8921185Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8921369Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8921560Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8921782Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8922039Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8922223Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8922453Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8922582Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8922721Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8922832Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8922972Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8923083Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8923220Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8923357Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8923479Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8923664Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8923858Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8924103Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8924317Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8924450Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8924594Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8925128Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8925278Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8925406Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8925525Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8925637Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8925780Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8926013Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8926160Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8926413Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8926587Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8926850Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8926999Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8927254Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8927352Z } 2025-12-04T14:34:54.8927547Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8927691Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8927782Z tt.return 2025-12-04T14:34:54.8927871Z } 2025-12-04T14:34:54.8927968Z } 2025-12-04T14:34:54.8927974Z 2025-12-04T14:34:54.8928058Z {-# 2025-12-04T14:34:54.8928163Z external_resources: { 2025-12-04T14:34:54.8928282Z mlir_reproducer: { 2025-12-04T14:34:54.8934359Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8934594Z disable_threading: false, 2025-12-04T14:34:54.8934754Z verify_each: true 2025-12-04T14:34:54.8934894Z } 2025-12-04T14:34:54.8934980Z } 2025-12-04T14:34:54.8935063Z #-} 2025-12-04T14:34:54.8935793Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8937081Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8937220Z ('RERUN', {'yellow': True}) [2.8469s] [100%] 2025-12-04T14:34:54.8939317Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8939586Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8939683Z ^ 2025-12-04T14:34:54.8939777Z module { 2025-12-04T14:34:54.8944475Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8944640Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8944757Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8944868Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8945039Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8945152Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8945264Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8945389Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8945495Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8945619Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8945726Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8945831Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8945950Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8946058Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8946168Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8946291Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8946394Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8946540Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8946658Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8946765Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8946868Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8946988Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8947095Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8947247Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8947381Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8947488Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8947646Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8947753Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8947861Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8948080Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8948280Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8948394Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8948539Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8948655Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8948770Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8948896Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8949126Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8949273Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8949397Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8949568Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8949769Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8950025Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8950284Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8950472Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8950750Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8950877Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8951015Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8951125Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8951261Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8951398Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8951524Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8951706Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8951901Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8952135Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8952386Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8952570Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8952782Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8952894Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8953030Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8953158Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8953282Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8953408Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8953530Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8953663Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8953802Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8953975Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8954166Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8954453Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8954652Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8954804Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8954929Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8955464Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8955619Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8955738Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8955861Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8955979Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8956114Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8956330Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8956466Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8956721Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8956872Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8957093Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8957240Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8957502Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8957593Z } 2025-12-04T14:34:54.8957796Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8957964Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8958058Z tt.return 2025-12-04T14:34:54.8958152Z } 2025-12-04T14:34:54.8958242Z } 2025-12-04T14:34:54.8958249Z 2025-12-04T14:34:54.8958336Z {-# 2025-12-04T14:34:54.8958452Z external_resources: { 2025-12-04T14:34:54.8958582Z mlir_reproducer: { 2025-12-04T14:34:54.8964677Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8964798Z disable_threading: false, 2025-12-04T14:34:54.8964907Z verify_each: true 2025-12-04T14:34:54.8964995Z } 2025-12-04T14:34:54.8965081Z } 2025-12-04T14:34:54.8965176Z #-} 2025-12-04T14:34:54.8965896Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8967191Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8967352Z ('RERUN', {'yellow': True}) [0.2682s] [100%] 2025-12-04T14:34:54.8969374Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2505:26: error: 'arith.extf' op operand #0 must be floating-point-like, but got 'tensor<32x32xi8, #ttg.dot_op<{opIdx = 0, parent = #ttg.blocked<{sizePerThread = [2, 2], threadsPerWarp = [4, 8], warpsPerCTA = [4, 1], order = [1, 0]}>}>>' 2025-12-04T14:34:54.8969636Z values_block, dense_block, allow_tf32=allow_tf32, out_dtype=acc_dtype 2025-12-04T14:34:54.8969733Z ^ 2025-12-04T14:34:54.8969840Z module { 2025-12-04T14:34:54.8974423Z tt.func public @_bsr_strided_addmm_kernel(%arg0: !tt.ptr {tt.divisibility = 16 : i32}, %arg1: i32 {tt.divisibility = 16 : i32}, %arg2: i32 {tt.divisibility = 16 : i32}, %arg3: i32 {tt.divisibility = 16 : i32}, %arg4: !tt.ptr {tt.divisibility = 16 : i32}, %arg5: i32, %arg6: !tt.ptr, %arg7: !tt.ptr {tt.divisibility = 16 : i32}, %arg8: i32 {tt.divisibility = 16 : i32}, %arg9: i32 {tt.divisibility = 16 : i32}, %arg10: i32 {tt.divisibility = 16 : i32}, %arg11: i32 {tt.divisibility = 16 : i32}, %arg12: !tt.ptr {tt.divisibility = 16 : i32}, %arg13: i32 {tt.divisibility = 16 : i32}, %arg14: i32 {tt.divisibility = 16 : i32}, %arg15: i32 {tt.divisibility = 16 : i32}, %arg16: i32 {tt.divisibility = 16 : i32}, %arg17: !tt.ptr {tt.divisibility = 16 : i32}, %arg18: i32 {tt.divisibility = 16 : i32}, %arg19: i32 {tt.divisibility = 16 : i32}, %arg20: i32 {tt.divisibility = 16 : i32}, %arg21: !tt.ptr {tt.divisibility = 16 : i32}, %arg22: i32 {tt.divisibility = 16 : i32}, %arg23: i32 {tt.divisibility = 16 : i32}, %arg24: i32 {tt.divisibility = 16 : i32}, %arg25: !tt.ptr {tt.divisibility = 16 : i32}, %arg26: i32 {tt.divisibility = 16 : i32}, %arg27: i32 {tt.divisibility = 16 : i32}, %arg28: i32 {tt.divisibility = 16 : i32}, %arg29: i32 {tt.divisibility = 16 : i32}, %arg30: i32 {tt.divisibility = 16 : i32}) attributes {noinline = false} { 2025-12-04T14:34:54.8974617Z %c4_i32 = arith.constant 4 : i32 2025-12-04T14:34:54.8974731Z %c1_i64 = arith.constant 1 : i64 2025-12-04T14:34:54.8974853Z %c0_i64 = arith.constant 0 : i64 2025-12-04T14:34:54.8975014Z %cst = arith.constant dense<0> : tensor<32x16xi32> 2025-12-04T14:34:54.8975126Z %c1_i32 = arith.constant 1 : i32 2025-12-04T14:34:54.8975247Z %0 = tt.get_program_id z : i32 2025-12-04T14:34:54.8975359Z %1 = tt.get_program_id x : i32 2025-12-04T14:34:54.8975481Z %2 = tt.get_program_id y : i32 2025-12-04T14:34:54.8975599Z %3 = tt.get_num_programs x : i32 2025-12-04T14:34:54.8975709Z %4 = tt.get_num_programs y : i32 2025-12-04T14:34:54.8975835Z %5 = arith.muli %1, %4 : i32 2025-12-04T14:34:54.8975943Z %6 = arith.addi %5, %2 : i32 2025-12-04T14:34:54.8976055Z %7 = arith.muli %4, %c4_i32 : i32 2025-12-04T14:34:54.8976175Z %8 = arith.divsi %6, %7 : i32 2025-12-04T14:34:54.8976288Z %9 = arith.muli %8, %c4_i32 : i32 2025-12-04T14:34:54.8976394Z %10 = arith.subi %3, %9 : i32 2025-12-04T14:34:54.8976524Z %11 = arith.minsi %10, %c4_i32 : i32 2025-12-04T14:34:54.8976633Z %12 = arith.remsi %6, %7 : i32 2025-12-04T14:34:54.8976741Z %13 = arith.remsi %12, %11 : i32 2025-12-04T14:34:54.8976854Z %14 = arith.addi %9, %13 : i32 2025-12-04T14:34:54.8976961Z %15 = arith.divsi %12, %11 : i32 2025-12-04T14:34:54.8977080Z %16 = arith.muli %arg5, %0 : i32 2025-12-04T14:34:54.8977222Z %17 = tt.addptr %arg4, %16 : !tt.ptr, i32 2025-12-04T14:34:54.8977353Z %18 = tt.addptr %17, %14 : !tt.ptr, i32 2025-12-04T14:34:54.8977472Z %19 = tt.load %18 : !tt.ptr 2025-12-04T14:34:54.8977620Z %20 = tt.addptr %18, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8977726Z %21 = tt.load %20 : !tt.ptr 2025-12-04T14:34:54.8977840Z %22 = arith.subi %21, %19 : i64 2025-12-04T14:34:54.8978075Z %23 = tt.make_range {end = 32 : i32, start = 0 : i32} : tensor<32xi32> 2025-12-04T14:34:54.8978342Z %24 = tt.make_range {end = 16 : i32, start = 0 : i32} : tensor<16xi32> 2025-12-04T14:34:54.8978469Z %25 = arith.muli %arg1, %0 : i32 2025-12-04T14:34:54.8978607Z %26 = tt.addptr %arg0, %25 : !tt.ptr, i32 2025-12-04T14:34:54.8978733Z %27 = arith.extsi %arg2 : i32 to i64 2025-12-04T14:34:54.8978839Z %28 = arith.muli %27, %19 : i64 2025-12-04T14:34:54.8978962Z %29 = tt.addptr %26, %28 : !tt.ptr, i64 2025-12-04T14:34:54.8979199Z %30 = tt.expand_dims %23 {axis = 1 : i32} : tensor<32xi32> -> tensor<32x1xi32> 2025-12-04T14:34:54.8979337Z %31 = tt.splat %arg3 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8979465Z %32 = arith.muli %31, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8979649Z %33 = tt.splat %29 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8979847Z %34 = tt.addptr %33, %32 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8980084Z %35 = tt.expand_dims %23 {axis = 0 : i32} : tensor<32xi32> -> tensor<1x32xi32> 2025-12-04T14:34:54.8980330Z %36 = tt.broadcast %34 : tensor<32x1x!tt.ptr> -> tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8980517Z %37 = tt.broadcast %35 : tensor<1x32xi32> -> tensor<32x32xi32> 2025-12-04T14:34:54.8980732Z %38 = tt.addptr %36, %37 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8980844Z %39 = arith.muli %arg13, %0 : i32 2025-12-04T14:34:54.8980979Z %40 = tt.addptr %arg12, %39 : !tt.ptr, i32 2025-12-04T14:34:54.8981103Z %41 = arith.muli %arg15, %15 : i32 2025-12-04T14:34:54.8981228Z %42 = tt.addptr %40, %41 : !tt.ptr, i32 2025-12-04T14:34:54.8981364Z %43 = tt.splat %arg16 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8981533Z %44 = arith.muli %43, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8981702Z %45 = tt.splat %42 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8981906Z %46 = tt.addptr %45, %44 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8982162Z %47 = tt.expand_dims %24 {axis = 0 : i32} : tensor<16xi32> -> tensor<1x16xi32> 2025-12-04T14:34:54.8982434Z %48 = tt.broadcast %46 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8982624Z %49 = tt.broadcast %47 : tensor<1x16xi32> -> tensor<32x16xi32> 2025-12-04T14:34:54.8982823Z %50 = tt.addptr %48, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8982947Z %51 = arith.muli %arg26, %0 : i32 2025-12-04T14:34:54.8983083Z %52 = tt.addptr %arg25, %51 : !tt.ptr, i32 2025-12-04T14:34:54.8983195Z %53 = arith.muli %arg27, %14 : i32 2025-12-04T14:34:54.8983330Z %54 = tt.addptr %52, %53 : !tt.ptr, i32 2025-12-04T14:34:54.8983443Z %55 = arith.muli %arg28, %15 : i32 2025-12-04T14:34:54.8983566Z %56 = tt.addptr %54, %55 : !tt.ptr, i32 2025-12-04T14:34:54.8983714Z %57 = tt.splat %arg29 : i32 -> tensor<32x1xi32> 2025-12-04T14:34:54.8983837Z %58 = arith.muli %57, %30 : tensor<32x1xi32> 2025-12-04T14:34:54.8984011Z %59 = tt.splat %56 : !tt.ptr -> tensor<32x1x!tt.ptr> 2025-12-04T14:34:54.8984214Z %60 = tt.addptr %59, %58 : tensor<32x1x!tt.ptr>, tensor<32x1xi32> 2025-12-04T14:34:54.8984457Z %61 = tt.broadcast %60 : tensor<32x1x!tt.ptr> -> tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8984766Z %62 = tt.addptr %61, %49 : tensor<32x16x!tt.ptr>, tensor<32x16xi32> 2025-12-04T14:34:54.8984898Z %63 = tt.addptr %arg6, %0 : !tt.ptr, i32 2025-12-04T14:34:54.8985024Z %64 = tt.addptr %63, %19 : !tt.ptr, i64 2025-12-04T14:34:54.8985563Z %65:3 = scf.for %arg31 = %c0_i64 to %22 step %c1_i64 iter_args(%arg32 = %38, %arg33 = %64, %arg34 = %cst) -> (tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32>) : i64 { 2025-12-04T14:34:54.8985708Z %67 = tt.load %arg32 : tensor<32x32x!tt.ptr> 2025-12-04T14:34:54.8985825Z %68 = tt.load %arg33 : !tt.ptr 2025-12-04T14:34:54.8985953Z %69 = arith.extsi %arg14 : i32 to i64 2025-12-04T14:34:54.8986103Z %70 = arith.muli %69, %68 : i64 2025-12-04T14:34:54.8986251Z %71 = tt.splat %70 : i64 -> tensor<32x16xi64> 2025-12-04T14:34:54.8986457Z %72 = tt.addptr %50, %71 : tensor<32x16x!tt.ptr>, tensor<32x16xi64> 2025-12-04T14:34:54.8986589Z %73 = tt.load %72 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8986855Z %74 = tt.dot %67, %73, %arg34 : tensor<32x32xi8> * tensor<32x16xi8> -> tensor<32x16xi32> 2025-12-04T14:34:54.8986998Z %75 = tt.splat %arg2 : i32 -> tensor<32x32xi32> 2025-12-04T14:34:54.8987218Z %76 = tt.addptr %arg32, %75 : tensor<32x32x!tt.ptr>, tensor<32x32xi32> 2025-12-04T14:34:54.8987378Z %77 = tt.addptr %arg33, %c1_i32 : !tt.ptr, i32 2025-12-04T14:34:54.8987631Z scf.yield %76, %77, %74 : tensor<32x32x!tt.ptr>, !tt.ptr, tensor<32x16xi32> 2025-12-04T14:34:54.8987890Z } 2025-12-04T14:34:54.8988087Z %66 = arith.trunci %65#2 : tensor<32x16xi32> to tensor<32x16xi8> 2025-12-04T14:34:54.8988230Z tt.store %62, %66 : tensor<32x16x!tt.ptr> 2025-12-04T14:34:54.8988337Z tt.return 2025-12-04T14:34:54.8988426Z } 2025-12-04T14:34:54.8988515Z } 2025-12-04T14:34:54.8988521Z 2025-12-04T14:34:54.8988624Z {-# 2025-12-04T14:34:54.8988728Z external_resources: { 2025-12-04T14:34:54.8988830Z mlir_reproducer: { 2025-12-04T14:34:54.8995030Z pipeline: "builtin.module(convert-triton-to-tritongpu{enable-source-remat=false num-ctas=1 num-warps=4 target=cuda:75 threads-per-warp=32}, tritongpu-coalesce, triton-nvidia-gpu-plan-cta, tritongpu-remove-layout-conversions, tritongpu-optimize-thread-locality, tritongpu-accelerate-matmul, tritongpu-remove-layout-conversions, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, triton-nvidia-optimize-descriptor-encoding, triton-loop-aware-cse, triton-licm, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true}, triton-loop-aware-cse, tritongpu-prefetch, tritongpu-optimize-dot-operands{hoist-layout-conversion=false}, tritongpu-coalesce-async-copy, triton-nvidia-optimize-tmem-layouts, tritongpu-remove-layout-conversions, triton-nvidia-interleave-tmem, tritongpu-reduce-data-duplication, tritongpu-reorder-instructions, triton-loop-aware-cse, symbol-dce, triton-nvidia-gpu-fence-insertion{compute-capability=75}, triton-nvidia-mma-lowering, sccp, cse, canonicalize{ max-iterations=10 max-num-rewrites=-1 region-simplify=normal test-convergence=false top-down=true})", 2025-12-04T14:34:54.8995197Z disable_threading: false, 2025-12-04T14:34:54.8995300Z verify_each: true 2025-12-04T14:34:54.8995391Z } 2025-12-04T14:34:54.8995493Z } 2025-12-04T14:34:54.8995579Z #-} 2025-12-04T14:34:54.8996303Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: error: Failures have been detected while processing an MLIR pass pipeline 2025-12-04T14:34:54.8997610Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py:2351:0: note: Pipeline failed while executing [`TritonGPUAccelerateMatmul` on 'builtin.module' operation]: reproducer generated at `std::errs, please share the reproducer above with Triton project.` 2025-12-04T14:34:54.8997714Z FAILED [0.2547s] [100%] 2025-12-04T14:34:54.8997719Z 2025-12-04T14:34:54.8997869Z ==================================== RERUNS ==================================== 2025-12-04T14:34:54.8998427Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.8998559Z Traceback (most recent call last): 2025-12-04T14:34:54.8998933Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.8999054Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.8999544Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1802, in bsr_dense_mm 2025-12-04T14:34:54.8999774Z return bsr_dense_addmm(out, bsr, dense, alpha=1, beta=0, out=out) 2025-12-04T14:34:54.9000315Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.9000548Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.9001019Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.9001147Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.9001587Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.9001709Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.9002132Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.9002414Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.9002797Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.9003117Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.9003554Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.9003801Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.9004257Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.9004398Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.9004907Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.9005269Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.9005801Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.9005903Z pm.run(mod) 2025-12-04T14:34:54.9006035Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.9006069Z 2025-12-04T14:34:54.9006337Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.9007180Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.9007186Z 2025-12-04T14:34:54.9007466Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.9008020Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.9008139Z Traceback (most recent call last): 2025-12-04T14:34:54.9008525Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.9008647Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.9009136Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1802, in bsr_dense_mm 2025-12-04T14:34:54.9009367Z return bsr_dense_addmm(out, bsr, dense, alpha=1, beta=0, out=out) 2025-12-04T14:34:54.9009861Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.9010091Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.9010565Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.9010679Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.9011131Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.9011250Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.9011672Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.9011985Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.9012374Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.9012693Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.9013125Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.9013364Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.9013812Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.9013955Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.9014469Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.9014801Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.9015316Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.9015431Z pm.run(mod) 2025-12-04T14:34:54.9015563Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.9015569Z 2025-12-04T14:34:54.9015795Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.9016634Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.9016639Z 2025-12-04T14:34:54.9016905Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.9017090Z =================================== FAILURES =================================== 2025-12-04T14:34:54.9017643Z _ TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 _ 2025-12-04T14:34:54.9017777Z Traceback (most recent call last): 2025-12-04T14:34:54.9018247Z File "/var/lib/jenkins/workspace/test/test_sparse_csr.py", line 4128, in test_triton_kernel 2025-12-04T14:34:54.9018405Z result = operation(*args, **kwargs) 2025-12-04T14:34:54.9018894Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1802, in bsr_dense_mm 2025-12-04T14:34:54.9019126Z return bsr_dense_addmm(out, bsr, dense, alpha=1, beta=0, out=out) 2025-12-04T14:34:54.9019634Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1330, in bsr_dense_addmm 2025-12-04T14:34:54.9019853Z launch_kernel(kernel, tensor_dims_map, full_grid, grid_blocks) 2025-12-04T14:34:54.9020329Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 183, in launch_kernel 2025-12-04T14:34:54.9020458Z kernel(grid, *sliced_tensors) 2025-12-04T14:34:54.9020907Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/sparse/_triton_ops.py", line 1302, in kernel 2025-12-04T14:34:54.9021029Z _bsr_strided_addmm_kernel[grid]( 2025-12-04T14:34:54.9021457Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 419, in 2025-12-04T14:34:54.9021740Z return lambda *args, **kwargs: self.run(grid=grid, warmup=False, *args, **kwargs) 2025-12-04T14:34:54.9022136Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 733, in run 2025-12-04T14:34:54.9022448Z kernel = self._do_compile(key, signature, device, constexprs, options, attrs, warmup) 2025-12-04T14:34:54.9022882Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/runtime/jit.py", line 861, in _do_compile 2025-12-04T14:34:54.9023134Z kernel = self.compile(src, target=target, options=options.__dict__) 2025-12-04T14:34:54.9023589Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/compiler/compiler.py", line 320, in compile 2025-12-04T14:34:54.9023771Z next_module = compile_ir(module, metadata) 2025-12-04T14:34:54.9024279Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 515, in 2025-12-04T14:34:54.9024616Z stages["ttgir"] = lambda src, metadata: self.make_ttgir(src, metadata, options, capability) 2025-12-04T14:34:54.9025144Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/triton/backends/nvidia/compiler.py", line 319, in make_ttgir 2025-12-04T14:34:54.9025244Z pm.run(mod) 2025-12-04T14:34:54.9025378Z RuntimeError: PassManager::run failed 2025-12-04T14:34:54.9025383Z 2025-12-04T14:34:54.9025615Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.9026457Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.9026465Z 2025-12-04T14:34:54.9026747Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.9027371Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-90277c9f3d74bea5.xml - 2025-12-04T14:34:54.9027549Z =========================== short test summary info ============================ 2025-12-04T14:34:54.9028438Z FAILED [0.2547s] test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 - RuntimeError: PassManager::run failed 2025-12-04T14:34:54.9028445Z 2025-12-04T14:34:54.9028660Z To execute this test, run the following from the base repo dir: 2025-12-04T14:34:54.9029542Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_sparse_csr.py TestSparseCompressedTritonKernelsCUDA.test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.9029548Z 2025-12-04T14:34:54.9029818Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T14:34:54.9030050Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T14:34:54.9030285Z ================= 1 failed, 2400 deselected, 2 rerun in 3.70s ================== 2025-12-04T14:34:54.9030386Z Got exit code 1 2025-12-04T14:34:54.9031153Z FAILED CONSISTENTLY: test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8 2025-12-04T14:34:54.9031566Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T14:34:54.9032027Z Test results will be stored in test-reports/python-pytest/test_sparse_csr/test_sparse_csr-559456aeb9be5546.xml 2025-12-04T14:34:54.9032196Z ============================= test session starts ============================== 2025-12-04T14:34:54.9032549Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T14:34:54.9032673Z cachedir: .pytest_cache 2025-12-04T14:34:54.9033201Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T14:34:54.9033326Z rootdir: /var/lib/jenkins/workspace 2025-12-04T14:34:54.9033447Z configfile: pytest.ini 2025-12-04T14:34:54.9034040Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, anyio-4.12.0, typeguard-4.3.0 2025-12-04T14:34:54.9034288Z collecting ... collected 4932 items / 2390 deselected / 2542 selected 2025-12-04T14:34:54.9034438Z stepcurrent: skipping 2390 already run items. 2025-12-04T14:34:54.9034553Z Running 11 items in this shard 2025-12-04T14:34:54.9034559Z 2025-12-04T14:34:54.9035179Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_sampled_addmm_block_size_64_cuda_float16 PASSED [33.8320s] [ 9%] 2025-12-04T14:34:54.9035778Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_sampled_addmm_block_size_64_cuda_float32 PASSED [28.9886s] [ 18%] 2025-12-04T14:34:54.9036492Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_scaled_dot_product_attention_block_size_32_cuda_float16 PASSED [14.7955s] [ 27%] 2025-12-04T14:34:54.9037159Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_scaled_dot_product_attention_block_size_64_cuda_float32 PASSED [17.6467s] [ 36%] 2025-12-04T14:34:54.9037679Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_scatter_mm_cuda_float16 PASSED [0.5437s] [ 45%] 2025-12-04T14:34:54.9038487Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op__int_bsr_dense_addmm_out_dtype_int32_cuda_float16 SKIPPED [0.0019s] (out dtype not implemented) [ 54%] 2025-12-04T14:34:54.9039278Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op__int_bsr_dense_addmm_out_dtype_int32_cuda_float32 SKIPPED [0.0016s] (out dtype not implemented) [ 63%] 2025-12-04T14:34:54.9040066Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op__int_bsr_dense_addmm_out_dtype_int32_cuda_int8 SKIPPED [0.0015s] (out dtype not implemented) [ 72%] 2025-12-04T14:34:54.9040761Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op__int_bsr_dense_addmm_out_dtype_unspecified_cuda_float16 PASSED [5.7650s] [ 81%] 2025-12-04T14:34:54.9041394Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op_bsr_dense_addmm_out_dtype_int32_cuda_int8 PASSED [7.5019s] [ 90%] 2025-12-04T14:34:54.9042062Z test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_tune_op_bsr_dense_addmm_out_dtype_unspecified_cuda_int8 PASSED [8.0606s] [100%] 2025-12-04T14:34:54.9042068Z 2025-12-04T14:34:54.9042741Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-559456aeb9be5546.xml - 2025-12-04T14:34:54.9042987Z ========== 8 passed, 3 skipped, 2390 deselected in 117.48s (0:01:57) =========== 2025-12-04T14:34:54.9046060Z The following tests failed consistently: ['test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op__int_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8', 'test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_int32_cuda_int8', 'test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_addmm_blocksize_32_out_dtype_unspecified_cuda_int8', 'test/test_sparse_csr.py::TestSparseCompressedTritonKernelsCUDA::test_triton_kernel_op_bsr_dense_mm_blocksize_32_out_dtype_unspecified_cuda_int8'] 2025-12-04T14:34:54.9046095Z 2025-12-04T14:34:54.9046538Z FINISHED PRINTING LOG FILE of test_sparse_csr 2/2 (test/test-reports/test_sparse_csr_2.2_9a8f6f98dd0804ce_.log) 2025-12-04T14:34:54.9046544Z 2025-12-04T14:34:54.9046837Z Finished test_sparse_csr 2/2 ... [2025-12-04 14:34:54.331943][19298.453156801], took 13.45min 2025-12-04T14:34:54.9047529Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-bc7545635763bbdc.xml 2025-12-04T14:34:54.9048210Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-fad3091b7781ac2f.xml 2025-12-04T14:34:54.9048928Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-74bf247f22e5b0d6.xml 2025-12-04T14:34:54.9049602Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-4c8f61ef5d858418.xml 2025-12-04T14:34:54.9050273Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-c19baf682148544c.xml 2025-12-04T14:34:54.9050956Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-a8d85ae14c598d51.xml 2025-12-04T14:34:54.9051663Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-8c50d88405c7b0b3.xml 2025-12-04T14:34:54.9052348Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-dad902e1dd428e6d.xml 2025-12-04T14:34:54.9053022Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-d765bceda0ac4437.xml 2025-12-04T14:34:54.9053694Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-42fbcb95d0a0c581.xml 2025-12-04T14:34:54.9054386Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-da54dbb08fbaa628.xml 2025-12-04T14:34:54.9055053Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-90277c9f3d74bea5.xml 2025-12-04T14:34:54.9055740Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-559456aeb9be5546.xml 2025-12-04T14:34:56.0614236Z Uploading logs for 57119749259 to S3 2025-12-04T14:34:56.3735034Z Uploading artifacts took 1.45 seconds 2025-12-04T14:34:56.3735450Z test_sparse_csr 2/2 failed! 2025-12-04T14:34:56.3740142Z Running functorch/test_ops 5/7 ... [2025-12-04 14:34:56.373841][19300.495060647] 2025-12-04T14:34:56.3740676Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T14:34:56.3745241Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'functorch/test_ops.py', '--shard-id=5', '--num-shards=7', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:34:56.374274] 2025-12-04T14:43:16.4421546Z 2025-12-04T14:43:16.4436379Z functorch/test_ops 5/7 was successful, full logs can be found in artifacts with path test/test-reports/functorch.test_ops_5.7_4f81116d2780ad42_.log 2025-12-04T14:43:16.5134699Z Running 1517 items in this shard: test/functorch/test_ops.py::TestOperatorsCUDA::test_extremal_numerics_binary_cross_entropy_cuda, test/functorch/test_ops.py::TestOperatorsCUDA::test_extremal_numerics_log_softmax_cuda, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad___radd___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad__segment_reduce_lengths_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad__softmax_backward_data_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad__unsafe_masked_index_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_addmm_decomposed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_angle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_as_strided_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_as_strided_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_bfloat16_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_bool_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_broadcast_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_byte_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_char_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_char_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_cholesky_inverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_clamp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_contiguous_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_copysign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_corrcoef_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_dsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_einsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_eq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_erfc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_expand_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_hfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_ifft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_floor_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_grid_sampler_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_gt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_index_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_isnan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_jiterator_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_jiterator_unary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_lerp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_eig_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_eigvalsh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_lstsq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_pinv_singular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_log1p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_mH_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_maximum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_mvlgamma_mvlgamma_p_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_narrow_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_native_dropout_backward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_new_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_new_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_adaptive_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_adaptive_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_alpha_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_batch_norm_without_cudnn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_binary_cross_entropy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_conv2d_stride_groups_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_conv2d_strided_padding_dilation_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_ctc_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_embedding_bag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_embedding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_grid_sample_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_leaky_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_max_unpool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_max_unpool3d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_prelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_selu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_threshold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_ops_aten_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_permute_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_polygamma_polygamma_n_4_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_pow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_randn_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_renorm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_resize_as__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_roll_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_signal_windows_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_signal_windows_general_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_signal_windows_hann_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_sub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_torch_ops_aten__safe_softmax_default_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_unsqueeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_H_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp___rpow___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_aminmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_asinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_atleast_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_baddbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_block_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_bool_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_constant_pad_nd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_dstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_ifftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fliplr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_frac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_full_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_index_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_jiterator_2inputs_2outputs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_jiterator_binary_return_by_ref_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_eigvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_eigvalsh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_inv_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_solve_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linspace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_log1p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_log_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_logical_xor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_logspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_lu_unpack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_meshgrid_list_of_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_mm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_movedim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_mvlgamma_mvlgamma_p_5_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nansum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ne_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_new_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv2d_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv2d_stride_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv_transpose3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_cosine_similarity_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_group_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_interpolate_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_interpolate_trilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_leaky_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_max_unpool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_mse_loss_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_multilabel_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_selu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_smooth_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_tanhshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_upsample_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nonzero_static_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_norm_inf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ones_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_positive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_reshape_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_scatter_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_select_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_sign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_signal_windows_hann_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_slice_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_slice_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_bessel_y0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_bessel_y1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_hermite_polynomial_he_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_modified_bessel_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_ndtri_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_scaled_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_shifted_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_shifted_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_spherical_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_xlog1py_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_square_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_squeeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_squeeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_squeeze_multiple_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_tensor_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_uniform_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_unsafe_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_unsqueeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_unsqueeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_zeros_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpjvpvmap_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_T_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp___getitem___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp__batch_norm_with_update_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp__chunk_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_acos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_addcmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_all_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_angle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_atanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_bucketize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cartesian_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_clone_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_combinations_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cov_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_eq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_equal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_hfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fliplr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_frexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_half_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_hypot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_igamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_index_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_isin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_ldl_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_matrix_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_log_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_long_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_lt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_fill_functorch_Scalar_only_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_min_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_msort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_narrow_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_native_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_adaptive_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv2d_stride_depthwise_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv2d_stride_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_cosine_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_glu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_group_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_huber_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_interpolate_nearest-exact_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_interpolate_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_margin_ranking_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_pad_reflect_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_relu6_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_softmin_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_polygamma_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_round_decimals_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_scatter_reduce_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_signal_windows_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_bessel_y1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_modified_bessel_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_shifted_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_spherical_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_square_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_t_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_tensor_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_unfold_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_unique_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_unsqueeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_vdot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_view_as_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmap_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmap_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmapvmap_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_argmax_cuda_complex128, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_argmin_cuda_complex128, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_ceil_cuda_complex64, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_clamp_cuda_complex64, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_ge_cuda_complex32, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_ge_cuda_complex64, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_gt_cuda_complex32, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_le_cuda_complex64, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_lt_cuda_complex32, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_sort_cuda_complex64, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_flatten_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_mH_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_mT_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_narrow_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_resolve_neg_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_select_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_transpose_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_view_as_complex_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_view_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_H_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_T_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_addcdiv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_addmm_decomposed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_all_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_argmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_argwhere_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_atleast_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_baddbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_broadcast_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_byte_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cholesky_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_count_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cumulative_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_equal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_expand_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_eye_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fft_hfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fft_hfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fft_ifft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fft_ifftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fft_irfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_floor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fmod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_grid_sampler_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_hash_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_hypot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_isclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_isneginf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_lgamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_cond_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_pinv_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_pinv_singular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_solve_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_solve_triangular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_log_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_logspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_lu_unpack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_matrix_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_max_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_maximum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_narrow_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_adaptive_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_conv2d_strided_padding_dilation_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_ctc_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_elu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_embedding_bag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_gaussian_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_glu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_local_response_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_max_unpool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_max_unpool1d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_max_unpool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_pixel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_poisson_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_relu6_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_rms_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_smooth_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_upsample_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_ravel_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_resolve_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_round_decimals_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_sigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_signal_windows_nuttall_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_sort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_sparse_mm_reduce_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_bessel_y0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_hermite_polynomial_he_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_legendre_polynomial_p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_xlog1py_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_true_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_NumpyExpMarkDirtyAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_T_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp___radd___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp___rmod___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp___rmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_acos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_acosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_addcdiv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_addmm_decomposed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_any_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_asin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_bfloat16_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_bfloat16_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_broadcast_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_broadcast_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_cholesky_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_cos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_cumulative_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_deg2rad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_diagonal_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_diff_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_exp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_hfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_ihfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_float_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_gradient_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_gt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_hsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_igamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_jiterator_2inputs_2outputs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_jiterator_binary_return_by_ref_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_le_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_eigvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_ldl_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_lstsq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_logdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_logical_or_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_logical_xor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_movedim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_multinomial_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nanmedian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_alpha_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_stride_depthwise_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_stride_groups_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_stride_padding_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_strided_padding_dilation_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_dropout3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_margin_ranking_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_multi_head_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_pixel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_relu6_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_softsign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_threshold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_ormqr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_scalar_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_scatter_reduce_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_sgn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_sigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_sign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_signal_windows_general_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_erfcx_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_laguerre_polynomial_l_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_scaled_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_shifted_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_sub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_tile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_torch_ops_aten__safe_softmax_default_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_transpose_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_true_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_unsafe_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_var_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_zeros_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjpvmap_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_H_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap___getitem___functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap___rdiv___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap___rmatmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_addmv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_arange_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_atanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_atleast_1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_broadcast_shapes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_byte_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_clamp_max_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_count_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_erfinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_fft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_fftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_geometric_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_half_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_hypot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_int_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_isclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_jiterator_4inputs_with_extra_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_kron_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_cholesky_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_inv_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_lstsq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_solve_triangular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_log1p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_logical_not_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_logit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_long_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_min_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_multinomial_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_mvlgamma_mvlgamma_p_5_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nan_to_num_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nanquantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_ne_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_new_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_new_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_new_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_adaptive_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv2d_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv2d_stride_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv2d_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv_transpose1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_dropout2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_embedding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_huber_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_instance_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_interpolate_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_max_unpool2d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_poisson_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_softsign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_outer_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_pca_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_polygamma_polygamma_n_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_polygamma_polygamma_n_4_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_pow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_randint_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_remainder_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_resolve_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_rot90_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_rsub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_scatter_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_signal_windows_gaussian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_signbit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_slice_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_bessel_y1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_erfcx_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_hermite_polynomial_h_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_i0e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_shifted_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_shifted_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_t_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_tensor_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_torch_ops_aten__safe_softmax_default_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_tril_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmapvmap_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ForwardHasDefaultArgsAutogradFunction_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_NumpyCubeNotComposableAutogradFunction_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ZeroGradientsGenVmapAutogradFunction_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___rmod___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___rmul___cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___rsub___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__native_batch_norm_legit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__native_batch_norm_legit_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__segment_reduce_lengths_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__softmax_backward_data_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__upsample_bilinear2d_aa_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_acos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_acosh_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_amin_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_argmax_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_as_strided_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_as_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_atanh_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_atleast_1d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_atleast_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_bmm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_bool_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_broadcast_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_byte_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_byte_functorch_no_channels_last_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cfloat_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_clone_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_conj_physical_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_constant_pad_nd_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_deg2rad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_diagflat_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_div_no_rounding_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_dot_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_dsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_dsplit_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_empty_like_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_erfinv_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_expm1_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_hfft2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_ifft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_ihfft2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_irfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_irfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_rfft2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_rfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fill_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_flipud_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_float_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_gather_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ge_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_grid_sampler_3d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_hypot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_i0_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_igamma_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_reduce_amax_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_inner_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_isnan_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_isposinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_isreal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_jiterator_2inputs_2outputs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_jiterator_4inputs_with_extra_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_jiterator_binary_return_by_ref_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_jiterator_unary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_cholesky_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_eig_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_eig_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_ldl_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_ldl_factor_ex_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_ldl_solve_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_matrix_rank_hermitian_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_pinv_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_pinv_singular_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_svd_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_log1p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_logaddexp2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_logaddexp_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mH_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mT_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_fill_functorch_Scalar_only_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_log_softmax_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_sum_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_matrix_exp_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_min_binary_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mvlgamma_mvlgamma_p_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mvlgamma_mvlgamma_p_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nan_to_num_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nanmean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nanmedian_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_adaptive_avg_pool2d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_avg_pool2d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_celu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_channel_shuffle_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_no_bias_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_with_bias_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv3d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_dropout3d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_l1_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_leaky_relu_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_local_response_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_max_unpool3d_grad_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_mish_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_multilabel_margin_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_nll_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pad_constant_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pairwise_distance_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_relu6_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_relu6_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_rrelu_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_scaled_dot_product_attention_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_softsign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_tanhshrink_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_triplet_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_unfold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_upsample_bilinear_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_norm_fro_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_norm_inf_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_outer_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_outer_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_permute_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_polygamma_polygamma_n_2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_polygamma_polygamma_n_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_quantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_rand_like_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_randint_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_randn_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_renorm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_repeat_interleave_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_rsub_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_scatter_add_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_scatter_reduce_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_gaussian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_gaussian_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_kaiser_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_nuttall_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_slice_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sparse_sampled_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sparse_sampled_addmm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_airy_ai_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_chebyshev_polynomial_w_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_entr_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_erfcx_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_modified_bessel_k1_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_shifted_chebyshev_polynomial_w_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_spherical_bessel_j0_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_zeta_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_split_list_args_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_squeeze_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sum_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sum_to_size_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_tanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_tensordot_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_tile_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_to_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_to_sparse_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_torch_ops_aten__efficient_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_transpose_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_trapz_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unbind_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unflatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_uniform_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unsafe_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_var_mean_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_view_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_vsplit_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_zero__cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_H_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall___rpow___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_acos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_addbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_addcmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_addmv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_all_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_aminmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_angle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_any_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_as_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_as_strided_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_atleast_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_bernoulli_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_bfloat16_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_bool_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_deg2rad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_div_floor_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_div_no_rounding_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_erf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_expand_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_hfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_irfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_flatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule__segment_reduce_offsets_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_aminmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_bool_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_chalf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_char_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_clamp_max_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cov_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cummin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_diagonal_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_empty_permuted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_exp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_expm1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_hfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_hfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_ihfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_rfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_rfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_flip_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fliplr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_full_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_gather_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_grid_sampler_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_igammac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_index_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_index_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_jiterator_4inputs_with_extra_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_le_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_householder_product_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_ldl_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_lu_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_lu_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_pinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_vander_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linspace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_log1p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_log_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_logspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_fill_functorch_Scalar_only_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_max_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_msort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_mvlgamma_mvlgamma_p_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nan_to_num_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_channel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_conv2d_stride_padding_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_cosine_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_fractional_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_grid_sample_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_interpolate_trilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_max_unpool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_multi_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_pdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_ops_aten_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_pca_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_ravel_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_remainder_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_renorm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_reshape_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_select_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_sign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_signal_windows_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_sinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_split_with_sizes_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_squeeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_t_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_take_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_tensor_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_tile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_torch_ops_aten__efficient_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_unfold_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_unsafe_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_hash_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_hypot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_igamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_index_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_int_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_isnan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_isneginf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_jiterator_4inputs_with_extra_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_jiterator_unary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_le_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_lerp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_eigvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_vecdot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_lt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_max_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_meshgrid_variadic_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_min_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_new_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_adaptive_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_adaptive_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_channel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv2d_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv2d_stride_padding_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv2d_strided_padding_dilation_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv2d_strided_padding_dilation_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv_transpose3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_embedding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_kl_div_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_logsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_max_unpool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_softsign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_pow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_reshape_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_rsqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_scatter_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_scatter_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_scatter_reduce_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_searchsorted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_sign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_signal_windows_nuttall_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_shifted_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_std_mean_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_take_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_tensor_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_triu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_unique_consecutive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_vstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp___radd___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp__segment_reduce_lengths_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp__softmax_backward_data_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_acos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_aminmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_asinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_atleast_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_bernoulli_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cdouble_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_char_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_clamp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_diag_embed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_div_floor_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_dstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_eq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fft_ihfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fft_ihfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fft_rfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_flatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_frac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_geometric_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_gradient_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_gt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_half_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_hash_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_heaviside_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_index_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_index_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_int_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_isposinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_ldl_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_solve_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_log10_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_logical_or_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_matmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_matrix_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_max_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_meshgrid_list_of_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_multinomial_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_new_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_adaptive_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_conv2d_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_conv_transpose2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_cross_entropy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_dropout2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_dropout3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_gelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_hardtanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_max_unpool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_max_unpool1d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_multi_head_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_pad_circular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_prelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_softsign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_upsample_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_norm_nuc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_ops_aten__new_zeros_with_same_feature_meta_functorchonly_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_permute_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_polygamma_polygamma_n_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_ravel_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_round_decimals_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_signal_windows_general_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_sin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_bessel_y0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_hermite_polynomial_h_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_i1e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_ndtr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_zeta_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_split_list_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_t_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_tensor_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_tile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_true_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_unfold_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvmap_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp__chunk_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_addcdiv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_addr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_as_strided_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_atanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_broadcast_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cdouble_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_char_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_clamp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_clone_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cumulative_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_deg2rad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_diagonal_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_double_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_empty_permuted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_expand_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_fft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_rfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_rfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_rfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_float_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_floor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_floor_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_full_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_gt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule___rmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule___rpow___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_addmv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_bool_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_bucketize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_corrcoef_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cummin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_div_no_rounding_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_einsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_eq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_hfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_ihfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_ihfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_geqrf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_igammac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_isreal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_jiterator_unary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_ldexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_eigvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_ldl_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_vander_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_log2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_logcumsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_logical_or_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_lu_unpack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_mH_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_meshgrid_list_of_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_meshgrid_variadic_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nextafter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv2d_stride_groups_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv2d_stride_padding_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv2d_strided_padding_dilation_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv2d_strided_padding_dilation_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_embedding_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_interpolate_area_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_mse_loss_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_multi_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_pdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_softplus_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_outer_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_pca_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_polygamma_polygamma_n_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_polygamma_polygamma_n_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_randn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_randn_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_remainder_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_resolve_conj_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_round_decimals_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_searchsorted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_signal_windows_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_signal_windows_hann_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_bessel_j1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_erfcx_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_log_ndtr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_polygamma_special_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_shifted_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_split_list_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_sum_to_size_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_svd_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_take_along_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_to_sparse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_triu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_unfold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_var_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_vstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_hsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_hstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_hypot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_isnan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_householder_product_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_lu_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_vander_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_log_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_log_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_logaddexp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_logical_not_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_mm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nanmedian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_narrow_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_alpha_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_channel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_conv2d_stride_depthwise_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_conv2d_stride_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_embedding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_embedding_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_fractional_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_interpolate_nearest-exact_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_multilabel_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_pad_replicate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_selu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_silu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_ops_aten_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_randint_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_real_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_repeat_interleave_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_roll_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_rot90_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_rsub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_erfcx_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_i0e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_i1e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_modified_bessel_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_modified_bessel_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_shifted_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_split_list_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_square_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_squeeze_multiple_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_sub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_unfold_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_unsafe_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_view_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_vsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_T_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp__segment_reduce_offsets_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_allclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_broadcast_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_byte_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cfloat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_char_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_clamp_min_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_combinations_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_contiguous_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_digamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_dstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_exp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_expm1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_fftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_rfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_float_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fmod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_ge_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_geometric_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_geqrf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_gradient_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_half_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_index_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_int_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_jiterator_2inputs_2outputs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_jiterator_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_jiterator_unary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_ldl_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_lu_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_matrix_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_vander_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_logical_and_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_lt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_masked_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_masked_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_matmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_max_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_maximum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_ne_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_new_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_batch_norm_without_cudnn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_conv2d_stride_groups_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_interpolate_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_local_response_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_logsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_tanhshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_norm_nuc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_permute_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_randint_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_repeat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_reshape_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_roll_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_round_decimals_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_rsqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_scatter_reduce_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_select_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_signal_windows_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_sort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_bessel_j1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_hermite_polynomial_he_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_laguerre_polynomial_l_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_modified_bessel_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_ndtr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_polygamma_special_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_split_with_sizes_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_squeeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_tensor_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_trapz_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_uniform_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_unique_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_vsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_zero__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvmap_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvmap_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvmap_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvmap_SelectGenVmapAutogradFunction_cuda_float32 2025-12-04T14:43:16.5828314Z 2025-12-04T14:43:16.5828667Z Finished functorch/test_ops 5/7 ... [2025-12-04 14:43:16.444439][19800.565656517], took 8.33min 2025-12-04T14:43:16.5829871Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/functorch.test_ops/functorch.test_ops-c51c09e8b0651326.xml 2025-12-04T14:43:16.6278329Z Running inductor/test_group_batch_fusion 1/1 ... [2025-12-04 14:43:16.627550][19800.748769011] 2025-12-04T14:43:16.6278935Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T14:43:16.6282112Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_group_batch_fusion.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:43:16.627951] 2025-12-04T14:44:29.5570585Z 2025-12-04T14:44:29.5573460Z inductor/test_group_batch_fusion 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_group_batch_fusion_1.1_8a4a2e0db31add28_.log 2025-12-04T14:44:29.5582436Z Running 13 items in this shard: test/inductor/test_group_batch_fusion.py::TestGroupBatchFusion::test_batch_dropout_pre_grad_fusion, test/inductor/test_group_batch_fusion.py::TestGroupBatchFusion::test_batch_layer_norm_fusion, test/inductor/test_group_batch_fusion.py::TestGroupBatchFusion::test_batch_linear_lhs_fusion, test/inductor/test_group_batch_fusion.py::TestGroupBatchFusion::test_batch_linear_pre_grad_fusion, test/inductor/test_group_batch_fusion.py::TestGroupBatchFusion::test_gate_fusion_post_grad, test/inductor/test_group_batch_fusion.py::TestGroupBatchFusion::test_group_linear_fusion, test/inductor/test_group_batch_fusion.py::TestGroupBatchFusion::test_group_linear_fusion_different_shapes, test/inductor/test_group_batch_fusion.py::TestGroupBatchFusion::test_math_op_fusion, test/inductor/test_group_batch_fusion.py::TestGroupBatchFusion::test_pointwise_op_fusion, test/inductor/test_group_batch_fusion.py::TestGroupBatchFusion::test_pointwise_op_fusion_post_grad, test/inductor/test_group_batch_fusion.py::TestPostGradBatchLinearFusion::test_batch_linear_post_grad_fusion, test/inductor/test_group_batch_fusion.py::TestFindIndependentSubsetGreedy::test_find_independent_subset_greedy, test/inductor/test_group_batch_fusion.py::TestFindIndependentSubsetGreedy::test_find_independent_subset_greedy_fuse 2025-12-04T14:44:29.5589705Z 2025-12-04T14:44:29.5590106Z Finished inductor/test_group_batch_fusion 1/1 ... [2025-12-04 14:44:29.556838][19873.678058212], took 1.22min 2025-12-04T14:44:29.5876160Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_group_batch_fusion/inductor.test_group_batch_fusion-c943193060216670.xml 2025-12-04T14:44:29.6875223Z Running inductor/test_cpu_repro 2/3 ... [2025-12-04 14:44:29.687291][19873.808511524] 2025-12-04T14:44:29.6875781Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T14:44:29.6879215Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cpu_repro.py', '--shard-id=2', '--num-shards=3', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 14:44:29.687669] 2025-12-04T15:00:50.1572789Z 2025-12-04T15:00:50.1574031Z inductor/test_cpu_repro 2/3 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_cpu_repro_2.3_d08b33d3f2a080e9_.log 2025-12-04T15:00:50.1806400Z Running 285 items in this shard: test/inductor/test_cpu_repro.py::CPUReproTests::test_ModularIndexing_range_issue_103133, test/inductor/test_cpu_repro.py::CPUReproTests::test__adaptive_avg_pool2d, test/inductor/test_cpu_repro.py::CPUReproTests::test_asinh_with_corner_inputs, test/inductor/test_cpu_repro.py::CPUReproTests::test_attention_size_mismatch, test/inductor/test_cpu_repro.py::CPUReproTests::test_auto_simd, test/inductor/test_cpu_repro.py::CPUReproTests::test_bool_max, test/inductor/test_cpu_repro.py::CPUReproTests::test_broadcast_scalar_cpp_tile_2d_kernel, test/inductor/test_cpu_repro.py::CPUReproTests::test_complex_cholesky_mh_view_fallback, test/inductor/test_cpu_repro.py::CPUReproTests::test_concat_inner_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_constant_store, test/inductor/test_cpu_repro.py::CPUReproTests::test_conv2d_packed, test/inductor/test_cpu_repro.py::CPUReproTests::test_conv_stride_constraints, test/inductor/test_cpu_repro.py::CPUReproTests::test_conv_used_from_multiple_places, test/inductor/test_cpu_repro.py::CPUReproTests::test_convert_int64_to_fp32_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_cpp_kernel_profile, test/inductor/test_cpu_repro.py::CPUReproTests::test_decomposed_dequant_relu_quant_int8, test/inductor/test_cpu_repro.py::CPUReproTests::test_dequant_maxpool2d_lowering_uint8, test/inductor/test_cpu_repro.py::CPUReproTests::test_disabled_amp_is_inference_False, test/inductor/test_cpu_repro.py::CPUReproTests::test_do_not_insert_to_dtype_for_memory_copy_only_kernel, test/inductor/test_cpu_repro.py::CPUReproTests::test_double_reduction_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_eliminate_meaningless_copy, test/inductor/test_cpu_repro.py::CPUReproTests::test_embedding_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_fp8_cast_float16_shape_15,3,13, test/inductor/test_cpu_repro.py::CPUReproTests::test_fp8_cast_float16_shape_4,2048,4096, test/inductor/test_cpu_repro.py::CPUReproTests::test_fp8_cast_float32_shape_15,3,13, test/inductor/test_cpu_repro.py::CPUReproTests::test_frexp, test/inductor/test_cpu_repro.py::CPUReproTests::test_in_out_buffer, test/inductor/test_cpu_repro.py::CPUReproTests::test_index_put, test/inductor/test_cpu_repro.py::CPUReproTests::test_inplace_squeeze_needed, test/inductor/test_cpu_repro.py::CPUReproTests::test_insert_to_dtype_count, test/inductor/test_cpu_repro.py::CPUReproTests::test_int32_reduction_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_int_div, test/inductor/test_cpu_repro.py::CPUReproTests::test_invalid_index_of_empty_tensor, test/inductor/test_cpu_repro.py::CPUReproTests::test_ir_node_str, test/inductor/test_cpu_repro.py::CPUReproTests::test_issue122380, test/inductor/test_cpu_repro.py::CPUReproTests::test_large_mean, test/inductor/test_cpu_repro.py::CPUReproTests::test_linear_used_from_multiple_places, test/inductor/test_cpu_repro.py::CPUReproTests::test_local_buffer_in_outer_loop_fusion, test/inductor/test_cpu_repro.py::CPUReproTests::test_logical_op_store_to_lowp_data_dtype, test/inductor/test_cpu_repro.py::CPUReproTests::test_lowp_fp_neg_abs, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_change_input_sizes_cpu_unbatched_False_input_size_2_hidden_size_5_num_layers_3_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_2_seq_len_3, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_maxpool2d_cpu_only, test/inductor/test_cpu_repro.py::CPUReproTests::test_maxpool2d_with_pre_loop_collapse_cpu_only, test/inductor/test_cpu_repro.py::CPUReproTests::test_multihead_attention_cpu, test/inductor/test_cpu_repro.py::CPUReproTests::test_nn_param_assign, test/inductor/test_cpu_repro.py::CPUReproTests::test_no_op_squeeze, test/inductor/test_cpu_repro.py::CPUReproTests::test_non_contiguous_load_buf_quant_uint8, test/inductor/test_cpu_repro.py::CPUReproTests::test_ops_masked_with_bool_input, test/inductor/test_cpu_repro.py::CPUReproTests::test_outer_mean_large_size, test/inductor/test_cpu_repro.py::CPUReproTests::test_pack_padded_sequence_lstm, test/inductor/test_cpu_repro.py::CPUReproTests::test_parallel_num_threads, test/inductor/test_cpu_repro.py::CPUReproTests::test_parallel_reduction_vectorization, test/inductor/test_cpu_repro.py::CPUReproTests::test_per_channel_fake_quant_int8_bf16_input, test/inductor/test_cpu_repro.py::CPUReproTests::test_per_channel_fake_quant_module_uint8, test/inductor/test_cpu_repro.py::CPUReproTests::test_per_channel_fake_quant_uint8_bf16_input, test/inductor/test_cpu_repro.py::CPUReproTests::test_per_tensor_fake_quant_uint8, test/inductor/test_cpu_repro.py::CPUReproTests::test_reduce_with_masked, test/inductor/test_cpu_repro.py::CPUReproTests::test_reduction_cpu_only, test/inductor/test_cpu_repro.py::CPUReproTests::test_reduction_float_to_int64, test/inductor/test_cpu_repro.py::CPUReproTests::test_redundant_to_node_elimination_lowp_fp, test/inductor/test_cpu_repro.py::CPUReproTests::test_repeat_interleave, test/inductor/test_cpu_repro.py::CPUReproTests::test_scalar_mul_bfloat16, test/inductor/test_cpu_repro.py::CPUReproTests::test_select_tiliing_with_index_expr, test/inductor/test_cpu_repro.py::CPUReproTests::test_share_local_buffers_in_outer_loop_fusion, test/inductor/test_cpu_repro.py::CPUReproTests::test_skip_cpp_codegen, test/inductor/test_cpu_repro.py::CPUReproTests::test_slice_scatter_issue122291, test/inductor/test_cpu_repro.py::CPUReproTests::test_softmax_with_zero_dim, test/inductor/test_cpu_repro.py::CPUReproTests::test_store_reduction, test/inductor/test_cpu_repro.py::CPUReproTests::test_tile2d_load_decomposed_dequant_add_relu_quant_uint8, test/inductor/test_cpu_repro.py::CPUReproTests::test_torch_linalg_qr_tuple_slice, test/inductor/test_cpu_repro.py::CPUReproTests::test_torch_logit, test/inductor/test_cpu_repro.py::CPUReproTests::test_transpose_copy, test/inductor/test_cpu_repro.py::CPUReproTests::test_transpose_sum2d_cpu_only, test/inductor/test_cpu_repro.py::CPUReproTests::test_two_local_buffers_in_outer_loop_fusion_case2, test/inductor/test_cpu_repro.py::CPUReproTests::test_uint32_pointwise_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_vec_bitwise, test/inductor/test_cpu_repro.py::CPUReproTests::test_vec_cpu_only_for_all_available_isa, test/inductor/test_cpu_repro.py::CPUReproTests::test_vec_dynamic_shapes, test/inductor/test_cpu_repro.py::CPUReproTests::test_vec_indirect_load_cse_cache, test/inductor/test_cpu_repro.py::CPUReproTests::test_vector_norm_compile 2025-12-04T15:00:50.2032795Z 2025-12-04T15:00:50.2033160Z Finished inductor/test_cpu_repro 2/3 ... [2025-12-04 15:00:50.157790][20854.27900755], took 16.34min 2025-12-04T15:00:50.2034436Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_cpu_repro/inductor.test_cpu_repro-2a01ea2f1377f4e8.xml 2025-12-04T15:00:51.4384502Z Uploading artifacts took 1.15 seconds 2025-12-04T15:00:51.4388703Z Running inductor/test_smoke 1/1 ... [2025-12-04 15:00:51.438675][20855.559896132] 2025-12-04T15:00:51.4389262Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:00:51.4393202Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_smoke.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:00:51.439092] 2025-12-04T15:01:01.1140107Z 2025-12-04T15:01:01.1141145Z inductor/test_smoke 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_smoke_1.1_b380630b8c684182_.log 2025-12-04T15:01:01.1141941Z 2025-12-04T15:01:01.1142266Z Finished inductor/test_smoke 1/1 ... [2025-12-04 15:01:01.113790][20865.23501065], took 0.16min 2025-12-04T15:01:01.1452181Z Running inductor/test_fused_attention 1/1 ... [2025-12-04 15:01:01.145000][20865.266222292] 2025-12-04T15:01:01.1452788Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:01:01.1456443Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_fused_attention.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:01:01.145376] 2025-12-04T15:04:33.9092230Z 2025-12-04T15:04:33.9095486Z inductor/test_fused_attention 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_fused_attention_1.1_0d21780331237b8e_.log 2025-12-04T15:04:33.9152607Z Running 108 items in this shard: test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_insignificant_strides, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_pattern_fails_with_reuse_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_pattern_fails_with_tensor_factor_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_pattern_fails_with_unsupported_mask_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_prev_13_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_prev_14_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_prev_15_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_10_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_11_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_12_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_13_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_14_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_15_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_17_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_19_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_1_freezing, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_1_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_20_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_21_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_22_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_23_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_24_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_2_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_3_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_4_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_5_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_6_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_7_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_8_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuTests::test_sdpa_rewriter_9_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_insignificant_strides, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_pattern_fails_with_reuse_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_pattern_fails_with_tensor_factor_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_pattern_fails_with_unsupported_mask_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_prev_13_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_prev_14_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_prev_15_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_10_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_11_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_12_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_13_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_14_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_15_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_17_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_19_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_1_freezing, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_1_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_20_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_21_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_22_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_23_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_24_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_2_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_3_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_4_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_5_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_6_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_7_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_8_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterGpuDynamicTests::test_sdpa_rewriter_9_gpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_pattern_fails_with_reuse_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_pattern_fails_with_tensor_factor_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_pattern_fails_with_unsupported_mask_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_prev_13_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_prev_14_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_prev_15_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_11_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_12_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_13_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_14_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_15_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_16_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_16_fp32_mask_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_17_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_18_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_19_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_1_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_20_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_21_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_22_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_23_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_24_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_2_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuTests::test_sdpa_rewriter_5_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_pattern_fails_with_reuse_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_pattern_fails_with_tensor_factor_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_pattern_fails_with_unsupported_mask_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_prev_13_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_prev_14_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_prev_15_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_11_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_12_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_13_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_14_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_15_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_16_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_16_fp32_mask_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_17_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_18_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_19_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_1_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_20_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_21_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_22_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_23_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_24_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_2_cpu, test/inductor/test_fused_attention.py::SDPAPatternRewriterCpuDynamicTests::test_sdpa_rewriter_5_cpu 2025-12-04T15:04:33.9208798Z 2025-12-04T15:04:33.9209193Z Finished inductor/test_fused_attention 1/1 ... [2025-12-04 15:04:33.909191][21078.030405862], took 3.55min 2025-12-04T15:04:33.9407081Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_fused_attention/inductor.test_fused_attention-93034b4b3ea7a646.xml 2025-12-04T15:04:34.0586803Z Running inductor/test_inductor_freezing 1/1 ... [2025-12-04 15:04:34.058319][21078.17953647] 2025-12-04T15:04:34.0587460Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:04:34.0590223Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_inductor_freezing.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:04:34.058753] 2025-12-04T15:05:40.4700174Z 2025-12-04T15:05:40.4702072Z inductor/test_inductor_freezing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_inductor_freezing_1.1_a351f78acb34ecfb_.log 2025-12-04T15:05:40.4724970Z Running 48 items in this shard: test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_aliased_param_return_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_autocast_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_bn_with_multi_bn_share_conv_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_functional_bn_with_multi_bn_share_conv_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_layout_convert_with_view_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_multiple_uses_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_weight_layout_convert_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_with_as_strided_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_cpp_wrapper_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_dont_change_dtype_folding_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_error_on_eager_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_folded_conv_bn_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_folded_conv_bn_hardswish_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_folded_conv_bn_with_module_sharing_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_folded_conv_functional_bn_with_module_sharing_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_mm_concat_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_mutation_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_param_deallocated_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_redundant_clone_for_layout_convert_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_rng_op_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_static_indices_cudagraph_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_symint_not_folded_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_unequal_bias_horizontal_addmm_fusion_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_unfolded_bn_cpu, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_aliased_param_return_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_autocast_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_bn_with_multi_bn_share_conv_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_functional_bn_with_multi_bn_share_conv_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_layout_convert_with_view_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_multiple_uses_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_weight_layout_convert_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_with_as_strided_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_cpp_wrapper_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_dont_change_dtype_folding_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_error_on_eager_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_folded_conv_bn_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_folded_conv_bn_hardswish_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_folded_conv_bn_with_module_sharing_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_folded_conv_functional_bn_with_module_sharing_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_mm_concat_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_mutation_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_param_deallocated_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_redundant_clone_for_layout_convert_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_rng_op_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_static_indices_cudagraph_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_symint_not_folded_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_unequal_bias_horizontal_addmm_fusion_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_unfolded_bn_cuda 2025-12-04T15:05:40.4747292Z 2025-12-04T15:05:40.4747687Z Finished inductor/test_inductor_freezing 1/1 ... [2025-12-04 15:05:40.469853][21144.591073609], took 1.11min 2025-12-04T15:05:40.5013301Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_inductor_freezing/inductor.test_inductor_freezing-a1df3eccdba273d4.xml 2025-12-04T15:05:40.5780199Z Running dynamo/test_higher_order_ops 1/1 ... [2025-12-04 15:05:40.577739][21144.698958921] 2025-12-04T15:05:40.5780782Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:05:40.5784020Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_higher_order_ops.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:05:40.578128] 2025-12-04T15:06:42.6812370Z 2025-12-04T15:06:42.6813433Z dynamo/test_higher_order_ops 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_higher_order_ops_1.1_d050168149600c3d_.log 2025-12-04T15:06:42.6927038Z Running 232 items in this shard: test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_access_module_attr, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_allow_python_side_effects_utility, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_capture_constants, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_capture_global_num, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_capture_global_num_adds_guard, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_capture_input_num, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_capture_numpy_number, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_capture_tracked, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_capture_tracked_nested, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_capture_untracked_global, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_capture_untracked_global_nested, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_capture_untracked_nonlocal, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_capture_value_created_in_subgraph, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_concat_unbacked_shape_tensor, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_branches_no_arguments, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_branches_no_arguments_no_closure, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_free_variable_in_both_branches, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_graph_break_in_one_branch, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_pytree_operands, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_pytree_operands_with_non_tensor_leaves, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_side_effect_in_one_branches, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_source_fn_stack, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_subgraph_name_is_valid, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_with_constant_pred, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_with_empty_operands, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_dynamic_shapes_over_vmap_batch_size, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_enum_arg, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_error_message_sane, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_fallback_on_graph_break_complicated, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_fallback_on_graph_break_simple, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_flat_list_output, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_fn_with_kwargs_in_torch_ops, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_freevars_as_inputs_to_wrap, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_grad_source_fn_stack, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_hints_wrapper, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_hints_wrapper_incorrect_type, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_hints_wrapper_no_hints, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_hints_wrapper_pytree_inputs, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_hooks, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_hopify_generic_wrap, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_inlined_functions, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_internal_nonlocal, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_lift_tensor_constant, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_lift_tensors_with_compound_expressions, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_lift_tensors_with_shared_symbols, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_make_closure, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_map_example_value_metadata_consistent_with_eager, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_map_graph_break, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_map_kwargs, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_map_lowers_to_graph, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_map_multi_return, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_map_pytree_return, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_map_side_effect, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_map_source_fn_stack, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_map_subgraph_name_is_valid, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_map_symint_input, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_modules, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_nested_tuple_output, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_nested_wrap, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_no_freevars, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_output_with_dict, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_register_mode, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_register_subclass, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_return_captured_var, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_return_captured_var_used_multiple_times, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_return_captured_vars, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_same_freevar_twice, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_del_existing_attr_global_module, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_del_existing_attr_global_obj, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_del_existing_attr_nonlocal_module, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_del_existing_attr_nonlocal_obj, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_in_body, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_local_list_append_no_graph_break, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_mutate_global_list, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_mutate_global_num, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_mutate_global_num_builtin, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_mutate_global_tensor, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_mutate_global_tensor_builtin, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_mutate_nonlocal_num, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_mutate_nonlocal_num_builtin, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_mutate_nonlocal_tensor, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_mutate_nonlocal_tensor_builtin, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_nested_nonlocal_list_append_graph_break, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_nonlocal_list_append_graph_break, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_set_existing_attr_global_module, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_set_existing_attr_global_obj, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_set_existing_attr_nonlocal_module, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_set_existing_attr_nonlocal_obj, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_set_new_attr_global_module, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_set_new_attr_global_obj, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_set_new_attr_nonlocal_module, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_side_effect_set_new_attr_nonlocal_obj, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_support_float_in_output, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_symint_in_slice, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_symint_input, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_tensor_and_unbacked_symbol_closure, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_tensor_to_list_closure, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_tensor_with_unbacked_shape_closure, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_unbacked_symbol_closure, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_vmap_multiply_scalar, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_vmap_source_fn_stack, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_all_kwarg, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_allow_local_assign_in_body_fn, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_inductor_compiled_regions_option, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_inductor_compiled_regions_with_backward, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_kwarg, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_kwarg_default, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_kwarg_default_else_branch, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_kwarg_default_if_branch, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_kwarg_int, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_kwarg_only, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_kwarg_recompile, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_pytree_args_nested, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_pytree_args_not_const_symint_tensor, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_pytree_args_with_symint_constant, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_pytree_kwargs, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_source_fn_stack, test/dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_wrap_subgraph_name_is_valid, test/dynamo/test_higher_order_ops.py::HigherOrderOpVmapGuardTests::test_dual_level_guard, test/dynamo/test_higher_order_ops.py::HigherOrderOpVmapGuardTests::test_emit_functorch_guard_if_active, test/dynamo/test_higher_order_ops.py::HigherOrderOpVmapGuardTests::test_grad_guard_fail, test/dynamo/test_higher_order_ops.py::HigherOrderOpVmapGuardTests::test_jvp_guard_fail, test/dynamo/test_higher_order_ops.py::HigherOrderOpVmapGuardTests::test_linearize_recompiles, test/dynamo/test_higher_order_ops.py::HigherOrderOpVmapGuardTests::test_vmap_grad_guard_ok, test/dynamo/test_higher_order_ops.py::HigherOrderOpVmapGuardTests::test_vmap_grad_vmap_guard_fail, test/dynamo/test_higher_order_ops.py::HigherOrderOpVmapGuardTests::test_vmap_guard_fail, test/dynamo/test_higher_order_ops.py::HigherOrderOpVmapGuardTests::test_vmap_guard_fail_different_state, test/dynamo/test_higher_order_ops.py::HigherOrderOpVmapGuardTests::test_vmap_guard_ok, test/dynamo/test_higher_order_ops.py::HigherOrderOpVmapGuardTests::test_vmap_recompile_different_states, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_functional_call, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_functional_call_disable_inline_nn_module, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_functional_call_sequential_params_and_buffers, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_call_compiled_backward_fn, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_call_torch_compile_fn, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_capture_tensor, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_closure_scalar, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_fn_with_kwargs, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_freevar_python_scalar, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_freevar_tensor, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_has_aux, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_non_tensor_input, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_over_grad, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_pytree, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_recompile, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_two_tensor_all_grad_has_aux, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_two_tensor_has_aux, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_with_graph_break, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_grad_with_side_effect, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_hessian, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_hessian_argnums, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jacfwd, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jacfwd_has_aux, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jacfwd_randomness, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jacfwd_two_tensors_argnums, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jacrev, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jacrev_has_aux, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jacrev_two_tensors_argnums, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jvp_call_torch_compile_fn, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jvp_freevar_python_scalar, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jvp_freevar_tensor, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jvp_has_aux, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jvp_jvp, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jvp_simple, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jvp_two_tensors_disable_enable_disable_grad, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jvp_two_tensors_disable_grad, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_jvp_two_tensors_has_aux, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_linearize_jvp_fn, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_teardown_resets_nested_graph_breaks, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vjp, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vjp_call_compiled_backward_fn, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vjp_has_aux, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vjp_multiple_outputs, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vjp_multiple_outputs_python_struct, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_call_compiled_backward_fn, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_call_torch_compile_fn, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_free_const, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_free_tensor, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_get_wrapped, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_kwargs, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_multiple_invocation_in_dims, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_multiple_invocation_out_dims, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_multiple_outputs, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_multiple_outputs_diff_dims, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_multiple_outputs_out_dims_tuple, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_new_tensor_implicit_via_op, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_new_tensor_in_body, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_new_tensor_unused_in_body, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_out_dims_None, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_over_vmap_captured, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_over_vmap_two_inputs, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_previous_illegal_op_no_graph_break, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_pytree_inputs, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_recompile, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_recompile_different_config, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_recompile_same_config, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_recompile_with_randomness, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_side_effects, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_side_effects_append_input, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_two_inputs, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_two_inputs_tuple_in_dims, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_with_conditional_graph_break, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_with_graph_break, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_with_graph_break_2, test/dynamo/test_higher_order_ops.py::FuncTorchHigherOrderOpTests::test_vmap_with_graph_break_lambda, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_cond_with_invalid_kwargs, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_cond_with_kwargs, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_cond_with_mismatched_output, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_dropout, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_dropout_inductor, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_fallback, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_flop_counter_for_cond, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_flop_counter_for_cond_unbalanced_branches, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_flop_counter_for_nested_cond, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_function, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_function_with_kwargs, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_module, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_non_aliasing_util, test/dynamo/test_higher_order_ops.py::ActivationCheckpointingTests::test_override_fallthrough_dispatch_key, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_aot_eager_auto_functionalize_simple_cuda_float32, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_aot_eager_cond_simple_cuda_float32, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_aot_eager_invoke_quant_packed_simple_cuda_float32, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_aot_eager_invoke_quant_simple_cuda_float32, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_aot_eager_invoke_subgraph_simple_cuda_float32, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_aot_eager_while_loop_stack_output_simple_cuda_float32, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_inductor_auto_functionalize_simple_cuda_float32, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_inductor_cond_simple_cuda_float32, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_inductor_invoke_quant_packed_simple_cuda_float32, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_inductor_invoke_quant_simple_cuda_float32, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_inductor_invoke_subgraph_simple_cuda_float32, test/dynamo/test_higher_order_ops.py::TestHigherOrderOpsOpInfoCUDA::test_hops_compile_backend_inductor_while_loop_stack_output_simple_cuda_float32 2025-12-04T15:06:42.7037179Z 2025-12-04T15:06:42.7037561Z Finished dynamo/test_higher_order_ops 1/1 ... [2025-12-04 15:06:42.681478][21206.802696309], took 1.04min 2025-12-04T15:06:42.7135425Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_higher_order_ops/dynamo.test_higher_order_ops-53ad11142cf6dd1d.xml 2025-12-04T15:06:42.8093180Z Running inductor/test_aot_inductor_arrayref 1/2 ... [2025-12-04 15:06:42.809096][21206.93031559] 2025-12-04T15:06:42.8093797Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:06:42.8097452Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor_arrayref.py', '--shard-id=1', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:06:42.809484] 2025-12-04T15:15:00.9187234Z 2025-12-04T15:15:00.9189157Z inductor/test_aot_inductor_arrayref 1/2 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_aot_inductor_arrayref_1.2_d25289a5cf5642e3_.log 2025-12-04T15:15:00.9327707Z Running 159 items in this shard: test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test__weight_int4pack_mm_m_32_n_64_q_group_64_num_groups_2_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_32_num_groups_2_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_64_num_groups_1_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_amp_fallback_random_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_aoti_constant_tensor_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_aoti_constant_tensor_name_collision_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_aoti_debug_printer_codegen_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_aoti_debug_printer_fp8_dtype_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_aoti_debug_printer_sym_inputs_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_aoti_debug_printer_user_defined_triton_kernel_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_aoti_debug_printing_model_inputs_codegen_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_aoti_profiler_enable_kernel_profile_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_aoti_profiler_enable_kernel_profile_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_aoti_runtime_asserts_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_autotune_int64_user_defined_triton_kernel_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_backward_no_op_logging_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_bool_input_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_buffer_mutation_1_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_buffer_mutation_2_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_buffer_mutation_3_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_buffer_reuse_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_clamp_decomposition_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_codegen_int_array_var_fix_memory_leak_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_composed_dynamic_size_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_cpu_predicate_cuda_operands_max_autotune_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_mismatched_branch_output_dynamic_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_nested_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_non_tensor_predicates_dynamic_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_share_predicate_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_symint_input_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_unbacked_symint_closure_dynamic_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_use_buffers_from_outer_scope_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_with_reinterpret_view_inputs_outputs_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_consecutive_compiles_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_constant_folding_with_update_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_conv3d_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_convolution_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_custom_op_in_subgraph_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_d2h_copy_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_deconv_freezing_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_device_moved_constant_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_duplicated_params_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_dynamic_scalar_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_dynamic_smem_above_default_limit_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_empty_cat_dtype_promotion_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_empty_constant_folding_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_fake_tensor_device_validation_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_fallback_mem_leak_fix_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_fill__fallback_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_foreach_multiple_dynamic_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_fqn_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_free_inactive_buffer_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_index_put_fallback_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_large_dynamic_dim_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_large_grid_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_large_mmaped_weights_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_libtorch_free_so_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_linear_dynamic_maxautotune_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_missing_cubin_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_missing_output_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_mixed_device_1_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_model_modified_weights_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_multi_device_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_nan_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_nested_tensor_from_jagged_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_non_default_gpu_device_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_none_args_aot_codegen_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_normal_functional_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_on_gpu_device1_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_output_misaligned_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_output_path_1_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_output_path_2_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_poi_multiple_dynamic_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_profile_benchmark_harness_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_proxy_executor_permute_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_quanatized_int8_linear_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_quantized_linear_bias_none_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_repeated_user_defined_triton_kernel_embed_kernel_binary_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_repeated_user_defined_triton_kernel_embed_kernel_binary_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_replace_unbacked_symbol_with_backed_expr_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_replicate_on_devices_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_return_view_constant_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_reuse_kernel_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_runtime_checks_complex_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_runtime_checks_dtype_failed_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_runtime_checks_fp8_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_runtime_checks_large_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_runtime_checks_shape_failed_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_same_backing_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_scaled_grouped_mm_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_scatter_fallback_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_scatter_reduce_fallback_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_sdpa_2_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_sdpa_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_seq_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_simple_dynamic_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_simple_embed_kernel_binary_False_max_autotune_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_simple_embed_kernel_binary_True_max_autotune_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_simple_embed_kernel_binary_True_max_autotune_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_size_from_multi_output_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_size_with_unbacked_add_expr_transitive_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_stride_with_unbacked_expr_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_subclasses_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_sym_i64_input_codegen_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_symint_item_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_sympy_cpp_printer_min_max_minmax0_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_autotuning_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_bool_param_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_dynamic_grid_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_equal_to_1_float_arg_dynamic_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_equal_to_1_float_arg_dynamic_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_extern_kernel_arg_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_2_num_dims_2_dynamic_False_autotune_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_on_device_tma_dynamic_False_tma_version_old_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_on_device_tma_dynamic_True_tma_version_new_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_reinterpret_view_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_sympy_expr_arg_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_tma_descriptor_1d_dynamic_False_tma_version_new_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_tma_descriptor_1d_dynamic_False_tma_version_old_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_tma_descriptor_1d_dynamic_True_tma_version_new_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_new_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_old_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_tma_descriptor_2d_dynamic_True_tma_version_new_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_tma_descriptor_2d_dynamic_True_tma_version_old_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_with_none_inputs_and_equal_to_1_arg_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_mutated_autotuning_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_unbacked_expr_replacements_shift_k_0_use_static_size_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_unbacked_expr_replacements_shift_k_1_use_static_size_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_unbacked_expr_replacements_shift_k_2_use_static_size_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_unbacked_expr_replacements_shift_k_3_use_static_size_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_unbounded_expr_substitutions_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_update_inactive_constant_buffer_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_using_model_name_for_files_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_weight_on_disk_legacy_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_while_loop_with_conv_dynamic_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_while_loop_with_mixed_device_dynamic_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_while_loop_with_outer_buffers_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_while_loop_with_pytree_inputs_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_while_loop_with_sym_expr_cond_dynamic_False_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_while_loop_with_sym_expr_cond_dynamic_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_while_loop_with_unbacked_symint_closure_dynamic_True_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_zero_grid_with_unbacked_symbols_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_zero_size_buffer_cpu_with_stack_allocation, test/inductor/test_aot_inductor_arrayref.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_zero_size_weight_cpu_with_stack_allocation 2025-12-04T15:15:00.9464883Z 2025-12-04T15:15:00.9465303Z Finished inductor/test_aot_inductor_arrayref 1/2 ... [2025-12-04 15:15:00.918955][21705.040173697], took 8.30min 2025-12-04T15:15:00.9506063Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor_arrayref/inductor.test_aot_inductor_arrayref-b86518472a5506a3.xml 2025-12-04T15:15:01.0481428Z Running inductor/test_triton_cpu_backend 1/1 ... [2025-12-04 15:15:01.047811][21705.16902901] 2025-12-04T15:15:01.0482084Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:15:01.0484668Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_triton_cpu_backend.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:15:01.048216] 2025-12-04T15:15:12.8405523Z 2025-12-04T15:15:12.8406680Z inductor/test_triton_cpu_backend 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_triton_cpu_backend_1.1_6ce1ac8654c4e721_.log 2025-12-04T15:15:12.8407597Z 2025-12-04T15:15:12.8408009Z Finished inductor/test_triton_cpu_backend 1/1 ... [2025-12-04 15:15:12.840321][21716.961541561], took 0.20min 2025-12-04T15:15:12.8720213Z Running inductor/test_autoheuristic 1/1 ... [2025-12-04 15:15:12.871801][21716.993023013] 2025-12-04T15:15:12.8720891Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:15:12.8724231Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_autoheuristic.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:15:12.872183] 2025-12-04T15:15:22.8975728Z 2025-12-04T15:15:22.8977143Z inductor/test_autoheuristic 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_autoheuristic_1.1_b0f24ef6b05122fc_.log 2025-12-04T15:15:22.8978149Z Running 0 items in this shard: 2025-12-04T15:15:22.8978431Z 2025-12-04T15:15:22.8978825Z Finished inductor/test_autoheuristic 1/1 ... [2025-12-04 15:15:22.897340][21727.018561207], took 0.17min 2025-12-04T15:15:22.9288665Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_autoheuristic/inductor.test_autoheuristic-96f2d55568fe5d93.xml 2025-12-04T15:15:23.0062396Z Running inductor/test_flex_flash 1/1 ... [2025-12-04 15:15:23.005941][21727.127160448] 2025-12-04T15:15:23.0062982Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:15:23.0066211Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_flex_flash.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:15:23.006330] 2025-12-04T15:15:31.2319756Z 2025-12-04T15:15:31.2320752Z inductor/test_flex_flash 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_flex_flash_1.1_f4efdeec9f55a014_.log 2025-12-04T15:15:31.2353622Z Running 58 items in this shard: test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_kernel_called_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_kernel_called_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_mask_mod_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_mask_mod_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_score_mod_capture_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_score_mod_capture_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_score_mod_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_backward_rejects_score_mod_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_basic_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_basic_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_block_mask_with_score_mod_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_block_mask_with_score_mod_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_kernel_called_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_kernel_called_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_mask_mod_with_dual_buffers_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_mask_mod_with_dual_buffers_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_mask_mod_with_view_buffer_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_mask_mod_with_view_buffer_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_score_mod_with_many_buffer_indexing_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_score_mod_with_many_buffer_indexing_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_127_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_127_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_255_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_255_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_383_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_383_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_511_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_unfriendly_seqlen_with_causal_seq_len_511_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_alibi_learned_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_alibi_learned_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_batch_bias_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_batch_bias_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_batch_head_bias_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_batch_head_bias_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_block_mask_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_block_mask_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_doc_mask_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_doc_mask_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_dual_buffer_bias_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_dual_buffer_bias_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_head_scale_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_head_scale_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_mask_mod_buffer_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_mask_mod_buffer_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_pos_bias_table_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_pos_bias_table_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_and_mask_buffers_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_and_mask_buffers_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_causal_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_causal_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_rel_bias_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_rel_bias_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_times_two_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_mod_times_two_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_view_buffer_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_attention_with_score_view_buffer_cuda_float16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_impl_error_with_requires_grad_cuda_bfloat16, test/inductor/test_flex_flash.py::TestFlexFlashCUDA::test_flash_impl_error_with_requires_grad_cuda_float16 2025-12-04T15:15:31.2385739Z 2025-12-04T15:15:31.2386093Z Finished inductor/test_flex_flash 1/1 ... [2025-12-04 15:15:31.231841][21735.353059874], took 0.14min 2025-12-04T15:15:31.2635316Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_flex_flash/inductor.test_flex_flash-19911cd6feadbffe.xml 2025-12-04T15:15:31.3461874Z Running inductor/test_segmented_tree 1/1 ... [2025-12-04 15:15:31.345928][21735.467148062] 2025-12-04T15:15:31.3462514Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:15:31.3465542Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_segmented_tree.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:15:31.346307] 2025-12-04T15:15:37.0183539Z 2025-12-04T15:15:37.0184884Z inductor/test_segmented_tree 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_segmented_tree_1.1_902748e42a4b3c21_.log 2025-12-04T15:15:37.0191091Z Running 12 items in this shard: test/inductor/test_segmented_tree.py::TestSegmentedTree::test_basic_construction, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_boundary_conditions, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_empty_array, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_full_array_range, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_invalid_ranges, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_max_query_matches_naive, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_multiple_operations, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_out_of_bounds, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_overlapping_updates, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_range_update, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_sequential_updates_and_queries, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_single_element_ranges 2025-12-04T15:15:37.0196287Z 2025-12-04T15:15:37.0196672Z Finished inductor/test_segmented_tree 1/1 ... [2025-12-04 15:15:37.018160][21741.139380132], took 0.09min 2025-12-04T15:15:37.0502539Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_segmented_tree/inductor.test_segmented_tree-1f8a73d877a0e0af.xml 2025-12-04T15:15:37.0844286Z Running inductor/test_kernel_optimization 1/1 ... [2025-12-04 15:15:37.084177][21741.205396991] 2025-12-04T15:15:37.0844929Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:15:37.0847914Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_kernel_optimization.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:15:37.084546] 2025-12-04T15:16:00.2314256Z 2025-12-04T15:16:00.2315765Z inductor/test_kernel_optimization 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_kernel_optimization_1.1_b508de360be55d82_.log 2025-12-04T15:16:00.2318142Z Running 1 items in this shard: test/inductor/test_kernel_optimization.py::TestKernelOptimization::test_einsum_to_pointwise 2025-12-04T15:16:00.2318803Z 2025-12-04T15:16:00.2319224Z Finished inductor/test_kernel_optimization 1/1 ... [2025-12-04 15:16:00.231211][21764.352431556], took 0.39min 2025-12-04T15:16:00.2637844Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_kernel_optimization/inductor.test_kernel_optimization-8ba4093a849c99fb.xml 2025-12-04T15:16:00.3476323Z Running inductor/test_metrics 1/1 ... [2025-12-04 15:16:00.347358][21764.46857682] 2025-12-04T15:16:00.3477010Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:16:00.3480387Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_metrics.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:16:00.347754] 2025-12-04T15:16:18.9388136Z 2025-12-04T15:16:18.9389174Z inductor/test_metrics 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_metrics_1.1_0adb01e2e59a8204_.log 2025-12-04T15:16:18.9392023Z Running 6 items in this shard: test/inductor/test_metrics.py::TestMetrics::test_atomic_add, test/inductor/test_metrics.py::TestMetrics::test_count_args, test/inductor/test_metrics.py::TestMetrics::test_count_pattern, test/inductor/test_metrics.py::TestMetrics::test_kernel_args_num_gb, test/inductor/test_metrics.py::TestMetrics::test_parse_proper_kernel_fn_code, test/inductor/test_metrics.py::TestMetrics::test_parse_reduction_hint 2025-12-04T15:16:18.9394182Z 2025-12-04T15:16:18.9394752Z Finished inductor/test_metrics 1/1 ... [2025-12-04 15:16:18.938585][21783.059804546], took 0.31min 2025-12-04T15:16:18.9709153Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_metrics/inductor.test_metrics-ba304e492604cb49.xml 2025-12-04T15:16:19.2580525Z Running export/test_unflatten_training_ir 1/1 ... [2025-12-04 15:16:19.257695][21783.378914058] 2025-12-04T15:16:19.2581308Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:16:19.2583896Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_unflatten_training_ir.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:16:19.258131] 2025-12-04T15:16:44.3081042Z 2025-12-04T15:16:44.3082453Z export/test_unflatten_training_ir 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_unflatten_training_ir_1.1_abb2865a8ff8868a_.log 2025-12-04T15:16:44.3104180Z Running 29 items in this shard: test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_assert_tensor_metadata_stack_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_attr_as_submod_input_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_dedup_sym_size_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_double_nested_submodule_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_duplicate_placeholder_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_fx_trace_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_nested_leaf_non_strict_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_placeholder_and_get_attr_ordering_after_unflattened_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_simple_alias_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_buffer_mutation_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_constant_obj_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_constant_tensor_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_container_type_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_eager_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_empty_branch_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_nested_access_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_nested_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_none_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_param_list_dict_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_preserve_signature_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_preserve_with_unused_input_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_requires_grad_param_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_root_module_type_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_shared_submodule_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_skipped_call_module_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_submodule_ordering_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_with_inplace_compile_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflatten_wrong_input_training_ir, test/export/test_unflatten_training_ir.py::TrainingIRUnflattenTestUnflatten::test_unflattened_module_nodes_has_meta_val_training_ir 2025-12-04T15:16:44.3124599Z 2025-12-04T15:16:44.3125014Z Finished export/test_unflatten_training_ir 1/1 ... [2025-12-04 15:16:44.307984][21808.429202214], took 0.42min 2025-12-04T15:16:44.3408041Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/export.test_unflatten_training_ir/export.test_unflatten_training_ir-dcf5c0e2417d22fe.xml 2025-12-04T15:16:44.4260459Z Running inductor/test_fx_fusion 1/1 ... [2025-12-04 15:16:44.425693][21808.546912586] 2025-12-04T15:16:44.4261147Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:16:44.4264165Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_fx_fusion.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:16:44.426148] 2025-12-04T15:16:50.7493064Z 2025-12-04T15:16:50.7494289Z inductor/test_fx_fusion 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_fx_fusion_1.1_98b847c3047c6d7b_.log 2025-12-04T15:16:50.7496633Z Running 4 items in this shard: test/inductor/test_fx_fusion.py::TestFxFusion::test_linear_permute_fusion, test/inductor/test_fx_fusion.py::TestFxFusion::test_permute_bmm_fusion, test/inductor/test_fx_fusion.py::TestFxFusion::test_permute_linear_fusion, test/inductor/test_fx_fusion.py::TestFxFusion::test_sink_cat_after_pointwise 2025-12-04T15:16:50.7498335Z 2025-12-04T15:16:50.7498716Z Finished inductor/test_fx_fusion 1/1 ... [2025-12-04 15:16:50.749121][21814.87034239], took 0.11min 2025-12-04T15:16:50.7815958Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_fx_fusion/inductor.test_fx_fusion-740800c94204dd1f.xml 2025-12-04T15:16:50.8657225Z Running dynamo/test_inline_and_install 1/1 ... [2025-12-04 15:16:50.865437][21814.98665698] 2025-12-04T15:16:50.8657876Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:16:50.8660744Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_inline_and_install.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:16:50.865835] 2025-12-04T15:17:30.1349550Z 2025-12-04T15:17:30.1352758Z dynamo/test_inline_and_install 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_inline_and_install_1.1_9d45f7d839a55bb8_.log 2025-12-04T15:17:30.1470831Z Running 183 items in this shard: test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_access_class_method_from_user_class_attr_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_access_class_method_from_user_class_builtin_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_byte_tensor_does_not_crash_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_capture_symbolic_tracing_simple_within_fake_mode_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_capture_symbolic_tracing_within_fake_mode_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_free_variables_overlapping_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_op_param_buffer_lifted_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_raise_user_error_on_branch_args_mismatch_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_raise_user_error_on_branch_return_multiple_tensors_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_raise_user_error_on_branch_return_non_tensor_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_raise_user_error_on_mismatch_return_length_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_raise_user_error_on_mismatch_return_tensor_meta_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_raise_user_error_on_missing_args_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_raise_user_error_on_non_list_operands_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_raise_user_error_on_non_tensor_operands_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_raise_user_error_on_unsupported_pred_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_cond_supported_pred_types_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_constraint_violation_error_messages_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dataclass_input_output_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dict_return_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dict_return_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_2_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_2_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_and_bypass_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_and_bypass_reorder_with_non_tensor_arg_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_and_bypass_reorder_with_non_tensor_arg_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_and_bypass_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_and_bypass_with_non_tensor_arg_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_and_bypass_with_non_tensor_arg_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_and_bypass_with_non_tensor_output_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_and_bypass_with_non_tensor_output_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dupes_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dynamic_slicing_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dynamic_slicing_simple_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dynamo_enum_in_tuple_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_dynamo_list_index_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_empty_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_enforce_equalities_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_compare_optimize_with_make_fx_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_cond_in_aten_symbolic_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_control_flow_with_getattr_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_decomp_asserts_bad_args_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_decomp_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_defaults_ok_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_dynamic_control_flow_error_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_dynamic_dim_cleanup_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_dynamic_dim_not_1_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_dynamic_dim_range_constraint_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_graph_bypass_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_graph_bypass_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_graph_with_complex_reorder_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_graph_with_complex_reorder_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_graph_with_list_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_graph_with_list_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_identity_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_masking_with_no_grad_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_meta_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_meta_val_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_mismatched_out_2_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_mismatched_out_2_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_mismatched_out_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_mismatched_out_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_module_specify_constraints_signature_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_multi_dynamic_dim_constraint_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_multi_dynamic_dim_unsafe_relationship_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_nn_module_stack_patched_module_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_no_raise_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_no_tensor_computation_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_pass_arg_by_name_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_pass_arg_by_name_star_args_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_persist_assert_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_preserve_constraints_as_metadata_tensor_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_preserves_nn_module_stack_for_get_attr_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_raise_guard_full_constraint_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_raise_guard_partial_constraint_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_raise_on_relationship_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_shape_control_flow_1_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_specialized_int_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_symbolic_shape_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_args_and_empty_kwargs_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_args_with_default_None_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_args_with_default_float_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_args_with_default_tensor_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_args_with_default_tuple_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_builtin_op_on_assume_constant_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_cond_branches_calling_methods_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_cond_closure_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_cond_dynamic_shape_pred_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_cond_with_closed_function_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_dict_values_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_free_function_and_class_method_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_free_function_and_class_method_multiarg_diff_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_free_function_and_class_method_multiarg_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_free_function_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_global_function_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_in_unspecialized_nn_module_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_list_nonzero_free_function_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_list_nonzero_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_method_on_module_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_method_on_module_invoke_twice_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_none_control_flow_free_func_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_none_control_flow_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_not_none_control_flow_free_func_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_not_none_control_flow_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_not_none_control_flow_pos_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_not_return_const_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_constant_tuple_nonzero_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_functools_wrapped_fn_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_functools_wrapped_method_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_kwargs_and_empty_args_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_kwargs_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_kwargs_with_default_None_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_kwargs_with_default_float_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_kwargs_with_default_tensor_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_kwargs_with_default_tuple_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_map_cond_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_map_zero_sized_tensor_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_map_zero_sized_tensor_suppress_errors_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_module_layer_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_nonzero_static_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_shallow_list_copy_with_side_effects_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_shallow_list_copy_wo_side_effects_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_stack_trace_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_symbool_inputs_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_export_with_wrapped_fn_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_exported_graph_serialization_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_func_return_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_func_return_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_fx_pytree_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_immutable_list_dict_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_input_container_type_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_input_global_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_input_global_multiple_access_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_input_nonlocal_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_input_unused_nonlocal_ok_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_list_contains_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_list_not_contains_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_list_unpack_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_list_unpack_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_map_cond_param_buffer_lifted_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_mixed_real_and_fake_inputs_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_multiple_outputs_op_with_evaluator_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_nested_cond_op_param_buffer_lifted_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_no_tensor_computation_2_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_no_tensor_computation_2_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_no_tensor_computation_fail_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_no_tensor_computation_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_not_functionalize_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_param_buffer_safe_from_mutation_recurse_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_param_buffer_safe_from_mutation_simple_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_pre_dispatch_simple_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_predispatch_with_for_out_dtype_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_predispatch_with_for_out_dtype_nested_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_predispatch_with_higher_order_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_predispatch_with_higher_order_nested_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_preserve_fx_node_metadata_graph_break_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_preserve_fx_node_metadata_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_preserve_fx_node_metadata_inline_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_preserve_fx_node_metadata_recompile_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_remove_redundant_dynamic_dim_in_error_message_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_retracibility_dict_container_inp_out_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_retracibility_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_retracibility_nested_list_out_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_round_dynamic_shapes_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_strict_fake_tensor_prop_real_tensors_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_subclass_parameters_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_sum_param_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_sym_contains_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_symbolic_tracing_within_fake_mode_with_constraints_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_symbolic_tracing_within_fake_mode_with_constraints_with_parameters_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_symbool_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_torch_inference_mode_ctx_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_trivial_constraint_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_uncaptured_higher_order_op_error_not_suppresed_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_untracked_inputs_in_constraints_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_zeroes_in_and_out_different_shape_on_test_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_zeroes_in_and_out_different_shape_on_test_with_aten_graph_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_zeroes_in_new_shape_scalar_out_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_zeroes_in_new_shape_scalar_out_permute_dupe_and_bypass_inline_and_install, test/dynamo/test_inline_and_install.py::InlineAndInstallExportTests::test_zeroes_in_new_shape_scalar_out_permute_inline_and_install 2025-12-04T15:17:30.1585979Z 2025-12-04T15:17:30.1586371Z Finished dynamo/test_inline_and_install 1/1 ... [2025-12-04 15:17:30.135554][21854.256772963], took 0.65min 2025-12-04T15:17:30.1683894Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_inline_and_install/dynamo.test_inline_and_install-ee70bed9693bb3f7.xml 2025-12-04T15:17:30.2468655Z Running inductor/test_inductor_scheduler 1/1 ... [2025-12-04 15:17:30.246567][21854.367786148] 2025-12-04T15:17:30.2469283Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:17:30.2472145Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_inductor_scheduler.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:17:30.246964] 2025-12-04T15:17:49.3890254Z 2025-12-04T15:17:49.3891392Z inductor/test_inductor_scheduler 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_inductor_scheduler_1.1_5f88a9d31f2f290d_.log 2025-12-04T15:17:49.3896891Z Running 8 items in this shard: test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_disable_get_estimated_runtime_logging_cuda_float16, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_disable_get_estimated_runtime_logging_cuda_float32, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_flop_counter_op_options0_cuda_float16, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_flop_counter_op_options0_cuda_float32, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_flop_counter_op_options1_cuda_float16, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_flop_counter_op_options1_cuda_float32, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_fusion_prevent_too_many_reads_and_writes_allows_fusion_cuda, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_fusion_prevent_too_many_reads_and_writes_prevents_fusion_cuda 2025-12-04T15:17:49.3901688Z 2025-12-04T15:17:49.3902094Z Finished inductor/test_inductor_scheduler 1/1 ... [2025-12-04 15:17:49.388598][21873.509818921], took 0.32min 2025-12-04T15:17:49.4214998Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_inductor_scheduler/inductor.test_inductor_scheduler-f799e81908d876d8.xml 2025-12-04T15:17:49.5032183Z Running export/test_export_strict 1/1 ... [2025-12-04 15:17:49.502937][21873.62415599] 2025-12-04T15:17:49.5032799Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:17:49.5035802Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_export_strict.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:17:49.503315] 2025-12-04T15:20:01.2379821Z 2025-12-04T15:20:01.2383125Z export/test_export_strict 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_export_strict_1.1_4ea191e22f87eaf9_.log 2025-12-04T15:20:01.2607013Z Running 440 items in this shard: test/export/test_export_strict.py::StrictExportTestDynamismExpression::test_export_assume_static_by_default_strict, test/export/test_export_strict.py::StrictExportTestDynamismExpression::test_export_constraints_error_not_in_range_strict, test/export/test_export_strict.py::StrictExportTestDynamismExpression::test_export_constraints_error_strict, test/export/test_export_strict.py::StrictExportTestDynamismExpression::test_export_inline_constraints_strict, test/export/test_export_strict.py::StrictExportTestDynamismExpression::test_export_slice_maxsize_strict, test/export/test_export_strict.py::StrictExportTestDynamismExpression::test_export_slice_unbacked_dim1_strict, test/export/test_export_strict.py::StrictExportTestDynamismExpression::test_export_strict_narrow_unbacked_expr_strict, test/export/test_export_strict.py::StrictExportTestDynamismExpression::test_no_grad_param_inplace_strict, test/export/test_export_strict.py::StrictExportTestDynamismExpression::test_reshape_view_backed_size_oblivious_strict, test/export/test_export_strict.py::StrictExportTestExport::test__scaled_dot_product_flash_attention_strict, test/export/test_export_strict.py::StrictExportTestExport::test_additional_inputs_constants_strict, test/export/test_export_strict.py::StrictExportTestExport::test_allow_explicit_guards_as_runtime_asserts_strict, test/export/test_export_strict.py::StrictExportTestExport::test_annotate_on_assert_strict, test/export/test_export_strict.py::StrictExportTestExport::test_args_type_checked_strict, test/export/test_export_strict.py::StrictExportTestExport::test_aten_lift_fresh_copy_strict, test/export/test_export_strict.py::StrictExportTestExport::test_attention_strict, test/export/test_export_strict.py::StrictExportTestExport::test_attr_assignment_extra_strict, test/export/test_export_strict.py::StrictExportTestExport::test_automatic_constrain_size_strict, test/export/test_export_strict.py::StrictExportTestExport::test_automatic_dynamic_shapes_constant_relation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_automatic_dynamic_shapes_linear_relation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_automatic_dynamic_shapes_simple_equality_strict, test/export/test_export_strict.py::StrictExportTestExport::test_baddbmm_strict, test/export/test_export_strict.py::StrictExportTestExport::test_basic_non_strict_fake_tensor_strict, test/export/test_export_strict.py::StrictExportTestExport::test_basic_non_strict_real_tensor_strict, test/export/test_export_strict.py::StrictExportTestExport::test_basic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_bincount_strict, test/export/test_export_strict.py::StrictExportTestExport::test_buffer_util_strict, test/export/test_export_strict.py::StrictExportTestExport::test_capture_subclass_constructor_strict, test/export/test_export_strict.py::StrictExportTestExport::test_capture_subclass_constructor_torch_ir_strict, test/export/test_export_strict.py::StrictExportTestExport::test_capture_subclass_wrong_strict, test/export/test_export_strict.py::StrictExportTestExport::test_ccode_python_mod_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cdist_forward_compute_mode_zero_export_strict, test/export/test_export_strict.py::StrictExportTestExport::test_check_specialized_int_strict, test/export/test_export_strict.py::StrictExportTestExport::test_checks_to_constrain_range_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cleanup_dynamic_markers_strict, test/export/test_export_strict.py::StrictExportTestExport::test_colin_unbacked_backed_vr_sub_strict, test/export/test_export_strict.py::StrictExportTestExport::test_colon_parameter_strict, test/export/test_export_strict.py::StrictExportTestExport::test_compiling_state_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cond_access_identical_symint_closure_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cond_branches_return_constant_int_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cond_branches_return_same_int_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cond_buffers_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cond_contains_unbacked_no_escape_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cond_int_closure_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cond_unflatten_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cond_with_module_stack_export_with_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cond_with_module_stack_export_with_unflatten_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constant_aliasing_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constant_input_naming_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constant_no_user_inp_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constant_output_dup_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constant_output_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constant_requires_grad_const_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constant_return_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constant_tensor_mutation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constant_tensor_with_non_functional_nested_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constant_tensor_with_non_functional_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constrain_decomp_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constrain_size_in_eager_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constrain_size_with_constrain_value_strict, test/export/test_export_strict.py::StrictExportTestExport::test_constrain_size_with_various_cases_strict, test/export/test_export_strict.py::StrictExportTestExport::test_conv_dynamic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_crop_like_strict, test/export/test_export_strict.py::StrictExportTestExport::test_cse_for_symint_strict, test/export/test_export_strict.py::StrictExportTestExport::test_custom_op_auto_functionalize_pre_dispatch_strict, test/export/test_export_strict.py::StrictExportTestExport::test_custom_op_auto_functionalize_strict, test/export/test_export_strict.py::StrictExportTestExport::test_custom_op_auto_warn_pre_dispatch_strict, test/export/test_export_strict.py::StrictExportTestExport::test_custom_op_preserve_strict, test/export/test_export_strict.py::StrictExportTestExport::test_custom_pytree_strict, test/export/test_export_strict.py::StrictExportTestExport::test_custom_tag_metadata_re_export_strict, test/export/test_export_strict.py::StrictExportTestExport::test_decomp_batch_norm_functional_predispatch_strict, test/export/test_export_strict.py::StrictExportTestExport::test_decomp_item_in_prim_after_decomposition_strict, test/export/test_export_strict.py::StrictExportTestExport::test_decomp_item_in_prim_before_decomposition_strict, test/export/test_export_strict.py::StrictExportTestExport::test_default_decomposition_core_cia_ops_strict, test/export/test_export_strict.py::StrictExportTestExport::test_derived_dim_1_2_strict, test/export/test_export_strict.py::StrictExportTestExport::test_derived_dim_basic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_derived_dim_integer_strict, test/export/test_export_strict.py::StrictExportTestExport::test_derived_dim_nested_strict, test/export/test_export_strict.py::StrictExportTestExport::test_derived_dim_out_of_order_repeat_derived_strict, test/export/test_export_strict.py::StrictExportTestExport::test_derived_dim_out_of_order_simplified_repeat_non_derived_strict, test/export/test_export_strict.py::StrictExportTestExport::test_derived_dim_out_of_order_simplified_strict, test/export/test_export_strict.py::StrictExportTestExport::test_derived_dim_out_of_order_strict, test/export/test_export_strict.py::StrictExportTestExport::test_derived_dim_repeat_derived_strict, test/export/test_export_strict.py::StrictExportTestExport::test_detect_leak_nonstrict_strict, test/export/test_export_strict.py::StrictExportTestExport::test_detect_leak_nonstrict_with_stacktrace_strict, test/export/test_export_strict.py::StrictExportTestExport::test_detect_leak_strict_strict, test/export/test_export_strict.py::StrictExportTestExport::test_device_to_dynamic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_device_to_gpu_strict, test/export/test_export_strict.py::StrictExportTestExport::test_device_to_mutation_float_strict, test/export/test_export_strict.py::StrictExportTestExport::test_device_to_mutation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_device_to_static_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dim_1_2_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dim_auto_and_dim_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dim_dynamic_divisibility_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dim_dynamic_specialization_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dim_dynamic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dim_hint_range_violations_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dim_hint_ranges_strict, test/export/test_export_strict.py::StrictExportTestExport::test_disable_forced_specializations_errors_strict, test/export/test_export_strict.py::StrictExportTestExport::test_disable_forced_specializations_ok_strict, test/export/test_export_strict.py::StrictExportTestExport::test_distributed_all_gather_into_tensor_strict, test/export/test_export_strict.py::StrictExportTestExport::test_distributed_all_gather_strict, test/export/test_export_strict.py::StrictExportTestExport::test_distributed_all_reduce_strict, test/export/test_export_strict.py::StrictExportTestExport::test_distributed_all_to_all_single_strict, test/export/test_export_strict.py::StrictExportTestExport::test_distributed_reduce_scatter_tensor_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dont_duck_size_for_auto_dynamic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_double_lifted_constants_strict, test/export/test_export_strict.py::StrictExportTestExport::test_draft_export_checks_aliasing_strict, test/export/test_export_strict.py::StrictExportTestExport::test_draft_export_checks_mutation_list_strict, test/export/test_export_strict.py::StrictExportTestExport::test_draft_export_checks_mutation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_draft_export_checks_mutation_with_nan_strict, test/export/test_export_strict.py::StrictExportTestExport::test_draft_export_fake_kernel_inference_errors_strict, test/export/test_export_strict.py::StrictExportTestExport::test_draft_export_infers_fake_kernel_strict, test/export/test_export_strict.py::StrictExportTestExport::test_duplicate_modules_with_non_persistent_buffers_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_lr_shift_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_shapes_bounds_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_shapes_builder_basic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_shapes_builder_kwargs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_shapes_builder_pytree_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_shapes_dataclass_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_shapes_inferred_basic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_shapes_serdes_generic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_shapes_serdes_user_errors_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_shapes_serdes_various_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_shapes_spec_with_pytree_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_shapes_wrapped_with_shape_guards_strict, test/export/test_export_strict.py::StrictExportTestExport::test_dynamic_sym_round_strict, test/export/test_export_strict.py::StrictExportTestExport::test_ends_of_bounds_oblivious_strict, test/export/test_export_strict.py::StrictExportTestExport::test_enum_str_strict, test/export/test_export_strict.py::StrictExportTestExport::test_error_does_not_reference_eager_fallback_strict, test/export/test_export_strict.py::StrictExportTestExport::test_error_when_passing_mutating_primitive_op_strict, test/export/test_export_strict.py::StrictExportTestExport::test_exception_strict, test/export/test_export_strict.py::StrictExportTestExport::test_expand_copy_export_handles_implicit_true_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_api_with_dynamic_shapes_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_as_backend_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_associative_scan_lifted_buffers_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_associative_scan_symbol_dim_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_associative_scan_symbol_scandim_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_aten_to_unflatten_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_aten_to_unflatten_subclass_pre_dispatch_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_aten_to_unflatten_subclass_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_cond_preserve_torch_fn_for_subgraphs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_cond_symbool_pred_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_cond_warns_constant_pred_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_custom_decomp_table_basic_pop_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_custom_decomp_table_container_methods_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_custom_op_lib_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_custom_triton_kernel_mutable_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_custom_triton_kernel_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_cyclic_reference_leak_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_decomp_torture_case_1_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_decomp_torture_case_2_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_decomps_dynamic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_decomps_simple_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_dynamo_config_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_for_training_run_decomp_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_for_training_with_container_type_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_for_training_with_dynamic_shapes_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_for_training_with_mutation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_for_training_with_state_dict_hooks_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_func_with_default_kwargs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_func_with_keyword_only_args_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_func_with_kwargs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_func_with_pytree_kwargs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_func_with_var_keyword_args_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_func_with_var_keyword_pytree_args_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_func_with_var_postional_args_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_function_schema_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_graph_with_no_inputs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_input_mutation_bug_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_input_mutation_dynamic_shape_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_input_mutation_static_shape_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_leak_compile_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_linear_preserve_dynamic_shape_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_max_nonstrict_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_max_onnx_reported_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_method_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_mod_constraints_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_module_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_preserve_linear_at_aot_level_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_preserve_linear_but_not_custom_op_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_rnn_variants_with_warning_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_scan_pytree_output_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_script_module_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_statically_known_true_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_then_compile_tensor_ctor_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_with_autocast_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_with_fake_tensor_inputs_on_cuda_devices_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_with_fake_tensor_inputs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_with_inline_constraints_complex_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_with_inline_constraints_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_with_set_grad_enabled_strict, test/export/test_export_strict.py::StrictExportTestExport::test_export_with_wrong_inputs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_external_call_non_strict_real_tensor_strict, test/export/test_export_strict.py::StrictExportTestExport::test_fake_inputs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_fake_weights_strict, test/export/test_export_strict.py::StrictExportTestExport::test_filter_traceback_frames_strict, test/export/test_export_strict.py::StrictExportTestExport::test_flex_attention_export_strict, test/export/test_export_strict.py::StrictExportTestExport::test_float_conversion_from_int_strict, test/export/test_export_strict.py::StrictExportTestExport::test_float_conversion_strict, test/export/test_export_strict.py::StrictExportTestExport::test_fqn_strict, test/export/test_export_strict.py::StrictExportTestExport::test_from_node_metadata_export_strict, test/export/test_export_strict.py::StrictExportTestExport::test_full_on_scalar_tensor_strict, test/export/test_export_strict.py::StrictExportTestExport::test_function_holding_tensor_strict, test/export/test_export_strict.py::StrictExportTestExport::test_hints_wrapper_strict, test/export/test_export_strict.py::StrictExportTestExport::test_hoo_inline_users_issue_strict, test/export/test_export_strict.py::StrictExportTestExport::test_if_functional_strict, test/export/test_export_strict.py::StrictExportTestExport::test_if_post_autograd_op_preserved_strict, test/export/test_export_strict.py::StrictExportTestExport::test_inductor_backend_inside_nonstrict_strict, test/export/test_export_strict.py::StrictExportTestExport::test_inline_script_class_method_recursive_strict, test/export/test_export_strict.py::StrictExportTestExport::test_inline_script_class_method_strict, test/export/test_export_strict.py::StrictExportTestExport::test_inline_script_function_strict, test/export/test_export_strict.py::StrictExportTestExport::test_inline_script_method_strict, test/export/test_export_strict.py::StrictExportTestExport::test_int_shape_specialization_strict, test/export/test_export_strict.py::StrictExportTestExport::test_intermediate_shape_comp_strict, test/export/test_export_strict.py::StrictExportTestExport::test_invalid_pytree_dynamo_graph_capture_strict, test/export/test_export_strict.py::StrictExportTestExport::test_is_exporting_strict, test/export/test_export_strict.py::StrictExportTestExport::test_is_nonzero_strict, test/export/test_export_strict.py::StrictExportTestExport::test_isnonzero_strict, test/export/test_export_strict.py::StrictExportTestExport::test_issue_113041_strict, test/export/test_export_strict.py::StrictExportTestExport::test_issue_157289_strict, test/export/test_export_strict.py::StrictExportTestExport::test_issue_161902_strict, test/export/test_export_strict.py::StrictExportTestExport::test_istft_op_strict, test/export/test_export_strict.py::StrictExportTestExport::test_keep_composite_ops_invalid_strict, test/export/test_export_strict.py::StrictExportTestExport::test_keep_composite_ops_linear_convd_for_training_ir_strict, test/export/test_export_strict.py::StrictExportTestExport::test_keep_composite_ops_linear_convd_strict, test/export/test_export_strict.py::StrictExportTestExport::test_kwarg_dynamic_shapes_diff_order_strict, test/export/test_export_strict.py::StrictExportTestExport::test_kwargs_reorder_strict, test/export/test_export_strict.py::StrictExportTestExport::test_layer_norm_unbacked_normalized_shape_strict, test/export/test_export_strict.py::StrictExportTestExport::test_layer_sharing_strict, test/export/test_export_strict.py::StrictExportTestExport::test_lazy_module_kwargs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_lifted_constants_strict, test/export/test_export_strict.py::StrictExportTestExport::test_linear_conv_strict, test/export/test_export_strict.py::StrictExportTestExport::test_malformed_fqn_from_source_name_strict, test/export/test_export_strict.py::StrictExportTestExport::test_map_buffers_strict, test/export/test_export_strict.py::StrictExportTestExport::test_map_strict, test/export/test_export_strict.py::StrictExportTestExport::test_mask_nonzero_static_strict, test/export/test_export_strict.py::StrictExportTestExport::test_masked_select_dynamic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_math_pow_strict, test/export/test_export_strict.py::StrictExportTestExport::test_mismatched_dynamic_shapes_strict, test/export/test_export_strict.py::StrictExportTestExport::test_mixed_input_strict, test/export/test_export_strict.py::StrictExportTestExport::test_module_dict_key_strict, test/export/test_export_strict.py::StrictExportTestExport::test_module_input_strict, test/export/test_export_strict.py::StrictExportTestExport::test_module_input_subclasses_parameterization_nested_strict, test/export/test_export_strict.py::StrictExportTestExport::test_module_list_slice_strict, test/export/test_export_strict.py::StrictExportTestExport::test_module_strict, test/export/test_export_strict.py::StrictExportTestExport::test_module_with_dict_container_inp_out_strict, test/export/test_export_strict.py::StrictExportTestExport::test_modules_access_for_deleted_submodule_strict, test/export/test_export_strict.py::StrictExportTestExport::test_more_multidimensional_slicing_strict, test/export/test_export_strict.py::StrictExportTestExport::test_multidimensional_slicing_strict, test/export/test_export_strict.py::StrictExportTestExport::test_multinomial_dynamic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_multiple_definitions_same_name_dim_strict, test/export/test_export_strict.py::StrictExportTestExport::test_namedtuple_input_export_strict, test/export/test_export_strict.py::StrictExportTestExport::test_native_multi_attention_head_strict, test/export/test_export_strict.py::StrictExportTestExport::test_nested_dynamic_shapes_spec_strict, test/export/test_export_strict.py::StrictExportTestExport::test_nested_module_fake_tensor_leak_strict, test/export/test_export_strict.py::StrictExportTestExport::test_nested_module_strict, test/export/test_export_strict.py::StrictExportTestExport::test_nested_module_with_constant_buffer_strict, test/export/test_export_strict.py::StrictExportTestExport::test_nested_module_with_init_buffer_strict, test/export/test_export_strict.py::StrictExportTestExport::test_nested_module_with_parameter_strict, test/export/test_export_strict.py::StrictExportTestExport::test_nn_module_stack_shared_submodule_strict, test/export/test_export_strict.py::StrictExportTestExport::test_nn_module_stack_strict, test/export/test_export_strict.py::StrictExportTestExport::test_no_check_is_size_error_strict, test/export/test_export_strict.py::StrictExportTestExport::test_no_suggested_fixes_for_data_dependent_errors_strict, test/export/test_export_strict.py::StrictExportTestExport::test_no_tensor_computation_2_strict, test/export/test_export_strict.py::StrictExportTestExport::test_no_tensor_computation_3_strict, test/export/test_export_strict.py::StrictExportTestExport::test_no_tensor_computation_4_strict, test/export/test_export_strict.py::StrictExportTestExport::test_no_tensor_computation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_non_arg_name_dynamic_shapes_api_strict, test/export/test_export_strict.py::StrictExportTestExport::test_non_arg_name_dynamic_shapes_api_with_container_type_strict, test/export/test_export_strict.py::StrictExportTestExport::test_non_arg_name_dynamic_shapes_api_with_kwarg_strict, test/export/test_export_strict.py::StrictExportTestExport::test_non_persistent_buffer_strict, test/export/test_export_strict.py::StrictExportTestExport::test_non_strict_dynamic_shapes_strict, test/export/test_export_strict.py::StrictExportTestExport::test_non_strict_dynamic_shapes_suggested_fixes_strict, test/export/test_export_strict.py::StrictExportTestExport::test_none_buffers_strict, test/export/test_export_strict.py::StrictExportTestExport::test_nonstrict_retrace_preserves_metadata_strict, test/export/test_export_strict.py::StrictExportTestExport::test_nonzero_2_strict, test/export/test_export_strict.py::StrictExportTestExport::test_nonzero_dynamic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_not_registered_parameter_strict, test/export/test_export_strict.py::StrictExportTestExport::test_operator_aten_tensor_mode_variant_strict, test/export/test_export_strict.py::StrictExportTestExport::test_output_node_name_strict, test/export/test_export_strict.py::StrictExportTestExport::test_pad_sequence_strict, test/export/test_export_strict.py::StrictExportTestExport::test_param_util_strict, test/export/test_export_strict.py::StrictExportTestExport::test_partial_patched_forward_strict, test/export/test_export_strict.py::StrictExportTestExport::test_placeholder_naming_collisions_hoo_subgraphs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_placeholder_naming_collisions_strict, test/export/test_export_strict.py::StrictExportTestExport::test_placeholder_naming_order_strict, test/export/test_export_strict.py::StrictExportTestExport::test_placeholder_naming_order_variadic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_placeholder_update_preserving_strict, test/export/test_export_strict.py::StrictExportTestExport::test_predispatch_cond_strict, test/export/test_export_strict.py::StrictExportTestExport::test_predispatch_grad_wrappers_strict, test/export/test_export_strict.py::StrictExportTestExport::test_preserve_annotation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_preserve_module_call_signature_unflatten_specialization_strict, test/export/test_export_strict.py::StrictExportTestExport::test_preserve_requires_grad_placeholders_strict, test/export/test_export_strict.py::StrictExportTestExport::test_preserve_shape_dynamism_for_unused_inputs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_profiling_code_strict, test/export/test_export_strict.py::StrictExportTestExport::test_python_asserts_with_sym_int_strict, test/export/test_export_strict.py::StrictExportTestExport::test_pytree_register_data_class_strict, test/export/test_export_strict.py::StrictExportTestExport::test_pytree_register_nested_data_class_strict, test/export/test_export_strict.py::StrictExportTestExport::test_raise_user_error_when_guard_on_data_dependent_operation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_range_constraints_with_replacement_strict, test/export/test_export_strict.py::StrictExportTestExport::test_real_tensor_alias_dtype_mismatch_strict, test/export/test_export_strict.py::StrictExportTestExport::test_real_tensor_bool_cast_strict, test/export/test_export_strict.py::StrictExportTestExport::test_real_tensor_errors_on_aliasing_custom_op_strict, test/export/test_export_strict.py::StrictExportTestExport::test_real_tensor_for_max_op_strict, test/export/test_export_strict.py::StrictExportTestExport::test_real_tensor_size_mismatch_strict, test/export/test_export_strict.py::StrictExportTestExport::test_redundant_assert_max_upper_bound_strict, test/export/test_export_strict.py::StrictExportTestExport::test_redundant_asserts_strict, test/export/test_export_strict.py::StrictExportTestExport::test_refine_dynamic_shapes_from_suggested_fixes_strict, test/export/test_export_strict.py::StrictExportTestExport::test_register_constant_strict, test/export/test_export_strict.py::StrictExportTestExport::test_repeat_interleave_strict, test/export/test_export_strict.py::StrictExportTestExport::test_replace_unbacked_with_very_large_upperbound_strict, test/export/test_export_strict.py::StrictExportTestExport::test_replaced_unbacked_bindings_strict, test/export/test_export_strict.py::StrictExportTestExport::test_reshape_view_helper_strict, test/export/test_export_strict.py::StrictExportTestExport::test_retracable_ep_strict, test/export/test_export_strict.py::StrictExportTestExport::test_retrace_pre_autograd_strict, test/export/test_export_strict.py::StrictExportTestExport::test_run_decomposition_supports_user_input_mutation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_run_decompositions_keep_metadata_strict, test/export/test_export_strict.py::StrictExportTestExport::test_run_decompositions_keep_tensor_constant_metadata_strict, test/export/test_export_strict.py::StrictExportTestExport::test_runtime_assert_for_prim_strict, test/export/test_export_strict.py::StrictExportTestExport::test_runtime_assert_for_prm_str_strict, test/export/test_export_strict.py::StrictExportTestExport::test_runtime_assert_with_size_strict, test/export/test_export_strict.py::StrictExportTestExport::test_sdpa_gqa_strict, test/export/test_export_strict.py::StrictExportTestExport::test_sequential_slicing_strict, test/export/test_export_strict.py::StrictExportTestExport::test_set_example_inputs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_set_grad_as_side_effect_strict, test/export/test_export_strict.py::StrictExportTestExport::test_set_grad_empty_strict, test/export/test_export_strict.py::StrictExportTestExport::test_set_grad_unflatten_strict, test/export/test_export_strict.py::StrictExportTestExport::test_setgrad_lifted_tensor_strict, test/export/test_export_strict.py::StrictExportTestExport::test_shared_submodule_nn_module_stack_strict, test/export/test_export_strict.py::StrictExportTestExport::test_simple_export_for_training_strict, test/export/test_export_strict.py::StrictExportTestExport::test_simple_unbacked_view_strict, test/export/test_export_strict.py::StrictExportTestExport::test_size_input_strict, test/export/test_export_strict.py::StrictExportTestExport::test_slice_nn_module_stack_strict, test/export/test_export_strict.py::StrictExportTestExport::test_solver_unsupported_sympy_function_strict, test/export/test_export_strict.py::StrictExportTestExport::test_specialize_derived_dim_roots_strict, test/export/test_export_strict.py::StrictExportTestExport::test_split_const_gm_with_lifted_constants_strict, test/export/test_export_strict.py::StrictExportTestExport::test_stack_trace_make_fx_strict, test/export/test_export_strict.py::StrictExportTestExport::test_stack_trace_strict, test/export/test_export_strict.py::StrictExportTestExport::test_state_primitives_strict, test/export/test_export_strict.py::StrictExportTestExport::test_state_shape_attribute_assignment_strict, test/export/test_export_strict.py::StrictExportTestExport::test_state_tensors_strict, test/export/test_export_strict.py::StrictExportTestExport::test_static_dim_constraints_strict, test/export/test_export_strict.py::StrictExportTestExport::test_subclass_context_strict, test/export/test_export_strict.py::StrictExportTestExport::test_subclass_nested_attr_access_complicated_metadata_strict, test/export/test_export_strict.py::StrictExportTestExport::test_subclass_nested_attr_access_const_metadata_not_top_level_strict, test/export/test_export_strict.py::StrictExportTestExport::test_subclass_nested_attr_access_const_metadata_strict, test/export/test_export_strict.py::StrictExportTestExport::test_subclass_nested_attr_access_strict, test/export/test_export_strict.py::StrictExportTestExport::test_subclass_nested_attr_access_submodule_strict, test/export/test_export_strict.py::StrictExportTestExport::test_subclasses_parameterization_nested_strict, test/export/test_export_strict.py::StrictExportTestExport::test_subclasses_parameterization_strict, test/export/test_export_strict.py::StrictExportTestExport::test_suggest_torch_checks_with_non_negative_check_strict, test/export/test_export_strict.py::StrictExportTestExport::test_suggest_torch_checks_with_regular_check_strict, test/export/test_export_strict.py::StrictExportTestExport::test_suggested_fixes_for_data_dependent_errors_basic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_suggested_fixes_for_data_dependent_errors_puzzlers_strict, test/export/test_export_strict.py::StrictExportTestExport::test_suggested_fixes_new_roots_strict, test/export/test_export_strict.py::StrictExportTestExport::test_sym_float_operators_strict, test/export/test_export_strict.py::StrictExportTestExport::test_sym_or_sym_and_strict, test/export/test_export_strict.py::StrictExportTestExport::test_sym_sqrt_strict, test/export/test_export_strict.py::StrictExportTestExport::test_symbool_item_strict, test/export/test_export_strict.py::StrictExportTestExport::test_symfloat_item_strict, test/export/test_export_strict.py::StrictExportTestExport::test_symint_input_additional_inputs_strict, test/export/test_export_strict.py::StrictExportTestExport::test_symint_input_basic_strict, test/export/test_export_strict.py::StrictExportTestExport::test_symint_input_ranges_strict, test/export/test_export_strict.py::StrictExportTestExport::test_symint_input_shapes_collection_strict, test/export/test_export_strict.py::StrictExportTestExport::test_symint_input_specialization_strict, test/export/test_export_strict.py::StrictExportTestExport::test_symint_item_strict, test/export/test_export_strict.py::StrictExportTestExport::test_symint_output_strict, test/export/test_export_strict.py::StrictExportTestExport::test_symint_tensor_return_strict, test/export/test_export_strict.py::StrictExportTestExport::test_tag_ac_export_strict, test/export/test_export_strict.py::StrictExportTestExport::test_tensor_attribute_zero_args_strict, test/export/test_export_strict.py::StrictExportTestExport::test_tensor_constant_aten_to_strict, test/export/test_export_strict.py::StrictExportTestExport::test_tensor_constant_with_wrapped_method_strict, test/export/test_export_strict.py::StrictExportTestExport::test_to_module_with_mutated_buffer_multiple_strict, test/export/test_export_strict.py::StrictExportTestExport::test_to_module_with_mutated_buffer_multiple_update_sub_later_strict, test/export/test_export_strict.py::StrictExportTestExport::test_to_module_with_mutated_buffer_strict, test/export/test_export_strict.py::StrictExportTestExport::test_tolist_strict, test/export/test_export_strict.py::StrictExportTestExport::test_torch_check_eq_commutativity_strict, test/export/test_export_strict.py::StrictExportTestExport::test_torch_fn_strict, test/export/test_export_strict.py::StrictExportTestExport::test_trace_under_fake_strict, test/export/test_export_strict.py::StrictExportTestExport::test_train_eval_on_exported_preautograd_module_strict, test/export/test_export_strict.py::StrictExportTestExport::test_tril_dynamic_diagonal_strict, test/export/test_export_strict.py::StrictExportTestExport::test_triu_dynamic_diagonal_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_3d_matmul_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_bincount_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_bindings_for_divisible_u_symint_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_deferred_runtime_retrace_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_expand_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_infer_size_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_kth_value_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_linear_layer_norm_input_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_noncontig_lin_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_pad_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_scalar_constructor_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_slice_forward_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_slice_simple_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_stack_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_to_cond_passthrough_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_to_cond_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unbacked_unsqueeze_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_asserts_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_buffer_update_child2parent_swap_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_closure_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_isinstance_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_multiple_graphs_dispatch_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_multiple_graphs_preserve_signature_no_error_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_multiple_graphs_shared_submodule_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_multiple_graphs_state_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_no_unroll_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_placeholder_update_child2parent_swap_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_placeholder_update_grandchild2cousin_swap_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_5_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_6_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_buf_8_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_const_preserving_3_1_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_const_preserving_3_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_mutating_buf_4_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_mutating_buf_6_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_mutating_buf_9_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_mutating_buf_preserving_10_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_mutating_buf_preserving_4_1_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_mutating_buf_preserving_4_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_mutating_buf_preserving_5_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_mutating_buf_preserving_7_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unflatten_random_dag_preserving_4_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unused_aliases_strict, test/export/test_export_strict.py::StrictExportTestExport::test_unused_constant_strict, test/export/test_export_strict.py::StrictExportTestExport::test_uplift_common_custom_meta_strict, test/export/test_export_strict.py::StrictExportTestExport::test_uplift_common_custom_meta_with_multiple_calls_strict, test/export/test_export_strict.py::StrictExportTestExport::test_use_embedding_twice_strict, test/export/test_export_strict.py::StrictExportTestExport::test_user_input_and_buffer_mutation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_vmap_custom_autograd_function_strict, test/export/test_export_strict.py::StrictExportTestExport::test_vmap_strict, test/export/test_export_strict.py::StrictExportTestExport::test_vmap_to_assert_strict, test/export/test_export_strict.py::StrictExportTestExport::test_where_decomp_strict, test/export/test_export_strict.py::StrictExportTestExport::test_while_loop_assert_separation_strict, test/export/test_export_strict.py::StrictExportTestExport::test_while_loop_index_assertions_strict, test/export/test_export_strict.py::StrictExportTestExport::test_while_loop_simple_strict, test/export/test_export_strict.py::StrictExportTestExport::test_while_loop_tensor_constant_idx_strict, test/export/test_export_strict.py::StrictExportTestExport::test_wrapper_module_strict 2025-12-04T15:20:01.2829845Z 2025-12-04T15:20:01.2830243Z Finished export/test_export_strict 1/1 ... [2025-12-04 15:20:01.239181][22005.360396381], took 2.20min 2025-12-04T15:20:01.2831539Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/export.test_export_strict/export.test_export_strict-16708e34d1c82f51.xml 2025-12-04T15:20:01.4089185Z Running inductor/test_compiled_optimizers 3/3 ... [2025-12-04 15:20:01.408573][22005.529790306] 2025-12-04T15:20:01.4090017Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:20:01.4093129Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_compiled_optimizers.py', '--shard-id=3', '--num-shards=3', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:20:01.409048] 2025-12-04T15:29:05.1458120Z 2025-12-04T15:29:05.1459561Z inductor/test_compiled_optimizers 3/3 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_compiled_optimizers_3.3_f162c7ad1675ca04_.log 2025-12-04T15:29:05.1589307Z Running 202 items in this shard: test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_capturable_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_capturable_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_rho_weight_decay_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_tensor_lr_capturable_cuda_constantlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_tensor_lr_capturable_cuda_cosineannealingwarmrestarts, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_tensor_lr_capturable_cuda_cycliclr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_tensor_lr_capturable_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_tensor_lr_capturable_cuda_multisteplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_tensor_lr_capturable_cuda_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_tensor_lr_capturable_foreach_cuda_cosineannealingwarmrestarts, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_tensor_lr_capturable_foreach_cuda_multisteplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_tensor_lr_capturable_foreach_cuda_steplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_weight_decay_maximize_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adadelta_weight_decay_maximize_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_initial_accumulator_value_weight_decay_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_lr_decay_weight_decay_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_cpu_multiplicativelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_cpu_steplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_cuda_cosineannealinglr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_cuda_cosineannealingwarmrestarts, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_cuda_lambdalr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_cuda_multiplicativelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_foreach_cuda_cosineannealinglr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_foreach_cuda_lambdalr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_foreach_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_foreach_cuda_multiplicativelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_tensor_lr_foreach_cuda_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_weight_decay_maximize_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adagrad_weight_decay_maximize_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_recompile, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_amsgrad_capturable_cuda_cycliclr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_amsgrad_capturable_cuda_exponentiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_amsgrad_capturable_cuda_lambdalr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_amsgrad_capturable_cuda_reducelronplateau, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_amsgrad_capturable_foreach_cuda_polynomiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_amsgrad_capturable_cuda_cosineannealinglr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_amsgrad_capturable_cuda_cycliclr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_amsgrad_capturable_cuda_multisteplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_amsgrad_capturable_cuda_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_amsgrad_capturable_cuda_polynomiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_amsgrad_capturable_cuda_reducelronplateau, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_amsgrad_capturable_cuda_steplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_amsgrad_capturable_foreach_cuda_cycliclr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_amsgrad_capturable_foreach_cuda_exponentiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_amsgrad_capturable_foreach_cuda_polynomiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_amsgrad_capturable_foreach_cuda_steplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_capturable_cuda_constantlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_capturable_cuda_cycliclr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_capturable_cuda_multisteplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_capturable_foreach_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_capturable_foreach_cuda_multiplicativelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_tensor_lr_tensor_betas_capturable_foreach_cuda_multisteplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_weight_decay_amsgrad_capturable_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_weight_decay_amsgrad_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_weight_decay_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_weight_decay_maximize_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adam_weight_decay_maximize_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_capturable_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_maximize_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_maximize_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_tensor_lr_weight_decay_capturable_cuda_constantlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_tensor_lr_weight_decay_capturable_cuda_cosineannealinglr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_tensor_lr_weight_decay_capturable_cuda_cycliclr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_tensor_lr_weight_decay_capturable_cuda_multiplicativelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_tensor_lr_weight_decay_capturable_cuda_reducelronplateau, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_tensor_lr_weight_decay_capturable_foreach_cuda_cosineannealinglr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_tensor_lr_weight_decay_capturable_foreach_cuda_polynomiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_weight_decay_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_weight_decay_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_weight_decay_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamax_weight_decay_maximize_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_amsgrad_capturable_cuda_constantlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_amsgrad_capturable_cuda_cosineannealinglr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_amsgrad_capturable_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_amsgrad_capturable_cuda_multiplicativelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_amsgrad_capturable_cuda_polynomiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_amsgrad_capturable_cuda_steplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_amsgrad_capturable_foreach_cuda_lambdalr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_amsgrad_capturable_foreach_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_amsgrad_capturable_foreach_cuda_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_amsgrad_capturable_foreach_cuda_steplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_amsgrad_capturable_cuda_constantlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_amsgrad_capturable_cuda_exponentiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_amsgrad_capturable_cuda_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_amsgrad_capturable_cuda_polynomiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_amsgrad_capturable_cuda_reducelronplateau, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_amsgrad_capturable_cuda_steplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_amsgrad_capturable_foreach_cuda_cosineannealinglr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_amsgrad_capturable_foreach_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_amsgrad_capturable_foreach_cuda_multisteplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_amsgrad_capturable_foreach_cuda_steplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_capturable_cuda_multiplicativelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_capturable_cuda_polynomiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_capturable_foreach_cuda_cosineannealingwarmrestarts, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_capturable_foreach_cuda_multiplicativelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_capturable_foreach_cuda_polynomiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_capturable_foreach_cuda_reducelronplateau, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_tensor_lr_tensor_betas_capturable_foreach_cuda_steplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_adamw_weight_decay_maximize_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_capturable_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_lambd_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_lambd_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_maximize_capturable_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_t0_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_tensor_lr_weight_decay_maximize_capturable_cuda_exponentiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_tensor_lr_weight_decay_maximize_capturable_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_tensor_lr_weight_decay_maximize_capturable_cuda_polynomiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_tensor_lr_weight_decay_maximize_capturable_foreach_cuda_constantlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_tensor_lr_weight_decay_maximize_capturable_foreach_cuda_cosineannealingwarmrestarts, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_weight_decay_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_weight_decay_maximize_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_weight_decay_maximize_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_asgd_weight_decay_maximize_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_basic_shampoo, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_guard_on_none_grads, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_nadam_momentum_decay_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_nadam_momentum_decay_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_nadam_recompile, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_nadam_tensor_lr_weight_decay_momentum_decay_decoupled_weight_decay_capturable_cuda_constantlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_nadam_tensor_lr_weight_decay_momentum_decay_decoupled_weight_decay_capturable_cuda_lambdalr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_nadam_tensor_lr_weight_decay_momentum_decay_decoupled_weight_decay_capturable_foreach_cuda_constantlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_nadam_tensor_lr_weight_decay_momentum_decay_decoupled_weight_decay_capturable_foreach_cuda_lambdalr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_nadam_tensor_lr_weight_decay_momentum_decay_decoupled_weight_decay_capturable_foreach_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_nadam_tensor_lr_weight_decay_momentum_decay_decoupled_weight_decay_capturable_foreach_cuda_multisteplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_nadam_weight_decay_momentum_decay_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_eps_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_tensor_lr_capturable_weight_decay_decoupled_weight_decay_cuda_cosineannealingwarmrestarts, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_tensor_lr_capturable_weight_decay_decoupled_weight_decay_cuda_cycliclr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_tensor_lr_capturable_weight_decay_decoupled_weight_decay_cuda_lambdalr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_tensor_lr_capturable_weight_decay_decoupled_weight_decay_cuda_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_tensor_lr_capturable_weight_decay_decoupled_weight_decay_cuda_reducelronplateau, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_tensor_lr_capturable_weight_decay_decoupled_weight_decay_foreach_cuda_constantlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_tensor_lr_capturable_weight_decay_decoupled_weight_decay_foreach_cuda_cosineannealingwarmrestarts, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_tensor_lr_capturable_weight_decay_decoupled_weight_decay_foreach_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_tensor_lr_capturable_weight_decay_decoupled_weight_decay_foreach_cuda_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_tensor_lr_capturable_weight_decay_decoupled_weight_decay_foreach_cuda_polynomiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_weight_decay_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_radam_weight_decay_decoupled_weight_decay_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_capturable_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_tensor_lr_capturable_cuda_cycliclr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_tensor_lr_capturable_cuda_multiplicativelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_tensor_lr_capturable_cuda_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_tensor_lr_capturable_cuda_reducelronplateau, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_tensor_lr_capturable_cuda_steplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_tensor_lr_capturable_foreach_cuda_constantlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_tensor_lr_capturable_foreach_cuda_lambdalr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_tensor_lr_capturable_foreach_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_tensor_lr_capturable_foreach_cuda_multisteplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_tensor_lr_capturable_foreach_cuda_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_weight_decay_centered_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_weight_decay_centered_momentum_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_weight_decay_centered_momentum_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_weight_decay_centered_momentum_maximize_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rmsprop_weight_decay_centered_momentum_maximize_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_capturable_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_etas_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_etas_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_maximize_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_step_sizes_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_step_sizes_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_tensor_lr_capturable_cuda_cosineannealingwarmrestarts, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_tensor_lr_capturable_cuda_exponentiallr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_tensor_lr_capturable_cuda_multisteplr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_tensor_lr_capturable_cuda_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_tensor_lr_capturable_foreach_cuda_lambdalr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_rprop_tensor_lr_capturable_foreach_cuda_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_momentum_dampening_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_momentum_nesterov_weight_decay_cpu, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_recompile_foreach, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_tensor_lr_cpu_cosineannealingwarmrestarts, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_tensor_lr_cpu_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_tensor_lr_cpu_multiplicativelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_tensor_lr_cpu_onecyclelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_tensor_lr_cuda_linearlr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_tensor_lr_cuda_multiplicativelr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_tensor_lr_foreach_cuda_cosineannealinglr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_tensor_lr_foreach_cuda_cosineannealingwarmrestarts, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_tensor_lr_foreach_cuda_lambdalr, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_tensor_lr_foreach_cuda_reducelronplateau, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_weight_decay_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_weight_decay_foreach_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerTests::test_sgd_weight_decay_maximize_cuda, test/inductor/test_compiled_optimizers.py::CompiledOptimizerParityTestsCUDA::test_correctness_Adadelta_use_closure_False_cuda_float32, test/inductor/test_compiled_optimizers.py::CompiledOptimizerParityTestsCUDA::test_correctness_Adagrad_use_closure_False_cuda_float32, test/inductor/test_compiled_optimizers.py::CompiledOptimizerParityTestsCUDA::test_correctness_AdamW_use_closure_False_cuda_float32, test/inductor/test_compiled_optimizers.py::CompiledOptimizerParityTestsCUDA::test_correctness_Adamax_use_closure_True_cuda_float32, test/inductor/test_compiled_optimizers.py::CompiledOptimizerParityTestsCUDA::test_correctness_Muon_use_closure_True_cuda_float32, test/inductor/test_compiled_optimizers.py::CompiledOptimizerParityTestsCUDA::test_correctness_RAdam_use_closure_False_cuda_float32, test/inductor/test_compiled_optimizers.py::CompiledOptimizerParityTestsCUDA::test_correctness_RMSprop_use_closure_False_cuda_float32, test/inductor/test_compiled_optimizers.py::CompiledOptimizerParityTestsCUDA::test_correctness_Rprop_use_closure_True_cuda_float32, test/inductor/test_compiled_optimizers.py::CompiledOptimizerParityTestsCUDA::test_correctness_SGD_use_closure_False_cuda_float32, test/inductor/test_compiled_optimizers.py::CompiledOptimizerParityTestsCUDA::test_correctness_SparseAdam_use_closure_True_cuda_float32 2025-12-04T15:29:05.1716306Z 2025-12-04T15:29:05.1716733Z Finished inductor/test_compiled_optimizers 3/3 ... [2025-12-04 15:29:05.145965][22549.267184671], took 9.06min 2025-12-04T15:29:05.1793534Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_compiled_optimizers/inductor.test_compiled_optimizers-8408ff78148caebd.xml 2025-12-04T15:29:06.4633248Z Uploading artifacts took 1.18 seconds 2025-12-04T15:29:06.4637350Z Running dynamo/test_graph_region_tracker 1/1 ... [2025-12-04 15:29:06.463543][22550.584762995] 2025-12-04T15:29:06.4638005Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:29:06.4641937Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_graph_region_tracker.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:29:06.463958] 2025-12-04T15:29:16.0421326Z 2025-12-04T15:29:16.0422475Z dynamo/test_graph_region_tracker 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_graph_region_tracker_1.1_f5ae5cf5f828f9b1_.log 2025-12-04T15:29:16.0429830Z Running 13 items in this shard: test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_get_regions_multiple_region_groups, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_get_regions_single_region_group, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_mismatched_arg_shapes, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_mismatched_dtypes, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_mismatched_global_state, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_mutation_tracking_allow_in_graph, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_mutation_tracking_setitem, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_mutation_tracking_simple, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_nested_args, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_no_duplicate_tracking, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_no_single_node_regions, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_non_tensor_arg_hashing, test/dynamo/test_graph_region_tracker.py::GraphRegionTrackerTests::test_region_sorting 2025-12-04T15:29:16.0436824Z 2025-12-04T15:29:16.0437221Z Finished dynamo/test_graph_region_tracker 1/1 ... [2025-12-04 15:29:16.041937][22560.163156855], took 0.16min 2025-12-04T15:29:16.0751960Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_graph_region_tracker/dynamo.test_graph_region_tracker-e7e3f06a1f8d4793.xml 2025-12-04T15:29:16.1481528Z Running dynamo/test_unittest 1/1 ... [2025-12-04 15:29:16.147922][22560.269142349] 2025-12-04T15:29:16.1482086Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:29:16.1485591Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_unittest.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:29:16.148298] 2025-12-04T15:29:23.3225403Z 2025-12-04T15:29:23.3226987Z dynamo/test_unittest 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_unittest_1.1_d68e34e45e740e8c_.log 2025-12-04T15:29:23.3229167Z Running 1 items in this shard: test/dynamo/test_unittest.py::TestUnittest::test_SkipTest 2025-12-04T15:29:23.3230298Z 2025-12-04T15:29:23.3230922Z Finished dynamo/test_unittest 1/1 ... [2025-12-04 15:29:23.322297][22567.443518048], took 0.12min 2025-12-04T15:29:23.3560786Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_unittest/dynamo.test_unittest-1a49598617666801.xml 2025-12-04T15:29:23.4159447Z Running inductor/test_compile 1/1 ... [2025-12-04 15:29:23.415632][22567.536852579] 2025-12-04T15:29:23.4160381Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:29:23.4164317Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_compile.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:29:23.416064] 2025-12-04T15:29:47.2646797Z 2025-12-04T15:29:47.2647830Z inductor/test_compile 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_compile_1.1_b13c9a106be3e4f3_.log 2025-12-04T15:29:47.2653107Z Running 10 items in this shard: test/inductor/test_compile.py::TestStandaloneInductor::test_inductor_generate_debug_compile, test/inductor/test_compile.py::TestStandaloneInductor::test_inductor_generate_debug_symbol, test/inductor/test_compile.py::TestStandaloneInductor::test_inductor_via_bare_module, test/inductor/test_compile.py::TestStandaloneInductor::test_inductor_via_export1, test/inductor/test_compile.py::TestStandaloneInductor::test_inductor_via_export2, test/inductor/test_compile.py::TestStandaloneInductor::test_inductor_via_fx, test/inductor/test_compile.py::TestStandaloneInductor::test_inductor_via_fx_dict_input, test/inductor/test_compile.py::TestStandaloneInductor::test_inductor_via_fx_tensor_return, test/inductor/test_compile.py::TestStandaloneInductor::test_inductor_via_make_fx, test/inductor/test_compile.py::TestStandaloneInductor::test_inductor_via_op_with_multiple_outputs 2025-12-04T15:29:47.2657923Z 2025-12-04T15:29:47.2658348Z Finished inductor/test_compile 1/1 ... [2025-12-04 15:29:47.264479][22591.385699125], took 0.40min 2025-12-04T15:29:47.2987551Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_compile/inductor.test_compile-f393f964f8c8ee63.xml 2025-12-04T15:29:47.3812147Z Running dynamo/test_functions 1/1 ... [2025-12-04 15:29:47.380895][22591.502112598] 2025-12-04T15:29:47.3812736Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:29:47.3815614Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_functions.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:29:47.381278] 2025-12-04T15:30:53.9403324Z 2025-12-04T15:30:53.9404464Z dynamo/test_functions 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_functions_1.1_6ca2c48a93208d7e_.log 2025-12-04T15:30:53.9573785Z Running 478 items in this shard: test/dynamo/test_functions.py::FunctionTests::test_T, test/dynamo/test_functions.py::FunctionTests::test_add, test/dynamo/test_functions.py::FunctionTests::test_add_, test/dynamo/test_functions.py::FunctionTests::test_addcdiv, test/dynamo/test_functions.py::FunctionTests::test_addcdiv_, test/dynamo/test_functions.py::FunctionTests::test_addcmul_, test/dynamo/test_functions.py::FunctionTests::test_are_functorch_transforms_active, test/dynamo/test_functions.py::FunctionTests::test_attrgetter, test/dynamo/test_functions.py::FunctionTests::test_broadcast_foreach_pow, test/dynamo/test_functions.py::FunctionTests::test_build_list_unpack, test/dynamo/test_functions.py::FunctionTests::test_call_dict1, test/dynamo/test_functions.py::FunctionTests::test_call_dict2, test/dynamo/test_functions.py::FunctionTests::test_call_dict3, test/dynamo/test_functions.py::FunctionTests::test_call_dict4, test/dynamo/test_functions.py::FunctionTests::test_call_dict5, test/dynamo/test_functions.py::FunctionTests::test_callable_builtin, test/dynamo/test_functions.py::FunctionTests::test_callable_class, test/dynamo/test_functions.py::FunctionTests::test_callable_lambda, test/dynamo/test_functions.py::FunctionTests::test_callable_list, test/dynamo/test_functions.py::FunctionTests::test_callable_torch, test/dynamo/test_functions.py::FunctionTests::test_chunks1, test/dynamo/test_functions.py::FunctionTests::test_class_dict, test/dynamo/test_functions.py::FunctionTests::test_cls_eq, test/dynamo/test_functions.py::FunctionTests::test_cls_hasattr, test/dynamo/test_functions.py::FunctionTests::test_cls_is, test/dynamo/test_functions.py::FunctionTests::test_compare_constant_and_tensor, test/dynamo/test_functions.py::FunctionTests::test_complex_closure, test/dynamo/test_functions.py::FunctionTests::test_const_tuple_add1, test/dynamo/test_functions.py::FunctionTests::test_const_tuple_add2, test/dynamo/test_functions.py::FunctionTests::test_constant1, test/dynamo/test_functions.py::FunctionTests::test_constant2, test/dynamo/test_functions.py::FunctionTests::test_constant3, test/dynamo/test_functions.py::FunctionTests::test_constant4, test/dynamo/test_functions.py::FunctionTests::test_constant_set, test/dynamo/test_functions.py::FunctionTests::test_context_wrapping_nested_functions_no_closure, test/dynamo/test_functions.py::FunctionTests::test_cublas_allow_tf32, test/dynamo/test_functions.py::FunctionTests::test_custom_dict_kwargs, test/dynamo/test_functions.py::FunctionTests::test_default_dict_closure, test/dynamo/test_functions.py::FunctionTests::test_default_dict_constr, test/dynamo/test_functions.py::FunctionTests::test_default_dict_dict, test/dynamo/test_functions.py::FunctionTests::test_default_dict_lambda, test/dynamo/test_functions.py::FunctionTests::test_default_dict_list, test/dynamo/test_functions.py::FunctionTests::test_default_dict_set, test/dynamo/test_functions.py::FunctionTests::test_default_dict_tuple, test/dynamo/test_functions.py::FunctionTests::test_defaultdict_setdefault1, test/dynamo/test_functions.py::FunctionTests::test_defaultdict_setdefault2, test/dynamo/test_functions.py::FunctionTests::test_defaultdict_setdefault3, test/dynamo/test_functions.py::FunctionTests::test_del, test/dynamo/test_functions.py::FunctionTests::test_deque, test/dynamo/test_functions.py::FunctionTests::test_device, test/dynamo/test_functions.py::FunctionTests::test_device_constant, test/dynamo/test_functions.py::FunctionTests::test_dict_copy, test/dynamo/test_functions.py::FunctionTests::test_dict_fromkeys, test/dynamo/test_functions.py::FunctionTests::test_dict_hasattr, test/dynamo/test_functions.py::FunctionTests::test_dict_id_guard, test/dynamo/test_functions.py::FunctionTests::test_dict_items_sorted, test/dynamo/test_functions.py::FunctionTests::test_dict_key_set1, test/dynamo/test_functions.py::FunctionTests::test_dict_key_set2, test/dynamo/test_functions.py::FunctionTests::test_dict_key_set3, test/dynamo/test_functions.py::FunctionTests::test_dict_keys, test/dynamo/test_functions.py::FunctionTests::test_dict_kwargs, test/dynamo/test_functions.py::FunctionTests::test_dict_mutable_map, test/dynamo/test_functions.py::FunctionTests::test_dict_ops, test/dynamo/test_functions.py::FunctionTests::test_dict_param_keys, test/dynamo/test_functions.py::FunctionTests::test_dict_setdefault1, test/dynamo/test_functions.py::FunctionTests::test_dict_setdefault2, test/dynamo/test_functions.py::FunctionTests::test_dict_setdefault3, test/dynamo/test_functions.py::FunctionTests::test_dict_sorted, test/dynamo/test_functions.py::FunctionTests::test_dict_tuple_lazy_guard, test/dynamo/test_functions.py::FunctionTests::test_dict_update, test/dynamo/test_functions.py::FunctionTests::test_dict_update_kwargs, test/dynamo/test_functions.py::FunctionTests::test_dict_values, test/dynamo/test_functions.py::FunctionTests::test_distributed_is_available, test/dynamo/test_functions.py::FunctionTests::test_distributed_is_initialized, test/dynamo/test_functions.py::FunctionTests::test_dtype, test/dynamo/test_functions.py::FunctionTests::test_dtype_compare, test/dynamo/test_functions.py::FunctionTests::test_elipsis, test/dynamo/test_functions.py::FunctionTests::test_enumerate, test/dynamo/test_functions.py::FunctionTests::test_enumerate_custom, test/dynamo/test_functions.py::FunctionTests::test_enumerate_reconstruct, test/dynamo/test_functions.py::FunctionTests::test_filter, test/dynamo/test_functions.py::FunctionTests::test_filter_fallback, test/dynamo/test_functions.py::FunctionTests::test_filter_graph_break_reconstruct, test/dynamo/test_functions.py::FunctionTests::test_filter_infinite_iterator, test/dynamo/test_functions.py::FunctionTests::test_filter_reconstruct, test/dynamo/test_functions.py::FunctionTests::test_filter_with_graph_break, test/dynamo/test_functions.py::FunctionTests::test_finfo, test/dynamo/test_functions.py::FunctionTests::test_flat_param_same_storage_size, test/dynamo/test_functions.py::FunctionTests::test_float, test/dynamo/test_functions.py::FunctionTests::test_fn_with_self_set, test/dynamo/test_functions.py::FunctionTests::test_foreach_lerp_, test/dynamo/test_functions.py::FunctionTests::test_fstrings1, test/dynamo/test_functions.py::FunctionTests::test_fstrings2, test/dynamo/test_functions.py::FunctionTests::test_fstrings3, test/dynamo/test_functions.py::FunctionTests::test_fstrings4, test/dynamo/test_functions.py::FunctionTests::test_fstrings5, test/dynamo/test_functions.py::FunctionTests::test_fstrings6, test/dynamo/test_functions.py::FunctionTests::test_funcdef_closure, test/dynamo/test_functions.py::FunctionTests::test_functools_cache_guard, test/dynamo/test_functions.py::FunctionTests::test_functools_partial, test/dynamo/test_functions.py::FunctionTests::test_functools_partial_binding, test/dynamo/test_functions.py::FunctionTests::test_generic_namedtuple_hasattr, test/dynamo/test_functions.py::FunctionTests::test_generic_namedtuple_subclass, test/dynamo/test_functions.py::FunctionTests::test_generic_namedtuple_user_methods, test/dynamo/test_functions.py::FunctionTests::test_get_autocast_gpu_dtype, test/dynamo/test_functions.py::FunctionTests::test_get_calculate_correct_fan, test/dynamo/test_functions.py::FunctionTests::test_get_default_dtype, test/dynamo/test_functions.py::FunctionTests::test_get_device_properties_tensor_device, test/dynamo/test_functions.py::FunctionTests::test_get_privateuse1_name, test/dynamo/test_functions.py::FunctionTests::test_getattr, test/dynamo/test_functions.py::FunctionTests::test_getattr_metaclass, test/dynamo/test_functions.py::FunctionTests::test_globalfn, test/dynamo/test_functions.py::FunctionTests::test_globalmodule, test/dynamo/test_functions.py::FunctionTests::test_globalvar, test/dynamo/test_functions.py::FunctionTests::test_import1, test/dynamo/test_functions.py::FunctionTests::test_in_not_in, test/dynamo/test_functions.py::FunctionTests::test_index, test/dynamo/test_functions.py::FunctionTests::test_indexed_range, test/dynamo/test_functions.py::FunctionTests::test_indirect1, test/dynamo/test_functions.py::FunctionTests::test_indirect2, test/dynamo/test_functions.py::FunctionTests::test_indirect3, test/dynamo/test_functions.py::FunctionTests::test_inline_jit__unwrap_optional, test/dynamo/test_functions.py::FunctionTests::test_inline_jit_annotations, test/dynamo/test_functions.py::FunctionTests::test_inline_lru_cache_fn_with_default_args, test/dynamo/test_functions.py::FunctionTests::test_inline_script_if_tracing_fn_with_default_args, test/dynamo/test_functions.py::FunctionTests::test_inline_softmax, test/dynamo/test_functions.py::FunctionTests::test_inline_with_default, test/dynamo/test_functions.py::FunctionTests::test_inner_function, test/dynamo/test_functions.py::FunctionTests::test_is, test/dynamo/test_functions.py::FunctionTests::test_is_any_autocast_enabled, test/dynamo/test_functions.py::FunctionTests::test_is_checkpoint_valid, test/dynamo/test_functions.py::FunctionTests::test_is_complex, test/dynamo/test_functions.py::FunctionTests::test_is_contiguous_frame_counts, test/dynamo/test_functions.py::FunctionTests::test_is_contiguous_memory_format, test/dynamo/test_functions.py::FunctionTests::test_is_floating_point, test/dynamo/test_functions.py::FunctionTests::test_is_fx_tracing, test/dynamo/test_functions.py::FunctionTests::test_is_in_onnx_export, test/dynamo/test_functions.py::FunctionTests::test_is_inference_mode_global_recompilation, test/dynamo/test_functions.py::FunctionTests::test_is_inference_recompilation, test/dynamo/test_functions.py::FunctionTests::test_is_integer, test/dynamo/test_functions.py::FunctionTests::test_is_not, test/dynamo/test_functions.py::FunctionTests::test_is_not_null, test/dynamo/test_functions.py::FunctionTests::test_is_quantized, test/dynamo/test_functions.py::FunctionTests::test_is_sparse, test/dynamo/test_functions.py::FunctionTests::test_isinstance, test/dynamo/test_functions.py::FunctionTests::test_islice_chain, test/dynamo/test_functions.py::FunctionTests::test_itemgetter, test/dynamo/test_functions.py::FunctionTests::test_itertools_chain, test/dynamo/test_functions.py::FunctionTests::test_itertools_chain_from_iterable, test/dynamo/test_functions.py::FunctionTests::test_itertools_combinations, test/dynamo/test_functions.py::FunctionTests::test_itertools_compress, test/dynamo/test_functions.py::FunctionTests::test_itertools_compress_tensors, test/dynamo/test_functions.py::FunctionTests::test_itertools_filterfalse_basic, test/dynamo/test_functions.py::FunctionTests::test_itertools_pairwise, test/dynamo/test_functions.py::FunctionTests::test_itertools_permutations_args, test/dynamo/test_functions.py::FunctionTests::test_itertools_permutations_basic, test/dynamo/test_functions.py::FunctionTests::test_itertools_permutations_various_iterators, test/dynamo/test_functions.py::FunctionTests::test_itertools_product, test/dynamo/test_functions.py::FunctionTests::test_itertools_product_args, test/dynamo/test_functions.py::FunctionTests::test_itertools_product_various_iterators, test/dynamo/test_functions.py::FunctionTests::test_itertools_reconstruct, test/dynamo/test_functions.py::FunctionTests::test_jit_annotate, test/dynamo/test_functions.py::FunctionTests::test_len_constant_dict, test/dynamo/test_functions.py::FunctionTests::test_len_constant_list, test/dynamo/test_functions.py::FunctionTests::test_len_constant_misc_iterables, test/dynamo/test_functions.py::FunctionTests::test_len_tensor, test/dynamo/test_functions.py::FunctionTests::test_list_add, test/dynamo/test_functions.py::FunctionTests::test_list_add_then_mutate, test/dynamo/test_functions.py::FunctionTests::test_list_clear, test/dynamo/test_functions.py::FunctionTests::test_list_compare_polyfill, test/dynamo/test_functions.py::FunctionTests::test_list_compare_polyfill_non_lists, test/dynamo/test_functions.py::FunctionTests::test_list_convert, test/dynamo/test_functions.py::FunctionTests::test_list_expand_lhs, test/dynamo/test_functions.py::FunctionTests::test_list_index_with_constant_tensor, test/dynamo/test_functions.py::FunctionTests::test_list_reversed, test/dynamo/test_functions.py::FunctionTests::test_list_setitem, test/dynamo/test_functions.py::FunctionTests::test_list_setitem_slice, test/dynamo/test_functions.py::FunctionTests::test_list_slice, test/dynamo/test_functions.py::FunctionTests::test_list_slice_assignment, test/dynamo/test_functions.py::FunctionTests::test_list_sorted1, test/dynamo/test_functions.py::FunctionTests::test_list_sorted2, test/dynamo/test_functions.py::FunctionTests::test_list_truth, test/dynamo/test_functions.py::FunctionTests::test_listarg1, test/dynamo/test_functions.py::FunctionTests::test_listarg2, test/dynamo/test_functions.py::FunctionTests::test_listarg3, test/dynamo/test_functions.py::FunctionTests::test_listarg4, test/dynamo/test_functions.py::FunctionTests::test_listarg5, test/dynamo/test_functions.py::FunctionTests::test_load_global_bool, test/dynamo/test_functions.py::FunctionTests::test_lru_cache_warning_issued_during_tracing, test/dynamo/test_functions.py::FunctionTests::test_mT, test/dynamo/test_functions.py::FunctionTests::test_manual_seed, test/dynamo/test_functions.py::FunctionTests::test_map_call_function_ex, test/dynamo/test_functions.py::FunctionTests::test_map_deque_extendleft, test/dynamo/test_functions.py::FunctionTests::test_map_dict_fromkeys, test/dynamo/test_functions.py::FunctionTests::test_map_enumerate, test/dynamo/test_functions.py::FunctionTests::test_map_infinite, test/dynamo/test_functions.py::FunctionTests::test_map_iter, test/dynamo/test_functions.py::FunctionTests::test_map_list, test/dynamo/test_functions.py::FunctionTests::test_map_list_extend, test/dynamo/test_functions.py::FunctionTests::test_map_list_slice_assign, test/dynamo/test_functions.py::FunctionTests::test_map_max, test/dynamo/test_functions.py::FunctionTests::test_map_max_const, test/dynamo/test_functions.py::FunctionTests::test_map_partial_unpack, test/dynamo/test_functions.py::FunctionTests::test_map_reconstruct, test/dynamo/test_functions.py::FunctionTests::test_map_reduce, test/dynamo/test_functions.py::FunctionTests::test_map_return, test/dynamo/test_functions.py::FunctionTests::test_map_set, test/dynamo/test_functions.py::FunctionTests::test_map_sorted, test/dynamo/test_functions.py::FunctionTests::test_map_str_join, test/dynamo/test_functions.py::FunctionTests::test_map_sum, test/dynamo/test_functions.py::FunctionTests::test_map_tuple, test/dynamo/test_functions.py::FunctionTests::test_map_unpack_twice, test/dynamo/test_functions.py::FunctionTests::test_map_unpack_vars, test/dynamo/test_functions.py::FunctionTests::test_map_with_graph_break, test/dynamo/test_functions.py::FunctionTests::test_map_zip_dict, test/dynamo/test_functions.py::FunctionTests::test_match_mapping_and_match_keys, test/dynamo/test_functions.py::FunctionTests::test_match_sequence, test/dynamo/test_functions.py::FunctionTests::test_math_fma, test/dynamo/test_functions.py::FunctionTests::test_math_radians, test/dynamo/test_functions.py::FunctionTests::test_mean_sum_np, test/dynamo/test_functions.py::FunctionTests::test_methodcall1, test/dynamo/test_functions.py::FunctionTests::test_methodcall2, test/dynamo/test_functions.py::FunctionTests::test_methodcall3, test/dynamo/test_functions.py::FunctionTests::test_methodcaller, test/dynamo/test_functions.py::FunctionTests::test_min_max, test/dynamo/test_functions.py::FunctionTests::test_module_constant, test/dynamo/test_functions.py::FunctionTests::test_namedtuple, test/dynamo/test_functions.py::FunctionTests::test_namedtuple_defaults, test/dynamo/test_functions.py::FunctionTests::test_namedtuple_fields, test/dynamo/test_functions.py::FunctionTests::test_namedtuple_hasattr, test/dynamo/test_functions.py::FunctionTests::test_namedtuple_replace, test/dynamo/test_functions.py::FunctionTests::test_namedtuple_subclass, test/dynamo/test_functions.py::FunctionTests::test_namedtuple_user_methods, test/dynamo/test_functions.py::FunctionTests::test_ndarray_builtin_functions, test/dynamo/test_functions.py::FunctionTests::test_ndarray_method, test/dynamo/test_functions.py::FunctionTests::test_ndarray_methods_returning_scalar, test/dynamo/test_functions.py::FunctionTests::test_ndarray_reshape, test/dynamo/test_functions.py::FunctionTests::test_ndarray_transpose, test/dynamo/test_functions.py::FunctionTests::test_ndim, test/dynamo/test_functions.py::FunctionTests::test_no_recompile_inner_function, test/dynamo/test_functions.py::FunctionTests::test_no_recompile_inner_lambda, test/dynamo/test_functions.py::FunctionTests::test_non_inlined_closure, test/dynamo/test_functions.py::FunctionTests::test_not_list, test/dynamo/test_functions.py::FunctionTests::test_np_constant_collections_as_input_int_or_float_float, test/dynamo/test_functions.py::FunctionTests::test_np_constant_collections_as_input_int_or_float_int, test/dynamo/test_functions.py::FunctionTests::test_np_constant_collections_guards_float, test/dynamo/test_functions.py::FunctionTests::test_np_constant_collections_guards_int, test/dynamo/test_functions.py::FunctionTests::test_np_finfo, test/dynamo/test_functions.py::FunctionTests::test_np_iinfo, test/dynamo/test_functions.py::FunctionTests::test_number_method_method_as_integer_ratio_num_type0, test/dynamo/test_functions.py::FunctionTests::test_number_method_method_as_integer_ratio_num_type3, test/dynamo/test_functions.py::FunctionTests::test_number_method_method_bit_length_num_type1, test/dynamo/test_functions.py::FunctionTests::test_number_method_method_conjugate_num_type2, test/dynamo/test_functions.py::FunctionTests::test_number_method_method_conjugate_num_type4, test/dynamo/test_functions.py::FunctionTests::test_number_method_method_hex_num_type5, test/dynamo/test_functions.py::FunctionTests::test_number_method_method_is_integer_num_type6, test/dynamo/test_functions.py::FunctionTests::test_numpy_attributes, test/dynamo/test_functions.py::FunctionTests::test_numpy_dtype_argument_to_function, test/dynamo/test_functions.py::FunctionTests::test_numpy_dtype_call_in_function, test/dynamo/test_functions.py::FunctionTests::test_numpy_fft, test/dynamo/test_functions.py::FunctionTests::test_numpy_linalg, test/dynamo/test_functions.py::FunctionTests::test_numpy_meshgrid, test/dynamo/test_functions.py::FunctionTests::test_numpy_random, test/dynamo/test_functions.py::FunctionTests::test_numpy_size, test/dynamo/test_functions.py::FunctionTests::test_obj_eq, test/dynamo/test_functions.py::FunctionTests::test_obj_is, test/dynamo/test_functions.py::FunctionTests::test_ordered_dict_kwargs, test/dynamo/test_functions.py::FunctionTests::test_partial_across_graph_break_uninvoked, test/dynamo/test_functions.py::FunctionTests::test_partials_as_input_UDF, test/dynamo/test_functions.py::FunctionTests::test_partials_as_input_partials_lambda, test/dynamo/test_functions.py::FunctionTests::test_partials_as_input_partials_mod, test/dynamo/test_functions.py::FunctionTests::test_partials_graph_break_reconstruct, test/dynamo/test_functions.py::FunctionTests::test_partials_graph_break_reconstruct_args_and_kwargs, test/dynamo/test_functions.py::FunctionTests::test_partials_graph_break_reconstruct_mix, test/dynamo/test_functions.py::FunctionTests::test_partials_graph_break_reconstruct_mix_no_source, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___annotations__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___builtins__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___call__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___class__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___closure__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___code__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___defaults__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___delattr__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___dict__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___dir__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___doc__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___eq__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___format__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___ge__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___get__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___getattribute__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___globals__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___gt__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___hash__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___init__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___init_subclass__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___kwdefaults__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___le__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___lt__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___module__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___name__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___ne__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___new__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___qualname__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___reduce__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___reduce_ex__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___repr__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___setattr__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___sizeof__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___str__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr___subclasshook__, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr_args, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr_func, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_attr_keywords, test/dynamo/test_functions.py::FunctionTests::test_partials_hasattr_set_attr, test/dynamo/test_functions.py::FunctionTests::test_partials_lambda, test/dynamo/test_functions.py::FunctionTests::test_partials_recompilation, test/dynamo/test_functions.py::FunctionTests::test_partials_torch_op_arg, test/dynamo/test_functions.py::FunctionTests::test_partials_torch_op_kwarg, test/dynamo/test_functions.py::FunctionTests::test_partials_udf_arg, test/dynamo/test_functions.py::FunctionTests::test_partials_udf_kwarg, test/dynamo/test_functions.py::FunctionTests::test_partials_udf_kwarg_method, test/dynamo/test_functions.py::FunctionTests::test_partials_udf_kwarg_module, test/dynamo/test_functions.py::FunctionTests::test_pop, test/dynamo/test_functions.py::FunctionTests::test_pos, test/dynamo/test_functions.py::FunctionTests::test_pos_only_args_with_same_name_in_star_kwargs, test/dynamo/test_functions.py::FunctionTests::test_pow_int, test/dynamo/test_functions.py::FunctionTests::test_promote_types, test/dynamo/test_functions.py::FunctionTests::test_rand_inlined, test/dynamo/test_functions.py::FunctionTests::test_rand_tensor_partial, test/dynamo/test_functions.py::FunctionTests::test_range1, test/dynamo/test_functions.py::FunctionTests::test_range2, test/dynamo/test_functions.py::FunctionTests::test_range_iterator, test/dynamo/test_functions.py::FunctionTests::test_range_iterator_2, test/dynamo/test_functions.py::FunctionTests::test_range_iterator_graph_break, test/dynamo/test_functions.py::FunctionTests::test_range_iterator_graph_break_2, test/dynamo/test_functions.py::FunctionTests::test_range_length, test/dynamo/test_functions.py::FunctionTests::test_range_with_index, test/dynamo/test_functions.py::FunctionTests::test_range_with_slice_index, test/dynamo/test_functions.py::FunctionTests::test_reduce, test/dynamo/test_functions.py::FunctionTests::test_reduce_with_initial, test/dynamo/test_functions.py::FunctionTests::test_reduce_with_none_initial, test/dynamo/test_functions.py::FunctionTests::test_reduce_with_single, test/dynamo/test_functions.py::FunctionTests::test_reduce_with_single_with_initial, test/dynamo/test_functions.py::FunctionTests::test_return_dict, test/dynamo/test_functions.py::FunctionTests::test_return_dict2, test/dynamo/test_functions.py::FunctionTests::test_return_multiple_numpy_ndarray, test/dynamo/test_functions.py::FunctionTests::test_return_numpy_ndarray, test/dynamo/test_functions.py::FunctionTests::test_return_tuple1, test/dynamo/test_functions.py::FunctionTests::test_return_tuple2, test/dynamo/test_functions.py::FunctionTests::test_returning_recursive_func, test/dynamo/test_functions.py::FunctionTests::test_round, test/dynamo/test_functions.py::FunctionTests::test_set_add, test/dynamo/test_functions.py::FunctionTests::test_set_in_frozenset, test/dynamo/test_functions.py::FunctionTests::test_set_keys_view, test/dynamo/test_functions.py::FunctionTests::test_set_update_bytecode, test/dynamo/test_functions.py::FunctionTests::test_set_update_list_with_duplicated_items, test/dynamo/test_functions.py::FunctionTests::test_shape1, test/dynamo/test_functions.py::FunctionTests::test_shape2, test/dynamo/test_functions.py::FunctionTests::test_size_tuple_add, test/dynamo/test_functions.py::FunctionTests::test_slice1, test/dynamo/test_functions.py::FunctionTests::test_slice2, test/dynamo/test_functions.py::FunctionTests::test_slice3, test/dynamo/test_functions.py::FunctionTests::test_slice4, test/dynamo/test_functions.py::FunctionTests::test_slice5, test/dynamo/test_functions.py::FunctionTests::test_slice6, test/dynamo/test_functions.py::FunctionTests::test_slice_eq, test/dynamo/test_functions.py::FunctionTests::test_sliced_range, test/dynamo/test_functions.py::FunctionTests::test_sorted_const_key_non_const_items, test/dynamo/test_functions.py::FunctionTests::test_sourceless_build_method_type, test/dynamo/test_functions.py::FunctionTests::test_startswith, test/dynamo/test_functions.py::FunctionTests::test_sum, test/dynamo/test_functions.py::FunctionTests::test_sum_shortcut, test/dynamo/test_functions.py::FunctionTests::test_sum_shortcut_with_start_arg, test/dynamo/test_functions.py::FunctionTests::test_sum_shortcut_with_start_kwarg, test/dynamo/test_functions.py::FunctionTests::test_sum_with_start_arg, test/dynamo/test_functions.py::FunctionTests::test_sum_with_start_kwarg, test/dynamo/test_functions.py::FunctionTests::test_symbool_to_int, test/dynamo/test_functions.py::FunctionTests::test_tensor_dim, test/dynamo/test_functions.py::FunctionTests::test_tensor_element_size, test/dynamo/test_functions.py::FunctionTests::test_tensor_is_complex, test/dynamo/test_functions.py::FunctionTests::test_tensor_len, test/dynamo/test_functions.py::FunctionTests::test_tensor_new_with_shape, test/dynamo/test_functions.py::FunctionTests::test_tensor_new_with_size, test/dynamo/test_functions.py::FunctionTests::test_tensor_size, test/dynamo/test_functions.py::FunctionTests::test_tensor_size_indexed_by_symint, test/dynamo/test_functions.py::FunctionTests::test_tensor_type, test/dynamo/test_functions.py::FunctionTests::test_tensor_type2, test/dynamo/test_functions.py::FunctionTests::test_tensor_type3, test/dynamo/test_functions.py::FunctionTests::test_tensor_type4, test/dynamo/test_functions.py::FunctionTests::test_tensor_type5, test/dynamo/test_functions.py::FunctionTests::test_to, test/dynamo/test_functions.py::FunctionTests::test_torch_distributions_functions, test/dynamo/test_functions.py::FunctionTests::test_torch_from_numpy, test/dynamo/test_functions.py::FunctionTests::test_torch_get_device_module, test/dynamo/test_functions.py::FunctionTests::test_torch_size_as_dict_key, test/dynamo/test_functions.py::FunctionTests::test_torch_size_hasattr, test/dynamo/test_functions.py::FunctionTests::test_torch_source, test/dynamo/test_functions.py::FunctionTests::test_transpose_for_scores, test/dynamo/test_functions.py::FunctionTests::test_truth, test/dynamo/test_functions.py::FunctionTests::test_tuple1, test/dynamo/test_functions.py::FunctionTests::test_tuple2, test/dynamo/test_functions.py::FunctionTests::test_tuple_contains, test/dynamo/test_functions.py::FunctionTests::test_tuple_iadd, test/dynamo/test_functions.py::FunctionTests::test_tuple_map, test/dynamo/test_functions.py::FunctionTests::test_tuple_sorted, test/dynamo/test_functions.py::FunctionTests::test_two_point_iter, test/dynamo/test_functions.py::FunctionTests::test_unary_fold_op, test/dynamo/test_functions.py::FunctionTests::test_unary_fold_op_seq, test/dynamo/test_functions.py::FunctionTests::test_unpack1, test/dynamo/test_functions.py::FunctionTests::test_unpack2, test/dynamo/test_functions.py::FunctionTests::test_unpack3, test/dynamo/test_functions.py::FunctionTests::test_unpack_ex1, test/dynamo/test_functions.py::FunctionTests::test_unpack_ex2, test/dynamo/test_functions.py::FunctionTests::test_unpack_ex3, test/dynamo/test_functions.py::FunctionTests::test_unpack_mutable_map, test/dynamo/test_functions.py::FunctionTests::test_unsqueeze_inplace, test/dynamo/test_functions.py::FunctionTests::test_viamethod, test/dynamo/test_functions.py::FunctionTests::test_viatorch, test/dynamo/test_functions.py::FunctionTests::test_zip_longest, test/dynamo/test_functions.py::FunctionTests::test_zip_reconstruct, test/dynamo/test_functions.py::DefaultsTests::test_cast_tensor_single_elem, test/dynamo/test_functions.py::DefaultsTests::test_dataclass_factory, test/dynamo/test_functions.py::DefaultsTests::test_dataclass_nested, test/dynamo/test_functions.py::DefaultsTests::test_fn_with_attr, test/dynamo/test_functions.py::DefaultsTests::test_frozenset_construction, test/dynamo/test_functions.py::DefaultsTests::test_frozenset_illegal_call_method, test/dynamo/test_functions.py::DefaultsTests::test_frozenset_reconstruction, test/dynamo/test_functions.py::DefaultsTests::test_frozenset_return_type_method_name_copy, test/dynamo/test_functions.py::DefaultsTests::test_frozenset_return_type_method_name_difference, test/dynamo/test_functions.py::DefaultsTests::test_frozenset_return_type_method_name_intersection, test/dynamo/test_functions.py::DefaultsTests::test_frozenset_return_type_method_name_symmetric_difference, test/dynamo/test_functions.py::DefaultsTests::test_frozenset_return_type_method_name_union, test/dynamo/test_functions.py::DefaultsTests::test_full_with_tensor_fill_value, test/dynamo/test_functions.py::DefaultsTests::test_func_attrs, test/dynamo/test_functions.py::DefaultsTests::test_func_default_tensor_args, test/dynamo/test_functions.py::DefaultsTests::test_func_default_torch_args, test/dynamo/test_functions.py::DefaultsTests::test_functional_compile, test/dynamo/test_functions.py::DefaultsTests::test_functools_partial_id, test/dynamo/test_functions.py::DefaultsTests::test_fx_immutable_list_mutation_not_allowed, test/dynamo/test_functions.py::DefaultsTests::test_fx_map_aggregate, test/dynamo/test_functions.py::DefaultsTests::test_gpu_current_device, test/dynamo/test_functions.py::DefaultsTests::test_in_set_inplace, test/dynamo/test_functions.py::DefaultsTests::test_in_set_would_fail_broadcast, test/dynamo/test_functions.py::DefaultsTests::test_inspect_method_source, test/dynamo/test_functions.py::DefaultsTests::test_is_init_in_compile_mutated_tensor_tensor, test/dynamo/test_functions.py::DefaultsTests::test_is_init_in_compile_vmapped_mutated_tensor_tensor, test/dynamo/test_functions.py::DefaultsTests::test_is_init_in_compile_vmapped_mutated_tensor_tensor_multi_arg, test/dynamo/test_functions.py::DefaultsTests::test_is_mutated_tensor_tensor, test/dynamo/test_functions.py::DefaultsTests::test_is_mutated_tensor_tensor_across_graph_break, test/dynamo/test_functions.py::DefaultsTests::test_is_not_tensor_tensor, test/dynamo/test_functions.py::DefaultsTests::test_is_tensor_tensor, test/dynamo/test_functions.py::DefaultsTests::test_is_vmapped_mutated_tensor_tensor, test/dynamo/test_functions.py::DefaultsTests::test_keyword, test/dynamo/test_functions.py::DefaultsTests::test_listlike_of_tensors_contains_constant, test/dynamo/test_functions.py::DefaultsTests::test_map_strict, test/dynamo/test_functions.py::DefaultsTests::test_map_strict_with_graph_break, test/dynamo/test_functions.py::DefaultsTests::test_meth_default_tensor_args, test/dynamo/test_functions.py::DefaultsTests::test_property_class_transmute, test/dynamo/test_functions.py::DefaultsTests::test_property_functools_partial, test/dynamo/test_functions.py::DefaultsTests::test_pybind_object, test/dynamo/test_functions.py::DefaultsTests::test_reconstructed_name, test/dynamo/test_functions.py::DefaultsTests::test_set_call___init___frozenset, test/dynamo/test_functions.py::DefaultsTests::test_set_call___init___set, test/dynamo/test_functions.py::DefaultsTests::test_set_construction, test/dynamo/test_functions.py::DefaultsTests::test_skip_function_call_very_weird_value, test/dynamo/test_functions.py::DefaultsTests::test_str_handler_for_user_defined_object, test/dynamo/test_functions.py::DefaultsTests::test_sys_recursionlimit, test/dynamo/test_functions.py::DefaultsTests::test_tree_map, test/dynamo/test_functions.py::DefaultsTests::test_udf_list, test/dynamo/test_functions.py::DefaultsTests::test_udf_list_reconstruction, test/dynamo/test_functions.py::DefaultsTests::test_udf_list_slice, test/dynamo/test_functions.py::DefaultsTests::test_udf_namedtuple, test/dynamo/test_functions.py::DefaultsTests::test_udf_tuple, test/dynamo/test_functions.py::DefaultsTests::test_udf_tuple_construction, test/dynamo/test_functions.py::DefaultsTests::test_udf_tuple_construction_custom_new, test/dynamo/test_functions.py::DefaultsTests::test_udf_tuple_reconstruction, test/dynamo/test_functions.py::DefaultsTests::test_zip_strict 2025-12-04T15:30:53.9741983Z 2025-12-04T15:30:53.9742351Z Finished dynamo/test_functions 1/1 ... [2025-12-04 15:30:53.940796][22658.062014036], took 1.11min 2025-12-04T15:30:53.9747976Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_functions/dynamo.test_functions-e9d213b32d56adfd.xml 2025-12-04T15:30:54.0581704Z Running inductor/test_ordered_set 1/1 ... [2025-12-04 15:30:54.057896][22658.179114764] 2025-12-04T15:30:54.0582325Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:30:54.0585406Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_ordered_set.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:30:54.058284] 2025-12-04T15:31:01.9332668Z 2025-12-04T15:31:01.9333978Z inductor/test_ordered_set 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_ordered_set_1.1_77140872da5a7126_.log 2025-12-04T15:31:01.9484960Z Running 401 items in this shard: test/inductor/test_ordered_set.py::TestJointOps::test_and, test/inductor/test_ordered_set.py::TestJointOps::test_badcmp, test/inductor/test_ordered_set.py::TestJointOps::test_container_iterator, test/inductor/test_ordered_set.py::TestJointOps::test_contains, test/inductor/test_ordered_set.py::TestJointOps::test_cyclical_repr, test/inductor/test_ordered_set.py::TestJointOps::test_deepcopy, test/inductor/test_ordered_set.py::TestJointOps::test_difference, test/inductor/test_ordered_set.py::TestJointOps::test_do_not_rehash_dict_keys, test/inductor/test_ordered_set.py::TestJointOps::test_equality, test/inductor/test_ordered_set.py::TestJointOps::test_free_after_iterating, test/inductor/test_ordered_set.py::TestJointOps::test_gc, test/inductor/test_ordered_set.py::TestJointOps::test_intersection, test/inductor/test_ordered_set.py::TestJointOps::test_isdisjoint, test/inductor/test_ordered_set.py::TestJointOps::test_iterator_pickling, test/inductor/test_ordered_set.py::TestJointOps::test_len, test/inductor/test_ordered_set.py::TestJointOps::test_new_or_init, test/inductor/test_ordered_set.py::TestJointOps::test_or, test/inductor/test_ordered_set.py::TestJointOps::test_pickling, test/inductor/test_ordered_set.py::TestJointOps::test_setOfFrozensets, test/inductor/test_ordered_set.py::TestJointOps::test_sub, test/inductor/test_ordered_set.py::TestJointOps::test_sub_and_super, test/inductor/test_ordered_set.py::TestJointOps::test_subclass_with_custom_hash, test/inductor/test_ordered_set.py::TestJointOps::test_symmetric_difference, test/inductor/test_ordered_set.py::TestJointOps::test_union, test/inductor/test_ordered_set.py::TestJointOps::test_uniquification, test/inductor/test_ordered_set.py::TestJointOps::test_xor, test/inductor/test_ordered_set.py::TestSet::test_add, test/inductor/test_ordered_set.py::TestSet::test_and, test/inductor/test_ordered_set.py::TestSet::test_badcmp, test/inductor/test_ordered_set.py::TestSet::test_clear, test/inductor/test_ordered_set.py::TestSet::test_constructor_identity, test/inductor/test_ordered_set.py::TestSet::test_container_iterator, test/inductor/test_ordered_set.py::TestSet::test_contains, test/inductor/test_ordered_set.py::TestSet::test_copy, test/inductor/test_ordered_set.py::TestSet::test_cyclical_repr, test/inductor/test_ordered_set.py::TestSet::test_deepcopy, test/inductor/test_ordered_set.py::TestSet::test_difference, test/inductor/test_ordered_set.py::TestSet::test_difference_update, test/inductor/test_ordered_set.py::TestSet::test_discard, test/inductor/test_ordered_set.py::TestSet::test_do_not_rehash_dict_keys, test/inductor/test_ordered_set.py::TestSet::test_equality, test/inductor/test_ordered_set.py::TestSet::test_free_after_iterating, test/inductor/test_ordered_set.py::TestSet::test_gc, test/inductor/test_ordered_set.py::TestSet::test_hash, test/inductor/test_ordered_set.py::TestSet::test_iand, test/inductor/test_ordered_set.py::TestSet::test_init, test/inductor/test_ordered_set.py::TestSet::test_inplace_on_self, test/inductor/test_ordered_set.py::TestSet::test_intersection, test/inductor/test_ordered_set.py::TestSet::test_intersection_update, test/inductor/test_ordered_set.py::TestSet::test_ior, test/inductor/test_ordered_set.py::TestSet::test_isdisjoint, test/inductor/test_ordered_set.py::TestSet::test_isub, test/inductor/test_ordered_set.py::TestSet::test_iterator_pickling, test/inductor/test_ordered_set.py::TestSet::test_ixor, test/inductor/test_ordered_set.py::TestSet::test_len, test/inductor/test_ordered_set.py::TestSet::test_new_or_init, test/inductor/test_ordered_set.py::TestSet::test_or, test/inductor/test_ordered_set.py::TestSet::test_pickling, test/inductor/test_ordered_set.py::TestSet::test_pop, test/inductor/test_ordered_set.py::TestSet::test_remove, test/inductor/test_ordered_set.py::TestSet::test_remove_keyerror_set, test/inductor/test_ordered_set.py::TestSet::test_remove_keyerror_unpacking, test/inductor/test_ordered_set.py::TestSet::test_rich_compare, test/inductor/test_ordered_set.py::TestSet::test_setOfFrozensets, test/inductor/test_ordered_set.py::TestSet::test_set_literal, test/inductor/test_ordered_set.py::TestSet::test_set_literal_evaluation_order, test/inductor/test_ordered_set.py::TestSet::test_set_literal_insertion_order, test/inductor/test_ordered_set.py::TestSet::test_sub, test/inductor/test_ordered_set.py::TestSet::test_sub_and_super, test/inductor/test_ordered_set.py::TestSet::test_subclass_with_custom_hash, test/inductor/test_ordered_set.py::TestSet::test_symmetric_difference, test/inductor/test_ordered_set.py::TestSet::test_symmetric_difference_update, test/inductor/test_ordered_set.py::TestSet::test_union, test/inductor/test_ordered_set.py::TestSet::test_uniquification, test/inductor/test_ordered_set.py::TestSet::test_update, test/inductor/test_ordered_set.py::TestSet::test_weakref, test/inductor/test_ordered_set.py::TestSet::test_xor, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_copy, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_empty_difference, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_empty_difference_rev, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_empty_intersection, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_empty_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_empty_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_empty_union, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_equivalent_equality, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_intersection_empty, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_isdisjoint_empty, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_issue_37219, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_iteration, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_length, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_pickling, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_repr, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_self_difference, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_self_equality, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_self_intersection, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_self_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_self_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_self_union, test/inductor/test_ordered_set.py::TestBasicOpsEmpty::test_union_empty, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_copy, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_empty_difference, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_empty_difference_rev, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_empty_intersection, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_empty_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_empty_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_empty_union, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_equivalent_equality, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_in, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_intersection_empty, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_isdisjoint_empty, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_issue_37219, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_iteration, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_length, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_not_in, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_pickling, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_repr, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_self_difference, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_self_equality, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_self_intersection, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_self_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_self_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_self_union, test/inductor/test_ordered_set.py::TestBasicOpsSingleton::test_union_empty, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_copy, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_empty_difference, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_empty_difference_rev, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_empty_intersection, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_empty_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_empty_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_empty_union, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_equivalent_equality, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_in, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_intersection_empty, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_isdisjoint_empty, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_issue_37219, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_iteration, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_length, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_not_in, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_pickling, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_repr, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_self_difference, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_self_equality, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_self_intersection, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_self_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_self_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_self_union, test/inductor/test_ordered_set.py::TestBasicOpsTuple::test_union_empty, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_copy, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_empty_difference, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_empty_difference_rev, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_empty_intersection, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_empty_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_empty_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_empty_union, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_equivalent_equality, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_intersection_empty, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_isdisjoint_empty, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_issue_37219, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_iteration, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_length, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_pickling, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_repr, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_self_difference, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_self_equality, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_self_intersection, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_self_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_self_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_self_union, test/inductor/test_ordered_set.py::TestBasicOpsTriple::test_union_empty, test/inductor/test_ordered_set.py::TestBasicOpsString::test_copy, test/inductor/test_ordered_set.py::TestBasicOpsString::test_empty_difference, test/inductor/test_ordered_set.py::TestBasicOpsString::test_empty_difference_rev, test/inductor/test_ordered_set.py::TestBasicOpsString::test_empty_intersection, test/inductor/test_ordered_set.py::TestBasicOpsString::test_empty_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsString::test_empty_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsString::test_empty_union, test/inductor/test_ordered_set.py::TestBasicOpsString::test_equivalent_equality, test/inductor/test_ordered_set.py::TestBasicOpsString::test_intersection_empty, test/inductor/test_ordered_set.py::TestBasicOpsString::test_isdisjoint_empty, test/inductor/test_ordered_set.py::TestBasicOpsString::test_issue_37219, test/inductor/test_ordered_set.py::TestBasicOpsString::test_iteration, test/inductor/test_ordered_set.py::TestBasicOpsString::test_length, test/inductor/test_ordered_set.py::TestBasicOpsString::test_pickling, test/inductor/test_ordered_set.py::TestBasicOpsString::test_repr, test/inductor/test_ordered_set.py::TestBasicOpsString::test_self_difference, test/inductor/test_ordered_set.py::TestBasicOpsString::test_self_equality, test/inductor/test_ordered_set.py::TestBasicOpsString::test_self_intersection, test/inductor/test_ordered_set.py::TestBasicOpsString::test_self_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsString::test_self_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsString::test_self_union, test/inductor/test_ordered_set.py::TestBasicOpsString::test_union_empty, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_copy, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_empty_difference, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_empty_difference_rev, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_empty_intersection, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_empty_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_empty_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_empty_union, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_equivalent_equality, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_intersection_empty, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_isdisjoint_empty, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_issue_37219, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_iteration, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_length, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_pickling, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_repr, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_self_difference, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_self_equality, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_self_intersection, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_self_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_self_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_self_union, test/inductor/test_ordered_set.py::TestBasicOpsBytes::test_union_empty, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_copy, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_empty_difference, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_empty_difference_rev, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_empty_intersection, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_empty_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_empty_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_empty_union, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_equivalent_equality, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_intersection_empty, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_isdisjoint_empty, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_issue_37219, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_iteration, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_length, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_pickling, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_repr, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_self_difference, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_self_equality, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_self_intersection, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_self_isdisjoint, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_self_symmetric_difference, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_self_union, test/inductor/test_ordered_set.py::TestBasicOpsMixedStringBytes::test_union_empty, test/inductor/test_ordered_set.py::TestExceptionPropagation::test_changingSizeWhileIterating, test/inductor/test_ordered_set.py::TestExceptionPropagation::test_instanceWithException, test/inductor/test_ordered_set.py::TestExceptionPropagation::test_instancesWithoutException, test/inductor/test_ordered_set.py::TestSetOfSets::test_constructor, test/inductor/test_ordered_set.py::TestBinaryOps::test_eq, test/inductor/test_ordered_set.py::TestBinaryOps::test_intersection_non_overlap, test/inductor/test_ordered_set.py::TestBinaryOps::test_intersection_overlap, test/inductor/test_ordered_set.py::TestBinaryOps::test_intersection_subset, test/inductor/test_ordered_set.py::TestBinaryOps::test_intersection_superset, test/inductor/test_ordered_set.py::TestBinaryOps::test_isdisjoint_non_overlap, test/inductor/test_ordered_set.py::TestBinaryOps::test_isdisjoint_overlap, test/inductor/test_ordered_set.py::TestBinaryOps::test_isdisjoint_subset, test/inductor/test_ordered_set.py::TestBinaryOps::test_isdisjoint_superset, test/inductor/test_ordered_set.py::TestBinaryOps::test_sym_difference_non_overlap, test/inductor/test_ordered_set.py::TestBinaryOps::test_sym_difference_overlap, test/inductor/test_ordered_set.py::TestBinaryOps::test_sym_difference_subset, test/inductor/test_ordered_set.py::TestBinaryOps::test_sym_difference_superset, test/inductor/test_ordered_set.py::TestBinaryOps::test_union_non_overlap, test/inductor/test_ordered_set.py::TestBinaryOps::test_union_overlap, test/inductor/test_ordered_set.py::TestBinaryOps::test_union_subset, test/inductor/test_ordered_set.py::TestBinaryOps::test_union_superset, test/inductor/test_ordered_set.py::TestUpdateOps::test_difference_method_call, test/inductor/test_ordered_set.py::TestUpdateOps::test_difference_non_overlap, test/inductor/test_ordered_set.py::TestUpdateOps::test_difference_overlap, test/inductor/test_ordered_set.py::TestUpdateOps::test_difference_subset, test/inductor/test_ordered_set.py::TestUpdateOps::test_difference_superset, test/inductor/test_ordered_set.py::TestUpdateOps::test_intersection_method_call, test/inductor/test_ordered_set.py::TestUpdateOps::test_intersection_non_overlap, test/inductor/test_ordered_set.py::TestUpdateOps::test_intersection_overlap, test/inductor/test_ordered_set.py::TestUpdateOps::test_intersection_subset, test/inductor/test_ordered_set.py::TestUpdateOps::test_intersection_superset, test/inductor/test_ordered_set.py::TestUpdateOps::test_sym_difference_method_call, test/inductor/test_ordered_set.py::TestUpdateOps::test_sym_difference_non_overlap, test/inductor/test_ordered_set.py::TestUpdateOps::test_sym_difference_overlap, test/inductor/test_ordered_set.py::TestUpdateOps::test_sym_difference_subset, test/inductor/test_ordered_set.py::TestUpdateOps::test_sym_difference_superset, test/inductor/test_ordered_set.py::TestUpdateOps::test_union_method_call, test/inductor/test_ordered_set.py::TestUpdateOps::test_union_non_overlap, test/inductor/test_ordered_set.py::TestUpdateOps::test_union_overlap, test/inductor/test_ordered_set.py::TestUpdateOps::test_union_subset, test/inductor/test_ordered_set.py::TestUpdateOps::test_union_superset, test/inductor/test_ordered_set.py::TestMutate::test_add_absent, test/inductor/test_ordered_set.py::TestMutate::test_add_present, test/inductor/test_ordered_set.py::TestMutate::test_add_until_full, test/inductor/test_ordered_set.py::TestMutate::test_clear, test/inductor/test_ordered_set.py::TestMutate::test_discard_absent, test/inductor/test_ordered_set.py::TestMutate::test_discard_present, test/inductor/test_ordered_set.py::TestMutate::test_pop, test/inductor/test_ordered_set.py::TestMutate::test_remove_absent, test/inductor/test_ordered_set.py::TestMutate::test_remove_present, test/inductor/test_ordered_set.py::TestMutate::test_remove_until_empty, test/inductor/test_ordered_set.py::TestMutate::test_update_empty_tuple, test/inductor/test_ordered_set.py::TestMutate::test_update_unit_tuple_non_overlap, test/inductor/test_ordered_set.py::TestMutate::test_update_unit_tuple_overlap, test/inductor/test_ordered_set.py::TestSubsets::test_issubset, test/inductor/test_ordered_set.py::TestSubsetEqualEmpty::test_issubset, test/inductor/test_ordered_set.py::TestSubsetEqualNonEmpty::test_issubset, test/inductor/test_ordered_set.py::TestSubsetEmptyNonEmpty::test_issubset, test/inductor/test_ordered_set.py::TestSubsetPartial::test_issubset, test/inductor/test_ordered_set.py::TestSubsetNonOverlap::test_issubset, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_difference, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_eq_ne, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_ge_gt_le_lt, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_intersection, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_intersection_update, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_intersection_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_sym_difference, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_sym_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_sym_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_union, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_update, test/inductor/test_ordered_set.py::TestOnlySetsNumeric::test_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_difference, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_eq_ne, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_ge_gt_le_lt, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_intersection, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_intersection_update, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_intersection_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_sym_difference, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_sym_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_sym_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_union, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_update, test/inductor/test_ordered_set.py::TestOnlySetsDict::test_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_difference, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_eq_ne, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_ge_gt_le_lt, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_intersection, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_intersection_update, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_intersection_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_sym_difference, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_sym_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_sym_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_union, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_update, test/inductor/test_ordered_set.py::TestOnlySetsOperator::test_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_difference, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_eq_ne, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_ge_gt_le_lt, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_intersection, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_intersection_update, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_intersection_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_sym_difference, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_sym_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_sym_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_union, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_update, test/inductor/test_ordered_set.py::TestOnlySetsTuple::test_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsString::test_difference, test/inductor/test_ordered_set.py::TestOnlySetsString::test_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsString::test_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsString::test_eq_ne, test/inductor/test_ordered_set.py::TestOnlySetsString::test_ge_gt_le_lt, test/inductor/test_ordered_set.py::TestOnlySetsString::test_intersection, test/inductor/test_ordered_set.py::TestOnlySetsString::test_intersection_update, test/inductor/test_ordered_set.py::TestOnlySetsString::test_intersection_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsString::test_sym_difference, test/inductor/test_ordered_set.py::TestOnlySetsString::test_sym_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsString::test_sym_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsString::test_union, test/inductor/test_ordered_set.py::TestOnlySetsString::test_update, test/inductor/test_ordered_set.py::TestOnlySetsString::test_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_difference, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_eq_ne, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_ge_gt_le_lt, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_intersection, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_intersection_update, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_intersection_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_sym_difference, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_sym_difference_update, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_sym_difference_update_operator, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_union, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_update, test/inductor/test_ordered_set.py::TestOnlySetsGenerator::test_update_operator, test/inductor/test_ordered_set.py::TestCopyingEmpty::test_copy, test/inductor/test_ordered_set.py::TestCopyingEmpty::test_deep_copy, test/inductor/test_ordered_set.py::TestCopyingSingleton::test_copy, test/inductor/test_ordered_set.py::TestCopyingSingleton::test_deep_copy, test/inductor/test_ordered_set.py::TestCopyingTriple::test_copy, test/inductor/test_ordered_set.py::TestCopyingTriple::test_deep_copy, test/inductor/test_ordered_set.py::TestCopyingTuple::test_copy, test/inductor/test_ordered_set.py::TestCopyingTuple::test_deep_copy, test/inductor/test_ordered_set.py::TestCopyingNested::test_copy, test/inductor/test_ordered_set.py::TestCopyingNested::test_deep_copy, test/inductor/test_ordered_set.py::TestIdentities::test_binopsVsSubsets, test/inductor/test_ordered_set.py::TestIdentities::test_commutativity, test/inductor/test_ordered_set.py::TestIdentities::test_exclusion, test/inductor/test_ordered_set.py::TestIdentities::test_summations, test/inductor/test_ordered_set.py::TestVariousIteratorArgs::test_constructor, test/inductor/test_ordered_set.py::TestVariousIteratorArgs::test_inline_methods, test/inductor/test_ordered_set.py::TestVariousIteratorArgs::test_inplace_methods, test/inductor/test_ordered_set.py::TestWeirdBugs::test_8420_set_merge, test/inductor/test_ordered_set.py::TestWeirdBugs::test_iter_and_mutate, test/inductor/test_ordered_set.py::TestWeirdBugs::test_merge_and_mutate, test/inductor/test_ordered_set.py::TestGraphs::test_cube, test/inductor/test_ordered_set.py::TestGraphs::test_cuboctahedron 2025-12-04T15:31:01.9632648Z 2025-12-04T15:31:01.9633013Z Finished inductor/test_ordered_set 1/1 ... [2025-12-04 15:31:01.933748][22666.054965127], took 0.13min 2025-12-04T15:31:01.9678119Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_ordered_set/inductor.test_ordered_set-b674b38a5211e06f.xml 2025-12-04T15:31:02.0666678Z Running inductor/test_pallas 1/1 ... [2025-12-04 15:31:02.066378][22666.187597779] 2025-12-04T15:31:02.0667246Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:31:02.0670184Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_pallas.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:31:02.066771] 2025-12-04T15:31:13.7502063Z 2025-12-04T15:31:13.7503067Z inductor/test_pallas 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_pallas_1.1_cb3d6eceb8711a40_.log 2025-12-04T15:31:13.7503879Z 2025-12-04T15:31:13.7504437Z Finished inductor/test_pallas 1/1 ... [2025-12-04 15:31:13.749983][22677.871203051], took 0.19min 2025-12-04T15:31:13.7840656Z Running dynamo/test_install_free_tensors 1/1 ... [2025-12-04 15:31:13.783794][22677.905014832] 2025-12-04T15:31:13.7841268Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:31:13.7844765Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_install_free_tensors.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:31:13.784208] 2025-12-04T15:31:35.9785561Z 2025-12-04T15:31:35.9788546Z dynamo/test_install_free_tensors 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_install_free_tensors_1.1_dc3bcfdcc8893d8d_.log 2025-12-04T15:31:35.9801704Z Running 25 items in this shard: test/dynamo/test_install_free_tensors.py::InstallParamsAsGraphAttrTests::test_breadth_linear, test/dynamo/test_install_free_tensors.py::InstallParamsAsGraphAttrTests::test_nested_linear, test/dynamo/test_install_free_tensors.py::InstallParamsAsGraphAttrTests::test_nets_as_input, test/dynamo/test_install_free_tensors.py::InstallParamsAsGraphAttrTests::test_optimizing_buffer_and_param_in_input, test/dynamo/test_install_free_tensors.py::InstallParamsAsGraphAttrTests::test_optimizing_buffer_in_input, test/dynamo/test_install_free_tensors.py::InstallParamsAsGraphAttrTests::test_optimizing_linear, test/dynamo/test_install_free_tensors.py::InstallParamsAsGraphAttrTests::test_optimizing_params_in_input, test/dynamo/test_install_free_tensors.py::InstallParamsAsGraphAttrTests::test_resnet_structure, test/dynamo/test_install_free_tensors.py::InstallParamsAsGraphAttrTests::test_simple_batchnorm, test/dynamo/test_install_free_tensors.py::InstallParamsAsGraphAttrTests::test_transformer, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_dict_of_tensor, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_global_tensor_export, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_list_of_tensor, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_modify_net_state, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_nested_list_of_tensor, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_nonlocal_closure, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_optimizing_buffer_and_param_in_input, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_optimizing_buffer_in_input, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_optimizing_params_in_input, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_resnet_structure, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_simple_batchnorm, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_simple_linear, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_tensors_as_nn_attr, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_transformer, test/dynamo/test_install_free_tensors.py::InstallParamsWhenExport::test_user_defined_object 2025-12-04T15:31:35.9814384Z 2025-12-04T15:31:35.9814790Z Finished dynamo/test_install_free_tensors 1/1 ... [2025-12-04 15:31:35.978368][22700.099587796], took 0.37min 2025-12-04T15:31:36.0129311Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_install_free_tensors/dynamo.test_install_free_tensors-4c842a4ed6e31f3f.xml 2025-12-04T15:31:36.1000545Z Running inductor/test_torchinductor_codegen_config_overrides 1/1 ... [2025-12-04 15:31:36.099792][22700.221011266] 2025-12-04T15:31:36.1001300Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:31:36.1004271Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_codegen_config_overrides.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:31:36.100164] 2025-12-04T15:31:54.8914636Z 2025-12-04T15:31:54.8916535Z inductor/test_torchinductor_codegen_config_overrides 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_codegen_config_overrides_1.1_1597e2f7db8ab1bf_.log 2025-12-04T15:31:54.8921053Z Running 4 items in this shard: test/inductor/test_torchinductor_codegen_config_overrides.py::CodegenInductorTest::test_cse_make_block_ptr_reduction, test/inductor/test_torchinductor_codegen_config_overrides.py::CodegenInductorTest::test_force_pointwise_cat_force_pointwise_cat_False, test/inductor/test_torchinductor_codegen_config_overrides.py::CodegenInductorTest::test_force_pointwise_cat_force_pointwise_cat_True, test/inductor/test_torchinductor_codegen_config_overrides.py::CodegenInductorTest::test_kernel_fusion_thresholds 2025-12-04T15:31:54.8923893Z 2025-12-04T15:31:54.8924414Z Finished inductor/test_torchinductor_codegen_config_overrides 1/1 ... [2025-12-04 15:31:54.891243][22719.012464519], took 0.31min 2025-12-04T15:31:54.9255912Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchinductor_codegen_config_overrides/inductor.test_torchinductor_codegen_config_overrides-15f0302c88c6ef46.xml 2025-12-04T15:31:55.0448112Z Running export/test_passes 1/1 ... [2025-12-04 15:31:55.044446][22719.165663314] 2025-12-04T15:31:55.0448711Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:31:55.0451488Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_passes.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:31:55.044853] 2025-12-04T15:32:49.7819032Z 2025-12-04T15:32:49.7820081Z export/test_passes 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_passes_1.1_5fdccc59815cb06b_.log 2025-12-04T15:32:49.7830727Z Running 28 items in this shard: test/export/test_passes.py::TestPasses::test_constant_folding_pass, test/export/test_passes.py::TestPasses::test_custom_obj_tuple_out, test/export/test_passes.py::TestPasses::test_fakify_script_objects, test/export/test_passes.py::TestPasses::test_fakify_script_objects_properly_handle_containers, test/export/test_passes.py::TestPasses::test_functionalization_with_view_copy, test/export/test_passes.py::TestPasses::test_inline_, test/export/test_passes.py::TestPasses::test_math_ops, test/export/test_passes.py::TestPasses::test_move_device_example_inputs, test/export/test_passes.py::TestPasses::test_move_device_submod, test/export/test_passes.py::TestPasses::test_move_device_to, test/export/test_passes.py::TestPasses::test_move_to_device_pass, test/export/test_passes.py::TestPasses::test_predispatch_autocast, test/export/test_passes.py::TestPasses::test_predispatch_autocast_and_set_grad, test/export/test_passes.py::TestPasses::test_predispatch_set_grad, test/export/test_passes.py::TestPasses::test_remove_auto_functionalized_pass, test/export/test_passes.py::TestPasses::test_remove_auto_functionalized_pass_tuple, test/export/test_passes.py::TestPasses::test_remove_effect_token_kwargs, test/export/test_passes.py::TestPasses::test_runtime_assert_inline_constraints_for_cond, test/export/test_passes.py::TestPasses::test_runtime_assert_inline_constraints_for_item, test/export/test_passes.py::TestPasses::test_runtime_assert_inline_constraints_for_nonzero, test/export/test_passes.py::TestPasses::test_runtime_assert_multiple_dims, test/export/test_passes.py::TestPasses::test_runtime_assert_one_dim, test/export/test_passes.py::TestPasses::test_runtime_assert_some_dims_not_specified, test/export/test_passes.py::TestPasses::test_runtime_assert_some_inps_not_used, test/export/test_passes.py::TestPasses::test_sequential_split, test/export/test_passes.py::TestPasses::test_sequential_split_graph, test/export/test_passes.py::TestPasses::test_view_to_view_copy, test/export/test_passes.py::TestPasses::test_views_op_having_view_copy 2025-12-04T15:32:49.7841210Z 2025-12-04T15:32:49.7841562Z Finished export/test_passes 1/1 ... [2025-12-04 15:32:49.781766][22773.902983479], took 0.91min 2025-12-04T15:32:49.8163974Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/export.test_passes/export.test_passes-d27457a3d9237a22.xml 2025-12-04T15:32:49.8841837Z Running dynamo/test_autograd_function 1/1 ... [2025-12-04 15:32:49.883881][22774.005100509] 2025-12-04T15:32:49.8842481Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:32:49.8845064Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_autograd_function.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:32:49.884251] 2025-12-04T15:33:13.0809690Z 2025-12-04T15:33:13.0810959Z dynamo/test_autograd_function 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_autograd_function_1.1_12e04931a7d6b73d_.log 2025-12-04T15:33:13.0832377Z Running 41 items in this shard: test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_allow_in_graph, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_amp_custom_fwd_bwd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_assert_is_contiguous_after_matmul, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_assert_is_contiguous_on_grad_output_directly, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_autograd_function_equivalence, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_autograd_function_has_graph_break, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_backward_returns_none_for_tensor_input, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_classmethod, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_data_in_bwd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_default_values, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_enum_arg, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_forward_returns_constant, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_function_context_mark_and_save, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_function_context_save_and_mark, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_function_with_bound_free_variable, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_fwd_no_grad, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_fwd_propogation_correctness, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_linear_setup_context, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_mark_multi_output_non_differentiable, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_mark_non_differentiable, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_materialize_grad, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_multi_output, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_multiple_different_non_tensor_inputs, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_needs_input_grad, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_nonlocal_list_mutation_in_autograd_function, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_once_differentiable, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_print_in_bwd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_repeated_save_for_backward_calls, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_requires_grad_in_bwd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_save_for_bwd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_set_materialize_grads_no_graph_break, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_smoke_from_test_autograd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_smuggle_symint_issue_111031, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_smuggle_tensor_and_complex_structures, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_stride_in_bwd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_tensor_list_as_input, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_tensor_subclass_intermediary_input, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_triton_kernel_basic, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_triton_kernel_multiple_out, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_tuple_arg, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_user_defined_object_as_input 2025-12-04T15:33:13.0851886Z 2025-12-04T15:33:13.0852289Z Finished dynamo/test_autograd_function 1/1 ... [2025-12-04 15:33:13.080834][22797.202052049], took 0.39min 2025-12-04T15:33:13.1153985Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_autograd_function/dynamo.test_autograd_function-bbde79f54df96d93.xml 2025-12-04T15:33:13.2143303Z Running inductor/test_codecache 1/1 ... [2025-12-04 15:33:13.214028][22797.335248175] 2025-12-04T15:33:13.2143905Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:33:13.2147444Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_codecache.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:33:13.214485] 2025-12-04T15:38:58.9413297Z 2025-12-04T15:38:58.9414361Z inductor/test_codecache 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_codecache_1.1_750e07d8e3fe4035_.log 2025-12-04T15:38:58.9575819Z Running 247 items in this shard: test/inductor/test_codecache.py::TestPyCodeCache::test_editable_cached_wrapper, test/inductor/test_codecache.py::TestPyCodeCache::test_linemaps_empty, test/inductor/test_codecache.py::TestFxGraphCache::test_async_compile_cache, test/inductor/test_codecache.py::TestFxGraphCache::test_auto_functionalized_caching_variant_v1, test/inductor/test_codecache.py::TestFxGraphCache::test_auto_functionalized_caching_variant_v2, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_clear, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_guard, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_guard_overspec, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_caching_precompile_dynamic_False_device_cpu_bfloat16, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_caching_precompile_dynamic_False_device_cpu_float32, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_caching_precompile_dynamic_False_device_cuda_bfloat16, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_caching_precompile_dynamic_False_device_cuda_float32, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_caching_precompile_dynamic_True_device_cpu_bfloat16, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_caching_precompile_dynamic_True_device_cpu_float32, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_caching_precompile_dynamic_True_device_cuda_bfloat16, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_caching_precompile_dynamic_True_device_cuda_float32, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_device_cpu_bfloat16_dynamic_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_device_cpu_bfloat16_dynamic_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_device_cpu_float32_dynamic_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_device_cpu_float32_dynamic_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_device_cuda_bfloat16_dynamic_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_device_cuda_bfloat16_dynamic_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_device_cuda_float32_dynamic_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_device_cuda_float32_dynamic_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_empty, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_generic, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_pgo, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_pgo_swap_file_names, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_hot_load_repeat, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True_grad_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True_grad_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_model_device_cpu_float32_dynamic_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_model_device_cpu_float32_dynamic_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_model_device_cpu_float64_dynamic_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_model_device_cpu_float64_dynamic_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_model_device_cuda_float32_dynamic_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_model_device_cuda_float32_dynamic_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_model_device_cuda_float64_dynamic_False, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_model_device_cuda_float64_dynamic_True, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_with_guards_int32_bounds_device_cuda_bfloat16, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_with_guards_int32_bounds_device_cuda_float16, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_with_guards_static_bounds_device_cpu_bfloat16, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_with_guards_static_bounds_device_cpu_float32, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_with_guards_static_bounds_device_cuda_bfloat16, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_load_with_guards_static_bounds_device_cuda_float32, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_with_nt, test/inductor/test_codecache.py::TestFxGraphCache::test_cache_with_symint_non_arg_guard, test/inductor/test_codecache.py::TestFxGraphCache::test_constant_handling_device_cpu, test/inductor/test_codecache.py::TestFxGraphCache::test_constant_handling_device_cuda, test/inductor/test_codecache.py::TestFxGraphCache::test_flex_attention_caching, test/inductor/test_codecache.py::TestFxGraphCache::test_freezing_device_cpu_inlinable_False, test/inductor/test_codecache.py::TestFxGraphCache::test_freezing_device_cpu_inlinable_True, test/inductor/test_codecache.py::TestFxGraphCache::test_freezing_device_cuda_inlinable_False, test/inductor/test_codecache.py::TestFxGraphCache::test_freezing_device_cuda_inlinable_True, test/inductor/test_codecache.py::TestFxGraphCache::test_generated_kernel_count, test/inductor/test_codecache.py::TestFxGraphCache::test_higher_order_op_bypass_bundle_triton_False, test/inductor/test_codecache.py::TestFxGraphCache::test_higher_order_op_bypass_bundle_triton_True, test/inductor/test_codecache.py::TestFxGraphCache::test_inductor_counters, test/inductor/test_codecache.py::TestFxGraphCache::test_no_arguments_tensor_device_guards, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cpu_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_bfloat16_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_bfloat16_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_False_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_float32_dynamic_False_bundle_triton_True_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_False_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_remote_cache_load_function_device_cuda_float32_dynamic_True_bundle_triton_True_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_tensor_device_guards_cpu_tensor, test/inductor/test_codecache.py::TestFxGraphCache::test_triton_higher_order_op_bundle_triton_False, test/inductor/test_codecache.py::TestFxGraphCache::test_triton_higher_order_op_bundle_triton_True, test/inductor/test_codecache.py::TestFxGraphCache::test_triton_higher_order_op_different_configs_bundle_triton_False, test/inductor/test_codecache.py::TestFxGraphCache::test_triton_higher_order_op_different_configs_bundle_triton_True, test/inductor/test_codecache.py::TestFxGraphCache::test_triton_op_bundle_triton_False_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_triton_op_bundle_triton_False_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestFxGraphCache::test_triton_op_bundle_triton_True_use_static_cuda_launcher_False, test/inductor/test_codecache.py::TestFxGraphCache::test_triton_op_bundle_triton_True_use_static_cuda_launcher_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_backend_dynamic_shapes_from_example_inputs_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_backend_dynamic_shapes_from_example_inputs_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_backend_is_aot_False_dynamic_shapes_from_graph, test/inductor/test_codecache.py::TestStandaloneCompile::test_backend_is_aot_False_dynamic_shapes_from_tracing_context, test/inductor/test_codecache.py::TestStandaloneCompile::test_backend_is_aot_True_dynamic_shapes_from_graph, test/inductor/test_codecache.py::TestStandaloneCompile::test_backend_is_aot_True_dynamic_shapes_from_tracing_context, test/inductor/test_codecache.py::TestStandaloneCompile::test_backend_static_shapes_dynamic_shapes_from_example_inputs, test/inductor/test_codecache.py::TestStandaloneCompile::test_backend_static_shapes_dynamic_shapes_from_graph, test/inductor/test_codecache.py::TestStandaloneCompile::test_backend_static_shapes_dynamic_shapes_from_tracing_context, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_binary_dynamic_False_graph_partition_False_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_binary_dynamic_False_graph_partition_False_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_binary_dynamic_False_graph_partition_True_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_binary_dynamic_False_graph_partition_True_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_binary_dynamic_True_graph_partition_False_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_binary_dynamic_True_graph_partition_False_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_binary_dynamic_True_graph_partition_True_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_binary_dynamic_True_graph_partition_True_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_unpacked_dynamic_False_graph_partition_False_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_unpacked_dynamic_False_graph_partition_False_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_unpacked_dynamic_False_graph_partition_True_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_unpacked_dynamic_False_graph_partition_True_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_unpacked_dynamic_True_graph_partition_False_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_unpacked_dynamic_True_graph_partition_False_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_unpacked_dynamic_True_graph_partition_True_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cpu_format_unpacked_dynamic_True_graph_partition_True_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_binary_dynamic_False_graph_partition_False_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_binary_dynamic_False_graph_partition_False_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_binary_dynamic_False_graph_partition_True_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_binary_dynamic_False_graph_partition_True_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_binary_dynamic_True_graph_partition_False_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_binary_dynamic_True_graph_partition_False_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_binary_dynamic_True_graph_partition_True_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_binary_dynamic_True_graph_partition_True_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_unpacked_dynamic_False_graph_partition_False_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_unpacked_dynamic_False_graph_partition_False_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_unpacked_dynamic_False_graph_partition_True_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_unpacked_dynamic_False_graph_partition_True_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_unpacked_dynamic_True_graph_partition_False_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_unpacked_dynamic_True_graph_partition_False_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_unpacked_dynamic_True_graph_partition_True_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_basic_device_cuda_format_unpacked_dynamic_True_graph_partition_True_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_call_in_backend_dynamic_False_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_call_in_backend_dynamic_False_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_call_in_backend_dynamic_True_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_call_in_backend_dynamic_True_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_custom_pass_handling, test/inductor/test_codecache.py::TestStandaloneCompile::test_different_process, test/inductor/test_codecache.py::TestStandaloneCompile::test_dynamic_shapes_from_example_inputs_is_aot_False_config_patches_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_dynamic_shapes_from_example_inputs_is_aot_False_config_patches_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_dynamic_shapes_from_example_inputs_is_aot_True_config_patches_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_dynamic_shapes_from_example_inputs_is_aot_True_config_patches_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_dynamic_shapes_from_graph_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_dynamic_shapes_from_graph_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_modify_unpacked_file_device_cpu, test/inductor/test_codecache.py::TestStandaloneCompile::test_modify_unpacked_file_device_cuda, test/inductor/test_codecache.py::TestStandaloneCompile::test_save_in_new_path, test/inductor/test_codecache.py::TestStandaloneCompile::test_split_module_is_aot_False, test/inductor/test_codecache.py::TestStandaloneCompile::test_split_module_is_aot_True, test/inductor/test_codecache.py::TestStandaloneCompile::test_static_shapes_is_aot_False_dynamic_shapes_from_example_inputs, test/inductor/test_codecache.py::TestStandaloneCompile::test_static_shapes_is_aot_False_dynamic_shapes_from_graph, test/inductor/test_codecache.py::TestStandaloneCompile::test_static_shapes_is_aot_True_dynamic_shapes_from_example_inputs, test/inductor/test_codecache.py::TestStandaloneCompile::test_static_shapes_is_aot_True_dynamic_shapes_from_graph, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_bypass_unsupported, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_get_hash_for_files, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_hash_config_changes, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_hash_custom_backend_config, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_hash_custom_backend_pass, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_hash_custom_partitioner_fn, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_hash_custom_passes, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_hash_fake_tensors, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_hash_kwargs, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_hash_private_config_changes, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_non_serializable_custom_passes_causes_cache_miss, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_parameter_constants, test/inductor/test_codecache.py::TestFxGraphCacheHashing::test_stable_strings, test/inductor/test_codecache.py::TestCudaCompileCommand::test_cuda_compile_command, test/inductor/test_codecache.py::TestAutotuneCache::test_autotune_cache, test/inductor/test_codecache.py::TestAutotuneCache::test_autotune_cache_warm_start, test/inductor/test_codecache.py::TestAutotuneCache::test_bundled_autotune_remote_cache, test/inductor/test_codecache.py::TestAutotuneCache::test_modified_autotune_cache_remote_cache_False, test/inductor/test_codecache.py::TestAutotuneCache::test_modified_autotune_cache_remote_cache_True, test/inductor/test_codecache.py::TestRemoteAOTAutogradCache::test_autograd_remote_cache, test/inductor/test_codecache.py::TestRemoteAOTAutogradCache::test_autograd_remote_lazy_backward, test/inductor/test_codecache.py::TestUtils::test_force_disable_coordinate_descent, test/inductor/test_codecache.py::TestUtils::test_fresh_cache 2025-12-04T15:38:58.9736536Z 2025-12-04T15:38:58.9736921Z Finished inductor/test_codecache 1/1 ... [2025-12-04 15:38:58.941202][23143.062421678], took 5.76min 2025-12-04T15:38:58.9767887Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_codecache/inductor.test_codecache-d532a0bd02778de2.xml 2025-12-04T15:38:59.1154625Z Running inductor/test_auto_functionalize 1/1 ... [2025-12-04 15:38:59.115174][23143.23639297] 2025-12-04T15:38:59.1155266Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:38:59.1158242Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_auto_functionalize.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:38:59.115560] 2025-12-04T15:39:37.2350877Z 2025-12-04T15:39:37.2352723Z inductor/test_auto_functionalize 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_auto_functionalize_1.1_de84d14995b669d1_.log 2025-12-04T15:39:37.2392957Z Running 39 items in this shard: test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_alias, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_alias2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_alias2_dynamic, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_alias_id_input_to_custom_op, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_alias_id_output, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_can_with_default, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_can_with_none_return, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_extra1, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_extra2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_extra3, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_extra4, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_extra5, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_old, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_on_view, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_optional_old, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_optional_v2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_self_as_mutate_arg, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_tensorlist, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_v2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_with_returns_old, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_auto_functionalize_with_returns_v2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_can_auto_functionalize, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_dynamic2_v2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_dynamic3_v2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_dynamic_v2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_graph_input_is_view, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_inference_mode1_v2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_inference_mode2_v2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_inference_mode3_v2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_inference_mode4_v2, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_inference_mode_view, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_recompile, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_scheduling_with_multiple_mutates, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_slice, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_slice_dynamic, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_split, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_split_dynamic, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_try_use_slice, test/inductor/test_auto_functionalize.py::AutoFunctionalizeTests::test_unbacked_auto_functionalize_op 2025-12-04T15:39:37.2430116Z 2025-12-04T15:39:37.2430872Z Finished inductor/test_auto_functionalize 1/1 ... [2025-12-04 15:39:37.234907][23181.356126452], took 0.64min 2025-12-04T15:39:37.2726832Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_auto_functionalize/inductor.test_auto_functionalize-1caf9ad6930ca9ac.xml 2025-12-04T15:39:37.3517326Z Running inductor/test_efficient_conv_bn_eval 1/1 ... [2025-12-04 15:39:37.351442][23181.472661285] 2025-12-04T15:39:37.3518341Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:39:37.3522178Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_efficient_conv_bn_eval.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:39:37.351878] 2025-12-04T15:39:49.2327005Z 2025-12-04T15:39:49.2328185Z inductor/test_efficient_conv_bn_eval 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_efficient_conv_bn_eval_1.1_efda8428304aed40_.log 2025-12-04T15:39:49.2330124Z Running 2 items in this shard: test/inductor/test_efficient_conv_bn_eval.py::EfficientConvBNEvalCpuTests::test_basic_cpu, test/inductor/test_efficient_conv_bn_eval.py::EfficientConvBNEvalGpuTests::test_basic_cuda 2025-12-04T15:39:49.2331244Z 2025-12-04T15:39:49.2331869Z Finished inductor/test_efficient_conv_bn_eval 1/1 ... [2025-12-04 15:39:49.232491][23193.353709592], took 0.20min 2025-12-04T15:39:49.2676709Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_efficient_conv_bn_eval/inductor.test_efficient_conv_bn_eval-93e786f7489a3e47.xml 2025-12-04T15:39:49.3430680Z Running inductor/test_torchbind 1/1 ... [2025-12-04 15:39:49.342764][23193.463983718] 2025-12-04T15:39:49.3431306Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:39:49.3433998Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchbind.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:39:49.343168] 2025-12-04T15:40:43.5346727Z 2025-12-04T15:40:43.5347795Z inductor/test_torchbind 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchbind_1.1_fc2e846084c707a2_.log 2025-12-04T15:40:43.5355150Z Running 16 items in this shard: test/inductor/test_torchbind.py::TestTorchbind::test_aoti_torchbind_name_collision, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_aot_compile, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_aot_compile_constant_folding, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_aoti, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_compile, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_compile_gpu_op_symint_graph_partition, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_compile_symint, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_config_not_generated, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_get_buf_bytes, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_hop_schema, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_hop_schema_no_input, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_hop_schema_no_output, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_inductor, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_input_aot_compile, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_list_return_aot_compile, test/inductor/test_torchbind.py::TestTorchbind::test_torchbind_queue 2025-12-04T15:40:43.5362311Z 2025-12-04T15:40:43.5362668Z Finished inductor/test_torchbind 1/1 ... [2025-12-04 15:40:43.534558][23247.655778627], took 0.90min 2025-12-04T15:40:43.5704674Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_torchbind/inductor.test_torchbind-9cdc6632b9077562.xml 2025-12-04T15:40:43.6620543Z Running dynamo/test_flat_apply 1/1 ... [2025-12-04 15:40:43.661759][23247.782977428] 2025-12-04T15:40:43.6621173Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:40:43.6624680Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_flat_apply.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:40:43.662218] 2025-12-04T15:40:51.3876108Z 2025-12-04T15:40:51.3877177Z dynamo/test_flat_apply 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_flat_apply_1.1_ce4f5252908fcc19_.log 2025-12-04T15:40:51.3879614Z Running 4 items in this shard: test/dynamo/test_flat_apply.py::FlatApplyTests::test_non_tensor_output, test/dynamo/test_flat_apply.py::FlatApplyTests::test_nonstrict_trace_captured_tensor_post_aot_graph, test/dynamo/test_flat_apply.py::FlatApplyTests::test_nonstrict_trace_dynamo_graph, test/dynamo/test_flat_apply.py::FlatApplyTests::test_simple 2025-12-04T15:40:51.3881338Z 2025-12-04T15:40:51.3881697Z Finished dynamo/test_flat_apply 1/1 ... [2025-12-04 15:40:51.387418][23255.508639192], took 0.13min 2025-12-04T15:40:51.4229748Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_flat_apply/dynamo.test_flat_apply-3a6f00a2fe27363c.xml 2025-12-04T15:40:51.4981519Z Running dynamo/test_input_attr_tracking 1/1 ... [2025-12-04 15:40:51.497903][23255.619122331] 2025-12-04T15:40:51.4982144Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:40:51.4986016Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_input_attr_tracking.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:40:51.498331] 2025-12-04T15:41:08.4864746Z 2025-12-04T15:41:08.4865817Z dynamo/test_input_attr_tracking 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_input_attr_tracking_1.1_c0c1747b0b64a184_.log 2025-12-04T15:41:08.4873224Z Running 12 items in this shard: test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_complex_attr_access_with_graph_breaks, test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_complex_attr_access_with_inline_reconstruct, test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_complex_attr_access_without_graph_breaks, test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_const_property_assigned_on_tensor, test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_const_property_on_tensor, test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_guards_correctly_property_assigned_on_tensor_type_change, test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_guards_correctly_property_assigned_on_tensor_type_change_inductor, test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_set_data_on_input_tensor, test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_set_data_on_scoped_tensor, test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_set_data_on_user_defined_class_input_tensor, test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_tensor_property_assigned_on_tensor, test/dynamo/test_input_attr_tracking.py::TestInputAttrTracking::test_tensor_property_on_tensor 2025-12-04T15:41:08.4880072Z 2025-12-04T15:41:08.4880459Z Finished dynamo/test_input_attr_tracking 1/1 ... [2025-12-04 15:41:08.486278][23272.607499066], took 0.28min 2025-12-04T15:41:08.5218884Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_input_attr_tracking/dynamo.test_input_attr_tracking-e306635654774769.xml 2025-12-04T15:41:08.6037145Z Running dynamo/test_graph_deduplication 1/1 ... [2025-12-04 15:41:08.603448][23272.724667733] 2025-12-04T15:41:08.6037787Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:41:08.6040673Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_graph_deduplication.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:41:08.603817] 2025-12-04T15:41:27.8453580Z 2025-12-04T15:41:27.8454764Z dynamo/test_graph_deduplication 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_graph_deduplication_1.1_1391af197500fccc_.log 2025-12-04T15:41:27.8464913Z Running 18 items in this shard: test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_autocast_ordering, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_cycle_detection_arg_and_additional_deps, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_cycle_detection_complex, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_cycle_detection_no_cycle, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_cycle_detection_simple, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_cycle_detection_single_node, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_cycle_detection_two_node, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_dependent_subgraphs, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_input_aliasing, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_input_mutation, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_multiple_subgraphs, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_mutation_ordering, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_output_nodes_last, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_param_transfer_to_submodule, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_single_subgraph, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_single_subgraph2, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_tuple_inputs, test/dynamo/test_graph_deduplication.py::GraphDededuplicationTests::test_tuple_return 2025-12-04T15:41:27.8474273Z 2025-12-04T15:41:27.8474674Z Finished dynamo/test_graph_deduplication 1/1 ... [2025-12-04 15:41:27.845169][23291.966388223], took 0.32min 2025-12-04T15:41:27.8809695Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_graph_deduplication/dynamo.test_graph_deduplication-72be7e1eac6e503d.xml 2025-12-04T15:41:27.9674869Z Running inductor/test_distributed_patterns 1/1 ... [2025-12-04 15:41:27.967265][23292.088484247] 2025-12-04T15:41:27.9675511Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:41:27.9678894Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_distributed_patterns.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:41:27.967655] 2025-12-04T15:42:04.9358241Z 2025-12-04T15:42:04.9359999Z inductor/test_distributed_patterns 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_distributed_patterns_1.1_a3ab6d55458abecf_.log 2025-12-04T15:42:04.9373285Z Running 20 items in this shard: test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_fake_distributed_aot_eager, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_fake_distributed_inductor, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_intermediate_hook_with_closure, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_intermediate_hook_with_nested_closure, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_module_backward_hooks_aot, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_module_backward_hooks_eager, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_module_backward_hooks_inductor, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_module_backward_hooks_multi_layers, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_nn_param_return1, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_nn_param_return2, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_nn_param_return3, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_nn_param_return4, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_storage_resize_nonzero_cpu, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_storage_resize_nonzero_gpu, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_storage_resize_zero_cpu, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_storage_resize_zero_gpu, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_unsafe_preserve_version_counter1, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_unsafe_preserve_version_counter2, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_unsafe_set_version_counter1, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_unsafe_set_version_counter2 2025-12-04T15:42:04.9384374Z 2025-12-04T15:42:04.9384811Z Finished inductor/test_distributed_patterns 1/1 ... [2025-12-04 15:42:04.935623][23329.056844512], took 0.62min 2025-12-04T15:42:04.9719865Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_distributed_patterns/inductor.test_distributed_patterns-7db9a3d44802273a.xml 2025-12-04T15:42:05.0666925Z Running dynamo/test_bytecode_utils 1/1 ... [2025-12-04 15:42:05.066402][23329.187620639] 2025-12-04T15:42:05.0667513Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:42:05.0670602Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_bytecode_utils.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:42:05.066819] 2025-12-04T15:42:20.5035336Z 2025-12-04T15:42:20.5036431Z dynamo/test_bytecode_utils 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_bytecode_utils_1.1_b447a5fdf5287190_.log 2025-12-04T15:42:20.5045558Z Running 20 items in this shard: test/dynamo/test_bytecode_utils.py::BytecodeTests::test_bytecode_analysis_jump_backward_no_interrupt, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_bytecode_from_template, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_bytecode_from_template_noprefix, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_bytecode_from_template_noreturn1, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_bytecode_from_template_noreturn2, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_bytecode_from_template_noreturn_const, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_compute_exception_table_nested, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_exception_table_e2e, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_exception_table_e2e_2, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_exception_table_encode_varint, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_exception_table_entry_propagation, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_exception_table_parsing, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_extended_args_starts_line, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_if_tensor_is_none, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_linetable_310_writer, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_linetable_311_writer1, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_linetable_311_writer2, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_py311_jump_offset, test/dynamo/test_bytecode_utils.py::BytecodeTests::test_remove_dead_code_with_exn_table_entries, test/dynamo/test_bytecode_utils.py::BytecodeHookTests::test_bytecode_hook 2025-12-04T15:42:20.5054352Z 2025-12-04T15:42:20.5054710Z Finished dynamo/test_bytecode_utils 1/1 ... [2025-12-04 15:42:20.503350][23344.62457134], took 0.26min 2025-12-04T15:42:20.5394335Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/dynamo.test_bytecode_utils/dynamo.test_bytecode_utils-4132ad7230faf1a5.xml 2025-12-04T15:42:20.6317375Z Running export/test_nativert 1/1 ... [2025-12-04 15:42:20.631475][23344.752693923] 2025-12-04T15:42:20.6317958Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:42:20.6320987Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_nativert.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:42:20.631875] 2025-12-04T15:42:32.6129786Z 2025-12-04T15:42:32.6130811Z export/test_nativert 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_nativert_1.1_85e109ef4797d863_.log 2025-12-04T15:42:32.6133877Z Running 6 items in this shard: test/export/test_nativert.py::TestNativeRT::test_aoti_0_cpu, test/export/test_nativert.py::TestNativeRT::test_aoti_1_cpu, test/export/test_nativert.py::TestNativeRT::test_aoti_2_cpu, test/export/test_nativert.py::TestNativeRT::test_aoti_3_cuda, test/export/test_nativert.py::TestNativeRT::test_aoti_4_cuda, test/export/test_nativert.py::TestNativeRT::test_aoti_5_cuda 2025-12-04T15:42:32.6135819Z 2025-12-04T15:42:32.6136336Z Finished export/test_nativert 1/1 ... [2025-12-04 15:42:32.612798][23356.734019105], took 0.20min 2025-12-04T15:42:32.6486903Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/export.test_nativert/export.test_nativert-bbcdef66a0abda26.xml 2025-12-04T15:42:32.7253403Z Running export/test_hop 1/1 ... [2025-12-04 15:42:32.725085][23356.846304327] 2025-12-04T15:42:32.7253955Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:42:32.7256892Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_hop.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:42:32.725448] 2025-12-04T15:42:55.6215591Z 2025-12-04T15:42:55.6216715Z export/test_hop 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_hop_1.1_4e0a4ce0cd3157fe_.log 2025-12-04T15:42:55.6238480Z Running 44 items in this shard: test/export/test_hop.py::TestHOPCUDA::test_aot_export_auto_functionalize_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_cond_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_flex_attention_backward_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_flex_attention_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_invoke_quant_packed_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_invoke_quant_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_invoke_subgraph_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_local_map_hop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_scan_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_while_loop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_aot_export_while_loop_stack_output_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_auto_functionalize_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_cond_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_flex_attention_backward_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_flex_attention_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_invoke_quant_packed_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_invoke_quant_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_invoke_subgraph_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_local_map_hop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_scan_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_while_loop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_pre_dispatch_export_while_loop_stack_output_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_auto_functionalize_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_cond_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_flex_attention_backward_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_flex_attention_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_invoke_quant_packed_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_invoke_quant_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_invoke_subgraph_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_local_map_hop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_scan_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_while_loop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_retrace_export_while_loop_stack_output_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_auto_functionalize_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_cond_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_flex_attention_backward_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_flex_attention_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_invoke_quant_packed_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_invoke_quant_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_invoke_subgraph_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_local_map_hop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_scan_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_while_loop_simple_cuda_float32, test/export/test_hop.py::TestHOPCUDA::test_serialize_export_while_loop_stack_output_simple_cuda_float32 2025-12-04T15:42:55.6258707Z 2025-12-04T15:42:55.6259018Z Finished export/test_hop 1/1 ... [2025-12-04 15:42:55.622006][23379.743224886], took 0.38min 2025-12-04T15:42:55.6585545Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/export.test_hop/export.test_hop-06f913442140df32.xml 2025-12-04T15:42:55.7616158Z Running export/test_export 1/1 ... [2025-12-04 15:42:55.761337][23379.88255618] 2025-12-04T15:42:55.7616705Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:42:55.7620118Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_export.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:42:55.761739] 2025-12-04T15:44:32.0144987Z 2025-12-04T15:44:32.0148558Z export/test_export 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_export_1.1_0d008746ac2eaaa3_.log 2025-12-04T15:44:32.0319177Z Running 470 items in this shard: test/export/test_export.py::TestDynamismExpression::test_export_assume_static_by_default, test/export/test_export.py::TestDynamismExpression::test_export_constraints_error, test/export/test_export.py::TestDynamismExpression::test_export_constraints_error_not_in_range, test/export/test_export.py::TestDynamismExpression::test_export_inline_constraints, test/export/test_export.py::TestDynamismExpression::test_export_slice_maxsize, test/export/test_export.py::TestDynamismExpression::test_export_slice_unbacked_dim1, test/export/test_export.py::TestDynamismExpression::test_export_strict_narrow_unbacked_expr, test/export/test_export.py::TestDynamismExpression::test_no_grad_param_inplace, test/export/test_export.py::TestDynamismExpression::test_reshape_view_backed_size_oblivious, test/export/test_export.py::TestExport::test__scaled_dot_product_flash_attention, test/export/test_export.py::TestExport::test_additional_inputs_constants, test/export/test_export.py::TestExport::test_allow_explicit_guards_as_runtime_asserts, test/export/test_export.py::TestExport::test_annotate_on_assert, test/export/test_export.py::TestExport::test_args_type_checked, test/export/test_export.py::TestExport::test_aten_lift_fresh_copy, test/export/test_export.py::TestExport::test_attention, test/export/test_export.py::TestExport::test_attr_assignment_extra, test/export/test_export.py::TestExport::test_automatic_constrain_size, test/export/test_export.py::TestExport::test_automatic_dynamic_shapes_constant_relation, test/export/test_export.py::TestExport::test_automatic_dynamic_shapes_linear_relation, test/export/test_export.py::TestExport::test_automatic_dynamic_shapes_simple_equality, test/export/test_export.py::TestExport::test_baddbmm, test/export/test_export.py::TestExport::test_basic, test/export/test_export.py::TestExport::test_basic_non_strict_fake_tensor, test/export/test_export.py::TestExport::test_basic_non_strict_real_tensor, test/export/test_export.py::TestExport::test_bincount, test/export/test_export.py::TestExport::test_buffer_util, test/export/test_export.py::TestExport::test_capture_subclass_constructor, test/export/test_export.py::TestExport::test_capture_subclass_constructor_torch_ir, test/export/test_export.py::TestExport::test_capture_subclass_wrong, test/export/test_export.py::TestExport::test_ccode_python_mod, test/export/test_export.py::TestExport::test_cdist_forward_compute_mode_zero_export, test/export/test_export.py::TestExport::test_check_specialized_int, test/export/test_export.py::TestExport::test_checks_to_constrain_range, test/export/test_export.py::TestExport::test_cleanup_dynamic_markers, test/export/test_export.py::TestExport::test_colin_unbacked_backed_vr_sub, test/export/test_export.py::TestExport::test_colon_parameter, test/export/test_export.py::TestExport::test_compiling_state, test/export/test_export.py::TestExport::test_cond_access_identical_symint_closure, test/export/test_export.py::TestExport::test_cond_branches_return_constant_int, test/export/test_export.py::TestExport::test_cond_branches_return_same_int, test/export/test_export.py::TestExport::test_cond_buffers, test/export/test_export.py::TestExport::test_cond_contains_unbacked_no_escape, test/export/test_export.py::TestExport::test_cond_int_closure, test/export/test_export.py::TestExport::test_cond_unflatten, test/export/test_export.py::TestExport::test_cond_with_module_stack_export_with, test/export/test_export.py::TestExport::test_cond_with_module_stack_export_with_unflatten, test/export/test_export.py::TestExport::test_constant_aliasing, test/export/test_export.py::TestExport::test_constant_input_naming, test/export/test_export.py::TestExport::test_constant_no_user_inp, test/export/test_export.py::TestExport::test_constant_output, test/export/test_export.py::TestExport::test_constant_output_dup, test/export/test_export.py::TestExport::test_constant_requires_grad_const, test/export/test_export.py::TestExport::test_constant_return, test/export/test_export.py::TestExport::test_constant_tensor_mutation, test/export/test_export.py::TestExport::test_constant_tensor_with_non_functional, test/export/test_export.py::TestExport::test_constant_tensor_with_non_functional_nested, test/export/test_export.py::TestExport::test_constrain_decomp, test/export/test_export.py::TestExport::test_constrain_size_in_eager, test/export/test_export.py::TestExport::test_constrain_size_with_constrain_value, test/export/test_export.py::TestExport::test_constrain_size_with_various_cases, test/export/test_export.py::TestExport::test_conv_dynamic, test/export/test_export.py::TestExport::test_crop_like, test/export/test_export.py::TestExport::test_cse_for_symint, test/export/test_export.py::TestExport::test_custom_op_auto_functionalize, test/export/test_export.py::TestExport::test_custom_op_auto_functionalize_pre_dispatch, test/export/test_export.py::TestExport::test_custom_op_auto_warn_pre_dispatch, test/export/test_export.py::TestExport::test_custom_op_preserve, test/export/test_export.py::TestExport::test_custom_pytree, test/export/test_export.py::TestExport::test_custom_tag_metadata_re_export, test/export/test_export.py::TestExport::test_decomp_batch_norm_functional_predispatch, test/export/test_export.py::TestExport::test_decomp_item_in_prim_after_decomposition, test/export/test_export.py::TestExport::test_decomp_item_in_prim_before_decomposition, test/export/test_export.py::TestExport::test_default_decomposition_core_cia_ops, test/export/test_export.py::TestExport::test_derived_dim_1_2, test/export/test_export.py::TestExport::test_derived_dim_basic, test/export/test_export.py::TestExport::test_derived_dim_integer, test/export/test_export.py::TestExport::test_derived_dim_nested, test/export/test_export.py::TestExport::test_derived_dim_out_of_order, test/export/test_export.py::TestExport::test_derived_dim_out_of_order_repeat_derived, test/export/test_export.py::TestExport::test_derived_dim_out_of_order_simplified, test/export/test_export.py::TestExport::test_derived_dim_out_of_order_simplified_repeat_non_derived, test/export/test_export.py::TestExport::test_derived_dim_repeat_derived, test/export/test_export.py::TestExport::test_detect_leak_nonstrict, test/export/test_export.py::TestExport::test_detect_leak_nonstrict_with_stacktrace, test/export/test_export.py::TestExport::test_detect_leak_strict, test/export/test_export.py::TestExport::test_device_to_dynamic, test/export/test_export.py::TestExport::test_device_to_gpu, test/export/test_export.py::TestExport::test_device_to_mutation, test/export/test_export.py::TestExport::test_device_to_mutation_float, test/export/test_export.py::TestExport::test_device_to_static, test/export/test_export.py::TestExport::test_dim_1_2, test/export/test_export.py::TestExport::test_dim_auto_and_dim, test/export/test_export.py::TestExport::test_dim_dynamic, test/export/test_export.py::TestExport::test_dim_dynamic_divisibility, test/export/test_export.py::TestExport::test_dim_dynamic_specialization, test/export/test_export.py::TestExport::test_dim_hint_range_violations, test/export/test_export.py::TestExport::test_dim_hint_ranges, test/export/test_export.py::TestExport::test_disable_forced_specializations_errors, test/export/test_export.py::TestExport::test_disable_forced_specializations_ok, test/export/test_export.py::TestExport::test_distributed_all_gather, test/export/test_export.py::TestExport::test_distributed_all_gather_into_tensor, test/export/test_export.py::TestExport::test_distributed_all_reduce, test/export/test_export.py::TestExport::test_distributed_all_to_all_single, test/export/test_export.py::TestExport::test_distributed_reduce_scatter_tensor, test/export/test_export.py::TestExport::test_dont_duck_size_for_auto_dynamic, test/export/test_export.py::TestExport::test_double_lifted_constants, test/export/test_export.py::TestExport::test_draft_export_checks_aliasing, test/export/test_export.py::TestExport::test_draft_export_checks_mutation, test/export/test_export.py::TestExport::test_draft_export_checks_mutation_list, test/export/test_export.py::TestExport::test_draft_export_checks_mutation_with_nan, test/export/test_export.py::TestExport::test_draft_export_fake_kernel_inference_errors, test/export/test_export.py::TestExport::test_draft_export_infers_fake_kernel, test/export/test_export.py::TestExport::test_duplicate_modules_with_non_persistent_buffers, test/export/test_export.py::TestExport::test_dynamic_lr_shift, test/export/test_export.py::TestExport::test_dynamic_shapes_bounds, test/export/test_export.py::TestExport::test_dynamic_shapes_builder_basic, test/export/test_export.py::TestExport::test_dynamic_shapes_builder_kwargs, test/export/test_export.py::TestExport::test_dynamic_shapes_builder_pytree, test/export/test_export.py::TestExport::test_dynamic_shapes_dataclass, test/export/test_export.py::TestExport::test_dynamic_shapes_inferred_basic, test/export/test_export.py::TestExport::test_dynamic_shapes_serdes_generic, test/export/test_export.py::TestExport::test_dynamic_shapes_serdes_user_errors, test/export/test_export.py::TestExport::test_dynamic_shapes_serdes_various, test/export/test_export.py::TestExport::test_dynamic_shapes_spec_with_pytree, test/export/test_export.py::TestExport::test_dynamic_shapes_wrapped_with_shape_guards, test/export/test_export.py::TestExport::test_dynamic_sym_round, test/export/test_export.py::TestExport::test_ends_of_bounds_oblivious, test/export/test_export.py::TestExport::test_enum_str, test/export/test_export.py::TestExport::test_error_does_not_reference_eager_fallback, test/export/test_export.py::TestExport::test_error_when_passing_mutating_primitive_op, test/export/test_export.py::TestExport::test_exception, test/export/test_export.py::TestExport::test_expand_copy_export_handles_implicit_true, test/export/test_export.py::TestExport::test_export_api_with_dynamic_shapes, test/export/test_export.py::TestExport::test_export_as_backend, test/export/test_export.py::TestExport::test_export_associative_scan_lifted_buffers, test/export/test_export.py::TestExport::test_export_associative_scan_symbol_dim, test/export/test_export.py::TestExport::test_export_associative_scan_symbol_scandim, test/export/test_export.py::TestExport::test_export_aten_to_unflatten, test/export/test_export.py::TestExport::test_export_aten_to_unflatten_subclass, test/export/test_export.py::TestExport::test_export_aten_to_unflatten_subclass_pre_dispatch, test/export/test_export.py::TestExport::test_export_cond_preserve_torch_fn_for_subgraphs, test/export/test_export.py::TestExport::test_export_cond_symbool_pred, test/export/test_export.py::TestExport::test_export_cond_warns_constant_pred, test/export/test_export.py::TestExport::test_export_custom_decomp_table_basic_pop, test/export/test_export.py::TestExport::test_export_custom_decomp_table_container_methods, test/export/test_export.py::TestExport::test_export_custom_op_lib, test/export/test_export.py::TestExport::test_export_custom_triton_kernel, test/export/test_export.py::TestExport::test_export_custom_triton_kernel_mutable, test/export/test_export.py::TestExport::test_export_cyclic_reference_leak, test/export/test_export.py::TestExport::test_export_decomp_torture_case_1, test/export/test_export.py::TestExport::test_export_decomp_torture_case_2, test/export/test_export.py::TestExport::test_export_decomps_dynamic, test/export/test_export.py::TestExport::test_export_decomps_simple, test/export/test_export.py::TestExport::test_export_dynamo_config, test/export/test_export.py::TestExport::test_export_for_training_run_decomp, test/export/test_export.py::TestExport::test_export_for_training_with_container_type, test/export/test_export.py::TestExport::test_export_for_training_with_dynamic_shapes, test/export/test_export.py::TestExport::test_export_for_training_with_mutation, test/export/test_export.py::TestExport::test_export_for_training_with_state_dict_hooks, test/export/test_export.py::TestExport::test_export_func_with_default_kwargs, test/export/test_export.py::TestExport::test_export_func_with_keyword_only_args, test/export/test_export.py::TestExport::test_export_func_with_kwargs, test/export/test_export.py::TestExport::test_export_func_with_pytree_kwargs, test/export/test_export.py::TestExport::test_export_func_with_var_keyword_args, test/export/test_export.py::TestExport::test_export_func_with_var_keyword_pytree_args, test/export/test_export.py::TestExport::test_export_func_with_var_postional_args, test/export/test_export.py::TestExport::test_export_function_schema, test/export/test_export.py::TestExport::test_export_graph_with_no_inputs, test/export/test_export.py::TestExport::test_export_input_mutation_bug, test/export/test_export.py::TestExport::test_export_input_mutation_dynamic_shape, test/export/test_export.py::TestExport::test_export_input_mutation_static_shape, test/export/test_export.py::TestExport::test_export_leak_compile, test/export/test_export.py::TestExport::test_export_linear_preserve_dynamic_shape, test/export/test_export.py::TestExport::test_export_max_nonstrict, test/export/test_export.py::TestExport::test_export_max_onnx_reported, test/export/test_export.py::TestExport::test_export_method, test/export/test_export.py::TestExport::test_export_mod_constraints, test/export/test_export.py::TestExport::test_export_module, test/export/test_export.py::TestExport::test_export_preserve_linear_at_aot_level, test/export/test_export.py::TestExport::test_export_preserve_linear_but_not_custom_op, test/export/test_export.py::TestExport::test_export_rnn_variants_with_warning, test/export/test_export.py::TestExport::test_export_scan_pytree_output, test/export/test_export.py::TestExport::test_export_script_module, test/export/test_export.py::TestExport::test_export_statically_known_true, test/export/test_export.py::TestExport::test_export_then_compile_tensor_ctor, test/export/test_export.py::TestExport::test_export_with_autocast, test/export/test_export.py::TestExport::test_export_with_fake_tensor_inputs, test/export/test_export.py::TestExport::test_export_with_fake_tensor_inputs_on_cuda_devices, test/export/test_export.py::TestExport::test_export_with_inline_constraints, test/export/test_export.py::TestExport::test_export_with_inline_constraints_complex, test/export/test_export.py::TestExport::test_export_with_set_grad_enabled, test/export/test_export.py::TestExport::test_export_with_wrong_inputs, test/export/test_export.py::TestExport::test_external_call_non_strict_real_tensor, test/export/test_export.py::TestExport::test_fake_inputs, test/export/test_export.py::TestExport::test_fake_weights, test/export/test_export.py::TestExport::test_filter_traceback_frames, test/export/test_export.py::TestExport::test_flex_attention_export, test/export/test_export.py::TestExport::test_float_conversion, test/export/test_export.py::TestExport::test_float_conversion_from_int, test/export/test_export.py::TestExport::test_fqn, test/export/test_export.py::TestExport::test_from_node_metadata_export, test/export/test_export.py::TestExport::test_full_on_scalar_tensor, test/export/test_export.py::TestExport::test_function_holding_tensor, test/export/test_export.py::TestExport::test_hints_wrapper, test/export/test_export.py::TestExport::test_hoo_inline_users_issue, test/export/test_export.py::TestExport::test_if_functional, test/export/test_export.py::TestExport::test_if_post_autograd_op_preserved, test/export/test_export.py::TestExport::test_inductor_backend_inside_nonstrict, test/export/test_export.py::TestExport::test_inline_script_class_method, test/export/test_export.py::TestExport::test_inline_script_class_method_recursive, test/export/test_export.py::TestExport::test_inline_script_function, test/export/test_export.py::TestExport::test_inline_script_method, test/export/test_export.py::TestExport::test_int_shape_specialization, test/export/test_export.py::TestExport::test_intermediate_shape_comp, test/export/test_export.py::TestExport::test_invalid_pytree_dynamo_graph_capture, test/export/test_export.py::TestExport::test_is_exporting, test/export/test_export.py::TestExport::test_is_nonzero, test/export/test_export.py::TestExport::test_isnonzero, test/export/test_export.py::TestExport::test_issue_113041, test/export/test_export.py::TestExport::test_issue_157289, test/export/test_export.py::TestExport::test_issue_161902, test/export/test_export.py::TestExport::test_istft_op, test/export/test_export.py::TestExport::test_keep_composite_ops_invalid, test/export/test_export.py::TestExport::test_keep_composite_ops_linear_convd, test/export/test_export.py::TestExport::test_keep_composite_ops_linear_convd_for_training_ir, test/export/test_export.py::TestExport::test_kwarg_dynamic_shapes_diff_order, test/export/test_export.py::TestExport::test_kwargs_reorder, test/export/test_export.py::TestExport::test_layer_norm_unbacked_normalized_shape, test/export/test_export.py::TestExport::test_layer_sharing, test/export/test_export.py::TestExport::test_lazy_module_kwargs, test/export/test_export.py::TestExport::test_lifted_constants, test/export/test_export.py::TestExport::test_linear_conv, test/export/test_export.py::TestExport::test_malformed_fqn_from_source_name, test/export/test_export.py::TestExport::test_map, test/export/test_export.py::TestExport::test_map_buffers, test/export/test_export.py::TestExport::test_mask_nonzero_static, test/export/test_export.py::TestExport::test_masked_select_dynamic, test/export/test_export.py::TestExport::test_math_pow, test/export/test_export.py::TestExport::test_mismatched_dynamic_shapes, test/export/test_export.py::TestExport::test_mixed_input, test/export/test_export.py::TestExport::test_module, test/export/test_export.py::TestExport::test_module_dict_key, test/export/test_export.py::TestExport::test_module_input, test/export/test_export.py::TestExport::test_module_input_subclasses_parameterization_nested, test/export/test_export.py::TestExport::test_module_list_slice, test/export/test_export.py::TestExport::test_module_with_dict_container_inp_out, test/export/test_export.py::TestExport::test_modules_access_for_deleted_submodule, test/export/test_export.py::TestExport::test_more_multidimensional_slicing, test/export/test_export.py::TestExport::test_multidimensional_slicing, test/export/test_export.py::TestExport::test_multinomial_dynamic, test/export/test_export.py::TestExport::test_multiple_definitions_same_name_dim, test/export/test_export.py::TestExport::test_namedtuple_input_export, test/export/test_export.py::TestExport::test_native_multi_attention_head, test/export/test_export.py::TestExport::test_nested_dynamic_shapes_spec, test/export/test_export.py::TestExport::test_nested_module, test/export/test_export.py::TestExport::test_nested_module_fake_tensor_leak, test/export/test_export.py::TestExport::test_nested_module_with_constant_buffer, test/export/test_export.py::TestExport::test_nested_module_with_init_buffer, test/export/test_export.py::TestExport::test_nested_module_with_parameter, test/export/test_export.py::TestExport::test_nn_module_stack, test/export/test_export.py::TestExport::test_nn_module_stack_shared_submodule, test/export/test_export.py::TestExport::test_no_check_is_size_error, test/export/test_export.py::TestExport::test_no_suggested_fixes_for_data_dependent_errors, test/export/test_export.py::TestExport::test_no_tensor_computation, test/export/test_export.py::TestExport::test_no_tensor_computation_2, test/export/test_export.py::TestExport::test_no_tensor_computation_3, test/export/test_export.py::TestExport::test_no_tensor_computation_4, test/export/test_export.py::TestExport::test_non_arg_name_dynamic_shapes_api, test/export/test_export.py::TestExport::test_non_arg_name_dynamic_shapes_api_with_container_type, test/export/test_export.py::TestExport::test_non_arg_name_dynamic_shapes_api_with_kwarg, test/export/test_export.py::TestExport::test_non_persistent_buffer, test/export/test_export.py::TestExport::test_non_strict_dynamic_shapes, test/export/test_export.py::TestExport::test_non_strict_dynamic_shapes_suggested_fixes, test/export/test_export.py::TestExport::test_none_buffers, test/export/test_export.py::TestExport::test_nonstrict_retrace_preserves_metadata, test/export/test_export.py::TestExport::test_nonzero_2, test/export/test_export.py::TestExport::test_nonzero_dynamic, test/export/test_export.py::TestExport::test_not_registered_parameter, test/export/test_export.py::TestExport::test_operator_aten_tensor_mode_variant, test/export/test_export.py::TestExport::test_output_node_name, test/export/test_export.py::TestExport::test_pad_sequence, test/export/test_export.py::TestExport::test_param_util, test/export/test_export.py::TestExport::test_partial_patched_forward, test/export/test_export.py::TestExport::test_placeholder_naming_collisions, test/export/test_export.py::TestExport::test_placeholder_naming_collisions_hoo_subgraphs, test/export/test_export.py::TestExport::test_placeholder_naming_order, test/export/test_export.py::TestExport::test_placeholder_naming_order_variadic, test/export/test_export.py::TestExport::test_placeholder_update_preserving, test/export/test_export.py::TestExport::test_predispatch_cond, test/export/test_export.py::TestExport::test_predispatch_grad_wrappers, test/export/test_export.py::TestExport::test_preserve_annotation, test/export/test_export.py::TestExport::test_preserve_module_call_signature_unflatten_specialization, test/export/test_export.py::TestExport::test_preserve_requires_grad_placeholders, test/export/test_export.py::TestExport::test_preserve_shape_dynamism_for_unused_inputs, test/export/test_export.py::TestExport::test_profiling_code, test/export/test_export.py::TestExport::test_python_asserts_with_sym_int, test/export/test_export.py::TestExport::test_pytree_register_data_class, test/export/test_export.py::TestExport::test_pytree_register_nested_data_class, test/export/test_export.py::TestExport::test_raise_user_error_when_guard_on_data_dependent_operation, test/export/test_export.py::TestExport::test_range_constraints_with_replacement, test/export/test_export.py::TestExport::test_real_tensor_alias_dtype_mismatch, test/export/test_export.py::TestExport::test_real_tensor_bool_cast, test/export/test_export.py::TestExport::test_real_tensor_errors_on_aliasing_custom_op, test/export/test_export.py::TestExport::test_real_tensor_for_max_op, test/export/test_export.py::TestExport::test_real_tensor_size_mismatch, test/export/test_export.py::TestExport::test_redundant_assert_max_upper_bound, test/export/test_export.py::TestExport::test_redundant_asserts, test/export/test_export.py::TestExport::test_refine_dynamic_shapes_from_suggested_fixes, test/export/test_export.py::TestExport::test_register_constant, test/export/test_export.py::TestExport::test_repeat_interleave, test/export/test_export.py::TestExport::test_replace_unbacked_with_very_large_upperbound, test/export/test_export.py::TestExport::test_replaced_unbacked_bindings, test/export/test_export.py::TestExport::test_reshape_view_helper, test/export/test_export.py::TestExport::test_retracable_ep, test/export/test_export.py::TestExport::test_retrace_pre_autograd, test/export/test_export.py::TestExport::test_run_decomposition_supports_user_input_mutation, test/export/test_export.py::TestExport::test_run_decompositions_keep_metadata, test/export/test_export.py::TestExport::test_run_decompositions_keep_tensor_constant_metadata, test/export/test_export.py::TestExport::test_runtime_assert_for_prim, test/export/test_export.py::TestExport::test_runtime_assert_for_prm_str, test/export/test_export.py::TestExport::test_runtime_assert_with_size, test/export/test_export.py::TestExport::test_sdpa_gqa, test/export/test_export.py::TestExport::test_sequential_slicing, test/export/test_export.py::TestExport::test_set_example_inputs, test/export/test_export.py::TestExport::test_set_grad_as_side_effect, test/export/test_export.py::TestExport::test_set_grad_empty, test/export/test_export.py::TestExport::test_set_grad_unflatten, test/export/test_export.py::TestExport::test_setgrad_lifted_tensor, test/export/test_export.py::TestExport::test_shared_submodule_nn_module_stack, test/export/test_export.py::TestExport::test_simple_export_for_training, test/export/test_export.py::TestExport::test_simple_unbacked_view, test/export/test_export.py::TestExport::test_size_input, test/export/test_export.py::TestExport::test_slice_nn_module_stack, test/export/test_export.py::TestExport::test_solver_unsupported_sympy_function, test/export/test_export.py::TestExport::test_specialize_derived_dim_roots, test/export/test_export.py::TestExport::test_split_const_gm_with_lifted_constants, test/export/test_export.py::TestExport::test_stack_trace, test/export/test_export.py::TestExport::test_stack_trace_make_fx, test/export/test_export.py::TestExport::test_state_primitives, test/export/test_export.py::TestExport::test_state_shape_attribute_assignment, test/export/test_export.py::TestExport::test_state_tensors, test/export/test_export.py::TestExport::test_static_dim_constraints, test/export/test_export.py::TestExport::test_subclass_context, test/export/test_export.py::TestExport::test_subclass_nested_attr_access, test/export/test_export.py::TestExport::test_subclass_nested_attr_access_complicated_metadata, test/export/test_export.py::TestExport::test_subclass_nested_attr_access_const_metadata, test/export/test_export.py::TestExport::test_subclass_nested_attr_access_const_metadata_not_top_level, test/export/test_export.py::TestExport::test_subclass_nested_attr_access_submodule, test/export/test_export.py::TestExport::test_subclasses_parameterization, test/export/test_export.py::TestExport::test_subclasses_parameterization_nested, test/export/test_export.py::TestExport::test_suggest_torch_checks_with_non_negative_check, test/export/test_export.py::TestExport::test_suggest_torch_checks_with_regular_check, test/export/test_export.py::TestExport::test_suggested_fixes_for_data_dependent_errors_basic, test/export/test_export.py::TestExport::test_suggested_fixes_for_data_dependent_errors_puzzlers, test/export/test_export.py::TestExport::test_suggested_fixes_new_roots, test/export/test_export.py::TestExport::test_sym_float_operators, test/export/test_export.py::TestExport::test_sym_or_sym_and, test/export/test_export.py::TestExport::test_sym_sqrt, test/export/test_export.py::TestExport::test_symbool_item, test/export/test_export.py::TestExport::test_symfloat_item, test/export/test_export.py::TestExport::test_symint_input_additional_inputs, test/export/test_export.py::TestExport::test_symint_input_basic, test/export/test_export.py::TestExport::test_symint_input_ranges, test/export/test_export.py::TestExport::test_symint_input_shapes_collection, test/export/test_export.py::TestExport::test_symint_input_specialization, test/export/test_export.py::TestExport::test_symint_item, test/export/test_export.py::TestExport::test_symint_output, test/export/test_export.py::TestExport::test_symint_tensor_return, test/export/test_export.py::TestExport::test_tag_ac_export, test/export/test_export.py::TestExport::test_tensor_attribute_zero_args, test/export/test_export.py::TestExport::test_tensor_constant_aten_to, test/export/test_export.py::TestExport::test_tensor_constant_with_wrapped_method, test/export/test_export.py::TestExport::test_to_module_with_mutated_buffer, test/export/test_export.py::TestExport::test_to_module_with_mutated_buffer_multiple, test/export/test_export.py::TestExport::test_to_module_with_mutated_buffer_multiple_update_sub_later, test/export/test_export.py::TestExport::test_tolist, test/export/test_export.py::TestExport::test_torch_check_eq_commutativity, test/export/test_export.py::TestExport::test_torch_fn, test/export/test_export.py::TestExport::test_trace_under_fake, test/export/test_export.py::TestExport::test_train_eval_on_exported_preautograd_module, test/export/test_export.py::TestExport::test_tril_dynamic_diagonal, test/export/test_export.py::TestExport::test_triu_dynamic_diagonal, test/export/test_export.py::TestExport::test_unbacked_3d_matmul, test/export/test_export.py::TestExport::test_unbacked_bincount, test/export/test_export.py::TestExport::test_unbacked_bindings_for_divisible_u_symint, test/export/test_export.py::TestExport::test_unbacked_deferred_runtime_retrace, test/export/test_export.py::TestExport::test_unbacked_expand, test/export/test_export.py::TestExport::test_unbacked_infer_size, test/export/test_export.py::TestExport::test_unbacked_kth_value, test/export/test_export.py::TestExport::test_unbacked_linear_layer_norm_input, test/export/test_export.py::TestExport::test_unbacked_noncontig_lin, test/export/test_export.py::TestExport::test_unbacked_pad, test/export/test_export.py::TestExport::test_unbacked_scalar_constructor, test/export/test_export.py::TestExport::test_unbacked_slice_forward, test/export/test_export.py::TestExport::test_unbacked_slice_simple, test/export/test_export.py::TestExport::test_unbacked_stack, test/export/test_export.py::TestExport::test_unbacked_to_cond, test/export/test_export.py::TestExport::test_unbacked_to_cond_passthrough, test/export/test_export.py::TestExport::test_unbacked_unsqueeze, test/export/test_export.py::TestExport::test_unflatten_asserts, test/export/test_export.py::TestExport::test_unflatten_buffer_update_child2parent_swap, test/export/test_export.py::TestExport::test_unflatten_closure, test/export/test_export.py::TestExport::test_unflatten_isinstance, test/export/test_export.py::TestExport::test_unflatten_multiple_graphs_dispatch, test/export/test_export.py::TestExport::test_unflatten_multiple_graphs_preserve_signature_no_error, test/export/test_export.py::TestExport::test_unflatten_multiple_graphs_shared_submodule, test/export/test_export.py::TestExport::test_unflatten_multiple_graphs_state, test/export/test_export.py::TestExport::test_unflatten_no_unroll, test/export/test_export.py::TestExport::test_unflatten_placeholder_update_child2parent_swap, test/export/test_export.py::TestExport::test_unflatten_placeholder_update_grandchild2cousin_swap, test/export/test_export.py::TestExport::test_unflatten_random_dag_5, test/export/test_export.py::TestExport::test_unflatten_random_dag_6, test/export/test_export.py::TestExport::test_unflatten_random_dag_buf_8, test/export/test_export.py::TestExport::test_unflatten_random_dag_const_preserving_3, test/export/test_export.py::TestExport::test_unflatten_random_dag_const_preserving_3_1, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_4, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_6, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_9, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_preserving_10, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_preserving_4, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_preserving_4_1, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_preserving_5, test/export/test_export.py::TestExport::test_unflatten_random_dag_mutating_buf_preserving_7, test/export/test_export.py::TestExport::test_unflatten_random_dag_preserving_4, test/export/test_export.py::TestExport::test_unused_aliases, test/export/test_export.py::TestExport::test_unused_constant, test/export/test_export.py::TestExport::test_uplift_common_custom_meta, test/export/test_export.py::TestExport::test_uplift_common_custom_meta_with_multiple_calls, test/export/test_export.py::TestExport::test_use_embedding_twice, test/export/test_export.py::TestExport::test_user_input_and_buffer_mutation, test/export/test_export.py::TestExport::test_vmap, test/export/test_export.py::TestExport::test_vmap_custom_autograd_function, test/export/test_export.py::TestExport::test_vmap_to_assert, test/export/test_export.py::TestExport::test_where_decomp, test/export/test_export.py::TestExport::test_while_loop_assert_separation, test/export/test_export.py::TestExport::test_while_loop_index_assertions, test/export/test_export.py::TestExport::test_while_loop_simple, test/export/test_export.py::TestExport::test_while_loop_tensor_constant_idx, test/export/test_export.py::TestExport::test_wrapper_module, test/export/test_export.py::TestOneOffModelExportResult::test_assert_tensor_metadata_device_index, test/export/test_export.py::TestOneOffModelExportResult::test_constant_fqn, test/export/test_export.py::TestOneOffModelExportResult::test_constant_name, test/export/test_export.py::TestOneOffModelExportResult::test_duplicated_getitem, test/export/test_export.py::TestOneOffModelExportResult::test_export_with_dict_input_nested_in_args, test/export/test_export.py::TestOneOffModelExportResult::test_hf_logging_logger, test/export/test_export.py::TestOneOffModelExportResult::test_input_output_no_stacktrace, test/export/test_export.py::TestOneOffModelExportResult::test_int_list_output, test/export/test_export.py::TestOneOffModelExportResult::test_logging_logger, test/export/test_export.py::TestOneOffModelExportResult::test_nested_retrace, test/export/test_export.py::TestOneOffModelExportResult::test_none_input_output, test/export/test_export.py::TestOneOffModelExportResult::test_primitive_constant_output, test/export/test_export.py::TestOneOffModelExportResult::test_print, test/export/test_export.py::TestOneOffModelExportResult::test_print_graph_signature, test/export/test_export.py::TestOneOffModelExportResult::test_scaled_dot_product_attention_cpu, test/export/test_export.py::TestOneOffModelExportResult::test_scaled_dot_product_attention_cuda, test/export/test_export.py::TestOneOffModelExportResult::test_strict_export_with_shared_parameters, test/export/test_export.py::TestOneOffModelExportResult::test_torchrec_jagged_tensor, test/export/test_export.py::TestOneOffModelExportResult::test_unbacked_sdpa, test/export/test_export.py::TestOneOffModelExportResult::test_warning, test/export/test_export.py::TestExportCustomClass::test_export_script_module, test/export/test_export.py::TestExportCustomClass::test_export_unbacked_lt, test/export/test_export.py::TestExportCustomClass::test_int_lift_constant, test/export/test_export.py::TestExportCustomClass::test_is_fx_tracing, test/export/test_export.py::TestExportCustomClass::test_item, test/export/test_export.py::TestExportCustomClass::test_lift_custom_obj, test/export/test_export.py::TestExportCustomClass::test_preserve_cia_op, test/export/test_export.py::TestExportCustomClass::test_preserve_non_cia_op, test/export/test_export.py::TestExportCustomClass::test_unbacked_contiguous, test/export/test_export.py::TestExportCustomClass::test_unbacked_select_index 2025-12-04T15:44:32.0487018Z 2025-12-04T15:44:32.0487355Z Finished export/test_export 1/1 ... [2025-12-04 15:44:32.015277][23476.136493631], took 1.60min 2025-12-04T15:44:32.0524271Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/export.test_export/export.test_export-f3f94f66dfdfed6b.xml 2025-12-04T15:44:32.1659954Z Running test_utils_config_module 1/1 ... [2025-12-04 15:44:32.165721][23476.286939567] 2025-12-04T15:44:32.1660526Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:44:32.1663737Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_utils_config_module.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:44:32.166117] 2025-12-04T15:44:37.4880249Z 2025-12-04T15:44:37.4881315Z test_utils_config_module 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_utils_config_module_1.1_ab32cc157b19bec7_.log 2025-12-04T15:44:37.4890403Z Running 22 items in this shard: test/test_utils_config_module.py::TestConfigModule::test_alias, test/test_utils_config_module.py::TestConfigModule::test_bad_jk_type, test/test_utils_config_module.py::TestConfigModule::test_base_value_loading, test/test_utils_config_module.py::TestConfigModule::test_codegen_config, test/test_utils_config_module.py::TestConfigModule::test_codegen_config_function, test/test_utils_config_module.py::TestConfigModule::test_dict_copy_semantics, test/test_utils_config_module.py::TestConfigModule::test_env_name_semantics, test/test_utils_config_module.py::TestConfigModule::test_env_name_string_semantics, test/test_utils_config_module.py::TestConfigModule::test_get_hash, test/test_utils_config_module.py::TestConfigModule::test_invalid_config_float, test/test_utils_config_module.py::TestConfigModule::test_invalid_config_int, test/test_utils_config_module.py::TestConfigModule::test_make_closur_patcher, test/test_utils_config_module.py::TestConfigModule::test_multi_env, test/test_utils_config_module.py::TestConfigModule::test_none_override_semantics, test/test_utils_config_module.py::TestConfigModule::test_overrides, test/test_utils_config_module.py::TestConfigModule::test_patch, test/test_utils_config_module.py::TestConfigModule::test_reference_is_default, test/test_utils_config_module.py::TestConfigModule::test_reference_semantics, test/test_utils_config_module.py::TestConfigModule::test_save_config, test/test_utils_config_module.py::TestConfigModule::test_save_config_portable, test/test_utils_config_module.py::TestConfigModule::test_type_loading, test/test_utils_config_module.py::TestConfigModule::test_unittest_patch 2025-12-04T15:44:37.4899769Z 2025-12-04T15:44:37.4900136Z Finished test_utils_config_module 1/1 ... [2025-12-04 15:44:37.487639][23481.608858775], took 0.09min 2025-12-04T15:44:37.5243106Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_utils_config_module/test_utils_config_module-5af1ad5b3b73495f.xml 2025-12-04T15:44:37.5544074Z Running test_hop_infra 1/1 ... [2025-12-04 15:44:37.554206][23481.675426856] 2025-12-04T15:44:37.5544597Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:44:37.5548027Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_hop_infra.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:44:37.554573] 2025-12-04T15:44:43.2767841Z 2025-12-04T15:44:43.2769118Z test_hop_infra 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_hop_infra_1.1_6308f2f4df0add62_.log 2025-12-04T15:44:43.2770839Z Running 3 items in this shard: test/test_hop_infra.py::TestHOPInfra::test_all_hops_are_imported, test/test_hop_infra.py::TestHOPInfra::test_all_hops_have_opinfo, test/test_hop_infra.py::TestHOPInfra::test_imports_from_all_work 2025-12-04T15:44:43.2772093Z 2025-12-04T15:44:43.2772393Z Finished test_hop_infra 1/1 ... [2025-12-04 15:44:43.276580][23487.397800316], took 0.10min 2025-12-04T15:44:43.3129661Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_hop_infra/test_hop_infra-7733a804aa6d4c14.xml 2025-12-04T15:44:43.3436202Z Running test_appending_byte_serializer 1/1 ... [2025-12-04 15:44:43.343420][23487.464639831] 2025-12-04T15:44:43.3436820Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:44:43.3440260Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_appending_byte_serializer.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:44:43.343796] 2025-12-04T15:44:48.6655024Z 2025-12-04T15:44:48.6656094Z test_appending_byte_serializer 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_appending_byte_serializer_1.1_b51fec74da87aec6_.log 2025-12-04T15:44:48.6658539Z Running 3 items in this shard: test/test_appending_byte_serializer.py::TestAppendingByteSerializer::test_checksum, test/test_appending_byte_serializer.py::TestAppendingByteSerializer::test_write_and_read_class, test/test_appending_byte_serializer.py::TestAppendingByteSerializer::test_write_and_read_int 2025-12-04T15:44:48.6660122Z 2025-12-04T15:44:48.6660529Z Finished test_appending_byte_serializer 1/1 ... [2025-12-04 15:44:48.665319][23492.786539262], took 0.09min 2025-12-04T15:44:48.7019654Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_appending_byte_serializer/test_appending_byte_serializer-65501f8364394f57.xml 2025-12-04T15:44:48.7448749Z Running test_license 1/1 ... [2025-12-04 15:44:48.744664][23492.865884176] 2025-12-04T15:44:48.7449266Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:44:48.7453046Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_license.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:44:48.745066] 2025-12-04T15:44:53.9667329Z 2025-12-04T15:44:53.9668240Z test_license 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_license_1.1_af7f8f91f8ff69a5_.log 2025-12-04T15:44:53.9669579Z Running 2 items in this shard: test/test_license.py::TestLicense::test_distinfo_license, test/test_license.py::TestLicense::test_license_for_wheel 2025-12-04T15:44:53.9670340Z 2025-12-04T15:44:53.9670625Z Finished test_license 1/1 ... [2025-12-04 15:44:53.966521][23498.087740495], took 0.09min 2025-12-04T15:44:54.0032964Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_license/test_license-56f2b4cb04cabe36.xml 2025-12-04T15:44:54.0364410Z Running test_ao_sparsity 1/1 ... [2025-12-04 15:44:54.036208][23498.157427596] 2025-12-04T15:44:54.0364950Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:44:54.0368433Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ao_sparsity.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:44:54.036616] 2025-12-04T15:45:12.5264277Z 2025-12-04T15:45:12.5265280Z test_ao_sparsity 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_ao_sparsity_1.1_435c6c94cb7afb29_.log 2025-12-04T15:45:12.5298684Z Running 88 items in this shard: test/test_ao_sparsity.py::TestQuantizedSparseKernels::test_sparse_qlinear, test/test_ao_sparsity.py::TestQuantizedSparseLayers::test_sparse_qlinear, test/test_ao_sparsity.py::TestQuantizedSparseLayers::test_sparse_qlinear_serdes, test/test_ao_sparsity.py::TestFakeSparsity::test_jit_trace, test/test_ao_sparsity.py::TestFakeSparsity::test_masking_logic, test/test_ao_sparsity.py::TestFakeSparsity::test_state_dict_preserved, test/test_ao_sparsity.py::TestFakeSparsity::test_weights_parametrized, test/test_ao_sparsity.py::TestCubicScheduler::test_constructor, test/test_ao_sparsity.py::TestCubicScheduler::test_step, test/test_ao_sparsity.py::TestScheduler::test_constructor, test/test_ao_sparsity.py::TestScheduler::test_lambda_scheduler, test/test_ao_sparsity.py::TestScheduler::test_order_of_steps, test/test_ao_sparsity.py::TestScheduler::test_step, test/test_ao_sparsity.py::TestBaseSparsifier::test_constructor, test/test_ao_sparsity.py::TestBaseSparsifier::test_convert, test/test_ao_sparsity.py::TestBaseSparsifier::test_mask_squash, test/test_ao_sparsity.py::TestBaseSparsifier::test_mask_squash_with_params1, test/test_ao_sparsity.py::TestBaseSparsifier::test_mask_squash_with_params2, test/test_ao_sparsity.py::TestBaseSparsifier::test_mask_squash_with_params3, test/test_ao_sparsity.py::TestBaseSparsifier::test_prepare_config, test/test_ao_sparsity.py::TestBaseSparsifier::test_state_dict, test/test_ao_sparsity.py::TestBaseSparsifier::test_step, test/test_ao_sparsity.py::TestNearlyDiagonalSparsifier::test_constructor, test/test_ao_sparsity.py::TestNearlyDiagonalSparsifier::test_mask_squash, test/test_ao_sparsity.py::TestNearlyDiagonalSparsifier::test_prepare, test/test_ao_sparsity.py::TestNearlyDiagonalSparsifier::test_sparsity_levels, test/test_ao_sparsity.py::TestNearlyDiagonalSparsifier::test_step, test/test_ao_sparsity.py::TestWeightNormSparsifier::test_constructor, test/test_ao_sparsity.py::TestWeightNormSparsifier::test_mask_squash, test/test_ao_sparsity.py::TestWeightNormSparsifier::test_prepare, test/test_ao_sparsity.py::TestWeightNormSparsifier::test_sparsity_levels, test/test_ao_sparsity.py::TestWeightNormSparsifier::test_step, test/test_ao_sparsity.py::TestWeightNormSparsifier::test_step_2_of_4, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_complex_conv2d, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_constructor, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prepare_conv2d, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prepare_linear, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_conv2d_activation_conv2d, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_conv2d_bias_conv2d, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_conv2d_conv2d, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_conv2d_padding_conv2d, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_conv2d_pool_conv2d, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_linear_activation_linear, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_linear_bias_linear, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_linear_linear, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_lstm_layernorm_linear_multiple_layer, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_lstm_layernorm_linear_single_layer, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_lstm_linear_multiple_layer, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_prune_lstm_linear_single_layer, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_step_conv2d, test/test_ao_sparsity.py::TestBaseStructuredSparsifier::test_step_linear, test/test_ao_sparsity.py::TestFPGMPruner::test_compute_distance, test/test_ao_sparsity.py::TestFPGMPruner::test_update_mask, test/test_ao_sparsity.py::TestSaliencyPruner::test_lstm_saliency_pruner_update_mask, test/test_ao_sparsity.py::TestSaliencyPruner::test_saliency_pruner_update_mask, test/test_ao_sparsity.py::TestComposability::test_convert_without_squash_mask, test/test_ao_sparsity.py::TestComposability::test_fusion_before_s_prep, test/test_ao_sparsity.py::TestComposability::test_q_prep_before_s_prep, test/test_ao_sparsity.py::TestComposability::test_qat_prep_before_s_prep, test/test_ao_sparsity.py::TestComposability::test_s_prep_before_fusion, test/test_ao_sparsity.py::TestComposability::test_s_prep_before_q_prep, test/test_ao_sparsity.py::TestComposability::test_s_prep_before_qat_prep, test/test_ao_sparsity.py::TestFxComposability::test_q_prep_fx_before_s_prep, test/test_ao_sparsity.py::TestFxComposability::test_q_prep_fx_s_prep_ref_conv, test/test_ao_sparsity.py::TestFxComposability::test_s_prep_before_q_prep_fx, test/test_ao_sparsity.py::TestFxComposability::test_s_prep_before_qat_prep_fx, test/test_ao_sparsity.py::TestFxComposability::test_s_prep_q_prep_fx_ref, test/test_ao_sparsity.py::TestActivationSparsifier::test_activation_sparsifier, test/test_ao_sparsity.py::TestBaseDataScheduler::test_constructor, test/test_ao_sparsity.py::TestBaseDataScheduler::test_order_of_steps, test/test_ao_sparsity.py::TestBaseDataScheduler::test_state_dict, test/test_ao_sparsity.py::TestBaseDataScheduler::test_step, test/test_ao_sparsity.py::TestBaseDataSparsifier::test_nn_embeddings, test/test_ao_sparsity.py::TestBaseDataSparsifier::test_nn_parameters, test/test_ao_sparsity.py::TestBaseDataSparsifier::test_tensors, test/test_ao_sparsity.py::TestNormDataSparsifiers::test_nn_embeddings, test/test_ao_sparsity.py::TestNormDataSparsifiers::test_nn_parameters, test/test_ao_sparsity.py::TestNormDataSparsifiers::test_tensors, test/test_ao_sparsity.py::TestQuantizationUtils::test_ptq_quantize_first, test/test_ao_sparsity.py::TestQuantizationUtils::test_ptq_sparsify_first, test/test_ao_sparsity.py::TestSparsityUtilFunctions::test_fqn_to_module, test/test_ao_sparsity.py::TestSparsityUtilFunctions::test_fqn_to_module_fail, test/test_ao_sparsity.py::TestSparsityUtilFunctions::test_fqn_to_module_for_tensors, test/test_ao_sparsity.py::TestSparsityUtilFunctions::test_get_arg_info_from_tensor_fqn, test/test_ao_sparsity.py::TestSparsityUtilFunctions::test_get_arg_info_from_tensor_fqn_fail, test/test_ao_sparsity.py::TestSparsityUtilFunctions::test_module_to_fqn, test/test_ao_sparsity.py::TestSparsityUtilFunctions::test_module_to_fqn_fail, test/test_ao_sparsity.py::TestSparsityUtilFunctions::test_module_to_fqn_root 2025-12-04T15:45:12.5331137Z 2025-12-04T15:45:12.5331447Z Finished test_ao_sparsity 1/1 ... [2025-12-04 15:45:12.526345][23516.647562938], took 0.31min 2025-12-04T15:45:12.5631241Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_ao_sparsity/test_ao_sparsity-c7bfc0a40ffe86a3.xml 2025-12-04T15:45:12.6379175Z Running test_proxy_tensor 1/1 ... [2025-12-04 15:45:12.637664][23516.758883118] 2025-12-04T15:45:12.6379709Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:45:12.6383667Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_proxy_tensor.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:45:12.638076] 2025-12-04T15:45:40.9919770Z 2025-12-04T15:45:40.9920803Z test_proxy_tensor 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_proxy_tensor_1.1_6f0b68fe764e3723_.log 2025-12-04T15:45:40.9992243Z Running 176 items in this shard: test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_T244632748, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_allclose, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_amp_cache, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_constant_blowup, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_constant_proxy_tensor_mut, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_constant_random, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_constant_unbind, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_decomp_of_capture, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_decomposition_interpreter, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_empty_like_doesnt_burn_in_defaults, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_inplace_metadata, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_isolated_graphmodule, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_model_double_param, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_model_fwd_bwd, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_model_fwd_bwd_wgtupdate, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_overloads, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_reentrant_dispatch, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_make_fx_simple, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_mode_tracing_factory_function, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_partial_decomp, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pickle_issue89626, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pr_86917, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pre_dispatch_functionalization, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pre_dispatch_functionalization_view_op, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pre_dispatch_linear, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pre_dispatch_mode_stack, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_pre_dispatch_no_grad, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_proxy_tensor, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_proxy_tensor_mode_with_decomp_table_preserves_proxy, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_resnet18_backward_trace, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_scalar_device, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_strides, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_tensor_constants, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_trace_subclasses, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_val_metadata_mutation, test/test_proxy_tensor.py::TestGenericProxyTensorReal::test_varargs, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_T244632748, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_allclose, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_amp_cache, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_constant_blowup, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_constant_proxy_tensor_mut, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_constant_random, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_constant_unbind, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_decomp_of_capture, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_decomposition_interpreter, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_empty_like_doesnt_burn_in_defaults, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_inplace_metadata, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_isolated_graphmodule, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_model_double_param, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_model_fwd_bwd, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_model_fwd_bwd_wgtupdate, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_overloads, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_reentrant_dispatch, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_make_fx_simple, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_mode_tracing_factory_function, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_partial_decomp, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pickle_issue89626, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pr_86917, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pre_dispatch_functionalization, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pre_dispatch_functionalization_view_op, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pre_dispatch_linear, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pre_dispatch_mode_stack, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_pre_dispatch_no_grad, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_proxy_tensor, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_proxy_tensor_mode_with_decomp_table_preserves_proxy, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_resnet18_backward_trace, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_scalar_device, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_strides, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_tensor_constants, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_trace_subclasses, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_val_metadata_mutation, test/test_proxy_tensor.py::TestGenericProxyTensorFake::test_varargs, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_T244632748, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_allclose, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_amp_cache, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_constant_blowup, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_constant_proxy_tensor_mut, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_constant_random, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_constant_unbind, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_decomp_of_capture, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_decomposition_interpreter, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_empty_like_doesnt_burn_in_defaults, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_inplace_metadata, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_isolated_graphmodule, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_model_double_param, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_model_fwd_bwd, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_model_fwd_bwd_wgtupdate, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_overloads, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_reentrant_dispatch, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_make_fx_simple, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_mode_tracing_factory_function, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_partial_decomp, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pickle_issue89626, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pr_86917, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pre_dispatch_functionalization, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pre_dispatch_functionalization_view_op, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pre_dispatch_linear, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pre_dispatch_mode_stack, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_pre_dispatch_no_grad, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_proxy_tensor, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_proxy_tensor_mode_with_decomp_table_preserves_proxy, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_resnet18_backward_trace, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_scalar_device, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_strides, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_tensor_constants, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_trace_subclasses, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_val_metadata_mutation, test/test_proxy_tensor.py::TestGenericProxyTensorSymbolic::test_varargs, test/test_proxy_tensor.py::TestRealProxyTensor::test_error_on_data_dependent_ops, test/test_proxy_tensor.py::TestFakeProxyTensor::test_alias, test/test_proxy_tensor.py::TestFakeProxyTensor::test_fake_tensor_mode, test/test_proxy_tensor.py::TestFakeProxyTensor::test_free_fake, test/test_proxy_tensor.py::TestFakeProxyTensor::test_fused_adam, test/test_proxy_tensor.py::TestFakeProxyTensor::test_issue82547, test/test_proxy_tensor.py::TestFakeProxyTensor::test_meta, test/test_proxy_tensor.py::TestFakeProxyTensor::test_use_fake_and_tensor, test/test_proxy_tensor.py::TestSymbolicTracing::test_adv_index_batch, test/test_proxy_tensor.py::TestSymbolicTracing::test_arange_unbacked_output_size, test/test_proxy_tensor.py::TestSymbolicTracing::test_binary_broadcast, test/test_proxy_tensor.py::TestSymbolicTracing::test_boolean_index, test/test_proxy_tensor.py::TestSymbolicTracing::test_broadcast_shapes, test/test_proxy_tensor.py::TestSymbolicTracing::test_cat, test/test_proxy_tensor.py::TestSymbolicTracing::test_constant_specialization, test/test_proxy_tensor.py::TestSymbolicTracing::test_cpu_scalar_cuda, test/test_proxy_tensor.py::TestSymbolicTracing::test_cumsum_unbacked, test/test_proxy_tensor.py::TestSymbolicTracing::test_debug_interpreter, test/test_proxy_tensor.py::TestSymbolicTracing::test_deduped_shape, test/test_proxy_tensor.py::TestSymbolicTracing::test_dynamic_pointwise_scalar, test/test_proxy_tensor.py::TestSymbolicTracing::test_elementwise_meta_with_sym_numbers, test/test_proxy_tensor.py::TestSymbolicTracing::test_expand, test/test_proxy_tensor.py::TestSymbolicTracing::test_fake_tensor_as_size, test/test_proxy_tensor.py::TestSymbolicTracing::test_guard_lowerbound_range_refinement, test/test_proxy_tensor.py::TestSymbolicTracing::test_guard_lowerbound_range_refinement_multivariate, test/test_proxy_tensor.py::TestSymbolicTracing::test_guard_upperbound_range_refinement, test/test_proxy_tensor.py::TestSymbolicTracing::test_guard_upperbound_range_refinement_multivariate, test/test_proxy_tensor.py::TestSymbolicTracing::test_guards_equal, test/test_proxy_tensor.py::TestSymbolicTracing::test_int_input, test/test_proxy_tensor.py::TestSymbolicTracing::test_invalidate_nonzero, test/test_proxy_tensor.py::TestSymbolicTracing::test_invalidate_nonzero_propagate_real_tensors, test/test_proxy_tensor.py::TestSymbolicTracing::test_item, test/test_proxy_tensor.py::TestSymbolicTracing::test_item_to_constructor, test/test_proxy_tensor.py::TestSymbolicTracing::test_make_fx_with_custom_tracer_preserving_nn_module_stack, test/test_proxy_tensor.py::TestSymbolicTracing::test_mega_guard, test/test_proxy_tensor.py::TestSymbolicTracing::test_metadata, test/test_proxy_tensor.py::TestSymbolicTracing::test_metadata_fresh, test/test_proxy_tensor.py::TestSymbolicTracing::test_mod_gcd_unbacked, test/test_proxy_tensor.py::TestSymbolicTracing::test_multiply_shape, test/test_proxy_tensor.py::TestSymbolicTracing::test_neg_shape, test/test_proxy_tensor.py::TestSymbolicTracing::test_new_empty, test/test_proxy_tensor.py::TestSymbolicTracing::test_non_deduped_shape, test/test_proxy_tensor.py::TestSymbolicTracing::test_non_symint_size_spec, test/test_proxy_tensor.py::TestSymbolicTracing::test_nonidentity_transitive_guards, test/test_proxy_tensor.py::TestSymbolicTracing::test_reflect_r_over_x, test/test_proxy_tensor.py::TestSymbolicTracing::test_repeat_interleave, test/test_proxy_tensor.py::TestSymbolicTracing::test_repeat_interleave_unbacked_output_size, test/test_proxy_tensor.py::TestSymbolicTracing::test_reshape_divisibility_unbacked, test/test_proxy_tensor.py::TestSymbolicTracing::test_resize_from_zero, test/test_proxy_tensor.py::TestSymbolicTracing::test_return_symint, test/test_proxy_tensor.py::TestSymbolicTracing::test_rmethod, test/test_proxy_tensor.py::TestSymbolicTracing::test_setitem_symint, test/test_proxy_tensor.py::TestSymbolicTracing::test_size_with_tensor, test/test_proxy_tensor.py::TestSymbolicTracing::test_split_unbacked_sizes, test/test_proxy_tensor.py::TestSymbolicTracing::test_sqrt_size, test/test_proxy_tensor.py::TestSymbolicTracing::test_sym_storage_offset, test/test_proxy_tensor.py::TestSymbolicTracing::test_symbolic_repeat_interleave, test/test_proxy_tensor.py::TestSymbolicTracing::test_symint_to_tensor, test/test_proxy_tensor.py::TestSymbolicTracing::test_tensor_symfloat, test/test_proxy_tensor.py::TestSymbolicTracing::test_unary, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_batch_resnet, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_slice, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_unification, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_unify_dependency_violation, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_unify_guard, test/test_proxy_tensor.py::TestSymbolicTracing::test_unbacked_unify_guard_transitivity, test/test_proxy_tensor.py::TestSymbolicTracing::test_view_divisibility_unbacked, test/test_proxy_tensor.py::TestSymbolicTracing::test_view_divisibility_unbacked_relatively_prime 2025-12-04T15:45:41.0062222Z 2025-12-04T15:45:41.0062536Z Finished test_proxy_tensor 1/1 ... [2025-12-04 15:45:40.992059][23545.113278048], took 0.47min 2025-12-04T15:45:41.0295555Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_proxy_tensor/test_proxy_tensor-25f8d712045ddeef.xml 2025-12-04T15:45:41.1100216Z Running torch_np/test_dtype 1/1 ... [2025-12-04 15:45:41.109786][23545.231005542] 2025-12-04T15:45:41.1100756Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:45:41.1104101Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/test_dtype.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:45:41.110172] 2025-12-04T15:45:46.6319729Z 2025-12-04T15:45:46.6321117Z torch_np/test_dtype 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.test_dtype_1.1_d9bfb103178d1533_.log 2025-12-04T15:45:46.6339657Z Running 44 items in this shard: test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'bool_', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'complex128', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'complex64', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'float16', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'float32', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'float64', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'int16', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'int32', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'int64', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'int8', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'uint16', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'uint32', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'uint64', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_'uint8', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_bool, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'bool_', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'complex128', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'complex64', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'float16', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'float32', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'float64', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'int16', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'int32', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'int64', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'int8', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'uint16', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'uint32', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'uint64', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.'uint8', test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.bool_, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.complex128, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.complex64, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.dtype('bool'), test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.float16, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.float32, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.float64, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.int16, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.int32, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.int64, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.int8, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.uint16, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.uint32, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.uint64, test/torch_np/test_dtype.py::TestConvertDType::test_convert_np_dtypes_numpy.uint8 2025-12-04T15:45:46.6357529Z 2025-12-04T15:45:46.6357849Z Finished torch_np/test_dtype 1/1 ... [2025-12-04 15:45:46.631792][23550.753013749], took 0.09min 2025-12-04T15:45:46.6697188Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/torch_np.test_dtype/torch_np.test_dtype-a1152bdff0bb5e89.xml 2025-12-04T15:45:46.7023493Z Running test_file_check 1/1 ... [2025-12-04 15:45:46.702117][23550.8233374] 2025-12-04T15:45:46.7024186Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:45:46.7027850Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_file_check.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:45:46.702480] 2025-12-04T15:45:53.9268443Z 2025-12-04T15:45:53.9269731Z test_file_check 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_file_check_1.1_1900e06915def041_.log 2025-12-04T15:45:53.9271065Z Running 2 items in this shard: test/test_file_check.py::TestFileCheck::test_all_python_api, test/test_file_check.py::TestFileCheck::test_not_run 2025-12-04T15:45:53.9271777Z 2025-12-04T15:45:53.9272089Z Finished test_file_check 1/1 ... [2025-12-04 15:45:53.926661][23558.047880258], took 0.12min 2025-12-04T15:45:53.9646909Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_file_check/test_file_check-21e904b3669672b6.xml 2025-12-04T15:45:54.0513832Z Running test_flop_counter 1/1 ... [2025-12-04 15:45:54.051109][23558.172327589] 2025-12-04T15:45:54.0514542Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:45:54.0518047Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_flop_counter.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:45:54.051515] 2025-12-04T15:46:00.1244835Z 2025-12-04T15:46:00.1245963Z test_flop_counter 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_flop_counter_1.1_4d9092c128608eaa_.log 2025-12-04T15:46:00.1253626Z Running 22 items in this shard: test/test_flop_counter.py::TestFlopCounter::test_addmm_out, test/test_flop_counter.py::TestFlopCounter::test_autograd_op, test/test_flop_counter.py::TestFlopCounter::test_backward, test/test_flop_counter.py::TestFlopCounter::test_backward_reset, test/test_flop_counter.py::TestFlopCounter::test_conv_backwards_as_decomposition, test/test_flop_counter.py::TestFlopCounter::test_conv_transpose_loop, test/test_flop_counter.py::TestFlopCounter::test_convs, test/test_flop_counter.py::TestFlopCounter::test_custom, test/test_flop_counter.py::TestFlopCounter::test_custom_op, test/test_flop_counter.py::TestFlopCounter::test_flop_counter_variety, test/test_flop_counter.py::TestFlopCounter::test_hook_registration, test/test_flop_counter.py::TestFlopCounter::test_inference_mode, test/test_flop_counter.py::TestFlopCounter::test_module, test/test_flop_counter.py::TestFlopCounter::test_nested_attention_fake_tensors, test/test_flop_counter.py::TestFlopCounter::test_noop, test/test_flop_counter.py::TestFlopCounter::test_op, test/test_flop_counter.py::TestFlopCounter::test_pytrees, test/test_flop_counter.py::TestFlopCounter::test_scaled_mm, test/test_flop_counter.py::TestFlopCounter::test_sdpa, test/test_flop_counter.py::TestFlopCounter::test_sdpa_nested_tensor, test/test_flop_counter.py::TestFlopCounter::test_torchscript, test/test_flop_counter.py::TestFlopCounter::test_warning 2025-12-04T15:46:00.1260724Z 2025-12-04T15:46:00.1261046Z Finished test_flop_counter 1/1 ... [2025-12-04 15:46:00.124299][23564.245520302], took 0.10min 2025-12-04T15:46:00.1621475Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_flop_counter/test_flop_counter-83e0741caa94eff2.xml 2025-12-04T15:46:00.2417863Z Running backends/xeon/test_launch 1/1 ... [2025-12-04 15:46:00.241518][23564.362737577] 2025-12-04T15:46:00.2418574Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:46:00.2421598Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'backends/xeon/test_launch.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:46:00.241905] 2025-12-04T15:46:07.3159902Z 2025-12-04T15:46:07.3161607Z backends/xeon/test_launch 1/1 was successful, full logs can be found in artifacts with path test/test-reports/backends.xeon.test_launch_1.1_fba14d6e358f5987_.log 2025-12-04T15:46:07.3164933Z Running 2 items in this shard: test/backends/xeon/test_launch.py::TestTorchrun::test_cpu_info, test/backends/xeon/test_launch.py::TestTorchrun::test_multi_threads 2025-12-04T15:46:07.3166581Z 2025-12-04T15:46:07.3167269Z Finished backends/xeon/test_launch 1/1 ... [2025-12-04 15:46:07.315788][23571.437007751], took 0.12min 2025-12-04T15:46:07.3544867Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/backends.xeon.test_launch/backends.xeon.test_launch-35338ec25f37484c.xml 2025-12-04T15:46:07.4417955Z Running test_show_pickle 1/1 ... [2025-12-04 15:46:07.441471][23571.562691227] 2025-12-04T15:46:07.4419015Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:46:07.4422828Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_show_pickle.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:46:07.441917] 2025-12-04T15:46:12.6638670Z 2025-12-04T15:46:12.6639806Z test_show_pickle 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_show_pickle_1.1_979efe7958d3bd6b_.log 2025-12-04T15:46:12.6640934Z Running 1 items in this shard: test/test_show_pickle.py::TestShowPickle::test_scripted_model 2025-12-04T15:46:12.6641426Z 2025-12-04T15:46:12.6641748Z Finished test_show_pickle 1/1 ... [2025-12-04 15:46:12.663644][23576.784864601], took 0.09min 2025-12-04T15:46:12.7017254Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_show_pickle/test_show_pickle-865976a650f0e2c8.xml 2025-12-04T15:46:12.7246308Z Running test_openmp 1/1 ... [2025-12-04 15:46:12.724437][23576.84565774] 2025-12-04T15:46:12.7249329Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:46:12.7251805Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_openmp.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:46:12.724815] 2025-12-04T15:46:22.4027378Z 2025-12-04T15:46:22.4028658Z test_openmp 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_openmp_1.1_db907b0eb0691969_.log 2025-12-04T15:46:22.4030016Z Running 2 items in this shard: test/test_openmp.py::TestOpenMP_ParallelFor::test_n_threads, test/test_openmp.py::TestOpenMP_ParallelFor::test_one_thread 2025-12-04T15:46:22.4030806Z 2025-12-04T15:46:22.4031087Z Finished test_openmp 1/1 ... [2025-12-04 15:46:22.402533][23586.52375379], took 0.16min 2025-12-04T15:46:22.4407660Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_openmp/test_openmp-ab09b4ea4a98cceb.xml 2025-12-04T15:46:22.5214986Z Running test_expanded_weights 1/1 ... [2025-12-04 15:46:22.521268][23586.642487152] 2025-12-04T15:46:22.5215563Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:46:22.5219173Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_expanded_weights.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:46:22.521677] 2025-12-04T15:46:51.5264897Z 2025-12-04T15:46:51.5266228Z test_expanded_weights 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_expanded_weights_1.1_db5bdda645780846_.log 2025-12-04T15:46:51.5396654Z Running 220 items in this shard: test/test_expanded_weights.py::TestExpandedWeightHelperFunctionCUDA::test_forward_helper_cuda, test/test_expanded_weights.py::TestExpandedWeightHelperFunctionCUDA::test_forward_helper_failure_args_cuda, test/test_expanded_weights.py::TestExpandedWeightHelperFunctionCUDA::test_set_grad_sample_if_exists_cuda, test/test_expanded_weights.py::TestExpandedWeightHelperFunctionCUDA::test_set_grad_sample_if_exists_failure_cuda, test/test_expanded_weights.py::TestExpandedWeightHelperFunctionCUDA::test_sum_over_all_but_batch_and_last_n_cuda, test/test_expanded_weights.py::TestExpandedWeightHelperFunctionCUDA::test_unpack_expanded_weight_or_tensor_cuda, test/test_expanded_weights.py::TestExpandedWeightHelperFunctionCUDA::test_unpack_expanded_weight_or_tensor_failure_cuda, test/test_expanded_weights.py::TestExpandedWeightHelperFunctionCUDA::test_unpack_expanded_weight_or_tensor_with_custom_function_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_cnn_model_mean_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_cnn_model_sum_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_embedding_model_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_error_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv1d_cuda_bfloat16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv1d_cuda_complex128, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv1d_cuda_complex32, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv1d_cuda_complex64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv1d_cuda_float16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv1d_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv1d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv2d_cuda_bfloat16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv2d_cuda_complex128, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv2d_cuda_complex32, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv2d_cuda_complex64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv2d_cuda_float16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv2d_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv2d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv3d_cuda_bfloat16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv3d_cuda_complex128, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv3d_cuda_complex32, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv3d_cuda_complex64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv3d_cuda_float16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv3d_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_conv3d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_embedding_cuda_bfloat16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_embedding_cuda_float16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_embedding_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_embedding_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_group_norm_cuda_bfloat16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_group_norm_cuda_float16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_group_norm_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_group_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_instance_norm_cuda_bfloat16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_instance_norm_cuda_float16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_instance_norm_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_instance_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_layer_norm_cuda_bfloat16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_layer_norm_cuda_float16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_layer_norm_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_layer_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_linear_cuda_bfloat16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_linear_cuda_complex128, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_linear_cuda_complex64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_linear_cuda_float16, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_linear_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_forward_nn_functional_linear_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_mean_nn_functional_conv1d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_mean_nn_functional_conv2d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_mean_nn_functional_conv3d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_mean_nn_functional_embedding_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_mean_nn_functional_group_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_mean_nn_functional_instance_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_mean_nn_functional_layer_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_mean_nn_functional_linear_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_sum_nn_functional_conv1d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_sum_nn_functional_conv2d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_sum_nn_functional_conv3d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_sum_nn_functional_embedding_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_sum_nn_functional_group_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_sum_nn_functional_instance_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_sum_nn_functional_layer_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weight_per_sample_grad_sum_nn_functional_linear_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weights_per_sample_grad_input_no_grad_nn_functional_conv1d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weights_per_sample_grad_input_no_grad_nn_functional_conv2d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weights_per_sample_grad_input_no_grad_nn_functional_conv3d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weights_per_sample_grad_input_no_grad_nn_functional_embedding_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weights_per_sample_grad_input_no_grad_nn_functional_group_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weights_per_sample_grad_input_no_grad_nn_functional_instance_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weights_per_sample_grad_input_no_grad_nn_functional_layer_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_expanded_weights_per_sample_grad_input_no_grad_nn_functional_linear_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_group_norm_error_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_group_norm_model_num_dim_1_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_group_norm_model_num_dim_2_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_group_norm_model_num_dim_3_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_instance_norm_model_num_dim_1_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_instance_norm_model_num_dim_2_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_instance_norm_model_num_dim_3_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_layer_norm_model_num_dim_1_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_layer_norm_model_num_dim_2_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_layer_norm_model_num_dim_3_cuda, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_unsupported_expand_weights_nn_functional_conv1d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_unsupported_expand_weights_nn_functional_conv2d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_unsupported_expand_weights_nn_functional_conv3d_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_unsupported_expand_weights_nn_functional_embedding_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_unsupported_expand_weights_nn_functional_group_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_unsupported_expand_weights_nn_functional_instance_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_unsupported_expand_weights_nn_functional_layer_norm_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightFunctionalCUDA::test_unsupported_expand_weights_nn_functional_linear_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_circular_stride2_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_circular_stride2_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_circular_stride2_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad1_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad1_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad1_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad1size1_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad1size1_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad1size1_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad2size1_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad2size1_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_pad2size1_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_reflect_stride2_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_reflect_stride2_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_reflect_stride2_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_replicate_stride2_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_replicate_stride2_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_replicate_stride2_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_stride_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_stride_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_stride_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_zero_batch_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_zero_batch_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_zero_batch_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_zeros_stride2_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_zeros_stride2_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv1d_zeros_stride2_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_circular_stride2_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_circular_stride2_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_circular_stride2_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_dilated_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_dilated_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_dilated_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_no_bias_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_no_bias_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_no_bias_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_padding_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_padding_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_padding_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_reflect_stride2_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_reflect_stride2_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_reflect_stride2_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_replicate_stride2_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_replicate_stride2_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_replicate_stride2_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_strided_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_strided_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_strided_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_zero_batch_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_zero_batch_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_zero_batch_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_zeros_stride2_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_zeros_stride2_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv2d_zeros_stride2_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_1x1x1_no_bias_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_1x1x1_no_bias_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_1x1x1_no_bias_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_circular_stride2_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_circular_stride2_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_circular_stride2_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_no_bias_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_no_bias_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_no_bias_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_replicate_stride2_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_replicate_stride2_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_replicate_stride2_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_stride_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_stride_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_stride_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_stride_padding_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_stride_padding_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_stride_padding_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_zero_batch_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_zero_batch_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_zero_batch_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_zeros_stride2_pad2_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_zeros_stride2_pad2_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Conv3d_zeros_stride2_pad2_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Embedding_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Embedding_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Embedding_discontiguous_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Embedding_discontiguous_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Embedding_discontiguous_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Embedding_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_LayerNorm_3d_no_affine_large_feature_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_LayerNorm_3d_no_affine_large_feature_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_LayerNorm_3d_no_affine_large_feature_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Linear_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Linear_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Linear_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Linear_no_batch_dim_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Linear_no_batch_dim_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Linear_no_batch_dim_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Linear_no_bias_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Linear_no_bias_cuda_double_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_Linear_no_bias_multiple_inputs_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_GRU_eval_mode_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_GRU_eval_mode_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_GRU_train_mode_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_GRU_train_mode_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_LSTM_eval_mode_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_LSTM_eval_mode_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_LSTM_train_mode_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_LSTM_train_mode_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_RNN_eval_mode_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_RNN_eval_mode_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_RNN_train_mode_cuda_float32, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_module_nn_RNN_train_mode_cuda_float64, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_per_sample_api_compute_batch_size_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_per_sample_api_compute_batch_size_not_pytreeable_cuda, test/test_expanded_weights.py::TestExpandedWeightModuleCUDA::test_per_sample_api_failing_cuda 2025-12-04T15:46:51.5523325Z 2025-12-04T15:46:51.5523751Z Finished test_expanded_weights 1/1 ... [2025-12-04 15:46:51.526786][23615.648001768], took 0.48min 2025-12-04T15:46:51.5708115Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_expanded_weights/test_expanded_weights-e8def21a8b982d80.xml 2025-12-04T15:46:51.6860823Z Running functorch/test_ac_knapsack 1/1 ... [2025-12-04 15:46:51.685769][23615.806988338] 2025-12-04T15:46:51.6861415Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:46:51.6864368Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'functorch/test_ac_knapsack.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:46:51.686189] 2025-12-04T15:46:57.0579940Z 2025-12-04T15:46:57.0581039Z functorch/test_ac_knapsack 1/1 was successful, full logs can be found in artifacts with path test/test-reports/functorch.test_ac_knapsack_1.1_69e4982e1e9d819e_.log 2025-12-04T15:46:57.0591134Z Running 17 items in this shard: test/functorch/test_ac_knapsack.py::TestGraphInfoProvider::test_full_joint_nx_graph, test/functorch/test_ac_knapsack.py::TestGraphInfoProvider::test_get_knapsack_memory_input, test/functorch/test_ac_knapsack.py::TestGraphInfoProvider::test_get_knapsack_runtime_input, test/functorch/test_ac_knapsack.py::TestGraphInfoProvider::test_get_non_ac_peak_memory, test/functorch/test_ac_knapsack.py::TestGraphInfoProvider::test_get_theoretical_max_runtime, test/functorch/test_ac_knapsack.py::TestGraphInfoProvider::test_inialize_from_graph, test/functorch/test_ac_knapsack.py::TestGraphInfoProvider::test_recomputable_node_only_graph, test/functorch/test_ac_knapsack.py::TestGraphInfoProvider::test_recomputable_node_only_graph_with_larger_graph_context, test/functorch/test_ac_knapsack.py::TestGraphInfoProvider::test_simplified_fx_joint_graph, test/functorch/test_ac_knapsack.py::TestKnapsackEvaluator::test_evaluate_distribution_of_results_for_knapsack_algo, test/functorch/test_ac_knapsack.py::TestKnapsackEvaluator::test_evaluate_knapsack_output_accounting_for_backward_pass, test/functorch/test_ac_knapsack.py::TestKnapsackEvaluator::test_evaluate_knapsack_output_not_accounting_for_backward_pass, test/functorch/test_ac_knapsack.py::TestKnapsackEvaluator::test_evaluate_knapsack_output_with_wrong_sized_values, test/functorch/test_ac_knapsack.py::TestKnapsackEvaluator::test_get_backward_memory_from_topologically_sorted_graph, test/functorch/test_ac_knapsack.py::TestKnapsackEvaluator::test_get_knee_point_memory_budget, test/functorch/test_ac_knapsack.py::TestActivationCheckpointingKnapsack::test_dp_knapsack, test/functorch/test_ac_knapsack.py::TestActivationCheckpointingKnapsack::test_dp_knapsack_sliding_hirschberg 2025-12-04T15:46:57.0600369Z 2025-12-04T15:46:57.0600740Z Finished functorch/test_ac_knapsack 1/1 ... [2025-12-04 15:46:57.057826][23621.179039272], took 0.09min 2025-12-04T15:46:57.0964550Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/functorch.test_ac_knapsack/functorch.test_ac_knapsack-e93133af03c79b6b.xml 2025-12-04T15:46:57.1296215Z Running torch_np/test_nep50_examples 1/1 ... [2025-12-04 15:46:57.129373][23621.250593681] 2025-12-04T15:46:57.1296822Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:46:57.1299911Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/test_nep50_examples.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:46:57.129731] 2025-12-04T15:47:04.8548801Z 2025-12-04T15:47:04.8549982Z torch_np/test_nep50_examples 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.test_nep50_examples_1.1_7e6125dc6a887b5d_.log 2025-12-04T15:47:04.9411842Z Running 1573 items in this shard: test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_3j + array(3, complex64), test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_True + uint8(2), test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array(1_0, float32) + 1e-14 == 1_0, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([0_1], float32) == float64(0_1), test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([100], uint8) + 200, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([1], uint8) + 1, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([1], uint8) + 200, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([1], uint8) + 300, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([1], uint8) + array(1, int64), test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([1], uint8) + int64(1), test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([1_0], float32) + 1e-14 == 1_0, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([1_], float32) + 3, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([1_], float32) + array(1_, float64), test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([1_], float32) + float64(1_), test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_array([1_], float32) + int64(3), test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_bool_(True) + 1, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_float32(1) + 1j, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_float32(1) + 3e100, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_float32(5) + 5j, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_int16(2) + 2, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_int16(4) + 4j, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_int32(1) + 5j, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_uint8(1) + 2, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_uint8(1) + 300, test/torch_np/test_nep50_examples.py::TestNEP50Table::test_nep50_exceptions_example_uint8(100) + 200, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_add_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_arctan2_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_and_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_or_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_bitwise_xor_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_copysign_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divide_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_divmod_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_equal_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_float_power_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_floor_divide_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmax_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmin_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_fmod_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_gcd_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_equal_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_greater_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_heaviside_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_hypot_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_lcm_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_ldexp_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_left_shift_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_equal_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_less_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp2_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logaddexp_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_and_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_or_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_logical_xor_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_matmul_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_maximum_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_minimum_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_mod_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_modf_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_multiply_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_nextafter_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_not_equal_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_power_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_remainder_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_right_shift_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_subtract_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar27_array27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar28_array28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar29_array29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar30_array30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar31_array31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar32_array32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar33_array33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar34_array34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar35_array35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_1_array10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_1_array11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_1_array12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_1_array13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_1_array14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_1_array15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_1_array16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_1_array17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_1_array9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_2_0_array18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_2_0_array19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_2_0_array20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_2_0_array21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_2_0_array22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_2_0_array23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_2_0_array24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_2_0_array25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_2_0_array26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_True_array0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_True_array1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_True_array2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_True_array3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_True_array4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_True_array5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_True_array6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_True_array7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_compare_ufuncs_name_true_divide_scalar_True_array8, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar27_array27_dtype27, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar28_array28_dtype28, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar29_array29_dtype29, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar30_array30_dtype30, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar31_array31_dtype31, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar32_array32_dtype32, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar33_array33_dtype33, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar34_array34_dtype34, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar35_array35_dtype35, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_1_array10_dtype10, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_1_array11_dtype11, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_1_array12_dtype12, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_1_array13_dtype13, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_1_array14_dtype14, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_1_array15_dtype15, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_1_array16_dtype16, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_1_array17_dtype17, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_1_array9_dtype9, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_2_0_array18_dtype18, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_2_0_array19_dtype19, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_2_0_array20_dtype20, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_2_0_array21_dtype21, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_2_0_array22_dtype22, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_2_0_array23_dtype23, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_2_0_array24_dtype24, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_2_0_array25_dtype25, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_2_0_array26_dtype26, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_True_array0_dtype0, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_True_array1_dtype1, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_True_array2_dtype2, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_True_array3_dtype3, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_True_array4_dtype4, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_True_array5_dtype5, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_True_array6_dtype6, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_True_array7_dtype7, test/torch_np/test_nep50_examples.py::TestCompareToNumpy::test_direct_compare_scalar_True_array8_dtype8 2025-12-04T15:47:05.0265004Z 2025-12-04T15:47:05.0265402Z Finished torch_np/test_nep50_examples 1/1 ... [2025-12-04 15:47:04.857335][23628.978550147], took 0.13min 2025-12-04T15:47:05.0266928Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/torch_np.test_nep50_examples/torch_np.test_nep50_examples-4b9983ced205ee32.xml 2025-12-04T15:47:05.0268120Z Running test_torch 1/1 ... [2025-12-04 15:47:05.020231][23629.14145036] 2025-12-04T15:47:05.0268605Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:47:05.0269846Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_torch.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:47:05.020624] 2025-12-04T15:48:49.6968933Z 2025-12-04T15:48:49.6969858Z test_torch 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_torch_1.1_3a3aff20d8da01a9_.log 2025-12-04T15:48:49.7337855Z Running 976 items in this shard: test/test_torch.py::TestBasicVitalSigns::test_basic_vitals, test/test_torch.py::TestBasicVitalSigns::test_basic_vitals_read_write, test/test_torch.py::TestBasicVitalSigns::test_dataloader_vitals, test/test_torch.py::TestTorch::test_RNGState, test/test_torch.py::TestTorch::test_RNGStateAliasing, test/test_torch.py::TestTorch::test_RNG_after_pickle, test/test_torch.py::TestTorch::test_Size, test/test_torch.py::TestTorch::test_Size_concat_non_tuple_sequence, test/test_torch.py::TestTorch::test_Size_concat_wildcard, test/test_torch.py::TestTorch::test_Size_iter, test/test_torch.py::TestTorch::test_Size_scalar, test/test_torch.py::TestTorch::test_add_meta_scalar, test/test_torch.py::TestTorch::test_allow_tensor_metadata_change, test/test_torch.py::TestTorch::test_apply, test/test_torch.py::TestTorch::test_as_subclass, test/test_torch.py::TestTorch::test_assert_async, test/test_torch.py::TestTorch::test_backward_hooks_traverse, test/test_torch.py::TestTorch::test_batch_norm_cpu_inference, test/test_torch.py::TestTorch::test_bf16_supported_on_cpu, test/test_torch.py::TestTorch::test_bmm_multithreaded, test/test_torch.py::TestTorch::test_boxMullerState, test/test_torch.py::TestTorch::test_cat_neg_dim, test/test_torch.py::TestTorch::test_check, test/test_torch.py::TestTorch::test_chunk_neg_dim, test/test_torch.py::TestTorch::test_conj_neg_tolist, test/test_torch.py::TestTorch::test_conj_physical_meta_stride, test/test_torch.py::TestTorch::test_contains, test/test_torch.py::TestTorch::test_copy_broadcast, test/test_torch.py::TestTorch::test_copy_dtypes, test/test_torch.py::TestTorch::test_copy_float16, test/test_torch.py::TestTorch::test_copy_many_to_one, test/test_torch.py::TestTorch::test_copy_transpose, test/test_torch.py::TestTorch::test_cuda_not_built, test/test_torch.py::TestTorch::test_cummax_neg_dim, test/test_torch.py::TestTorch::test_cummin_neg_dim, test/test_torch.py::TestTorch::test_cumprod_neg_dim, test/test_torch.py::TestTorch::test_cumsum_neg_dim, test/test_torch.py::TestTorch::test_cxx_flags, test/test_torch.py::TestTorch::test_data_ptr_of_empty_tensor_with_storage, test/test_torch.py::TestTorch::test_data_ptr_of_empty_view_with_storage, test/test_torch.py::TestTorch::test_deepcopy_gradient, test/test_torch.py::TestTorch::test_deepcopy_parameter, test/test_torch.py::TestTorch::test_deterministic_fill_uninitialized_memory, test/test_torch.py::TestTorch::test_deterministic_flag, test/test_torch.py::TestTorch::test_device, test/test_torch.py::TestTorch::test_dim_order, test/test_torch.py::TestTorch::test_dir, test/test_torch.py::TestTorch::test_doc, test/test_torch.py::TestTorch::test_doc_template, test/test_torch.py::TestTorch::test_dot_data_use, test/test_torch.py::TestTorch::test_dtype_is_signed, test/test_torch.py::TestTorch::test_element_size, test/test_torch.py::TestTorch::test_empty_meta, test/test_torch.py::TestTorch::test_empty_storage_view, test/test_torch.py::TestTorch::test_equal, test/test_torch.py::TestTorch::test_error_msg_type_translation, test/test_torch.py::TestTorch::test_fill_diagonal, test/test_torch.py::TestTorch::test_format_scalar_meta, test/test_torch.py::TestTorch::test_from_buffer, test/test_torch.py::TestTorch::test_from_file, test/test_torch.py::TestTorch::test_gather_neg_dim, test/test_torch.py::TestTorch::test_generator_cpu, test/test_torch.py::TestTorch::test_get_cpu_capability, test/test_torch.py::TestTorch::test_has_internal_overlap, test/test_torch.py::TestTorch::test_has_storage, test/test_torch.py::TestTorch::test_index_add, test/test_torch.py::TestTorch::test_index_add_all_dtypes, test/test_torch.py::TestTorch::test_index_add_cornercase, test/test_torch.py::TestTorch::test_index_add_correctness, test/test_torch.py::TestTorch::test_index_add_neg_dim, test/test_torch.py::TestTorch::test_index_copy_neg_dim, test/test_torch.py::TestTorch::test_index_fill_neg_dim, test/test_torch.py::TestTorch::test_index_select_neg_dim, test/test_torch.py::TestTorch::test_invalid_arg_error_handling, test/test_torch.py::TestTorch::test_invalid_generator_raises, test/test_torch.py::TestTorch::test_is_nonzero, test/test_torch.py::TestTorch::test_is_same_size, test/test_torch.py::TestTorch::test_iter, test/test_torch.py::TestTorch::test_kthvalue_neg_dim, test/test_torch.py::TestTorch::test_linspace_logspace, test/test_torch.py::TestTorch::test_logcumsumexp_neg_dim, test/test_torch.py::TestTorch::test_manual_seed, test/test_torch.py::TestTorch::test_map, test/test_torch.py::TestTorch::test_map2, test/test_torch.py::TestTorch::test_max_neg_dim, test/test_torch.py::TestTorch::test_mean_neg_dim, test/test_torch.py::TestTorch::test_median_neg_dim, test/test_torch.py::TestTorch::test_memory_format, test/test_torch.py::TestTorch::test_memory_format_contiguous_returns_same_tensor_if_already_satisfies, test/test_torch.py::TestTorch::test_memory_format_empty, test/test_torch.py::TestTorch::test_min_neg_dim, test/test_torch.py::TestTorch::test_mode_neg_dim, test/test_torch.py::TestTorch::test_multinomial_invalid_probs, test/test_torch.py::TestTorch::test_nanmedian_neg_dim, test/test_torch.py::TestTorch::test_narrow_neg_dim, test/test_torch.py::TestTorch::test_nbytes, test/test_torch.py::TestTorch::test_ndim, test/test_torch.py::TestTorch::test_new, test/test_torch.py::TestTorch::test_newaxis_numpy_comparison, test/test_torch.py::TestTorch::test_newindex, test/test_torch.py::TestTorch::test_no_cuda_monkeypatch, test/test_torch.py::TestTorch::test_norm_neg_dim, test/test_torch.py::TestTorch::test_normal_shape, test/test_torch.py::TestTorch::test_numel, test/test_torch.py::TestTorch::test_parallel_info, test/test_torch.py::TestTorch::test_parsing_double, test/test_torch.py::TestTorch::test_parsing_int64, test/test_torch.py::TestTorch::test_parsing_intlist, test/test_torch.py::TestTorch::test_permute, test/test_torch.py::TestTorch::test_pickle, test/test_torch.py::TestTorch::test_pickle_dtype, test/test_torch.py::TestTorch::test_pickle_function, test/test_torch.py::TestTorch::test_pickle_generator, test/test_torch.py::TestTorch::test_pickle_parameter, test/test_torch.py::TestTorch::test_pickle_parameter_no_requires_grad, test/test_torch.py::TestTorch::test_pickle_size, test/test_torch.py::TestTorch::test_pin_memory, test/test_torch.py::TestTorch::test_print, test/test_torch.py::TestTorch::test_prod_neg_dim, test/test_torch.py::TestTorch::test_pyobj_preserved, test/test_torch.py::TestTorch::test_qengine, test/test_torch.py::TestTorch::test_renorm_neg_dim, test/test_torch.py::TestTorch::test_resizable, test/test_torch.py::TestTorch::test_reversed, test/test_torch.py::TestTorch::test_scatter_neg_dim, test/test_torch.py::TestTorch::test_select_neg_dim, test/test_torch.py::TestTorch::test_set_flush_denormal, test/test_torch.py::TestTorch::test_setting_real_imag_to_a_number, test/test_torch.py::TestTorch::test_show_config, test/test_torch.py::TestTorch::test_size_neg_dim, test/test_torch.py::TestTorch::test_size_stride, test/test_torch.py::TestTorch::test_sizeof, test/test_torch.py::TestTorch::test_slice, test/test_torch.py::TestTorch::test_slow_test, test/test_torch.py::TestTorch::test_sobolengine_bounds, test/test_torch.py::TestTorch::test_sobolengine_bounds_scrambled, test/test_torch.py::TestTorch::test_sobolengine_continuing, test/test_torch.py::TestTorch::test_sobolengine_continuing_scrambled, test/test_torch.py::TestTorch::test_sobolengine_default_dtype, test/test_torch.py::TestTorch::test_sobolengine_distribution, test/test_torch.py::TestTorch::test_sobolengine_distribution_scrambled, test/test_torch.py::TestTorch::test_sobolengine_draw, test/test_torch.py::TestTorch::test_sobolengine_draw_base2, test/test_torch.py::TestTorch::test_sobolengine_draw_base2_scrambled, test/test_torch.py::TestTorch::test_sobolengine_draw_scrambled, test/test_torch.py::TestTorch::test_sobolengine_fast_forward, test/test_torch.py::TestTorch::test_sobolengine_fast_forward_scrambled, test/test_torch.py::TestTorch::test_sobolengine_first_point, test/test_torch.py::TestTorch::test_sobolengine_high_dim, test/test_torch.py::TestTorch::test_sobolengine_raise, test/test_torch.py::TestTorch::test_sobolengine_reset, test/test_torch.py::TestTorch::test_sobolengine_reset_scrambled, test/test_torch.py::TestTorch::test_sort_neg_dim, test/test_torch.py::TestTorch::test_split_neg_dim, test/test_torch.py::TestTorch::test_split_with_sizes_copy_out, test/test_torch.py::TestTorch::test_squeeze_neg_dim, test/test_torch.py::TestTorch::test_std_neg_dim, test/test_torch.py::TestTorch::test_storage_base_init, test/test_torch.py::TestTorch::test_storage_base_new, test/test_torch.py::TestTorch::test_storage_byteswap, test/test_torch.py::TestTorch::test_storage_casts, test/test_torch.py::TestTorch::test_storage_cycle_via_dict, test/test_torch.py::TestTorch::test_storage_cycle_via_slots, test/test_torch.py::TestTorch::test_storage_dead_weak_ref, test/test_torch.py::TestTorch::test_storage_dealloc, test/test_torch.py::TestTorch::test_storage_dealloc_resurrected, test/test_torch.py::TestTorch::test_storage_dealloc_subclass_resurrected, test/test_torch.py::TestTorch::test_storage_dealloc_subclass_zombie, test/test_torch.py::TestTorch::test_storage_dict_dealloc, test/test_torch.py::TestTorch::test_storage_error, test/test_torch.py::TestTorch::test_storage_error_no_attribute, test/test_torch.py::TestTorch::test_storage_finalizer_dealloc, test/test_torch.py::TestTorch::test_storage_fix_weakref_no_leak, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc_resurrected, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc_zombie, test/test_torch.py::TestTorch::test_storage_preserve_nonhermetic_in_hermetic_context, test/test_torch.py::TestTorch::test_storage_resurrected_weak_ref, test/test_torch.py::TestTorch::test_storage_slot_dealloc, test/test_torch.py::TestTorch::test_storage_thread_safety, test/test_torch.py::TestTorch::test_storage_weakref_dealloc, test/test_torch.py::TestTorch::test_structseq_repr, test/test_torch.py::TestTorch::test_subclass_preserved, test/test_torch.py::TestTorch::test_subclass_tensors, test/test_torch.py::TestTorch::test_sum_neg_dim, test/test_torch.py::TestTorch::test_swap_basic, test/test_torch.py::TestTorch::test_swap_fail_slots, test/test_torch.py::TestTorch::test_t_not_2d_error, test/test_torch.py::TestTorch::test_tensor_base_init, test/test_torch.py::TestTorch::test_tensor_base_new, test/test_torch.py::TestTorch::test_tensor_ctor_scalar, test/test_torch.py::TestTorch::test_tensor_cycle_via_dict, test/test_torch.py::TestTorch::test_tensor_cycle_via_slots, test/test_torch.py::TestTorch::test_tensor_dead_weak_ref, test/test_torch.py::TestTorch::test_tensor_dict_dealloc, test/test_torch.py::TestTorch::test_tensor_finalizer_dealloc, test/test_torch.py::TestTorch::test_tensor_fix_weakref_no_leak, test/test_torch.py::TestTorch::test_tensor_item_no_warning, test/test_torch.py::TestTorch::test_tensor_ressurecting_clear, test/test_torch.py::TestTorch::test_tensor_resurrected_weak_ref, test/test_torch.py::TestTorch::test_tensor_set, test/test_torch.py::TestTorch::test_tensor_set_errors, test/test_torch.py::TestTorch::test_tensor_slot_dealloc, test/test_torch.py::TestTorch::test_tensor_weakref_dealloc, test/test_torch.py::TestTorch::test_tensor_where_scalar, test/test_torch.py::TestTorch::test_tensor_with_grad_to_scalar_warning, test/test_torch.py::TestTorch::test_tensoriterator_output_setup, test/test_torch.py::TestTorch::test_terminate_handler_on_crash, test/test_torch.py::TestTorch::test_to, test/test_torch.py::TestTorch::test_to_with_tensor, test/test_torch.py::TestTorch::test_topk_neg_dim, test/test_torch.py::TestTorch::test_torch_from_file, test/test_torch.py::TestTorch::test_transpose_neg_dim, test/test_torch.py::TestTorch::test_type, test/test_torch.py::TestTorch::test_type_alias, test/test_torch.py::TestTorch::test_type_conversion_via_dtype_name, test/test_torch.py::TestTorch::test_typed_storage_deprecation_warning, test/test_torch.py::TestTorch::test_typed_storage_internal_no_warning, test/test_torch.py::TestTorch::test_unbind_neg_dim, test/test_torch.py::TestTorch::test_unflatten, test/test_torch.py::TestTorch::test_unfold_neg_dim, test/test_torch.py::TestTorch::test_unsqueeze_neg_dim, test/test_torch.py::TestTorch::test_upsample_nearest1d_meta, test/test_torch.py::TestTorch::test_upsample_nearest2d_meta, test/test_torch.py::TestTorch::test_var_neg_dim, test/test_torch.py::TestTorch::test_warn_types, test/test_torch.py::TestTorch::test_wildcard_import, test/test_torch.py::TestVitalSignsCudaCUDA::test_cuda_vitals_gpu_only_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test__local_scalar_dense_with_empty_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_errors_with_cpu_scalars_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_False_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_use_cpu_scalar_True_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_assertRaisesRegex_ignore_msg_non_native_device_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bfloat16_neg_abs_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bool_tensor_value_change_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_add_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_addcdiv_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_addcmul_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_atan2_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_copy_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_dist_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_div_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_eq_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_fmod_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_ge_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_gt_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_le_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_lerp_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_lt_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_map2_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_map_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_fill_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_scatter_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_select_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_max_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_min_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_mul_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_ne_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_pow_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_remainder_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_sub_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_no_inf_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_no_inf_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_cuda_backward_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_euclidean_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_grad_p_lt_1_no_nan_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_large_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_non_contiguous_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_non_contiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_norm_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_norm_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_same_inputs_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_check_tensor_all_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_check_tensor_internal_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_all_dtypes_and_devices_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_not_memory_dense_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_zero_stride_dim_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_complex_half_experimental_warning_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_constants_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_conv_transposed_backward_agnostic_to_memory_format_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_conv_transposed_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_all_dtypes_and_devices_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_math_view_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_mem_overlap_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cov_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cpp_warnings_have_python_context_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummax_cummin_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummax_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummin_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumprod_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumsum_64bit_indexing_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumsum_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumsum_outer_dim_64bit_indexing_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_cumsum_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_complex32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_interpolate_bilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_replication_pad2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_device_guard_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dim_function_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_discontiguous_out_cumsum_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dist_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dtypetensor_warnings_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_expected_failure_xla_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_no_zero_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_no_zero_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gather_backward_deterministic_path_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gather_backward_one_dim_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scale_will_not_overflow_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaler_deprecated_warning_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaler_pass_itself_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_accumulation_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach0_fused0_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach0_fused0_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach0_fused0_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach2_fused_True_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach2_fused_True_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach2_fused_True_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach_True_fused1_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach_True_fused1_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach_True_fused1_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_clipping_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_clipping_separate_unscale_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_multiple_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_penalty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_state_dict_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_sparse_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_update_scale_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_type_promotion_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_hook_remove_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_add_large_inputs_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_add_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_put_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_int64_upsample3d_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_invalid_shapes_grid_sampler_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_is_set_to_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_is_signed_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e4m3fn, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e4m3fnuz, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e5m2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e5m2fnuz, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_large_cumprod_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_large_cumsum_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_logcumsumexp_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lognormal_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_bool_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bfloat16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bfloat16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bool_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bool_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex128_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex128_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float32_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float32_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int32_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int32_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int8_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int8_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_uint8_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_uint8_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_bool_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_inplace_noncontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_large_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_clone_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_consistency_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_cpu_and_cuda_ops_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_empty_like_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_factory_like_functions_preserve_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_operators_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_preserved_after_permute_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_propagation_rules_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_to_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_type_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_type_shortcuts_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_module_share_memory_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_device_constrain_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_empty_w_replacement_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_empty_wo_replacement_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_gpu_device_constrain_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_rng_state_advance_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_narrow_copy_non_contiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_narrow_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_no_nondeterministic_alert_interpolate_bilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_no_nondeterministic_alert_interpolate_trilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveAvgPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveAvgPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveMaxPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AvgPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_CTCLoss_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_EmbeddingBag_max_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_FractionalMaxPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_FractionalMaxPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_NLLLoss_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReflectionPad1d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReflectionPad3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad1d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_bincount_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_grid_sample_2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_grid_sample_3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_histc_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_histc_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_bicubic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_bilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_linear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_trilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_median_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_put_accumulate_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_put_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_qint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_qint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint2x4, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint4x2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nullary_op_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pairwise_distance_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_and_graph_partition_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_and_graph_partition_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_and_graph_partition_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_between_unscale_and_step_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_between_unscale_and_step_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_between_unscale_and_step_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pdist_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pdist_norm_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pickle_gradscaler_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pin_memory_from_constructor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_reduced_type_float_copy_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_reduced_type_float_copy_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_repeat_interleave_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scalar_check_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_bool_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_non_unique_index_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_one_dim_deterministic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_to_large_input_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_bool_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_multiply_unsupported_dtypes_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_multiply_unsupported_dtypes_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_to_large_input_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_zero_size_index_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_serialization_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_default_tensor_type_warnings_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_shift_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_skip_xla_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_all_devices_non_blocking_False_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_all_devices_non_blocking_True_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_qint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_qint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_quint4x2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_quint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_use_count_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_strides_propagation_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_sync_warning_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_set_errors_multigpu_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_shape_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_type_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_ternary_op_mem_overlap_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_untyped_storage_meta_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_warn_always_caught_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_where_scalar_handcrafted_values_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_advancedindex_mixed_cpu_devices_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_advancedindex_mixed_devices_error_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_float32, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_float64, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_int64, test/test_torch.py::TestDevicePrecisionCUDA::test_copy_broadcast_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_copy_noncontig_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_cuda_device_idx_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_device_serialization_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float16, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float32, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float64, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int16, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int32, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int64, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int8, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_uint8, test/test_torch.py::TestDevicePrecisionCUDA::test_index_add_bfloat16_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_multidevice_serialization_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_type_conversions_same_device_cuda 2025-12-04T15:48:49.7695924Z 2025-12-04T15:48:49.7696240Z Finished test_torch 1/1 ... [2025-12-04 15:48:49.698047][23733.819265245], took 1.74min 2025-12-04T15:48:49.7697305Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_torch/test_torch-f1fe0f79113cbdd9.xml 2025-12-04T15:48:49.8336021Z Running higher_order_ops/test_with_effects 1/1 ... [2025-12-04 15:48:49.833350][23733.954569541] 2025-12-04T15:48:49.8336638Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:48:49.8339571Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'higher_order_ops/test_with_effects.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:48:49.833723] 2025-12-04T15:49:20.8932036Z 2025-12-04T15:49:20.8935530Z higher_order_ops/test_with_effects 1/1 was successful, full logs can be found in artifacts with path test/test-reports/higher_order_ops.test_with_effects_1.1_2e0c561acdbd87a0_.log 2025-12-04T15:49:20.8945281Z Running 19 items in this shard: test/higher_order_ops/test_with_effects.py::TestWithEffects::test_alias_op, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_compile_aot_eager, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_compile_aot_eager_requires_grad, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_compile_inductor, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_compile_inductor_external_op_return_none, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_effectful_custom_op_with_subclasses, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_effectful_op_in_backward, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_effects_and_aliased_outputs, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_effects_and_input_mutation_is_output, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_effects_and_input_mutation_return, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_effects_and_input_output_view_simple, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_export_invoke_subgraph, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_print, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_print_with_buffer_mutations, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_print_with_input_mutations, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_register_effectful_custom_op, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_regular_effectful_op_in_forward_and_backward, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_regular_effectful_op_only_in_backward, test/higher_order_ops/test_with_effects.py::TestWithEffects::test_torchbind_custom_op 2025-12-04T15:49:20.8954494Z 2025-12-04T15:49:20.8954906Z Finished higher_order_ops/test_with_effects 1/1 ... [2025-12-04 15:49:20.893034][23765.014254113], took 0.52min 2025-12-04T15:49:20.9313987Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/higher_order_ops.test_with_effects/higher_order_ops.test_with_effects-e6b12b25e8a894fd.xml 2025-12-04T15:49:22.3348883Z Uploading artifacts took 1.31 seconds 2025-12-04T15:49:22.3353123Z Running test_weak 1/1 ... [2025-12-04 15:49:22.335114][23766.456333465] 2025-12-04T15:49:22.3353705Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:49:22.3357585Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_weak.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:49:22.335527] 2025-12-04T15:49:31.4635441Z 2025-12-04T15:49:31.4636696Z test_weak 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_weak_1.1_9028b921eb7afade_.log 2025-12-04T15:49:31.4649858Z Running 39 items in this shard: test/test_weak.py::WeakTest::test_make_weak_keyed_dict_from_dict, test/test_weak.py::WeakTest::test_make_weak_keyed_dict_from_weak_keyed_dict, test/test_weak.py::WeakTest::test_make_weak_keyed_dict_repr, test/test_weak.py::WeakTest::test_threaded_weak_key_dict_copy, test/test_weak.py::WeakTest::test_threaded_weak_key_dict_deepcopy, test/test_weak.py::WeakTest::test_weak_keyed_bad_delitem, test/test_weak.py::WeakTest::test_weak_keyed_delitem, test/test_weak.py::WeakTest::test_weak_keyed_dict_popitem, test/test_weak.py::WeakTest::test_weak_keyed_dict_setdefault, test/test_weak.py::WeakTest::test_weak_keyed_dict_update, test/test_weak.py::WeakTest::test_weak_keyed_union_operators, test/test_weak.py::WeakKeyDictionaryTestCase::test_bool, test/test_weak.py::WeakKeyDictionaryTestCase::test_constructor, test/test_weak.py::WeakKeyDictionaryTestCase::test_get, test/test_weak.py::WeakKeyDictionaryTestCase::test_getitem, test/test_weak.py::WeakKeyDictionaryTestCase::test_items, test/test_weak.py::WeakKeyDictionaryTestCase::test_keys, test/test_weak.py::WeakKeyDictionaryTestCase::test_len, test/test_weak.py::WeakKeyDictionaryTestCase::test_pop, test/test_weak.py::WeakKeyDictionaryTestCase::test_popitem, test/test_weak.py::WeakKeyDictionaryTestCase::test_read, test/test_weak.py::WeakKeyDictionaryTestCase::test_setdefault, test/test_weak.py::WeakKeyDictionaryTestCase::test_update, test/test_weak.py::WeakKeyDictionaryTestCase::test_values, test/test_weak.py::WeakKeyDictionaryTestCase::test_write, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_bool, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_constructor, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_get, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_getitem, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_items, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_keys, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_len, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_pop, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_popitem, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_read, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_setdefault, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_update, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_values, test/test_weak.py::WeakKeyDictionaryScriptObjectTestCase::test_write 2025-12-04T15:49:31.4662996Z 2025-12-04T15:49:31.4663265Z Finished test_weak 1/1 ... [2025-12-04 15:49:31.463398][23775.584618884], took 0.15min 2025-12-04T15:49:31.5021947Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_weak/test_weak-d0e879aa5e6c216f.xml 2025-12-04T15:49:31.6079481Z Running test_complex 1/1 ... [2025-12-04 15:49:31.607677][23775.728896239] 2025-12-04T15:49:31.6080014Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:49:31.6083113Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_complex.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:49:31.608038] 2025-12-04T15:49:38.7829741Z 2025-12-04T15:49:38.7830859Z test_complex 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_complex_1.1_f2afd153be3495f4_.log 2025-12-04T15:49:38.7837435Z Running 15 items in this shard: test/test_complex.py::TestComplexTensorCUDA::test_all_cuda_complex128, test/test_complex.py::TestComplexTensorCUDA::test_all_cuda_complex64, test/test_complex.py::TestComplexTensorCUDA::test_any_cuda_complex128, test/test_complex.py::TestComplexTensorCUDA::test_any_cuda_complex64, test/test_complex.py::TestComplexTensorCUDA::test_conj_copy_cuda_complex128, test/test_complex.py::TestComplexTensorCUDA::test_conj_copy_cuda_complex64, test/test_complex.py::TestComplexTensorCUDA::test_dtype_inference_cuda_float16, test/test_complex.py::TestComplexTensorCUDA::test_dtype_inference_cuda_float32, test/test_complex.py::TestComplexTensorCUDA::test_dtype_inference_cuda_float64, test/test_complex.py::TestComplexTensorCUDA::test_eq_cuda_complex128, test/test_complex.py::TestComplexTensorCUDA::test_eq_cuda_complex64, test/test_complex.py::TestComplexTensorCUDA::test_ne_cuda_complex128, test/test_complex.py::TestComplexTensorCUDA::test_ne_cuda_complex64, test/test_complex.py::TestComplexTensorCUDA::test_to_list_cuda_complex128, test/test_complex.py::TestComplexTensorCUDA::test_to_list_cuda_complex64 2025-12-04T15:49:38.7843354Z 2025-12-04T15:49:38.7843644Z Finished test_complex 1/1 ... [2025-12-04 15:49:38.782777][23782.903997081], took 0.12min 2025-12-04T15:49:38.8215940Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_complex/test_complex-85789b75771d41fd.xml 2025-12-04T15:49:38.9163778Z Running test_optim 1/1 ... [2025-12-04 15:49:38.916155][23783.037375017] 2025-12-04T15:49:38.9164275Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:49:38.9167516Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_optim.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:49:38.916528] 2025-12-04T15:55:09.2703422Z 2025-12-04T15:55:09.2704385Z test_optim 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_optim_1.1_ef0be9bf78c331a7_.log 2025-12-04T15:55:09.3193334Z Running 1015 items in this shard: test/test_optim.py::TestLRScheduler::test_CosineAnnealingWarmRestarts_T_cur_reset, test/test_optim.py::TestLRScheduler::test_CosineAnnealingWarmRestarts_lr1_T_mult_1, test/test_optim.py::TestLRScheduler::test_CosineAnnealingWarmRestarts_lr1_T_mult_2, test/test_optim.py::TestLRScheduler::test_CosineAnnealingWarmRestarts_lr1_T_mult_4, test/test_optim.py::TestLRScheduler::test_CosineAnnealingWarmRestarts_lr2, test/test_optim.py::TestLRScheduler::test_CosineAnnealingWarmRestarts_lr3, test/test_optim.py::TestLRScheduler::test_CosineAnnealingWarmRestarts_lr_state_dict, test/test_optim.py::TestLRScheduler::test_add_param_group_does_not_break_reduce_lr_on_plateau_min_lr_list, test/test_optim.py::TestLRScheduler::test_add_param_group_does_not_break_reduce_lr_on_plateau_min_lr_scalar, test/test_optim.py::TestLRScheduler::test_add_param_group_errors_reduce_lr_on_plateau, test/test_optim.py::TestLRScheduler::test_chained_lr1, test/test_optim.py::TestLRScheduler::test_chained_lr2, test/test_optim.py::TestLRScheduler::test_chained_lr2_get_last_lr_before_step, test/test_optim.py::TestLRScheduler::test_chained_lr3, test/test_optim.py::TestLRScheduler::test_chained_lr4, test/test_optim.py::TestLRScheduler::test_chained_lr5, test/test_optim.py::TestLRScheduler::test_closed_form_constantlr, test/test_optim.py::TestLRScheduler::test_closed_form_cos_anneal_lr, test/test_optim.py::TestLRScheduler::test_closed_form_exp_lr, test/test_optim.py::TestLRScheduler::test_closed_form_linearlr, test/test_optim.py::TestLRScheduler::test_closed_form_multi_step_lr, test/test_optim.py::TestLRScheduler::test_closed_form_poly_lr, test/test_optim.py::TestLRScheduler::test_closed_form_step_lr, test/test_optim.py::TestLRScheduler::test_compound_cosanneal_and_exp_lr, test/test_optim.py::TestLRScheduler::test_compound_cosanneal_and_linearlr, test/test_optim.py::TestLRScheduler::test_compound_cosanneal_and_multistep_lr, test/test_optim.py::TestLRScheduler::test_compound_cosanneal_and_step_lr, test/test_optim.py::TestLRScheduler::test_compound_exp_and_linearlr, test/test_optim.py::TestLRScheduler::test_compound_exp_and_multistep_lr, test/test_optim.py::TestLRScheduler::test_compound_linearlr_and_multistep_lr, test/test_optim.py::TestLRScheduler::test_compound_reduce_lr_on_plateau1, test/test_optim.py::TestLRScheduler::test_compound_reduce_lr_on_plateau2, test/test_optim.py::TestLRScheduler::test_compound_reduce_lr_on_plateau3, test/test_optim.py::TestLRScheduler::test_compound_reduce_lr_on_plateau4, test/test_optim.py::TestLRScheduler::test_compound_reduce_lr_on_plateau5, test/test_optim.py::TestLRScheduler::test_compound_step_and_constantlr, test/test_optim.py::TestLRScheduler::test_compound_step_and_exp_lr, test/test_optim.py::TestLRScheduler::test_compound_step_and_multistep_lr, test/test_optim.py::TestLRScheduler::test_constant_initial_lr_LRClass0, test/test_optim.py::TestLRScheduler::test_constant_initial_lr_LRClass1, test/test_optim.py::TestLRScheduler::test_constant_initial_lr_LRClass2, test/test_optim.py::TestLRScheduler::test_constant_initial_lr_LRClass3, test/test_optim.py::TestLRScheduler::test_constant_initial_lr_LRClass4, test/test_optim.py::TestLRScheduler::test_constant_initial_lr_LRClass5, test/test_optim.py::TestLRScheduler::test_constant_initial_lr_LRClass6, test/test_optim.py::TestLRScheduler::test_constant_initial_lr_LRClass7, test/test_optim.py::TestLRScheduler::test_constant_initial_lr_LRClass8, test/test_optim.py::TestLRScheduler::test_constant_initial_lr_LRClass9, test/test_optim.py::TestLRScheduler::test_constant_initial_params_cyclelr, test/test_optim.py::TestLRScheduler::test_constant_initial_params_onecyclelr, test/test_optim.py::TestLRScheduler::test_constant_initial_params_swalr, test/test_optim.py::TestLRScheduler::test_constantlr, test/test_optim.py::TestLRScheduler::test_constantlr_is_constant_for_constant_epoch, test/test_optim.py::TestLRScheduler::test_constantlr_with_epoch, test/test_optim.py::TestLRScheduler::test_cos_anneal_lr, test/test_optim.py::TestLRScheduler::test_cos_anneal_lr_continue, test/test_optim.py::TestLRScheduler::test_cosine_lr_state_dict, test/test_optim.py::TestLRScheduler::test_cosine_then_cyclic, test/test_optim.py::TestLRScheduler::test_cycle_lr_cycle_momentum_fail_with_momentumless_optimizer, test/test_optim.py::TestLRScheduler::test_cycle_lr_cycle_momentum_with_beta1_optimizer, test/test_optim.py::TestLRScheduler::test_cycle_lr_exp_range_mode, test/test_optim.py::TestLRScheduler::test_cycle_lr_exp_range_mode_one_lr, test/test_optim.py::TestLRScheduler::test_cycle_lr_exp_range_mode_step_size_up_down, test/test_optim.py::TestLRScheduler::test_cycle_lr_invalid_mode, test/test_optim.py::TestLRScheduler::test_cycle_lr_removed_after_out_of_scope, test/test_optim.py::TestLRScheduler::test_cycle_lr_scale_fn_restored_from_state_dict, test/test_optim.py::TestLRScheduler::test_cycle_lr_state_dict_picklable, test/test_optim.py::TestLRScheduler::test_cycle_lr_triangular2_mode, test/test_optim.py::TestLRScheduler::test_cycle_lr_triangular2_mode_one_lr, test/test_optim.py::TestLRScheduler::test_cycle_lr_triangular2_mode_step_size_up_down, test/test_optim.py::TestLRScheduler::test_cycle_lr_triangular_mode, test/test_optim.py::TestLRScheduler::test_cycle_lr_triangular_mode_one_lr, test/test_optim.py::TestLRScheduler::test_cycle_lr_triangular_mode_one_lr_no_momentum, test/test_optim.py::TestLRScheduler::test_cycle_lr_triangular_mode_step_size_up_down, test/test_optim.py::TestLRScheduler::test_cycle_lr_with_adam, test/test_optim.py::TestLRScheduler::test_cycle_lr_with_momentumless_optimizer, test/test_optim.py::TestLRScheduler::test_error_when_getlr_has_epoch, test/test_optim.py::TestLRScheduler::test_exp_lr, test/test_optim.py::TestLRScheduler::test_exp_step_lr_state_dict, test/test_optim.py::TestLRScheduler::test_exponential_lr_is_constant_for_constant_epoch, test/test_optim.py::TestLRScheduler::test_get_last_lr_constantlr, test/test_optim.py::TestLRScheduler::test_get_last_lr_linearlr, test/test_optim.py::TestLRScheduler::test_get_last_lr_multi_step_lr, test/test_optim.py::TestLRScheduler::test_get_last_lr_sequentiallr, test/test_optim.py::TestLRScheduler::test_get_last_lr_step_lr, test/test_optim.py::TestLRScheduler::test_lambda_lr, test/test_optim.py::TestLRScheduler::test_lambda_lr_state_dict_fn, test/test_optim.py::TestLRScheduler::test_lambda_lr_state_dict_obj, test/test_optim.py::TestLRScheduler::test_linear_linearlr_is_constant_for_constant_epoch, test/test_optim.py::TestLRScheduler::test_linearlr, test/test_optim.py::TestLRScheduler::test_linearlr_start_factor_limits1, test/test_optim.py::TestLRScheduler::test_linearlr_start_factor_limits2, test/test_optim.py::TestLRScheduler::test_linearlr_with_epoch, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass0, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass1, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass10, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass11, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass12, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass2, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass3, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass4, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass5, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass6, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass7, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass8, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_LRClass9, test/test_optim.py::TestLRScheduler::test_lr_scheduler_checkpoint_on_plateau, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass0_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass0_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass10_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass10_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass11_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass11_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass12_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass12_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass13_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass13_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass14_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass14_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass15_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass15_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass1_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass1_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass2_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass2_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass3_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass3_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass4_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass4_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass5_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass5_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass6_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass6_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass7_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass7_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass8_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass8_weights_only_True, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass9_weights_only_False, test/test_optim.py::TestLRScheduler::test_lr_scheduler_state_dict_load_LRClass9_weights_only_True, test/test_optim.py::TestLRScheduler::test_multi_step_lr, test/test_optim.py::TestLRScheduler::test_multi_step_lr_state_dict, test/test_optim.py::TestLRScheduler::test_multi_step_lr_with_epoch, test/test_optim.py::TestLRScheduler::test_multiplicative_lr, test/test_optim.py::TestLRScheduler::test_multiplicative_lr_with_lr_lambda, test/test_optim.py::TestLRScheduler::test_new_pattern_no_warning, test/test_optim.py::TestLRScheduler::test_new_pattern_no_warning_with_arg, test/test_optim.py::TestLRScheduler::test_new_pattern_no_warning_with_overridden_optim_step, test/test_optim.py::TestLRScheduler::test_no_cyclic_references, test/test_optim.py::TestLRScheduler::test_no_cyclic_references_in_step, test/test_optim.py::TestLRScheduler::test_old_pattern_warning, test/test_optim.py::TestLRScheduler::test_old_pattern_warning_resuming, test/test_optim.py::TestLRScheduler::test_old_pattern_warning_resuming_with_arg, test/test_optim.py::TestLRScheduler::test_old_pattern_warning_with_arg, test/test_optim.py::TestLRScheduler::test_old_pattern_warning_with_overridden_optim_step, test/test_optim.py::TestLRScheduler::test_onecycle_lr_cannot_calculate_total_steps, test/test_optim.py::TestLRScheduler::test_onecycle_lr_cosine_annealing, test/test_optim.py::TestLRScheduler::test_onecycle_lr_invalid_anneal_strategy, test/test_optim.py::TestLRScheduler::test_onecycle_lr_invalid_pct_start, test/test_optim.py::TestLRScheduler::test_onecycle_lr_legacy_state_dict, test/test_optim.py::TestLRScheduler::test_onecycle_lr_linear_annealing, test/test_optim.py::TestLRScheduler::test_onecycle_lr_linear_annealing_three_phases, test/test_optim.py::TestLRScheduler::test_poly_lr, test/test_optim.py::TestLRScheduler::test_polynomial_lr_is_constant_for_constant_epoch, test/test_optim.py::TestLRScheduler::test_raise_error_when_last_epoch_is_greater_than_0_and_initial_lr_is_not_specified, test/test_optim.py::TestLRScheduler::test_reduce_lr_on_plateau1, test/test_optim.py::TestLRScheduler::test_reduce_lr_on_plateau2, test/test_optim.py::TestLRScheduler::test_reduce_lr_on_plateau3, test/test_optim.py::TestLRScheduler::test_reduce_lr_on_plateau4, test/test_optim.py::TestLRScheduler::test_reduce_lr_on_plateau5, test/test_optim.py::TestLRScheduler::test_reduce_lr_on_plateau6, test/test_optim.py::TestLRScheduler::test_reduce_lr_on_plateau7, test/test_optim.py::TestLRScheduler::test_reduce_lr_on_plateau8, test/test_optim.py::TestLRScheduler::test_reduce_lr_on_plateau_get_last_lr_before_step, test/test_optim.py::TestLRScheduler::test_reduce_lr_on_plateau_preserves_lr_type, test/test_optim.py::TestLRScheduler::test_reduce_lr_on_plateau_state_dict, test/test_optim.py::TestLRScheduler::test_sequentiallr1, test/test_optim.py::TestLRScheduler::test_sequentiallr2, test/test_optim.py::TestLRScheduler::test_sequentiallr3, test/test_optim.py::TestLRScheduler::test_sequentiallr4, test/test_optim.py::TestLRScheduler::test_sequentiallr5, test/test_optim.py::TestLRScheduler::test_sequentiallr_does_not_alias_lr_and_initial_lr, test/test_optim.py::TestLRScheduler::test_sequentiallr_no_warnings, test/test_optim.py::TestLRScheduler::test_step_lr, test/test_optim.py::TestLRScheduler::test_step_lr_is_constant_for_constant_epoch, test/test_optim.py::TestLRScheduler::test_step_lr_state_dict, test/test_optim.py::TestLRScheduler::test_swa_lr_state_dict, test/test_optim.py::TestLRScheduler::test_swalr_cosine_anneal_after_multiplicative, test/test_optim.py::TestLRScheduler::test_swalr_hypers, test/test_optim.py::TestLRScheduler::test_swalr_linear_anneal_after_multiplicative, test/test_optim.py::TestLRScheduler::test_swalr_no_anneal, test/test_optim.py::TestDifferentiableOptimizer::test_adadelta, test/test_optim.py::TestDifferentiableOptimizer::test_adagrad, test/test_optim.py::TestDifferentiableOptimizer::test_adam, test/test_optim.py::TestDifferentiableOptimizer::test_adam_differentiable_all_hyperparams, test/test_optim.py::TestDifferentiableOptimizer::test_adam_differentiable_betas, test/test_optim.py::TestDifferentiableOptimizer::test_adam_differentiable_lr, test/test_optim.py::TestDifferentiableOptimizer::test_adam_differentiable_weight_decay, test/test_optim.py::TestDifferentiableOptimizer::test_adamax, test/test_optim.py::TestDifferentiableOptimizer::test_adamw, test/test_optim.py::TestDifferentiableOptimizer::test_adamw_differentiable_all_hyperparams, test/test_optim.py::TestDifferentiableOptimizer::test_adamw_differentiable_betas, test/test_optim.py::TestDifferentiableOptimizer::test_adamw_differentiable_lr, test/test_optim.py::TestDifferentiableOptimizer::test_adamw_differentiable_weight_decay, test/test_optim.py::TestDifferentiableOptimizer::test_asgd, test/test_optim.py::TestDifferentiableOptimizer::test_differentiable_lr, test/test_optim.py::TestDifferentiableOptimizer::test_differentiable_weight_decay, test/test_optim.py::TestDifferentiableOptimizer::test_differentiable_weight_decay_and_lr, test/test_optim.py::TestDifferentiableOptimizer::test_nadam, test/test_optim.py::TestDifferentiableOptimizer::test_radam, test/test_optim.py::TestDifferentiableOptimizer::test_rmsprop, test/test_optim.py::TestDifferentiableOptimizer::test_rprop, test/test_optim.py::TestDifferentiableOptimizer::test_sgd, test/test_optim.py::TestSWAUtils::test_averaged_model_all_devices_ema_False, test/test_optim.py::TestSWAUtils::test_averaged_model_all_devices_ema_True, test/test_optim.py::TestSWAUtils::test_averaged_model_default_avg_fn_picklable, test/test_optim.py::TestSWAUtils::test_averaged_model_exponential_use_multi_avg_fn_False_use_buffers_False, test/test_optim.py::TestSWAUtils::test_averaged_model_exponential_use_multi_avg_fn_False_use_buffers_True, test/test_optim.py::TestSWAUtils::test_averaged_model_exponential_use_multi_avg_fn_True_use_buffers_False, test/test_optim.py::TestSWAUtils::test_averaged_model_exponential_use_multi_avg_fn_True_use_buffers_True, test/test_optim.py::TestSWAUtils::test_averaged_model_mixed_device_ema_False, test/test_optim.py::TestSWAUtils::test_averaged_model_mixed_device_ema_True, test/test_optim.py::TestSWAUtils::test_averaged_model_state_dict, test/test_optim.py::TestSWAUtils::test_bn_update_eval_momentum, test/test_optim.py::TestSWAUtils::test_update_bn_cnn, test/test_optim.py::TestSWAUtils::test_update_bn_dnn, test/test_optim.py::TestOptimRenewedCUDA::test_adamw_serialization_cuda, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_False_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_False_is_named_optim1_True_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_False_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_from_to_named_state_dict_is_named_optim0_True_is_named_optim1_True_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_can_load_older_state_dict_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_ASGD_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_Adadelta_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_Adagrad_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_AdamW_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_Adam_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_Adamax_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_LBFGS_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_NAdam_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_RAdam_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_RMSprop_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_Rprop_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_2d_SGD_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_ASGD_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_Adadelta_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_Adagrad_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_AdamW_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_Adam_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_Adamax_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_LBFGS_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_NAdam_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_RAdam_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_RMSprop_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_Rprop_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_complex_SGD_cuda_complex64, test/test_optim.py::TestOptimRenewedCUDA::test_cpu_load_state_dict_impl_capturable_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_cpu_load_state_dict_impl_capturable_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_cpu_load_state_dict_impl_capturable_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_cpu_load_state_dict_impl_capturable_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_cpu_load_state_dict_impl_fused_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_cpu_load_state_dict_impl_fused_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_cpu_load_state_dict_impl_fused_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_cpu_load_state_dict_impl_fused_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_deepcopy_copies_all_public_attrs_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_defaults_changed_to_foreach_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_defaults_changed_to_foreach_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_defaults_changed_to_foreach_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_defaults_changed_to_foreach_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_defaults_changed_to_foreach_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_defaults_changed_to_foreach_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_defaults_changed_to_foreach_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_defaults_changed_to_foreach_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_defaults_changed_to_foreach_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_defaults_changed_to_foreach_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_defaults_changed_to_foreach_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_errors_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_ASGD_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_Adadelta_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_Adafactor_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_Adagrad_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_AdamW_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_Adam_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_Adamax_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_NAdam_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_RAdam_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_RMSprop_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_Rprop_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_large_tensor_SGD_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_ASGD_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_Adadelta_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_Adafactor_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_Adagrad_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_AdamW_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_Adam_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_Adamax_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_NAdam_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_RAdam_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_RMSprop_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_Rprop_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_foreach_matches_forloop_SGD_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_False_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_False_with_lrsched_True_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_False_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_contiguous_True_with_lrsched_True_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_False_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_forloop_goes_right_direction_multigpu_with_lrsched_True_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_AdamW_cuda_bfloat16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_AdamW_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_AdamW_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_Adam_cuda_bfloat16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_Adam_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_Adam_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_SGD_cuda_bfloat16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_SGD_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_cpu_matches_cuda_SGD_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_fused_does_not_step_if_foundinf_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_does_not_step_if_foundinf_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_does_not_step_if_foundinf_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_does_not_step_if_foundinf_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_error_on_params_on_meta_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_error_on_params_on_meta_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_error_on_params_on_meta_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_error_on_params_on_meta_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_large_tensor_Adagrad_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_large_tensor_AdamW_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_large_tensor_Adam_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_large_tensor_SGD_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_Adagrad_cuda_bfloat16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_Adagrad_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_Adagrad_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_AdamW_cuda_bfloat16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_AdamW_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_AdamW_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_Adam_cuda_bfloat16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_Adam_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_Adam_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_SGD_cuda_bfloat16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_SGD_cuda_float16, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_fused_matches_forloop_SGD_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_grads_are_never_inplaced_into_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_nontensor_step_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_post_hook_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_hook_and_prepend_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_load_state_dict_pre_post_hook_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_foreach_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_mixed_device_dtype_impl_fused_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_non_empty_state_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optim_infos_do_not_specify_global_cliquey_kwargs_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_optimizer_can_be_printed_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_group_with_lrscheduler_goes_right_direction_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_lr_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_param_groups_weight_decay_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_peak_memory_foreach_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_rosenbrock_sparse_with_lrsched_False_Adagrad_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_rosenbrock_sparse_with_lrsched_False_SGD_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_rosenbrock_sparse_with_lrsched_False_SparseAdam_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_rosenbrock_sparse_with_lrsched_True_Adagrad_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_rosenbrock_sparse_with_lrsched_True_SGD_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_rosenbrock_sparse_with_lrsched_True_SparseAdam_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_False_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_save_load_equality_with_weights_only_is_named_optim_True_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_second_order_optims_return_consistent_types_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_ASGD_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_Adadelta_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_Adafactor_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_Adagrad_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_AdamW_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_Adam_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_Adamax_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_NAdam_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_RAdam_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_RMSprop_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_Rprop_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_set_default_dtype_works_with_foreach_SGD_cuda_float64, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_False_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_False_is_named_optim1_True_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_False_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_deterministic_is_named_optim0_True_is_named_optim1_True_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_post_hook_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_hook_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_pre_post_hook_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_state_dict_with_cuda_params_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_all_hooks_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_for_zero_grads_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_is_noop_when_params_have_no_grad_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_iteration_float32_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_post_hook_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_step_pre_hook_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_0_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_1_SparseAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_ASGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_Adadelta_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_Adafactor_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_Adagrad_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_AdamW_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_Adam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_Adamax_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_LBFGS_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_Muon_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_NAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_RAdam_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_RMSprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_Rprop_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_SGD_cuda_float32, test/test_optim.py::TestOptimRenewedCUDA::test_tensor_lr_num_dim_2_SparseAdam_cuda_float32 2025-12-04T15:55:09.3668050Z 2025-12-04T15:55:09.3668370Z Finished test_optim 1/1 ... [2025-12-04 15:55:09.271854][24113.393070016], took 5.51min 2025-12-04T15:55:09.3669426Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_optim/test_optim-e2224a41ca12c5b8.xml 2025-12-04T15:55:09.4157024Z Running xpu/test_conv 1/1 ... [2025-12-04 15:55:09.415486][24113.536705965] 2025-12-04T15:55:09.4157544Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:55:09.4160877Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'xpu/test_conv.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:55:09.415860] 2025-12-04T15:55:14.8523393Z 2025-12-04T15:55:14.8524347Z xpu/test_conv 1/1 was successful, full logs can be found in artifacts with path test/test-reports/xpu.test_conv_1.1_02560067eb16cf7d_.log 2025-12-04T15:55:14.8525193Z Running 0 items in this shard: 2025-12-04T15:55:14.8525414Z 2025-12-04T15:55:14.8525721Z Finished xpu/test_conv 1/1 ... [2025-12-04 15:55:14.852184][24118.973403282], took 0.09min 2025-12-04T15:55:14.8917918Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/xpu.test_conv/xpu.test_conv-725b55c77ef2b365.xml 2025-12-04T15:55:14.9175680Z Running nn/test_packed_sequence 1/1 ... [2025-12-04 15:55:14.917351][24119.038572019] 2025-12-04T15:55:14.9176248Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:55:14.9179776Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'nn/test_packed_sequence.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:55:14.917723] 2025-12-04T15:55:22.1418431Z 2025-12-04T15:55:22.1419477Z nn/test_packed_sequence 1/1 was successful, full logs can be found in artifacts with path test/test-reports/nn.test_packed_sequence_1.1_69614db4f9bab953_.log 2025-12-04T15:55:22.1425258Z Running 13 items in this shard: test/nn/test_packed_sequence.py::PackedSequenceTest::test_empty_packed_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_pack_padded_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_pack_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_pad_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_pad_sequence_with_non_iterable_sequences, test/nn/test_packed_sequence.py::PackedSequenceTest::test_pad_sequence_with_tensor_sequences, test/nn/test_packed_sequence.py::PackedSequenceTest::test_to, test/nn/test_packed_sequence.py::PackedSequenceTest::test_to_memory_format, test/nn/test_packed_sequence.py::PackedSequenceTest::test_total_length, test/nn/test_packed_sequence.py::PackedSequenceTest::test_type_casts, test/nn/test_packed_sequence.py::PackedSequenceTest::test_unpack_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_unpad_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_wrong_order 2025-12-04T15:55:22.1430605Z 2025-12-04T15:55:22.1430951Z Finished nn/test_packed_sequence 1/1 ... [2025-12-04 15:55:22.141656][24126.262876142], took 0.12min 2025-12-04T15:55:22.1811865Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/nn.test_packed_sequence/nn.test_packed_sequence-cbb1d818037c9c38.xml 2025-12-04T15:55:22.2708299Z Running test_numa_binding 1/1 ... [2025-12-04 15:55:22.270559][24126.391778899] 2025-12-04T15:55:22.2708860Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:55:22.2711842Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_numa_binding.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:55:22.270928] 2025-12-04T15:55:33.5003774Z 2025-12-04T15:55:33.5005085Z test_numa_binding 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_numa_binding_1.1_69471f0d5f25dce0_.log 2025-12-04T15:55:33.5014537Z Running 21 items in this shard: test/test_numa_binding.py::NumaBindingTest::test_bind_all_threads_in_current_process_to_logical_cpus, test/test_numa_binding.py::NumaBindingTest::test_binds_to_node_0_if_node_stored_as_minus_one, test/test_numa_binding.py::NumaBindingTest::test_callable_entrypoint_basic, test/test_numa_binding.py::NumaBindingTest::test_core_complex_numa_binding_with_extra_l3, test/test_numa_binding.py::NumaBindingTest::test_core_complex_numa_binding_with_fewer_l3_than_gpu, test/test_numa_binding.py::NumaBindingTest::test_core_complex_prefers_caches_with_more_cpus, test/test_numa_binding.py::NumaBindingTest::test_core_complex_tiebreak_prefers_lower_cache_key, test/test_numa_binding.py::NumaBindingTest::test_default_numa_binding, test/test_numa_binding.py::NumaBindingTest::test_exclusive_numa_binding, test/test_numa_binding.py::NumaBindingTest::test_exclusive_raises_if_too_few_physical_cores, test/test_numa_binding.py::NumaBindingTest::test_explicit_numa_options_overrides_default, test/test_numa_binding.py::NumaBindingTest::test_fallback, test/test_numa_binding.py::NumaBindingTest::test_fallback_if_numactl_not_available, test/test_numa_binding.py::NumaBindingTest::test_get_range_str_from_ints, test/test_numa_binding.py::NumaBindingTest::test_get_set_of_int_from_ranges_str, test/test_numa_binding.py::NumaBindingTest::test_no_numa_binding_if_numa_options_not_provided, test/test_numa_binding.py::NumaBindingTest::test_node_numa_binding, test/test_numa_binding.py::NumaBindingTest::test_nproc_must_equal_cuda_device_count_to_use_default_numa_options, test/test_numa_binding.py::NumaBindingTest::test_raises_if_binding_to_empty_set, test/test_numa_binding.py::NumaBindingTest::test_socket_numa_binding_with_multiple_numa_per_socket, test/test_numa_binding.py::NumaBindingTest::test_socket_numa_binding_with_single_numa_per_socket 2025-12-04T15:55:33.5023699Z 2025-12-04T15:55:33.5024032Z Finished test_numa_binding 1/1 ... [2025-12-04 15:55:33.500362][24137.621581753], took 0.19min 2025-12-04T15:55:33.5399398Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_numa_binding/test_numa_binding-1b1d6199d4053b8e.xml 2025-12-04T15:55:33.6105822Z Running test_pruning_op 1/1 ... [2025-12-04 15:55:33.610364][24137.731585349] 2025-12-04T15:55:33.6106362Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:55:33.6109439Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_pruning_op.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:55:33.610713] 2025-12-04T15:55:39.0324550Z 2025-12-04T15:55:39.0325561Z test_pruning_op 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_pruning_op_1.1_67dd68b7168ba8bb_.log 2025-12-04T15:55:39.0327110Z Running 2 items in this shard: test/test_pruning_op.py::PruningOpTest::test_rowwise_prune_op_32bit_indices, test/test_pruning_op.py::PruningOpTest::test_rowwise_prune_op_64bit_indices 2025-12-04T15:55:39.0328020Z 2025-12-04T15:55:39.0328333Z Finished test_pruning_op 1/1 ... [2025-12-04 15:55:39.032274][24143.153494936], took 0.09min 2025-12-04T15:55:39.0719878Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_pruning_op/test_pruning_op-4970668223f4717e.xml 2025-12-04T15:55:39.1024593Z Running test_jit_fuser_te 1/1 ... [2025-12-04 15:55:39.102260][24143.223480092] 2025-12-04T15:55:39.1025122Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T15:55:39.1028677Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_jit_fuser_te.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 15:55:39.102613] 2025-12-04T16:01:57.7230791Z 2025-12-04T16:01:57.7231954Z test_jit_fuser_te 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_jit_fuser_te_1.1_986011f95e3b5bd3_.log 2025-12-04T16:01:58.0298661Z Running 6823 items in this shard: test/test_jit_fuser_te.py::TestFuserCommon::test_autodiff_fallback, test/test_jit_fuser_te.py::TestTEFuserStatic::test_abs, test/test_jit_fuser_te.py::TestTEFuserStatic::test_adaptive_avg_pool2d, test/test_jit_fuser_te.py::TestTEFuserStatic::test_add_bool, test/test_jit_fuser_te.py::TestTEFuserStatic::test_addcmul, test/test_jit_fuser_te.py::TestTEFuserStatic::test_arg_configurations_smoke, test/test_jit_fuser_te.py::TestTEFuserStatic::test_autocast_down, test/test_jit_fuser_te.py::TestTEFuserStatic::test_autocast_up, test/test_jit_fuser_te.py::TestTEFuserStatic::test_batch_norm, test/test_jit_fuser_te.py::TestTEFuserStatic::test_binary_div_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_binary_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_binary_pow, test/test_jit_fuser_te.py::TestTEFuserStatic::test_binary_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_binary_tensor_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_bitwise_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_broadcast, test/test_jit_fuser_te.py::TestTEFuserStatic::test_cat_2k_args, test/test_jit_fuser_te.py::TestTEFuserStatic::test_cat_graph_opt, test/test_jit_fuser_te.py::TestTEFuserStatic::test_channels_last_dims_dynamic, test/test_jit_fuser_te.py::TestTEFuserStatic::test_checks_cat_inputs, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk_correctness, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk_distributes, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk_motion_deduplicates_inputs, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk_mul_one, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk_multiple, test/test_jit_fuser_te.py::TestTEFuserStatic::test_clamp, test/test_jit_fuser_te.py::TestTEFuserStatic::test_clamp_double, test/test_jit_fuser_te.py::TestTEFuserStatic::test_clamp_int, test/test_jit_fuser_te.py::TestTEFuserStatic::test_comparison_eq_ne, test/test_jit_fuser_te.py::TestTEFuserStatic::test_comparison_ge_le, test/test_jit_fuser_te.py::TestTEFuserStatic::test_comparison_gt_lt, test/test_jit_fuser_te.py::TestTEFuserStatic::test_concat, test/test_jit_fuser_te.py::TestTEFuserStatic::test_concat_invariant, test/test_jit_fuser_te.py::TestTEFuserStatic::test_constant_chunk_shapes, test/test_jit_fuser_te.py::TestTEFuserStatic::test_conv2d, test/test_jit_fuser_te.py::TestTEFuserStatic::test_conv2d_depthwise, test/test_jit_fuser_te.py::TestTEFuserStatic::test_cuda_half, test/test_jit_fuser_te.py::TestTEFuserStatic::test_dims, test/test_jit_fuser_te.py::TestTEFuserStatic::test_disabled, test/test_jit_fuser_te.py::TestTEFuserStatic::test_div_bool, test/test_jit_fuser_te.py::TestTEFuserStatic::test_dynamic_cat, test/test_jit_fuser_te.py::TestTEFuserStatic::test_dynamic_shapes, test/test_jit_fuser_te.py::TestTEFuserStatic::test_eq_unsqueeze_type_as, test/test_jit_fuser_te.py::TestTEFuserStatic::test_erf, test/test_jit_fuser_te.py::TestTEFuserStatic::test_exhaust_specializations, test/test_jit_fuser_te.py::TestTEFuserStatic::test_exp, test/test_jit_fuser_te.py::TestTEFuserStatic::test_fusion_reuse_multi_gpu, test/test_jit_fuser_te.py::TestTEFuserStatic::test_gelu, test/test_jit_fuser_te.py::TestTEFuserStatic::test_hardsigmoid_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserStatic::test_hardswish_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserStatic::test_inlined_optimized_graph, test/test_jit_fuser_te.py::TestTEFuserStatic::test_isnan, test/test_jit_fuser_te.py::TestTEFuserStatic::test_kernel_cache_multi_gpu, test/test_jit_fuser_te.py::TestTEFuserStatic::test_lerp, test/test_jit_fuser_te.py::TestTEFuserStatic::test_list_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_lstm, test/test_jit_fuser_te.py::TestTEFuserStatic::test_lstm_concat, test/test_jit_fuser_te.py::TestTEFuserStatic::test_lstm_gates_permutations, test/test_jit_fuser_te.py::TestTEFuserStatic::test_lstm_traced, test/test_jit_fuser_te.py::TestTEFuserStatic::test_masked_fill, test/test_jit_fuser_te.py::TestTEFuserStatic::test_matmul, test/test_jit_fuser_te.py::TestTEFuserStatic::test_milstm, test/test_jit_fuser_te.py::TestTEFuserStatic::test_minmax, test/test_jit_fuser_te.py::TestTEFuserStatic::test_minmax_int_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_mul_bool, test/test_jit_fuser_te.py::TestTEFuserStatic::test_neg_pow, test/test_jit_fuser_te.py::TestTEFuserStatic::test_nonzero_device_cuda, test/test_jit_fuser_te.py::TestTEFuserStatic::test_nop, test/test_jit_fuser_te.py::TestTEFuserStatic::test_pow_multiple_dtype, test/test_jit_fuser_te.py::TestTEFuserStatic::test_profiler, test/test_jit_fuser_te.py::TestTEFuserStatic::test_rand_broadcast_cuda, test/test_jit_fuser_te.py::TestTEFuserStatic::test_rand_cuda, test/test_jit_fuser_te.py::TestTEFuserStatic::test_rand_diamond, test/test_jit_fuser_te.py::TestTEFuserStatic::test_relu, test/test_jit_fuser_te.py::TestTEFuserStatic::test_relu_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserStatic::test_remove_output_used_only_in_size, test/test_jit_fuser_te.py::TestTEFuserStatic::test_scalar, test/test_jit_fuser_te.py::TestTEFuserStatic::test_scalar_arg, test/test_jit_fuser_te.py::TestTEFuserStatic::test_scalar_only_inputs, test/test_jit_fuser_te.py::TestTEFuserStatic::test_skip_grad_in_check, test/test_jit_fuser_te.py::TestTEFuserStatic::test_small_constant, test/test_jit_fuser_te.py::TestTEFuserStatic::test_sub_gt_and, test/test_jit_fuser_te.py::TestTEFuserStatic::test_sum_dim, test/test_jit_fuser_te.py::TestTEFuserStatic::test_sum_keepdim_cast, test/test_jit_fuser_te.py::TestTEFuserStatic::test_sum_simple, test/test_jit_fuser_te.py::TestTEFuserStatic::test_superslomo, test/test_jit_fuser_te.py::TestTEFuserStatic::test_tensor_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_ternary_norm_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_ternary_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_threshold, test/test_jit_fuser_te.py::TestTEFuserStatic::test_to_device, test/test_jit_fuser_te.py::TestTEFuserStatic::test_to_dtype, test/test_jit_fuser_te.py::TestTEFuserStatic::test_torch_to, test/test_jit_fuser_te.py::TestTEFuserStatic::test_type_as_cat, test/test_jit_fuser_te.py::TestTEFuserStatic::test_typecheck, test/test_jit_fuser_te.py::TestTEFuserStatic::test_unary_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_unrolled_cat, test/test_jit_fuser_te.py::TestTEFuserStatic::test_unsqueeze_size_calculation, test/test_jit_fuser_te.py::TestTEFuserStatic::test_unsqueeze_var_dim, test/test_jit_fuser_te.py::TestTEFuserStatic::test_unsupported_dtypes, test/test_jit_fuser_te.py::TestTEFuserStatic::test_where_and_typing, test/test_jit_fuser_te.py::TestTEFuserStatic::test_where_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_with_strict_fusion, test/test_jit_fuser_te.py::TestTEFuserStatic::test_zero_element_tensors, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_abs, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_adaptive_avg_pool2d, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_add_bool, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_addcmul, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_arg_configurations_smoke, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_autocast_down, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_autocast_up, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_batch_norm, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_binary_div_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_binary_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_binary_pow, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_binary_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_binary_tensor_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_bitwise_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_broadcast, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_cat_2k_args, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_cat_graph_opt, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_channels_last_dims_dynamic, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_checks_cat_inputs, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk_correctness, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk_distributes, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk_motion_deduplicates_inputs, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk_mul_one, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk_multiple, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_clamp, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_clamp_double, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_clamp_int, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_comparison_eq_ne, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_comparison_ge_le, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_comparison_gt_lt, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_concat, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_concat_invariant, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_constant_chunk_shapes, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_conv2d, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_conv2d_depthwise, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_cuda_half, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_dims, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_disabled, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_div_bool, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_dynamic_cat, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_dynamic_shapes, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_eq_unsqueeze_type_as, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_erf, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_exhaust_specializations, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_exp, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_fusion_reuse_multi_gpu, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_gelu, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_hardsigmoid_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_hardswish_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_inlined_optimized_graph, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_isnan, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_kernel_cache_multi_gpu, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_lerp, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_list_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_lstm, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_lstm_concat, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_lstm_gates_permutations, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_lstm_traced, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_masked_fill, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_matmul, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_milstm, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_minmax, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_minmax_int_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_mul_bool, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_neg_pow, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_nonzero_device_cuda, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_nop, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_pow_multiple_dtype, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_profiler, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_rand_broadcast_cuda, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_rand_cuda, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_rand_diamond, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_relu, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_relu_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_remove_output_used_only_in_size, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_scalar, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_scalar_arg, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_scalar_only_inputs, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_skip_grad_in_check, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_small_constant, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_sub_gt_and, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_sum_dim, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_sum_keepdim_cast, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_sum_simple, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_superslomo, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_tensor_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_ternary_norm_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_ternary_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_threshold, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_to_device, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_to_dtype, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_torch_to, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_type_as_cat, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_typecheck, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_unary_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_unrolled_cat, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_unsqueeze_size_calculation, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_unsqueeze_var_dim, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_unsupported_dtypes, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_where_and_typing, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_where_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_with_strict_fusion, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_zero_element_tensors, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_failures___rmatmul___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_failures_frac_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_failures_matmul_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__batch_norm_with_update_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__batch_norm_with_update_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__batch_norm_with_update_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__batch_norm_with_update_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__native_batch_norm_legit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__native_batch_norm_legit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__native_batch_norm_legit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__native_batch_norm_legit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_lengths_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_lengths_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_lengths_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_lengths_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_offsets_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_offsets_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_offsets_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_offsets_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__softmax_backward_data_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__softmax_backward_data_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__softmax_backward_data_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__softmax_backward_data_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__upsample_bilinear2d_aa_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__upsample_bilinear2d_aa_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__upsample_bilinear2d_aa_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__upsample_bilinear2d_aa_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bernoulli_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bernoulli_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bernoulli_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bernoulli_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bincount_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bincount_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bincount_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bincount_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bincount_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_left_shift_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_left_shift_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_left_shift_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_left_shift_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_left_shift_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_right_shift_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_right_shift_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_right_shift_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_right_shift_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_right_shift_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_shapes_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cauchy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cauchy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cauchy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cauchy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdist_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_inverse_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_inverse_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_inverse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_inverse_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_complex_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_complex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_complex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exponential_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exponential_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exponential_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exponential_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float8_e4m3fn, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float8_e4m3fnuz, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float8_e5m2, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float8_e5m2fnuz, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_frexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_frexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_frexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_frexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_uint16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_uint32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gcd_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gcd_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gcd_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gcd_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gcd_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geqrf_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geqrf_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geqrf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geqrf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hypot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hypot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hypot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hypot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_igamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_igamma_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_igammac_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_igammac_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_imag_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_imag_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_imag_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_istft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_istft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lcm_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lcm_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lcm_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lcm_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lcm_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_ex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_ex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_ex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cond_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cond_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cond_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cond_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_det_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_det_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_det_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_det_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eig_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eig_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eig_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eig_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvals_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvals_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvals_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvals_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvalsh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvalsh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvalsh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvalsh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_householder_product_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_householder_product_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_householder_product_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_householder_product_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_ex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_ex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_ex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_ex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_ex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_ex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_grad_oriented_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_grad_oriented_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_grad_oriented_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_grad_oriented_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_ex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_ex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_ex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_power_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_power_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_power_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_power_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_hermitian_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_hermitian_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_hermitian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_hermitian_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_hermitian_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_hermitian_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_hermitian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_hermitian_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_singular_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_singular_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_singular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_singular_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_qr_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_qr_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_qr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_qr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_slogdet_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_slogdet_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_slogdet_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_slogdet_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_ex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_ex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_ex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_triangular_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_triangular_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_triangular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_triangular_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svd_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svd_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svd_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svdvals_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svdvals_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svdvals_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svdvals_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorinv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorinv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorinv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorsolve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorsolve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorsolve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorsolve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_normal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_normal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_normal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_normal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp2_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logdet_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logdet_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logdet_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logdet_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_unpack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_unpack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_unpack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_unpack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_log_softmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_log_softmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_log_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_log_softmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logaddexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logaddexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logaddexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logaddexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_median_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_median_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_median_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_median_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_pool2d_with_indices_backward_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_pool2d_with_indices_backward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_pool2d_with_indices_backward_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_multinomial_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_multinomial_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_multinomial_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_multinomial_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanquantile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanquantile_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_batch_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_batch_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_batch_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_batch_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_dropout_backward_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_dropout_backward_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_dropout_backward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_dropout_backward_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_layer_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_layer_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_layer_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_layer_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nextafter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nextafter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nextafter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nextafter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_alpha_dropout_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_alpha_dropout_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_alpha_dropout_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_alpha_dropout_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_without_cudnn_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_bilinear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_bilinear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_bilinear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_celu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_celu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_celu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_celu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_similarity_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_similarity_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_similarity_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_similarity_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cross_entropy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cross_entropy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cross_entropy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cross_entropy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_ctc_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_ctc_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_elu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_elu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_elu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_elu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_bag_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_bag_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_bag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_bag_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gaussian_nll_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gaussian_nll_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gaussian_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gaussian_nll_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gelu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gelu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gelu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_glu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_glu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_glu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_glu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_grid_sample_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_grid_sample_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_grid_sample_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_grid_sample_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_group_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_group_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_group_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_group_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardshrink_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardshrink_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardshrink_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardsigmoid_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardsigmoid_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardsigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardsigmoid_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardswish_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardswish_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardswish_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardswish_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hinge_embedding_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hinge_embedding_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hinge_embedding_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_huber_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_huber_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_huber_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_huber_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_instance_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_instance_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_instance_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_instance_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_area_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_area_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_area_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_area_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bicubic_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bicubic_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bicubic_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bicubic_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bilinear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bilinear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bilinear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_linear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_linear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_linear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_linear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest-exact_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_trilinear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_trilinear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_trilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_trilinear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_kl_div_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_kl_div_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_kl_div_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_kl_div_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_layer_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_layer_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_layer_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_layer_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_leaky_relu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_leaky_relu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_leaky_relu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_leaky_relu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_local_response_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_local_response_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_local_response_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_local_response_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_logsigmoid_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_logsigmoid_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_logsigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_logsigmoid_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_grad_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_grad_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_grad_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_grad_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_grad_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_grad_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_grad_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_grad_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_grad_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mish_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mish_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mish_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mish_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mse_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mse_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mse_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mse_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_head_attention_forward_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_head_attention_forward_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_head_attention_forward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_head_attention_forward_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_margin_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_margin_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_margin_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_margin_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_margin_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_margin_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_soft_margin_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_nll_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_nll_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_nll_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_one_hot_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pdist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pdist_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_prelu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_prelu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_prelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_prelu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rrelu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rrelu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rrelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rrelu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_scaled_dot_product_attention_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_selu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_selu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_selu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_selu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_complex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_complex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_smooth_l1_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_smooth_l1_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_smooth_l1_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_soft_margin_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_soft_margin_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_soft_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_soft_margin_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softplus_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softplus_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softplus_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softplus_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softshrink_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softshrink_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softshrink_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_bilinear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_bilinear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_bilinear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_nearest_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_nearest_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_nearest_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_nearest_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_nearest_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_nuc_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_nuc_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_nuc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_nuc_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_number_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_number_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_number_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_number_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ormqr_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ormqr_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ormqr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ormqr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pca_lowrank_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pca_lowrank_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pca_lowrank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pca_lowrank_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pinverse_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pinverse_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pinverse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pinverse_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polar_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polar_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_qr_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_qr_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_qr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_qr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_quantile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_quantile_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_0_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_0_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_3_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_3_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_3_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_neg_3_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_neg_3_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_neg_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_neg_3_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_bartlett_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_bartlett_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_blackman_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_blackman_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_cosine_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_cosine_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_exponential_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_exponential_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_gaussian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_gaussian_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_general_cosine_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_general_cosine_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_general_hamming_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_general_hamming_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_hamming_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_hamming_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_hann_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_hann_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_kaiser_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_kaiser_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_nuttall_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_nuttall_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_mm_reduce_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_mm_reduce_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_mm_reduce_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_mm_reduce_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_sampled_addmm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_sampled_addmm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_sampled_addmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_sampled_addmm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_lowrank_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_lowrank_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_lowrank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_lowrank_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch__scaled_mm_cuda_float8_e4m3fn, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__efficient_attention_forward_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triangular_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triangular_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triangular_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triangular_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_indices_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_indices_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_indices_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_indices_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_uint16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_uint32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_uint64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unravel_index_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unravel_index_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unravel_index_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unravel_index_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unravel_index_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_complex_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_complex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_complex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_real_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_real_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_H_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_T_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported___getitem___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported___rpow___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported___rsub___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__batch_norm_with_update_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__chunk_cat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__native_batch_norm_legit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__segment_reduce_lengths_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__segment_reduce_offsets_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__softmax_backward_data_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__unsafe_masked_index_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__unsafe_masked_index_put_accumulate_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__upsample_bilinear2d_aa_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_acosh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_addbmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_addcdiv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_addmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_addmv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_addr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_alias_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_all_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_allclose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_aminmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_angle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_any_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_arange_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_argmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_argmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_argsort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_argwhere_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_as_strided_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_as_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_as_strided_partial_views_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_as_strided_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_asinh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_atanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_atleast_1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_atleast_2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_atleast_3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_baddbmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_bernoulli_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_bfloat16_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_block_diag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_bmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_broadcast_shapes_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_broadcast_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_broadcast_to_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_bucketize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cartesian_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cauchy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cdist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cdouble_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cfloat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_chalf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cholesky_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cholesky_inverse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cholesky_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_chunk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_clamp_max_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_clamp_min_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_clone_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_column_stack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_combinations_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_complex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_conj_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_conj_physical_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_constant_pad_nd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_copysign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_corrcoef_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_count_nonzero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cov_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cross_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cummax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cummin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cumprod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cumsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cumulative_trapezoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_deg2rad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diag_embed_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diagflat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diagonal_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diagonal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diagonal_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diff_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_digamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_dist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_dot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_dsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_dstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_einsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_empty_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_empty_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_empty_permuted_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_empty_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_equal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_erfinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_exp2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_expand_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_exponential_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_eye_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_fft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_fft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_fftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_fftshift_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_hfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_hfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_hfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ifft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ifft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ifftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ifftshift_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ihfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ihfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ihfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_irfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_irfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_irfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_rfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_rfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_rfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_flatten_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_flip_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fliplr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_flipud_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_float_power_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_floor_divide_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_frexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_full_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_full_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_gather_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_geometric_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_geqrf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_gradient_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_grid_sampler_2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_grid_sampler_3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_hash_tensor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_heaviside_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_histc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_hsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_hstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_hypot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_i0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_igamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_igammac_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_put_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_reduce_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_reduce_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_reduce_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_reduce_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_inner_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isclose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isfinite_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isinf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isneginf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isposinf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isreal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_item_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_jiterator_2inputs_2outputs_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_jiterator_4inputs_with_extra_args_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_jiterator_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_jiterator_binary_return_by_ref_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_jiterator_unary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_kron_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_kthvalue_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_ldexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_cholesky_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_cholesky_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_cond_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_cross_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_det_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_diagonal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_eig_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_eigh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_eigvals_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_eigvalsh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_householder_product_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_inv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_inv_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_ldl_factor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_ldl_factor_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_ldl_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lstsq_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lstsq_grad_oriented_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lu_factor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lu_factor_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lu_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_matrix_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_matrix_power_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_matrix_rank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_matrix_rank_hermitian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_multi_dot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_norm_subgradients_at_zero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_pinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_pinv_hermitian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_pinv_singular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_qr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_slogdet_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_solve_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_solve_triangular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_svd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_svdvals_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_tensorinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_tensorsolve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_vander_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_vecdot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_vector_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linspace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linspace_tensor_overload_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_log_normal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_log_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_log_softmax_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logaddexp2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logaddexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logcumsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logdet_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logical_and_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logical_not_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logical_or_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logical_xor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logspace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logspace_tensor_overload_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_lu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_lu_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_lu_unpack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mH_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mT_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_argmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_argmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_cumprod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_cumsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_log_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_logaddexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_logsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_median_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_normalize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_softmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_std_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_var_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_matrix_exp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_max_pool2d_with_indices_backward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_max_reduction_no_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_max_reduction_with_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_maximum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_median_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_meshgrid_list_of_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_meshgrid_variadic_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_min_reduction_no_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_min_reduction_with_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_minimum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mode_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_movedim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_msort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_multinomial_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nan_to_num_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nanmean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nanmedian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nanquantile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nansum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_narrow_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_narrow_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_native_batch_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_native_dropout_backward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_native_layer_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_new_empty_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_new_empty_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_new_full_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_new_ones_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_new_zeros_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nextafter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_alpha_dropout_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_avg_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_avg_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_avg_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_batch_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_binary_cross_entropy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_celu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_channel_shuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv_transpose1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv_transpose2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv_transpose3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_cosine_embedding_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_cosine_similarity_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_cross_entropy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_ctc_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_dropout2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_dropout3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_dropout_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_elu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_embedding_bag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_embedding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_fractional_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_fractional_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_gaussian_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_gelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_glu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_grid_sample_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_group_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_hinge_embedding_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_huber_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_instance_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_area_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_bicubic_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_linear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_nearest_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_trilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_kl_div_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_l1_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_layer_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_linear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_local_response_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_logsigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_margin_ranking_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool1d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool2d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool3d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_mish_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_mse_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_multi_head_attention_forward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_multi_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_multilabel_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_normalize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pad_circular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pad_constant_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pad_reflect_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pad_replicate_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pad_replicate_negative_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pairwise_distance_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pdist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pixel_shuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pixel_unshuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_poisson_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_prelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_rms_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_rrelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_selu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_silu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_smooth_l1_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_soft_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_softmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_softmin_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_softshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_triplet_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_unfold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_upsample_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_upsample_nearest_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nonzero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nonzero_static_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_norm_fro_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_norm_inf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_norm_nuc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_normal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_normal_in_place_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_normal_number_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_ones_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_ones_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_ormqr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_outer_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_pca_lowrank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_permute_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_pinverse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polar_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polygamma_polygamma_n_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polygamma_polygamma_n_1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polygamma_polygamma_n_2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polygamma_polygamma_n_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polygamma_polygamma_n_4_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_positive_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_put_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_qr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_quantile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_rad2deg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_rand_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_randint_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_randint_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_randn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_randn_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_ravel_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_real_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_renorm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_repeat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_repeat_interleave_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_resize__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_resize_as__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_resolve_conj_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_resolve_neg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_roll_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_rot90_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_round_decimals_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_round_decimals_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_round_decimals_neg_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scalar_tensor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_reduce_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_reduce_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_reduce_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_reduce_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_reduce_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_searchsorted_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_select_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sgn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_bartlett_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_blackman_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_cosine_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_exponential_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_gaussian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_general_cosine_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_general_hamming_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_hamming_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_hann_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_kaiser_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_nuttall_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signbit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sinc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_slice_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_slice_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_softmax_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sparse_mm_reduce_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sparse_sampled_addmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_airy_ai_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_bessel_j0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_bessel_j1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_bessel_y0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_bessel_y1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_chebyshev_polynomial_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_chebyshev_polynomial_u_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_chebyshev_polynomial_v_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_chebyshev_polynomial_w_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_entr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_erfcx_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_hermite_polynomial_h_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_hermite_polynomial_he_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_i0e_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_i1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_i1e_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_laguerre_polynomial_l_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_legendre_polynomial_p_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_log_ndtr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_modified_bessel_i0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_modified_bessel_i1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_modified_bessel_k0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_modified_bessel_k1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_ndtr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_ndtri_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_scaled_modified_bessel_k0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_scaled_modified_bessel_k1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_spherical_bessel_j0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_xlog1py_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_zeta_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_split_list_args_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_split_with_sizes_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_split_with_sizes_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_square_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_squeeze_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_squeeze_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_squeeze_multiple_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_stack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_std_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_std_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_std_mean_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_std_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_stft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sum_to_size_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_svd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_svd_lowrank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_t_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_take_along_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_take_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_tensor_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_tensordot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_tile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_to_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_to_sparse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_topk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_trace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_transpose_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_trapezoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_trapz_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_triangular_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_tril_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_triu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unbind_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unbind_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unflatten_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unfold_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unfold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_uniform_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unique_consecutive_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unique_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unsafe_chunk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unsafe_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unsqueeze_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_var_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_var_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_var_mean_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_var_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_vdot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_view_as_complex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_view_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_vsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_vstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_xlogy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_zero__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_zeros_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_zeros_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working___radd___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working___rdiv___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working___rmod___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working___rmul___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_abs_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_acos_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_addcmul_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_addmm_decomposed_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_asin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_atan2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_atan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_bool_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_byte_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_ceil_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_char_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_clamp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_contiguous_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_cos_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_cosh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_div_floor_rounding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_div_no_rounding_mode_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_div_trunc_rounding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_double_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_eq_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_erf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_erfc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_exp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_expand_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_expand_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_expm1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_float_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_floor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_fmod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_ge_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_gt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_half_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_int_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_isnan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_le_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_lerp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_lgamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_log10_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_log1p_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_log2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_log_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_long_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_lt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_masked_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_max_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_min_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_mm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_mul_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_ne_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_neg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_hardshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_hardsigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_hardswish_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_hardtanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_leaky_relu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_relu6_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_relu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_softplus_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_softsign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_tanhshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_threshold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_permute_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_pow_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_reciprocal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_remainder_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_reshape_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_reshape_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_round_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_rsqrt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_rsub_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_short_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sinh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sqrt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sub_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_tan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_tanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_transpose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_true_divide_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_trunc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_unsqueeze_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_view_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_view_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_where_cuda_float32 2025-12-04T16:01:58.3270918Z 2025-12-04T16:01:58.3271274Z Finished test_jit_fuser_te 1/1 ... [2025-12-04 16:01:57.733691][24521.854905385], took 6.31min 2025-12-04T16:01:58.3272460Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_jit_fuser_te/test_jit_fuser_te-c194bc30f4bbb26c.xml 2025-12-04T16:01:58.3273560Z Running test_sympy_utils 1/1 ... [2025-12-04 16:01:58.049268][24522.170486108] 2025-12-04T16:01:58.3274086Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T16:01:58.3275286Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_sympy_utils.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 16:01:58.049684] 2025-12-04T16:02:36.4651010Z 2025-12-04T16:02:36.4652093Z test_sympy_utils 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_sympy_utils_1.1_8a60d70855d2b894_.log 2025-12-04T16:02:36.4726782Z Running 217 items in this shard: test/test_sympy_utils.py::TestNumbers::test_float_cast, test/test_sympy_utils.py::TestNumbers::test_int_infinity, test/test_sympy_utils.py::TestNumbers::test_lt_self, test/test_sympy_utils.py::TestNumbers::test_mixed_oo_int_oo, test/test_sympy_utils.py::TestNumbers::test_relation, test/test_sympy_utils.py::TestValueRanges::test_binary_bool_ref_range_fn_and_, test/test_sympy_utils.py::TestValueRanges::test_binary_bool_ref_range_fn_bitwise_and, test/test_sympy_utils.py::TestValueRanges::test_binary_bool_ref_range_fn_bitwise_or, test/test_sympy_utils.py::TestValueRanges::test_binary_bool_ref_range_fn_bitwise_xor, test/test_sympy_utils.py::TestValueRanges::test_binary_bool_ref_range_fn_or_, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_add_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_add_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_bitwise_and_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_bitwise_and_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_bitwise_or_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_bitwise_or_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_bitwise_xor_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_bitwise_xor_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_floordiv_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_floordiv_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_maximum_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_maximum_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_minimum_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_minimum_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_mod_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_mod_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_mul_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_mul_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_pow_by_natural_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_pow_by_natural_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_pow_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_pow_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_sub_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_sub_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_truediv_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_fn_truediv_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_add, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_bitwise_and, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_bitwise_or, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_bitwise_xor, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_eq, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_floordiv, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_ge, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_gt, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_le, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_lt, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_maximum, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_minimum, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_mod, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_mul, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_ne, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_pow, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_pow_by_natural, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_sub, test/test_sympy_utils.py::TestValueRanges::test_binary_ref_range_fn_truediv, test/test_sympy_utils.py::TestValueRanges::test_bitwise_ref_range_fn_bitwise_and, test/test_sympy_utils.py::TestValueRanges::test_bitwise_ref_range_fn_bitwise_or, test/test_sympy_utils.py::TestValueRanges::test_bitwise_ref_range_fn_bitwise_xor, test/test_sympy_utils.py::TestValueRanges::test_mul_zero_unknown, test/test_sympy_utils.py::TestValueRanges::test_pow_half, test/test_sympy_utils.py::TestValueRanges::test_unary_bool_ref_range_fn_not_, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_abs_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_abs_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_ceil_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_ceil_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_exp_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_exp_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_floor_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_floor_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_log_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_log_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_neg_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_neg_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_reciprocal_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_reciprocal_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_sqrt_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_sqrt_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_square_dtype_float, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_fn_square_dtype_int, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_range_fn_abs, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_range_fn_ceil, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_range_fn_exp, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_range_fn_floor, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_range_fn_log, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_range_fn_neg, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_range_fn_reciprocal, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_range_fn_sqrt, test/test_sympy_utils.py::TestValueRanges::test_unary_ref_range_fn_square, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_abs, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_add, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_and_, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_bitwise_and, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_bitwise_or, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_bitwise_xor, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_ceil, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_eq, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_exp, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_floor, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_floordiv, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_ge, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_gt, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_le, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_log, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_lt, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_maximum, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_minimum, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_mod, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_mul, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_ne, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_neg, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_not_, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_or_, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_pow, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_pow_by_natural, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_reciprocal, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_sqrt, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_square, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_sub, test/test_sympy_utils.py::TestSympyInterp::test_interp_fn_truediv, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_abs, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_add, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_and_, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_bitwise_and, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_bitwise_or, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_bitwise_xor, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_ceil, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_eq, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_exp, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_floor, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_floordiv, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_ge, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_gt, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_le, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_log, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_lt, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_maximum, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_minimum, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_mod, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_mul, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_ne, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_neg, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_not_, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_or_, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_pow, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_pow_by_natural, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_reciprocal, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_sqrt, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_square, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_sub, test/test_sympy_utils.py::TestSympyInterp::test_python_interp_fx_fn_truediv, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_abs, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_add, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_and_, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_bitwise_and, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_bitwise_or, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_bitwise_xor, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_ceil, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_eq, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_exp, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_floor, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_floordiv, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_ge, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_gt, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_le, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_log, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_lt, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_maximum, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_minimum, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_mod, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_mul, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_ne, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_neg, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_not_, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_or_, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_pow, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_pow_by_natural, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_reciprocal, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_sqrt, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_square, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_sub, test/test_sympy_utils.py::TestSympyInterp::test_tensor_interp_fn_truediv, test/test_sympy_utils.py::TestSympySolve::test_addition, test/test_sympy_utils.py::TestSympySolve::test_floordiv_Equality, test/test_sympy_utils.py::TestSympySolve::test_floordiv_GreaterThan, test/test_sympy_utils.py::TestSympySolve::test_floordiv_LessThan, test/test_sympy_utils.py::TestSympySolve::test_floordiv_StrictGreaterThan, test/test_sympy_utils.py::TestSympySolve::test_floordiv_StrictLessThan, test/test_sympy_utils.py::TestSympySolve::test_floordiv_Unequality, test/test_sympy_utils.py::TestSympySolve::test_floordiv_eq_simplify, test/test_sympy_utils.py::TestSympySolve::test_give_up, test/test_sympy_utils.py::TestSympySolve::test_multiplication_division_Equality, test/test_sympy_utils.py::TestSympySolve::test_multiplication_division_Unequality, test/test_sympy_utils.py::TestSympySolve::test_multiplication_division_inequality_GreaterThan, test/test_sympy_utils.py::TestSympySolve::test_multiplication_division_inequality_LessThan, test/test_sympy_utils.py::TestSympySolve::test_multiplication_division_inequality_StrictGreaterThan, test/test_sympy_utils.py::TestSympySolve::test_multiplication_division_inequality_StrictLessThan, test/test_sympy_utils.py::TestSympySolve::test_noop_Equality, test/test_sympy_utils.py::TestSympySolve::test_noop_GreaterThan, test/test_sympy_utils.py::TestSympySolve::test_noop_LessThan, test/test_sympy_utils.py::TestSympySolve::test_noop_StrictGreaterThan, test/test_sympy_utils.py::TestSympySolve::test_noop_StrictLessThan, test/test_sympy_utils.py::TestSympySolve::test_noop_Unequality, test/test_sympy_utils.py::TestSympySolve::test_noop_rhs_Equality, test/test_sympy_utils.py::TestSympySolve::test_noop_rhs_GreaterThan, test/test_sympy_utils.py::TestSympySolve::test_noop_rhs_LessThan, test/test_sympy_utils.py::TestSympySolve::test_noop_rhs_StrictGreaterThan, test/test_sympy_utils.py::TestSympySolve::test_noop_rhs_StrictLessThan, test/test_sympy_utils.py::TestSympySolve::test_noop_rhs_Unequality, test/test_sympy_utils.py::TestSympySolve::test_simple_floordiv_gcd, test/test_sympy_utils.py::TestSympySolve::test_z3_proof_floordiv_eq_simplify, test/test_sympy_utils.py::TestSympyFunctions::test_pickle, test/test_sympy_utils.py::TestSingletonInt::test_basic, test/test_sympy_utils.py::TestIdentity::test_cast_identity_float, test/test_sympy_utils.py::TestIdentity::test_cast_identity_illegal, test/test_sympy_utils.py::TestIdentity::test_cast_identity_int, test/test_sympy_utils.py::TestIdentity::test_expand_identity, test/test_sympy_utils.py::TestTypedExpr::test_typed_expr 2025-12-04T16:02:36.4800485Z 2025-12-04T16:02:36.4800799Z Finished test_sympy_utils 1/1 ... [2025-12-04 16:02:36.465192][24560.586413093], took 0.64min 2025-12-04T16:02:36.5057092Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_sympy_utils/test_sympy_utils-922738ab31cc19f0.xml 2025-12-04T16:02:36.6041540Z Running test_mkldnn_fusion 1/1 ... [2025-12-04 16:02:36.603897][24560.725116231] 2025-12-04T16:02:36.6042099Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T16:02:36.6045002Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_mkldnn_fusion.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 16:02:36.604276] 2025-12-04T16:02:54.6525962Z 2025-12-04T16:02:54.6526946Z test_mkldnn_fusion 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_mkldnn_fusion_1.1_e7401bbd89a7874f_.log 2025-12-04T16:02:54.6530591Z Running 8 items in this shard: test/test_mkldnn_fusion.py::TestMkldnnFusion::test_conv_binary_fusion_ops, test/test_mkldnn_fusion.py::TestMkldnnFusion::test_conv_transpose_unary_fusion_ops, test/test_mkldnn_fusion.py::TestMkldnnFusion::test_conv_unary_fusion_nnc, test/test_mkldnn_fusion.py::TestMkldnnFusion::test_conv_unary_fusion_ops, test/test_mkldnn_fusion.py::TestMkldnnFusion::test_linear_binary_fusion_ops, test/test_mkldnn_fusion.py::TestMkldnnFusion::test_linear_unary_fusion_ops, test/test_mkldnn_fusion.py::TestMkldnnFusion::test_single_conv, test/test_mkldnn_fusion.py::TestMkldnnFusion::test_unsupported_conv 2025-12-04T16:02:54.6533620Z 2025-12-04T16:02:54.6533966Z Finished test_mkldnn_fusion 1/1 ... [2025-12-04 16:02:54.652398][24578.773616584], took 0.30min 2025-12-04T16:02:54.6926047Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_mkldnn_fusion/test_mkldnn_fusion-51238045ef241025.xml 2025-12-04T16:02:54.7715167Z Running test_schema_check 1/1 ... [2025-12-04 16:02:54.771250][24578.892470665] 2025-12-04T16:02:54.7715717Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T16:02:54.7719190Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_schema_check.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 16:02:54.771639] 2025-12-04T16:14:34.5977900Z 2025-12-04T16:14:34.5978940Z test_schema_check 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_schema_check_1.1_a822f87ec4dc8cdd_.log 2025-12-04T16:14:34.9355152Z Running 5994 items in this shard: test/test_schema_check.py::TestSchemaCheck::test_alias_check_fail_custom_ops_output_is_input, test/test_schema_check.py::TestSchemaCheck::test_alias_check_fail_custom_ops_secretly_aliasing, test/test_schema_check.py::TestSchemaCheck::test_alias_check_fail_custom_ops_secretly_mutating, test/test_schema_check.py::TestSchemaCheck::test_alias_check_fail_multiple_operators, test/test_schema_check.py::TestSchemaCheck::test_alias_check_fail_multiple_operators_centered, test/test_schema_check.py::TestSchemaCheck::test_alias_check_fail_outputs_unexpectedly_aliasing, test/test_schema_check.py::TestSchemaCheck::test_alias_check_fail_simple, test/test_schema_check.py::TestSchemaCheck::test_is_alias_of_basic, test/test_schema_check.py::TestSchemaCheck::test_is_alias_of_empty_container, test/test_schema_check.py::TestSchemaCheck::test_mutation_check_fail, test/test_schema_check.py::TestSchemaCheck::test_mutation_check_fail_multiple_operators, test/test_schema_check.py::TestSchemaCheck::test_overlaps_basic, test/test_schema_check.py::TestSchemaCheck::test_overlaps_empty_container, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_empty_list_input, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality_aliasing_inputs, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality_default_replaced, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality_device_input, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality_kwarg_tensor, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality_list_input, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality_mutable_inputs, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality_nested_training_op, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality_training_op, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality_wildcard_after, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality_with_multiple_outputs, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_functionality_with_multiple_outputs_aliasing, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_mutated_aliasing_aliasing_inputs, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_mutated_aliasing_aliasing_outputs, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_mutated_aliasing_as_strided, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_mutated_aliasing_multiple_outputs, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_mutated_aliasing_mutation, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_mutated_aliasing_none, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_mutated_aliasing_resize_, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_operator_order, test/test_schema_check.py::TestSchemaCheck::test_schema_check_mode_operator_order_without_grad, test/test_schema_check.py::TestSchemaCheck::test_schema_info_bind_basic, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_H_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_T_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___getitem___cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___radd___cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rand___cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rand___cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rand___cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rand___cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rand___cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rand___cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rdiv___cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmatmul___cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmatmul___cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmatmul___cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmatmul___cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmatmul___cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmatmul___cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmod___cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmod___cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmod___cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmod___cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmod___cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmod___cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmod___cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmod___cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmod___cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rmul___cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___ror___cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___ror___cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___ror___cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___ror___cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___ror___cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___ror___cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rpow___cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rpow___cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rpow___cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rpow___cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rpow___cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rpow___cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rpow___cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rpow___cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rpow___cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rpow___cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rpow___cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rsub___cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rsub___cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rsub___cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rsub___cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rsub___cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rsub___cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rsub___cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rsub___cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rsub___cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rsub___cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rsub___cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rxor___cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rxor___cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rxor___cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rxor___cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rxor___cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness___rxor___cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__batch_norm_with_update_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__batch_norm_with_update_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__batch_norm_with_update_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__batch_norm_with_update_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__chunk_cat_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__native_batch_norm_legit_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__native_batch_norm_legit_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__native_batch_norm_legit_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__native_batch_norm_legit_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__segment_reduce_lengths_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__segment_reduce_lengths_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__segment_reduce_lengths_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__segment_reduce_lengths_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__segment_reduce_offsets_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__segment_reduce_offsets_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__segment_reduce_offsets_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__segment_reduce_offsets_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__softmax_backward_data_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__softmax_backward_data_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__softmax_backward_data_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__softmax_backward_data_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__upsample_bilinear2d_aa_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__upsample_bilinear2d_aa_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__upsample_bilinear2d_aa_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness__upsample_bilinear2d_aa_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_abs_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acos_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_acosh_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_add_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addbmm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addbmm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addbmm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addbmm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addbmm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addbmm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcdiv_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcdiv_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcdiv_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcdiv_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcdiv_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcdiv_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcmul_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcmul_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcmul_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcmul_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcmul_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcmul_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcmul_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcmul_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcmul_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcmul_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addcmul_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_decomposed_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_decomposed_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_decomposed_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_decomposed_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_decomposed_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmm_decomposed_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmv_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmv_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmv_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmv_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmv_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addmv_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_addr_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_alias_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_all_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_allclose_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_allclose_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_allclose_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_allclose_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_allclose_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_allclose_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amax_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amax_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amax_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amax_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amax_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amax_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amin_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amin_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amin_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amin_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amin_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_amin_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_aminmax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_aminmax_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_aminmax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_aminmax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_aminmax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_aminmax_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_aminmax_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_aminmax_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_aminmax_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_aminmax_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_angle_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_angle_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_angle_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_angle_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_angle_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_angle_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_angle_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_angle_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_angle_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_angle_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_angle_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_any_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_arange_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_arange_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_arange_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_arange_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_arange_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_arange_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_arange_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_arange_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_arange_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmax_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmax_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmax_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmax_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmax_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmin_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmin_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmin_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmin_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argmin_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argsort_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argsort_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argsort_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argsort_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argsort_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argsort_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argsort_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argsort_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argsort_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argsort_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_argwhere_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_partial_views_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_as_strided_scatter_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asin_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_asinh_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan2_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan2_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan2_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan2_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan2_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan2_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan2_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan2_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan2_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan2_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atan_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atanh_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_1d_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_2d_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_atleast_3d_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_baddbmm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_baddbmm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_baddbmm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_baddbmm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_baddbmm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_baddbmm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bernoulli_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bernoulli_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bernoulli_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bernoulli_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bfloat16_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bincount_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bincount_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bincount_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bincount_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bincount_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_and_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_and_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_and_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_and_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_and_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_and_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_left_shift_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_left_shift_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_left_shift_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_left_shift_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_left_shift_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_not_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_not_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_not_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_not_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_not_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_not_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_or_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_or_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_or_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_or_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_or_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_or_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_right_shift_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_right_shift_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_right_shift_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_right_shift_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_right_shift_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_xor_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_xor_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_xor_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_xor_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_xor_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bitwise_xor_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_block_diag_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bmm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bmm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bmm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bmm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bmm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bmm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bool_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_shapes_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_tensors_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_broadcast_to_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bucketize_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bucketize_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bucketize_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bucketize_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bucketize_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bucketize_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bucketize_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bucketize_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_bucketize_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_byte_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cartesian_prod_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cat_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cauchy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cauchy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cauchy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cauchy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdist_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdist_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cdouble_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ceil_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ceil_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ceil_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ceil_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ceil_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ceil_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ceil_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ceil_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ceil_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cfloat_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chalf_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_char_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_inverse_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_inverse_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_inverse_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_inverse_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_solve_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_solve_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_solve_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cholesky_solve_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_chunk_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_max_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_max_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_max_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_max_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_max_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_max_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_max_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_max_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_max_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_max_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_min_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_min_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_min_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_min_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_min_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_min_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_min_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_min_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_min_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clamp_min_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_clone_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_column_stack_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_combinations_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_complex_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_complex_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_complex_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_conj_physical_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_constant_pad_nd_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_contiguous_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_copysign_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_copysign_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_copysign_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_copysign_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_copysign_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_copysign_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_copysign_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_copysign_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_copysign_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_copysign_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_corrcoef_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_corrcoef_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_corrcoef_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_corrcoef_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_corrcoef_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_corrcoef_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_corrcoef_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_corrcoef_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_corrcoef_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_corrcoef_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_corrcoef_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cos_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cosh_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_count_nonzero_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cov_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cov_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cov_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cov_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cov_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cov_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cov_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cov_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cov_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cov_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cov_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cross_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cross_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cross_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cross_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cross_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cross_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cross_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cross_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cross_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cross_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cross_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummax_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummax_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummax_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummax_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummax_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummax_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummin_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummin_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummin_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummin_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummin_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cummin_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumprod_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumprod_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumprod_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumprod_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumprod_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumprod_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumprod_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumprod_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumprod_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumprod_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumprod_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumsum_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumsum_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumsum_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumsum_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumsum_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumsum_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumsum_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumsum_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumsum_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumsum_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumsum_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumulative_trapezoid_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumulative_trapezoid_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumulative_trapezoid_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumulative_trapezoid_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumulative_trapezoid_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumulative_trapezoid_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumulative_trapezoid_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumulative_trapezoid_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumulative_trapezoid_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumulative_trapezoid_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_cumulative_trapezoid_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_deg2rad_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_deg2rad_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_deg2rad_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_deg2rad_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_deg2rad_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_deg2rad_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_deg2rad_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_deg2rad_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_deg2rad_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_deg2rad_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diag_embed_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagflat_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diagonal_scatter_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_diff_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_digamma_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_digamma_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_digamma_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_digamma_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_digamma_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_digamma_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_digamma_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_digamma_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_digamma_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_digamma_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dist_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dist_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dist_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dist_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dist_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dist_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_floor_rounding_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_floor_rounding_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_floor_rounding_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_floor_rounding_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_floor_rounding_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_floor_rounding_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_floor_rounding_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_floor_rounding_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_floor_rounding_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_no_rounding_mode_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_trunc_rounding_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_trunc_rounding_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_trunc_rounding_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_trunc_rounding_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_trunc_rounding_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_trunc_rounding_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_trunc_rounding_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_trunc_rounding_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_div_trunc_rounding_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dot_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dot_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dot_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dot_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dot_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dot_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_double_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dsplit_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_dstack_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_einsum_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_einsum_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_einsum_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_einsum_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_einsum_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_einsum_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_like_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_permuted_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_empty_strided_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eq_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_equal_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erf_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erf_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erf_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erf_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erf_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erf_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erf_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erf_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erf_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erf_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfc_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfc_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfc_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfc_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfc_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfc_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfc_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfc_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfc_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfc_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfinv_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfinv_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfinv_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfinv_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfinv_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfinv_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfinv_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfinv_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfinv_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_erfinv_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp2_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exp_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_as_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expand_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_expm1_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exponential_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exponential_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exponential_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_exponential_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_float8_e4m3fn, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_float8_e4m3fnuz, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_float8_e5m2, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_float8_e5m2fnuz, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_eye_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft2_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fft_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftn_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_fftshift_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft2_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfft_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_hfftn_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft2_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifft_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftn_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ifftshift_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft2_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft2_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft2_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft2_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft2_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft2_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft2_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft2_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft2_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfft_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfftn_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfftn_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfftn_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfftn_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfftn_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfftn_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfftn_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfftn_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_ihfftn_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft2_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfft_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_irfftn_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft2_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft2_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft2_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft2_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft2_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft2_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft2_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft2_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft2_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfft_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfftn_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfftn_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfftn_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfftn_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfftn_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfftn_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfftn_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfftn_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fft_rfftn_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fill_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flatten_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flip_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fliplr_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_flipud_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_float_power_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_divide_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_divide_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_divide_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_divide_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_divide_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_divide_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_divide_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_divide_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_floor_divide_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmax_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmax_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmax_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmax_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmax_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmax_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmin_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmin_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmin_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmin_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmin_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmin_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmod_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmod_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmod_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmod_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmod_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmod_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmod_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmod_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_fmod_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_frac_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_frac_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_frac_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_frac_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_frexp_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_frexp_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_frexp_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_frexp_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_uint16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_uint32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_full_like_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gather_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gcd_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gcd_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gcd_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gcd_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gcd_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ge_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ge_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ge_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ge_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ge_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ge_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ge_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ge_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ge_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ge_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geometric_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geometric_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geometric_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geometric_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geometric_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geometric_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geometric_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geometric_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geometric_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geqrf_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geqrf_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geqrf_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_geqrf_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gradient_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gradient_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gradient_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gradient_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gradient_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gradient_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gradient_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gradient_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gradient_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gradient_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_grid_sampler_2d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_grid_sampler_2d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_grid_sampler_2d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_grid_sampler_2d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_grid_sampler_3d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_grid_sampler_3d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_grid_sampler_3d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_grid_sampler_3d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gt_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gt_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gt_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gt_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gt_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gt_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gt_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gt_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gt_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_gt_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_half_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hash_tensor_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hash_tensor_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hash_tensor_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hash_tensor_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hash_tensor_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hash_tensor_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hash_tensor_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hash_tensor_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hash_tensor_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hash_tensor_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_heaviside_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_heaviside_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_heaviside_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_heaviside_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_heaviside_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_heaviside_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_heaviside_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_heaviside_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_heaviside_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_heaviside_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_histc_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_histc_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_histc_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_histc_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_histc_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_histc_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_histc_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hsplit_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hstack_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hypot_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hypot_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hypot_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_hypot_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_i0_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_i0_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_i0_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_i0_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_i0_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_i0_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_i0_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_i0_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_i0_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_i0_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_igamma_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_igamma_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_igammac_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_igammac_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_imag_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_imag_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_imag_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_add_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_fill_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_put_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amax_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amax_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amax_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amax_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amax_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amin_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amin_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amin_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amin_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_amin_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_mean_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_mean_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_mean_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_mean_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_mean_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_mean_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_mean_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_mean_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_mean_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_prod_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_prod_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_prod_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_prod_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_prod_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_prod_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_prod_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_prod_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_reduce_prod_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_index_select_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_inner_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_inner_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_inner_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_inner_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_inner_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_inner_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_int_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isclose_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isfinite_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isin_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isin_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isin_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isin_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isin_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isinf_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isnan_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isneginf_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isneginf_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isneginf_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isneginf_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isneginf_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isneginf_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isneginf_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isneginf_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isneginf_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isneginf_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isposinf_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isposinf_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isposinf_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isposinf_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isposinf_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isposinf_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isposinf_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isposinf_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isposinf_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isposinf_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_isreal_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_istft_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_istft_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_item_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_2inputs_2outputs_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_4inputs_with_extra_args_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_binary_return_by_ref_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_jiterator_unary_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kron_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kthvalue_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kthvalue_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kthvalue_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kthvalue_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kthvalue_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kthvalue_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kthvalue_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kthvalue_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_kthvalue_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lcm_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lcm_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lcm_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lcm_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lcm_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ldexp_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_le_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_le_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_le_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_le_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_le_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_le_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_le_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_le_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_le_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_le_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lerp_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lerp_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lerp_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lerp_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lerp_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lerp_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lerp_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lgamma_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lgamma_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lgamma_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lgamma_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lgamma_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lgamma_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lgamma_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lgamma_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lgamma_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lgamma_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cholesky_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cholesky_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cholesky_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cholesky_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cholesky_ex_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cholesky_ex_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cholesky_ex_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cholesky_ex_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cond_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cond_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cond_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cond_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cross_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cross_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cross_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cross_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cross_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cross_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cross_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cross_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cross_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cross_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_cross_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_det_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_det_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_det_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_det_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_diagonal_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eig_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eig_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eig_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eig_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigh_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigh_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigh_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigh_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigvals_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigvals_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigvals_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigvals_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigvalsh_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigvalsh_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigvalsh_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_eigvalsh_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_householder_product_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_householder_product_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_householder_product_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_householder_product_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_inv_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_inv_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_inv_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_inv_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_inv_ex_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_inv_ex_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_inv_ex_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_inv_ex_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_factor_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_factor_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_factor_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_factor_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_factor_ex_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_factor_ex_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_factor_ex_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_factor_ex_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_solve_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_solve_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_solve_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_ldl_solve_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lstsq_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lstsq_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lstsq_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lstsq_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lstsq_grad_oriented_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lstsq_grad_oriented_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lstsq_grad_oriented_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lstsq_grad_oriented_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_factor_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_factor_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_factor_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_factor_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_factor_ex_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_factor_ex_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_factor_ex_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_factor_ex_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_solve_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_solve_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_solve_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_lu_solve_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_norm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_norm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_power_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_power_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_power_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_power_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_rank_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_rank_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_rank_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_rank_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_rank_hermitian_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_rank_hermitian_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_rank_hermitian_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_matrix_rank_hermitian_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_multi_dot_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_multi_dot_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_multi_dot_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_multi_dot_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_multi_dot_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_multi_dot_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_subgradients_at_zero_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_subgradients_at_zero_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_subgradients_at_zero_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_norm_subgradients_at_zero_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_hermitian_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_hermitian_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_hermitian_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_hermitian_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_singular_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_singular_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_singular_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_pinv_singular_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_qr_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_qr_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_qr_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_qr_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_slogdet_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_slogdet_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_slogdet_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_slogdet_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_ex_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_ex_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_ex_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_ex_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_triangular_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_triangular_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_triangular_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_solve_triangular_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_svd_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_svd_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_svd_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_svd_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_svdvals_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_svdvals_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_svdvals_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_svdvals_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_tensorinv_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_tensorinv_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_tensorinv_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_tensorinv_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_tensorsolve_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_tensorsolve_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_tensorsolve_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_tensorsolve_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vander_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vander_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vander_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vander_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vander_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vander_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vander_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vander_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vander_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vecdot_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vecdot_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vecdot_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vecdot_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vecdot_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vecdot_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vector_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vector_norm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vector_norm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vector_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vector_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linalg_vector_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_tensor_overload_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_tensor_overload_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_tensor_overload_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_tensor_overload_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_tensor_overload_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_tensor_overload_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_tensor_overload_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_tensor_overload_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_tensor_overload_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_tensor_overload_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_linspace_tensor_overload_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log10_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log1p_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log2_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_normal_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_normal_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_normal_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_normal_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_log_softmax_with_dtype_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logaddexp2_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logaddexp2_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logaddexp2_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logaddexp2_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logaddexp_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logaddexp_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logaddexp_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logaddexp_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logaddexp_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logaddexp_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logaddexp_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logcumsumexp_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logcumsumexp_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logcumsumexp_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logcumsumexp_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logcumsumexp_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logcumsumexp_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logdet_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logdet_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logdet_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logdet_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_and_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_not_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_or_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logical_xor_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logit_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logit_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logit_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logit_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logit_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logit_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logit_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logit_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logit_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logit_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_tensor_overload_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_tensor_overload_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_tensor_overload_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_tensor_overload_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_tensor_overload_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_tensor_overload_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_tensor_overload_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_tensor_overload_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_tensor_overload_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_tensor_overload_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logspace_tensor_overload_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_logsumexp_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_long_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lt_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lt_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lt_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lt_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lt_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lt_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lt_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lt_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lt_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lt_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_solve_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_solve_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_solve_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_solve_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_unpack_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_unpack_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_unpack_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_lu_unpack_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mH_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mT_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amax_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amax_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amax_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amax_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amax_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amin_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amin_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amin_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amin_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_amin_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmax_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmax_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmax_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmax_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmax_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmin_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmin_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmin_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmin_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_argmin_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumprod_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumprod_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumprod_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumprod_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumprod_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumprod_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumprod_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumprod_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumprod_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumprod_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumprod_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumsum_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumsum_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumsum_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumsum_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumsum_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumsum_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumsum_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumsum_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumsum_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumsum_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_cumsum_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_fill_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_log_softmax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_log_softmax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_log_softmax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_log_softmax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logaddexp_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logaddexp_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logaddexp_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logaddexp_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logsumexp_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logsumexp_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logsumexp_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logsumexp_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logsumexp_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logsumexp_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logsumexp_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logsumexp_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logsumexp_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logsumexp_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_logsumexp_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_mean_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_mean_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_mean_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_mean_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_mean_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_mean_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_median_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_median_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_median_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_median_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_normalize_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_normalize_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_normalize_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_normalize_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_normalize_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_normalize_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_prod_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_scatter_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_select_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_softmax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_softmax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_softmax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_softmax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_softmin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_softmin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_softmin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_softmin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_std_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_std_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_std_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_std_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_std_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_std_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_std_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_std_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_std_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_std_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_std_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_sum_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_var_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_var_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_var_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_var_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_var_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_var_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_var_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_var_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_var_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_var_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_masked_var_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matmul_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matmul_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matmul_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matmul_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matmul_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matmul_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matrix_exp_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matrix_exp_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matrix_exp_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matrix_exp_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matrix_exp_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_matrix_exp_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_binary_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_binary_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_binary_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_binary_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_binary_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_binary_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_binary_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_binary_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_binary_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_binary_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_pool2d_with_indices_backward_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_pool2d_with_indices_backward_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_pool2d_with_indices_backward_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_no_dim_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_no_dim_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_no_dim_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_no_dim_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_no_dim_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_no_dim_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_no_dim_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_no_dim_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_no_dim_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_no_dim_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_with_dim_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_with_dim_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_with_dim_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_with_dim_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_with_dim_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_with_dim_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_with_dim_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_with_dim_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_with_dim_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_max_reduction_with_dim_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_maximum_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_maximum_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_maximum_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_maximum_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_maximum_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_maximum_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_maximum_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_maximum_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_maximum_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_maximum_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mean_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mean_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mean_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mean_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mean_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mean_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_median_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_median_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_median_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_median_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_median_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_median_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_median_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_median_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_median_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_list_of_tensors_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_meshgrid_variadic_tensors_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_binary_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_binary_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_binary_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_binary_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_binary_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_binary_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_binary_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_binary_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_binary_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_binary_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_no_dim_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_no_dim_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_no_dim_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_no_dim_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_no_dim_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_no_dim_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_no_dim_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_no_dim_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_no_dim_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_no_dim_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_with_dim_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_with_dim_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_with_dim_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_with_dim_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_with_dim_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_with_dim_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_with_dim_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_with_dim_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_with_dim_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_min_reduction_with_dim_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_minimum_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_minimum_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_minimum_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_minimum_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_minimum_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_minimum_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_minimum_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_minimum_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_minimum_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_minimum_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mode_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mode_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mode_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mode_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mode_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mode_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mode_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mode_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mode_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mode_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_movedim_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_msort_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_msort_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_msort_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_msort_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_msort_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_msort_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_msort_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_msort_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_msort_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_msort_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mul_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_multinomial_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_multinomial_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_multinomial_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_multinomial_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mv_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mv_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mv_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mv_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mv_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mv_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nan_to_num_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nan_to_num_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nan_to_num_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nan_to_num_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nan_to_num_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nan_to_num_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nan_to_num_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nan_to_num_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nan_to_num_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nan_to_num_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmean_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmean_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmean_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmean_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmean_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmean_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmean_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmedian_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmedian_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmedian_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmedian_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmedian_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmedian_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmedian_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmedian_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanmedian_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanquantile_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nanquantile_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nansum_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_narrow_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_batch_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_batch_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_batch_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_batch_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_dropout_backward_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_dropout_backward_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_dropout_backward_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_dropout_backward_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_layer_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_layer_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_layer_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_native_layer_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ne_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_neg_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_empty_strided_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_full_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_ones_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_new_zeros_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nextafter_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nextafter_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nextafter_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nextafter_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool1d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool2d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool2d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool3d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool3d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool1d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool2d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool2d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool3d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool3d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_alpha_dropout_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_alpha_dropout_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_alpha_dropout_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_alpha_dropout_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool1d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool1d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool1d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool1d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool2d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool2d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool2d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool2d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool3d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool3d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool3d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_avg_pool3d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_batch_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_batch_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_batch_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_batch_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_batch_norm_without_cudnn_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_bilinear_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_bilinear_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_bilinear_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_bilinear_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_binary_cross_entropy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_binary_cross_entropy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_binary_cross_entropy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_binary_cross_entropy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_celu_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_celu_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_celu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_celu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_channel_shuffle_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv1d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv1d_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv1d_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv1d_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv1d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv1d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv1d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv2d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv2d_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv2d_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv2d_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv2d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv2d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv2d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv3d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv3d_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv3d_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv3d_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv3d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv3d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv3d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose1d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose1d_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose1d_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose1d_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose1d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose1d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose1d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose2d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose2d_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose2d_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose2d_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose2d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose2d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose2d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose3d_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose3d_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose3d_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose3d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose3d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_conv_transpose3d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_embedding_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_embedding_loss_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_embedding_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_embedding_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_embedding_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_embedding_loss_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_embedding_loss_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_embedding_loss_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_embedding_loss_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_embedding_loss_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_similarity_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_similarity_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_similarity_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cosine_similarity_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cross_entropy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cross_entropy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cross_entropy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_cross_entropy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_ctc_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_ctc_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout2d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout2d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout2d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout2d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout3d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout3d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout3d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout3d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_dropout_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_elu_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_elu_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_elu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_elu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_embedding_bag_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_embedding_bag_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_embedding_bag_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_embedding_bag_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_embedding_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_embedding_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_embedding_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_embedding_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_fractional_max_pool2d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_fractional_max_pool2d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_fractional_max_pool2d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_fractional_max_pool2d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_fractional_max_pool3d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_fractional_max_pool3d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_fractional_max_pool3d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_fractional_max_pool3d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_gaussian_nll_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_gaussian_nll_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_gaussian_nll_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_gaussian_nll_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_gelu_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_gelu_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_gelu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_gelu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_glu_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_glu_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_glu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_glu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_grid_sample_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_grid_sample_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_grid_sample_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_grid_sample_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_group_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_group_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_group_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_group_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardshrink_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardshrink_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardshrink_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardshrink_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardsigmoid_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardsigmoid_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardsigmoid_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardsigmoid_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardswish_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardswish_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardswish_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardswish_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardtanh_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardtanh_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardtanh_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardtanh_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardtanh_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardtanh_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardtanh_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hardtanh_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hinge_embedding_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hinge_embedding_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_hinge_embedding_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_huber_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_huber_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_huber_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_huber_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_instance_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_instance_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_instance_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_instance_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_area_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_area_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_area_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_area_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_bicubic_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_bicubic_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_bicubic_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_bicubic_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_bilinear_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_bilinear_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_bilinear_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_bilinear_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_linear_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_linear_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_linear_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_linear_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_nearest-exact_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_nearest_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_nearest_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_nearest_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_nearest_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_nearest_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_trilinear_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_trilinear_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_trilinear_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_interpolate_trilinear_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_kl_div_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_kl_div_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_kl_div_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_kl_div_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_l1_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_l1_loss_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_l1_loss_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_l1_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_l1_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_l1_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_layer_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_layer_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_layer_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_layer_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_leaky_relu_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_leaky_relu_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_leaky_relu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_leaky_relu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_linear_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_linear_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_linear_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_linear_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_linear_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_linear_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_local_response_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_local_response_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_local_response_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_local_response_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_logsigmoid_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_logsigmoid_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_logsigmoid_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_logsigmoid_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_margin_ranking_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_margin_ranking_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_margin_ranking_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_margin_ranking_loss_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_margin_ranking_loss_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_margin_ranking_loss_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_margin_ranking_loss_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_margin_ranking_loss_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool1d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool1d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool1d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool1d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool2d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool2d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool2d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool2d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool3d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool3d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool3d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_pool3d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool1d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool1d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool1d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool1d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool1d_grad_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool1d_grad_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool1d_grad_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool1d_grad_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool2d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool2d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool2d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool2d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool2d_grad_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool2d_grad_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool2d_grad_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool2d_grad_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool3d_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool3d_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool3d_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool3d_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool3d_grad_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool3d_grad_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool3d_grad_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_max_unpool3d_grad_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_mish_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_mish_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_mish_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_mish_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_mse_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_mse_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_mse_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_mse_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multi_head_attention_forward_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multi_head_attention_forward_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multi_head_attention_forward_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multi_head_attention_forward_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multi_margin_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multi_margin_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multi_margin_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multi_margin_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multilabel_margin_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multilabel_margin_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multilabel_margin_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multilabel_margin_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multilabel_soft_margin_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_nll_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_nll_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_nll_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_nll_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_normalize_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_normalize_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_normalize_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_normalize_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_normalize_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_normalize_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_one_hot_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_circular_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_constant_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_reflect_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_reflect_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_reflect_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_reflect_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_reflect_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_reflect_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_reflect_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_reflect_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_reflect_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_reflect_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_reflect_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_negative_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_negative_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_negative_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_negative_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_negative_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_negative_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_negative_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_negative_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_negative_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_negative_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pad_replicate_negative_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pairwise_distance_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pairwise_distance_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pairwise_distance_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pairwise_distance_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pairwise_distance_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pairwise_distance_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pairwise_distance_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pairwise_distance_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pairwise_distance_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pairwise_distance_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pairwise_distance_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pdist_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pdist_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_shuffle_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_pixel_unshuffle_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_poisson_nll_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_poisson_nll_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_poisson_nll_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_poisson_nll_loss_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_poisson_nll_loss_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_poisson_nll_loss_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_poisson_nll_loss_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_poisson_nll_loss_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_prelu_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_prelu_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_prelu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_prelu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu6_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu6_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu6_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu6_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu6_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu6_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu6_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu6_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu6_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_relu_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_rms_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_rms_norm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_rms_norm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_rms_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_rms_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_rms_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_rrelu_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_rrelu_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_rrelu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_rrelu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_scaled_dot_product_attention_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_selu_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_selu_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_selu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_selu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_silu_complex_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_silu_complex_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_silu_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_silu_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_silu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_silu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_smooth_l1_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_smooth_l1_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_smooth_l1_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_soft_margin_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_soft_margin_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_soft_margin_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_soft_margin_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_with_dtype_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_with_dtype_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_with_dtype_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_with_dtype_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_with_dtype_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_with_dtype_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_with_dtype_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_with_dtype_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_with_dtype_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softmin_with_dtype_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softplus_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softplus_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softplus_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softplus_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softshrink_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softshrink_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softshrink_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softshrink_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_softsign_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_tanhshrink_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_tanhshrink_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_tanhshrink_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_tanhshrink_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_tanhshrink_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_tanhshrink_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_tanhshrink_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_tanhshrink_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_tanhshrink_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_tanhshrink_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_tanhshrink_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_threshold_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_threshold_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_threshold_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_threshold_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_threshold_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_threshold_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_threshold_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_threshold_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_threshold_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_loss_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_loss_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_loss_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_loss_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_loss_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_loss_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_loss_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_unfold_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_unfold_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_unfold_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_unfold_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_unfold_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_unfold_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_unfold_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_upsample_bilinear_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_upsample_bilinear_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_upsample_bilinear_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_upsample_bilinear_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_upsample_nearest_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_upsample_nearest_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_upsample_nearest_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_upsample_nearest_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nn_functional_upsample_nearest_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_nonzero_static_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_fro_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_fro_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_fro_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_fro_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_fro_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_fro_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_inf_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_inf_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_inf_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_inf_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_inf_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_inf_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_nuc_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_nuc_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_nuc_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_norm_nuc_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_in_place_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_in_place_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_in_place_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_in_place_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_in_place_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_in_place_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_number_mean_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_number_mean_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_number_mean_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_normal_number_mean_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ones_like_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ormqr_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ormqr_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ormqr_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ormqr_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_outer_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pca_lowrank_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pca_lowrank_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pca_lowrank_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pca_lowrank_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_permute_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pinverse_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pinverse_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pinverse_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pinverse_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polar_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polar_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_0_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_0_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_0_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_0_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_0_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_0_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_0_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_0_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_0_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_0_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_1_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_1_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_1_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_1_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_1_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_1_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_1_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_1_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_1_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_1_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_2_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_2_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_2_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_2_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_2_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_2_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_2_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_2_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_2_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_2_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_3_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_3_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_3_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_3_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_3_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_3_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_3_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_3_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_3_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_3_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_4_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_4_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_4_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_4_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_4_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_4_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_4_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_4_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_4_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_polygamma_polygamma_n_4_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_positive_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_pow_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_prod_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_put_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_qr_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_qr_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_qr_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_qr_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_quantile_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_quantile_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rad2deg_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rad2deg_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rad2deg_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rad2deg_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rad2deg_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rad2deg_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rad2deg_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rad2deg_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rad2deg_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rad2deg_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rand_like_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rand_like_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rand_like_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rand_like_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rand_like_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rand_like_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rand_like_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_like_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_like_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_like_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_like_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_like_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_like_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_like_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_like_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randint_like_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_like_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_like_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_like_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_like_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_like_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_like_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_randn_like_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_ravel_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_real_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reciprocal_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_remainder_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_remainder_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_remainder_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_remainder_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_remainder_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_remainder_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_remainder_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_remainder_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_remainder_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_renorm_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_renorm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_renorm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_renorm_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_renorm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_renorm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_repeat_interleave_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_as_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_reshape_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize__cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resize_as__cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_conj_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_resolve_neg_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_roll_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rot90_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_0_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_0_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_0_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_0_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_3_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_3_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_3_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_3_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_neg_3_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_neg_3_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_neg_3_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_round_decimals_neg_3_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsqrt_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsub_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsub_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsub_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsub_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsub_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsub_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsub_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsub_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsub_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsub_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_rsub_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scalar_tensor_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_add_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amax_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amax_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amax_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amax_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amax_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amin_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amin_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amin_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amin_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_amin_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_mean_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_mean_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_mean_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_mean_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_mean_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_mean_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_mean_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_mean_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_mean_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_prod_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_prod_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_prod_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_prod_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_prod_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_prod_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_prod_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_prod_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_prod_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_sum_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_sum_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_sum_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_sum_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_sum_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_sum_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_sum_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_sum_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_sum_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_scatter_reduce_sum_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_searchsorted_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_searchsorted_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_searchsorted_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_searchsorted_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_searchsorted_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_searchsorted_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_searchsorted_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_searchsorted_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_searchsorted_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_scatter_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_scatter_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_scatter_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_scatter_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_scatter_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_scatter_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_scatter_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_scatter_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_scatter_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_select_scatter_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sgn_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_short_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sigmoid_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sign_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sign_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sign_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sign_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sign_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sign_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sign_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sign_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sign_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sign_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_bartlett_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_bartlett_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_blackman_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_blackman_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_cosine_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_cosine_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_exponential_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_exponential_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_gaussian_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_gaussian_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_general_cosine_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_general_cosine_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_general_hamming_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_general_hamming_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_hamming_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_hamming_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_hann_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_hann_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_kaiser_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_kaiser_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_nuttall_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signal_windows_nuttall_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signbit_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signbit_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signbit_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signbit_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signbit_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signbit_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signbit_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signbit_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signbit_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_signbit_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sin_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinc_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sinh_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_scatter_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_scatter_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_scatter_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_scatter_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_scatter_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_scatter_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_scatter_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_scatter_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_scatter_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_slice_scatter_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_softmax_with_dtype_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sort_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sort_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sort_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sort_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sort_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sort_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sort_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sort_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sort_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sort_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sparse_mm_reduce_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sparse_mm_reduce_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sparse_mm_reduce_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sparse_mm_reduce_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sparse_sampled_addmm_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sparse_sampled_addmm_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sparse_sampled_addmm_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sparse_sampled_addmm_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_airy_ai_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_airy_ai_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_airy_ai_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_airy_ai_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_airy_ai_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_airy_ai_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_airy_ai_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_airy_ai_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j0_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j0_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j0_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j0_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j0_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j0_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j0_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j0_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j1_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j1_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j1_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j1_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j1_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j1_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j1_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_j1_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y0_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y0_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y0_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y0_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y0_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y0_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y0_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y0_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y1_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y1_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y1_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y1_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y1_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y1_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y1_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_bessel_y1_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_t_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_t_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_t_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_t_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_t_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_t_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_t_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_t_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_u_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_u_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_u_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_u_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_u_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_u_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_u_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_u_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_v_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_v_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_v_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_v_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_v_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_v_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_v_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_v_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_w_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_w_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_w_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_w_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_w_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_w_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_w_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_chebyshev_polynomial_w_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_entr_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_entr_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_entr_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_entr_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_entr_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_entr_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_entr_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_entr_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_entr_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_entr_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_erfcx_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_erfcx_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_erfcx_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_erfcx_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_erfcx_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_erfcx_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_erfcx_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_erfcx_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_h_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_h_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_h_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_h_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_h_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_h_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_h_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_h_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_he_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_he_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_he_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_he_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_he_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_he_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_he_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_hermite_polynomial_he_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i0e_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i0e_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i0e_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i0e_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i0e_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i0e_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i0e_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i0e_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i0e_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i0e_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1e_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1e_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1e_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1e_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1e_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1e_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1e_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1e_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1e_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_i1e_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_laguerre_polynomial_l_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_laguerre_polynomial_l_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_laguerre_polynomial_l_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_laguerre_polynomial_l_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_laguerre_polynomial_l_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_laguerre_polynomial_l_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_laguerre_polynomial_l_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_laguerre_polynomial_l_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_legendre_polynomial_p_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_legendre_polynomial_p_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_legendre_polynomial_p_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_legendre_polynomial_p_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_legendre_polynomial_p_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_legendre_polynomial_p_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_legendre_polynomial_p_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_legendre_polynomial_p_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_log_ndtr_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_log_ndtr_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_log_ndtr_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_log_ndtr_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_log_ndtr_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_log_ndtr_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_log_ndtr_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_log_ndtr_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i0_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i0_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i0_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i0_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i0_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i0_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i0_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i0_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i1_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i1_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i1_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i1_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i1_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i1_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i1_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_i1_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k0_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k0_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k0_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k0_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k0_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k0_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k0_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k0_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k1_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k1_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k1_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k1_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k1_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k1_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k1_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_modified_bessel_k1_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtr_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtr_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtr_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtr_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtr_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtr_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtr_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtr_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtr_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtr_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtri_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtri_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtri_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtri_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtri_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtri_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtri_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_ndtri_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k0_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k0_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k0_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k0_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k0_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k0_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k0_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k0_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k1_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k1_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k1_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k1_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k1_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k1_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k1_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_scaled_modified_bessel_k1_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_spherical_bessel_j0_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_spherical_bessel_j0_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_spherical_bessel_j0_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_spherical_bessel_j0_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_spherical_bessel_j0_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_spherical_bessel_j0_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_spherical_bessel_j0_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_spherical_bessel_j0_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_xlog1py_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_xlog1py_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_xlog1py_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_xlog1py_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_xlog1py_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_xlog1py_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_xlog1py_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_xlog1py_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_xlog1py_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_xlog1py_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_zeta_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_zeta_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_zeta_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_zeta_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_zeta_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_zeta_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_zeta_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_special_zeta_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_list_args_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_split_with_sizes_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sqrt_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_square_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_squeeze_multiple_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stack_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_unbiased_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_unbiased_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_unbiased_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_unbiased_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_unbiased_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_mean_unbiased_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_unbiased_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_unbiased_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_unbiased_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_unbiased_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_unbiased_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_std_unbiased_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stft_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stft_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stft_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_stft_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sub_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_sum_to_size_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_svd_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_svd_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_svd_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_svd_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_svd_lowrank_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_svd_lowrank_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_svd_lowrank_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_svd_lowrank_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_t_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_along_dim_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_take_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tan_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tanh_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensor_split_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensordot_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensordot_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensordot_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensordot_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensordot_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tensordot_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tile_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_to_sparse_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_topk_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_topk_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_topk_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_topk_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_topk_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_topk_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_topk_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_topk_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_topk_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch__scaled_mm_cuda_float8_e4m3fn, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__efficient_attention_forward_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__safe_softmax_default_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__safe_softmax_default_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__safe_softmax_default_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_torch_ops_aten__safe_softmax_default_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trace_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_transpose_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapezoid_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapezoid_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapezoid_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapezoid_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapezoid_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapezoid_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapezoid_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapezoid_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapezoid_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapezoid_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapezoid_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapz_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapz_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapz_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapz_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapz_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapz_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapz_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapz_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapz_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapz_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trapz_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triangular_solve_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triangular_solve_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triangular_solve_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triangular_solve_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_indices_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_tril_indices_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_indices_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_triu_indices_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_true_divide_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trunc_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trunc_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trunc_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trunc_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trunc_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trunc_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trunc_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trunc_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_trunc_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unbind_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unflatten_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unfold_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_uniform_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_uniform_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_uniform_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_uniform_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_uniform_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_uniform_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_consecutive_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_consecutive_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_consecutive_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_consecutive_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_consecutive_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_consecutive_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_consecutive_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_consecutive_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_consecutive_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_consecutive_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_uint16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_uint32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_uint64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unique_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unravel_index_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unravel_index_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unravel_index_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unravel_index_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unravel_index_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_chunk_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsafe_split_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_unsqueeze_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_unbiased_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_unbiased_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_unbiased_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_unbiased_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_unbiased_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_mean_unbiased_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_unbiased_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_unbiased_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_unbiased_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_unbiased_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_unbiased_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_var_unbiased_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vdot_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vdot_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vdot_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vdot_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vdot_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vdot_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_complex_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_complex_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_complex_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_real_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_as_real_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_copy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_view_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vsplit_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_vstack_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_where_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_xlogy_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_xlogy_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_xlogy_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_xlogy_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_xlogy_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_xlogy_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_xlogy_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_xlogy_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_xlogy_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_xlogy_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zero__cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_cuda_uint8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_bfloat16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_bool, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_complex128, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_complex32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_complex64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_float16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_float32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_float64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_int16, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_int32, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_int64, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_int8, test/test_schema_check.py::TestSchemaCheckModeOpInfoCUDA::test_schema_correctness_zeros_like_cuda_uint8 2025-12-04T16:14:35.2666983Z 2025-12-04T16:14:35.2667338Z Finished test_schema_check 1/1 ... [2025-12-04 16:14:34.607495][25278.728710564], took 11.66min 2025-12-04T16:14:35.2668540Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_schema_check/test_schema_check-489dd02163956a84.xml 2025-12-04T16:14:36.1304778Z Uploading artifacts took 1.29 seconds 2025-12-04T16:14:36.1309143Z Running test_mkldnn 1/1 ... [2025-12-04 16:14:36.130725][25280.251943602] 2025-12-04T16:14:36.1309650Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T16:14:36.1314284Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_mkldnn.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 16:14:36.131165] 2025-12-04T16:14:41.5561056Z 2025-12-04T16:14:41.5561972Z test_mkldnn 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_mkldnn_1.1_15c2989d3d5eef99_.log 2025-12-04T16:14:41.5562781Z Running 0 items in this shard: 2025-12-04T16:14:41.5563017Z 2025-12-04T16:14:41.5563296Z Finished test_mkldnn 1/1 ... [2025-12-04 16:14:41.555910][25285.677130679], took 0.09min 2025-12-04T16:14:41.5964987Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_mkldnn/test_mkldnn-2c6972170cf2deca.xml 2025-12-04T16:14:41.6238681Z Running test_unary_ufuncs 1/1 ... [2025-12-04 16:14:41.623668][25285.744888365] 2025-12-04T16:14:41.6239221Z SCRIBE_GRAPHQL_ACCESS_TOKEN is set 2025-12-04T16:14:41.6242681Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_unary_ufuncs.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 16:14:41.624030] 2025-12-04T16:22:08.6389246Z 2025-12-04T16:22:08.6390259Z test_unary_ufuncs 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_unary_ufuncs_1.1_47fb7c62f96091ec_.log 2025-12-04T16:22:09.8649003Z Running 25078 items in this shard: test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_angle_complex_to_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_angle_complex_to_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_big_number_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_signed_zero_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_signed_zero_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_zero_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_zero_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_complex_edge_values_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_special_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_special_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_slow_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_assert_raises_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_common_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_common_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_common_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_common_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_log1p_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_log1p_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mvlgamma_inplace_integer_error_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mvlgamma_inplace_integer_error_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mvlgamma_integer_promotion_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mvlgamma_integer_promotion_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_bfloat16_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_narrow_dtypes_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_narrow_dtypes_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nonzero_empty_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nonzero_large_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nonzero_static_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nonzero_static_large_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_op_invert_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_polygamma_neg_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_log_ndtr_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_log_ndtr_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_ndtr_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_ndtr_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_tan_complex_cuda_matches_numpy_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_tanh_complex_cuda_matches_numpy_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_unary_out_op_mem_overlap_cuda_float64 2025-12-04T16:22:11.1185145Z 2025-12-04T16:22:11.1185498Z Finished test_unary_ufuncs 1/1 ... [2025-12-04 16:22:08.679269][25732.800482517], took 7.45min 2025-12-04T16:22:11.1186947Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-pytest/test_unary_ufuncs/test_unary_ufuncs-52555684d34478cf.xml 2025-12-04T16:22:11.1188234Z Running test_autoload_enable 1/1 ... [2025-12-04 16:22:09.411378][25733.532596488] 2025-12-04T16:22:11.1188875Z Processing /var/lib/jenkins/workspace/test/cpp_extensions 2025-12-04T16:22:14.5596877Z Preparing metadata (pyproject.toml) ... [?25l- done 2025-12-04T16:22:14.5620160Z [?25hBuilding wheels for collected packages: torch_test_cpp_extension 2025-12-04T16:23:46.7140024Z Building wheel for torch_test_cpp_extension (pyproject.toml) ... [?25l- \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-12-04T16:23:46.7519927Z [?25h Created wheel for torch_test_cpp_extension: filename=torch_test_cpp_extension-0.0.0-cp310-cp310-linux_x86_64.whl size=13197897 sha256=880cc757bbe676d4eb1346bac6f2c1869d436d21321e303f0024a263962096cb 2025-12-04T16:23:46.7521576Z Stored in directory: /tmp/pip-ephem-wheel-cache-6gbsgw8g/wheels/2b/79/8d/635cf291e138cfea331292ca746c62b61fade208eb55a7e3a1 2025-12-04T16:23:46.7541769Z Successfully built torch_test_cpp_extension 2025-12-04T16:23:47.3684793Z Installing collected packages: torch_test_cpp_extension 2025-12-04T16:23:47.6123281Z Successfully installed torch_test_cpp_extension-0.0.0 2025-12-04T16:23:51.9638843Z 2025-12-04T16:23:51.9639532Z Running tests... 2025-12-04T16:23:51.9639970Z ---------------------------------------------------------------------- 2025-12-04T16:23:53.8235063Z . 2025-12-04T16:23:53.8235715Z ---------------------------------------------------------------------- 2025-12-04T16:23:53.8236403Z Ran 1 test in 1.860s 2025-12-04T16:23:53.8236708Z 2025-12-04T16:23:53.8236849Z OK 2025-12-04T16:23:53.8237066Z 2025-12-04T16:23:53.8237264Z Generating XML reports... 2025-12-04T16:23:54.6744944Z Finished test_autoload_enable 1/1 ... [2025-12-04 16:23:54.674109][25838.795320397], took 1.75min 2025-12-04T16:23:54.7153940Z Parsing testcases for test report: /var/lib/jenkins/workspace/test/test-reports/python-unittest/test_autoload/TEST-TestDeviceBackendAutoload-20251204162351.xml 2025-12-04T16:24:01.6244964Z Running test batch 'tests to run' cost 23948.9 seconds 2025-12-04T16:24:01.6261443Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:01.6265058Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4bff712d12d11f0be0b0242ac110002 2025-12-04T16:24:01.7277172Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4bff712d12d11f0be0b0242ac110002 2025-12-04T16:24:01.7292014Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:01.7294347Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4cfac98d12d11f0be0b0242ac110002 2025-12-04T16:24:01.7611126Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4cfac98d12d11f0be0b0242ac110002 2025-12-04T16:24:01.7626888Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:01.7629085Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4d4c7c8d12d11f0be0b0242ac110002 2025-12-04T16:24:01.8002061Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4d4c7c8d12d11f0be0b0242ac110002 2025-12-04T16:24:01.8016984Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:01.8019712Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4dabcf0d12d11f0be0b0242ac110002 2025-12-04T16:24:01.8449456Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4dabcf0d12d11f0be0b0242ac110002 2025-12-04T16:24:01.8465601Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:01.8467934Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4e1949ed12d11f0be0b0242ac110002 2025-12-04T16:24:01.8799786Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4e1949ed12d11f0be0b0242ac110002 2025-12-04T16:24:01.8814811Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:01.8816932Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4e6e8a4d12d11f0be0b0242ac110002 2025-12-04T16:24:01.9920364Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4e6e8a4d12d11f0be0b0242ac110002 2025-12-04T16:24:01.9935192Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:01.9937435Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4f80148d12d11f0be0b0242ac110002 2025-12-04T16:24:02.0598519Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865441_a4f80148d12d11f0be0b0242ac110002 2025-12-04T16:24:02.0613070Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:02.0615155Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865442_a50258e6d12d11f0be0b0242ac110002 2025-12-04T16:24:02.0953710Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865442_a50258e6d12d11f0be0b0242ac110002 2025-12-04T16:24:02.0968448Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:02.0970616Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865442_a507c4d4d12d11f0be0b0242ac110002 2025-12-04T16:24:02.1294417Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865442_a507c4d4d12d11f0be0b0242ac110002 2025-12-04T16:24:02.1309348Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:02.1311308Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865442_a50cf864d12d11f0be0b0242ac110002 2025-12-04T16:24:02.1628111Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865442_a50cf864d12d11f0be0b0242ac110002 2025-12-04T16:24:02.1643244Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:02.1645102Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865442_a5120fcad12d11f0be0b0242ac110002 2025-12-04T16:24:02.1981269Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865442_a5120fcad12d11f0be0b0242ac110002 2025-12-04T16:24:02.1996595Z Emitting td_test_failure_stats_v2 2025-12-04T16:24:02.1998598Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865442_a51774f6d12d11f0be0b0242ac110002 2025-12-04T16:24:02.2362534Z Done! Finish writing document to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764865442_a51774f6d12d11f0be0b0242ac110002 2025-12-04T16:24:02.2364036Z inductor/test_aot_inductor 2/6 failed! 2025-12-04T16:24:02.2364427Z inductor/test_torchinductor 1/2 failed! 2025-12-04T16:24:02.2364961Z inductor/test_torchinductor_dynamic_shapes 3/5 failed! 2025-12-04T16:24:02.2365588Z inductor/test_torchinductor_codegen_dynamic_shapes 3/4 failed! 2025-12-04T16:24:02.2366137Z dynamo/test_activation_checkpointing 1/1 failed! 2025-12-04T16:24:02.2366656Z inductor/test_flex_attention 1/1 failed! 2025-12-04T16:24:02.2367101Z inductor/test_cuda_select_algorithm 1/5 failed! 2025-12-04T16:24:02.2367593Z inductor/test_compile_subprocess 1/3 failed! 2025-12-04T16:24:02.2368076Z inductor/test_op_dtype_prop 1/2 failed! 2025-12-04T16:24:02.2368454Z dynamo/test_compiler_bisector 1/1 failed! 2025-12-04T16:24:02.2368901Z test_linalg 1/1 failed! 2025-12-04T16:24:02.2369381Z test_sparse_csr 2/2 failed! 2025-12-04T16:24:03.0249606Z 2025-12-04T16:24:03.0250055Z real 399m17.307s 2025-12-04T16:24:03.0250430Z user 404m36.414s 2025-12-04T16:24:03.0250694Z sys 57m30.032s 2025-12-04T16:24:03.0251187Z + sccache_epilogue 2025-12-04T16:24:03.0251534Z + echo '::group::Sccache Compilation Log' 2025-12-04T16:24:03.0252373Z ##[group]Sccache Compilation Log 2025-12-04T16:24:03.0252865Z + echo '=================== sccache compilation log ===================' 2025-12-04T16:24:03.0253345Z =================== sccache compilation log =================== 2025-12-04T16:24:03.0254135Z + python /var/lib/jenkins/workspace/.ci/pytorch/print_sccache_log.py /var/lib/jenkins/sccache_error.log 2025-12-04T16:24:03.0404166Z + echo '=========== If your build fails, please take a look at the log above for possible reasons ===========' 2025-12-04T16:24:03.0405051Z =========== If your build fails, please take a look at the log above for possible reasons =========== 2025-12-04T16:24:03.0405713Z + sccache --show-stats 2025-12-04T16:24:03.0439611Z Compile requests 4661 2025-12-04T16:24:03.0440057Z Compile requests executed 364 2025-12-04T16:24:03.0440444Z Cache hits 177 2025-12-04T16:24:03.0440788Z Cache hits (C/C++) 177 2025-12-04T16:24:03.0441143Z Cache misses 185 2025-12-04T16:24:03.0441495Z Cache misses (C/C++) 185 2025-12-04T16:24:03.0441855Z Cache hits rate 48.90 % 2025-12-04T16:24:03.0442229Z Cache hits rate (C/C++) 48.90 % 2025-12-04T16:24:03.0442594Z Cache timeouts 0 2025-12-04T16:24:03.0442950Z Cache read errors 0 2025-12-04T16:24:03.0443292Z Forced recaches 0 2025-12-04T16:24:03.0443651Z Cache write errors 0 2025-12-04T16:24:03.0444005Z Cache errors 0 2025-12-04T16:24:03.0444349Z Compilations 185 2025-12-04T16:24:03.0444721Z Compilation failures 2 2025-12-04T16:24:03.0445100Z Non-cacheable compilations 0 2025-12-04T16:24:03.0445457Z Non-cacheable calls 224 2025-12-04T16:24:03.0445826Z Non-compilation calls 4073 2025-12-04T16:24:03.0446202Z Unsupported compiler calls 0 2025-12-04T16:24:03.0446568Z Average cache write 0.048 s 2025-12-04T16:24:03.0447091Z Average compiler 6.879 s 2025-12-04T16:24:03.0447467Z Average cache read hit 0.032 s 2025-12-04T16:24:03.0447858Z Failed distributed compilations 0 2025-12-04T16:24:03.0448111Z 2025-12-04T16:24:03.0448222Z Non-cacheable reasons: 2025-12-04T16:24:03.0448536Z unknown source language 152 2025-12-04T16:24:03.0448893Z -E 72 2025-12-04T16:24:03.0449132Z 2025-12-04T16:24:03.0449404Z Cache location s3, name: ossci-compiler-cache-circleci-v2, prefix: / 2025-12-04T16:24:03.0449939Z Version (client) 0.10.0 2025-12-04T16:24:03.0450302Z + sccache --stop-server 2025-12-04T16:24:03.0464546Z Stopping sccache server... 2025-12-04T16:24:03.0467739Z Compile requests 4661 2025-12-04T16:24:03.0468130Z Compile requests executed 364 2025-12-04T16:24:03.0468494Z Cache hits 177 2025-12-04T16:24:03.0468831Z Cache hits (C/C++) 177 2025-12-04T16:24:03.0469184Z Cache misses 185 2025-12-04T16:24:03.0469536Z Cache misses (C/C++) 185 2025-12-04T16:24:03.0469885Z Cache hits rate 48.90 % 2025-12-04T16:24:03.0470253Z Cache hits rate (C/C++) 48.90 % 2025-12-04T16:24:03.0470692Z Cache timeouts 0 2025-12-04T16:24:03.0471166Z Cache read errors 0 2025-12-04T16:24:03.0471598Z Forced recaches 0 2025-12-04T16:24:03.0471971Z Cache write errors 0 2025-12-04T16:24:03.0472327Z Cache errors 0 2025-12-04T16:24:03.0472812Z Compilations 185 2025-12-04T16:24:03.0473242Z Compilation failures 2 2025-12-04T16:24:03.0473619Z Non-cacheable compilations 0 2025-12-04T16:24:03.0474031Z Non-cacheable calls 224 2025-12-04T16:24:03.0474406Z Non-compilation calls 4073 2025-12-04T16:24:03.0474782Z Unsupported compiler calls 0 2025-12-04T16:24:03.0475164Z Average cache write 0.048 s 2025-12-04T16:24:03.0475524Z Average compiler 6.879 s 2025-12-04T16:24:03.0475899Z Average cache read hit 0.032 s 2025-12-04T16:24:03.0476282Z Failed distributed compilations 0 2025-12-04T16:24:03.0476532Z 2025-12-04T16:24:03.0476641Z Non-cacheable reasons: 2025-12-04T16:24:03.0476950Z unknown source language 152 2025-12-04T16:24:03.0477311Z -E 72 2025-12-04T16:24:03.0477544Z 2025-12-04T16:24:03.0477825Z Cache location s3, name: ossci-compiler-cache-circleci-v2, prefix: / 2025-12-04T16:24:03.0478348Z Version (client) 0.10.0 2025-12-04T16:24:03.0478712Z + echo ::endgroup:: 2025-12-04T16:24:03.0479257Z ##[endgroup] 2025-12-04T16:24:03.0479520Z + cleanup_workspace 2025-12-04T16:24:03.0480092Z + echo 'sudo may print the following warning message that can be ignored. The chown command will still run.' 2025-12-04T16:24:03.0481321Z sudo may print the following warning message that can be ignored. The chown command will still run. 2025-12-04T16:24:03.0482200Z + echo ' sudo: setrlimit(RLIMIT_STACK): Operation not permitted' 2025-12-04T16:24:03.0482914Z sudo: setrlimit(RLIMIT_STACK): Operation not permitted 2025-12-04T16:24:03.0483748Z + echo 'For more details refer to https://github.com/sudo-project/sudo/issues/42' 2025-12-04T16:24:03.0484466Z For more details refer to https://github.com/sudo-project/sudo/issues/42 2025-12-04T16:24:03.0485090Z + sudo chown -R 1000 /var/lib/jenkins/workspace 2025-12-04T16:24:03.8340895Z ##[error]Process completed with exit code 1. 2025-12-04T16:24:03.8422997Z Prepare all required actions 2025-12-04T16:24:03.8423464Z Getting action download info 2025-12-04T16:24:04.0166218Z ##[group]Run ./.github/actions/pytest-cache-upload 2025-12-04T16:24:04.0166619Z with: 2025-12-04T16:24:04.0166877Z cache_dir: .pytest_cache 2025-12-04T16:24:04.0167180Z shard: 2 2025-12-04T16:24:04.0167461Z sha: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T16:24:04.0167985Z test_config: legacy_nvidia_driver 2025-12-04T16:24:04.0168437Z job_identifier: periodic_linux-jammy-cuda12.4-py3.10-gcc11 2025-12-04T16:24:04.0168881Z env: 2025-12-04T16:24:04.0169116Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:04.0169425Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:04.0169795Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:04.0170432Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:04.0171020Z ##[endgroup] 2025-12-04T16:24:04.0209132Z ##[group]Run nick-fields/retry@v3.0.0 2025-12-04T16:24:04.0209543Z with: 2025-12-04T16:24:04.0209767Z shell: bash 2025-12-04T16:24:04.0210023Z timeout_minutes: 5 2025-12-04T16:24:04.0210305Z max_attempts: 5 2025-12-04T16:24:04.0210566Z retry_wait_seconds: 30 2025-12-04T16:24:04.0210958Z command: set -eu python3 -m pip install boto3==1.35.42 2025-12-04T16:24:04.0211407Z polling_interval_seconds: 1 2025-12-04T16:24:04.0211741Z warning_on_retry: true 2025-12-04T16:24:04.0212032Z continue_on_error: false 2025-12-04T16:24:04.0212326Z env: 2025-12-04T16:24:04.0212565Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:04.0212856Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:04.0213217Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:04.0213869Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:04.0214440Z ##[endgroup] 2025-12-04T16:24:04.6758395Z Defaulting to user installation because normal site-packages is not writeable 2025-12-04T16:24:06.0024978Z Collecting boto3==1.35.42 2025-12-04T16:24:06.0227353Z Downloading boto3-1.35.42-py3-none-any.whl (139 kB) 2025-12-04T16:24:07.4696490Z Collecting botocore<1.36.0,>=1.35.42 2025-12-04T16:24:07.4741708Z Downloading botocore-1.35.99-py3-none-any.whl (13.3 MB) 2025-12-04T16:24:07.6396546Z Requirement already satisfied: jmespath<2.0.0,>=0.7.1 in /usr/lib/python3.9/site-packages (from boto3==1.35.42) (0.10.0) 2025-12-04T16:24:07.6879549Z Collecting s3transfer<0.11.0,>=0.10.0 2025-12-04T16:24:07.6922875Z Downloading s3transfer-0.10.4-py3-none-any.whl (83 kB) 2025-12-04T16:24:07.7025810Z Requirement already satisfied: python-dateutil<3.0.0,>=2.1 in /usr/lib/python3.9/site-packages (from botocore<1.36.0,>=1.35.42->boto3==1.35.42) (2.8.1) 2025-12-04T16:24:07.7038214Z Requirement already satisfied: urllib3<1.27,>=1.25.4 in /usr/lib/python3.9/site-packages (from botocore<1.36.0,>=1.35.42->boto3==1.35.42) (1.25.10) 2025-12-04T16:24:07.9262352Z Requirement already satisfied: six>=1.5 in /usr/lib/python3.9/site-packages (from python-dateutil<3.0.0,>=2.1->botocore<1.36.0,>=1.35.42->boto3==1.35.42) (1.15.0) 2025-12-04T16:24:08.0302466Z Installing collected packages: botocore, s3transfer, boto3 2025-12-04T16:24:08.6640057Z Successfully installed boto3-1.35.42 botocore-1.35.99 s3transfer-0.10.4 2025-12-04T16:24:09.1089278Z Command completed after 1 attempt(s). 2025-12-04T16:24:09.1154659Z ##[group]Run python3 .github/scripts/pytest_cache.py \ 2025-12-04T16:24:09.1155344Z python3 .github/scripts/pytest_cache.py \ 2025-12-04T16:24:09.1155893Z  --upload \ 2025-12-04T16:24:09.1156348Z  --cache_dir "$GITHUB_WORKSPACE/$CACHE_DIR" \ 2025-12-04T16:24:09.1156917Z  --pr_identifier "$GITHUB_REF" \ 2025-12-04T16:24:09.1157506Z  --job_identifier "$JOB_IDENTIFIER" \ 2025-12-04T16:24:09.1157989Z  --sha "$SHA" \ 2025-12-04T16:24:09.1158434Z  --test_config "$TEST_CONFIG" \ 2025-12-04T16:24:09.1158939Z  --shard "$SHARD" \ 2025-12-04T16:24:09.1159546Z  --repo "$REPO" \ 2025-12-04T16:24:09.1160086Z  --temp_dir "$RUNNER_TEMP" \ 2025-12-04T16:24:09.1170922Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T16:24:09.1171521Z env: 2025-12-04T16:24:09.1171889Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:09.1172296Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:09.1172843Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:09.1173734Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:09.1174518Z CACHE_DIR: .pytest_cache 2025-12-04T16:24:09.1174991Z JOB_IDENTIFIER: periodic_linux-jammy-cuda12.4-py3.10-gcc11 2025-12-04T16:24:09.1175611Z SHA: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T16:24:09.1176162Z TEST_CONFIG: legacy_nvidia_driver 2025-12-04T16:24:09.1176565Z SHARD: 2 2025-12-04T16:24:09.1176938Z REPO: pytorch/pytorch 2025-12-04T16:24:09.1177376Z ##[endgroup] 2025-12-04T16:24:09.6375672Z PR identifier for `refs/heads/main` is `96e092540d6b3c4076e3d2bc6f1f9013` 2025-12-04T16:24:09.6378228Z Uploading cache with args Namespace(upload=True, download=False, cache_dir='/home/ec2-user/actions-runner/_work/pytorch/pytorch/.pytest_cache', pr_identifier='refs/heads/main', job_identifier='periodic_linux-jammy-cuda12.4-py3.10-gcc11', sha='ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32', test_config='legacy_nvidia_driver', shard='2', repo='pytorch/pytorch', temp_dir='/home/ec2-user/actions-runner/_work/_temp', bucket=None) 2025-12-04T16:24:09.6380866Z Zipping /home/ec2-user/actions-runner/_work/pytorch/pytorch/.pytest_cache 2025-12-04T16:24:09.6382536Z to /home/ec2-user/actions-runner/_work/_temp/zip-upload/pytest_cache/pytorch/pytorch/96e092540d6b3c4076e3d2bc6f1f9013/periodic_linux-jammy-cuda12_4-py3_10-gcc11/ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32/legacy_nvidia_driver/2 2025-12-04T16:24:09.6385128Z Uploading /home/ec2-user/actions-runner/_work/_temp/zip-upload/pytest_cache/pytorch/pytorch/96e092540d6b3c4076e3d2bc6f1f9013/periodic_linux-jammy-cuda12_4-py3_10-gcc11/ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32/legacy_nvidia_driver/2.zip 2025-12-04T16:24:09.6387944Z to s3://gha-artifacts/pytest_cache/pytorch/pytorch/96e092540d6b3c4076e3d2bc6f1f9013/periodic_linux-jammy-cuda12_4-py3_10-gcc11/ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32/legacy_nvidia_driver/2.zip 2025-12-04T16:24:09.6944300Z ##[group]Run cat test/**/*_toprint.log || true 2025-12-04T16:24:09.6944770Z cat test/**/*_toprint.log || true 2025-12-04T16:24:09.6952068Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T16:24:09.6952513Z env: 2025-12-04T16:24:09.6952762Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:09.6953059Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:09.6953423Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:09.6954079Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:09.6954664Z ##[endgroup] 2025-12-04T16:24:09.7057743Z cat: 'test/**/*_toprint.log': No such file or directory 2025-12-04T16:24:09.7088193Z ##[group]Run kill "$MONITOR_SCRIPT_PID" 2025-12-04T16:24:09.7088638Z kill "$MONITOR_SCRIPT_PID" 2025-12-04T16:24:09.7095152Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T16:24:09.7095596Z env: 2025-12-04T16:24:09.7095846Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:09.7096144Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:09.7096513Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:09.7097182Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:09.7097787Z MONITOR_SCRIPT_PID: 68667 2025-12-04T16:24:09.7098081Z ##[endgroup] 2025-12-04T16:24:09.7124262Z /home/ec2-user/actions-runner/_work/_temp/901a2231-81b7-4046-a75d-1d5bbabfeeec.sh: line 1: kill: (68667) - No such process 2025-12-04T16:24:09.7127172Z ##[error]Process completed with exit code 1. 2025-12-04T16:24:09.7276465Z Prepare all required actions 2025-12-04T16:24:09.7276969Z Getting action download info 2025-12-04T16:24:10.0511947Z Download action repository 'seemethere/upload-artifact-s3@v5' (SHA:baba72d0712b404f646cebe0730933554ebce96a) 2025-12-04T16:24:10.2903851Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-12-04T16:24:10.7302292Z ##[group]Run ./.github/actions/upload-test-artifacts 2025-12-04T16:24:10.7302698Z with: 2025-12-04T16:24:10.7303271Z file-suffix: test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259 2025-12-04T16:24:10.7303871Z s3-bucket: gha-artifacts 2025-12-04T16:24:10.7304153Z env: 2025-12-04T16:24:10.7304395Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:10.7304697Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:10.7305065Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:10.7305707Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:10.7306339Z ##[endgroup] 2025-12-04T16:24:10.7339250Z ##[group]Run # Remove any previous test jsons if they exist 2025-12-04T16:24:10.7339958Z # Remove any previous test jsons if they exist 2025-12-04T16:24:10.7340478Z rm -f test-jsons-*.zip 2025-12-04T16:24:10.7341092Z zip -r "test-jsons-${FILE_SUFFIX}.zip" test/test-reports -i '*.json' 2025-12-04T16:24:10.7348205Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T16:24:10.7348736Z env: 2025-12-04T16:24:10.7349151Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:10.7349568Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:10.7350018Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:10.7350822Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:10.7351746Z FILE_SUFFIX: test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259 2025-12-04T16:24:10.7352407Z ##[endgroup] 2025-12-04T16:24:10.7587306Z adding: test/test-reports/td_exclusions-385bdb830ed2ab45a360.json (deflated 82%) 2025-12-04T16:24:10.7593020Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-7ff7771e3c72f123.json (deflated 91%) 2025-12-04T16:24:10.7602201Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b312d84b484956a1.json (deflated 95%) 2025-12-04T16:24:10.7611042Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-9ee61a34966ac1ed.json (deflated 95%) 2025-12-04T16:24:10.7616073Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-31f94497420761eb.json (deflated 92%) 2025-12-04T16:24:10.7618257Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b252d51c09f06fd1.json (deflated 91%) 2025-12-04T16:24:10.7620473Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-446d02039a2c0737.json (deflated 91%) 2025-12-04T16:24:10.7623429Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b3aa6989e65545bb.json (deflated 92%) 2025-12-04T16:24:10.7625466Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-dc71db8b83a58b10.json (deflated 90%) 2025-12-04T16:24:10.7627438Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-d1d71be98185ea4a.json (deflated 90%) 2025-12-04T16:24:10.7629973Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-ec873e76fad9978d.json (deflated 94%) 2025-12-04T16:24:10.7659719Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-bf110f953f5c5355.json (deflated 95%) 2025-12-04T16:24:10.7661198Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-268cc96fdd12a701.json (deflated 87%) 2025-12-04T16:24:10.7663769Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-9d3344bbd0e95b72.json (deflated 87%) 2025-12-04T16:24:10.7665291Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-c49d705795b8a118.json (deflated 88%) 2025-12-04T16:24:10.7667192Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-03cc5e31909b0a52.json (deflated 89%) 2025-12-04T16:24:10.7669113Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-e6be60d41d66658c.json (deflated 89%) 2025-12-04T16:24:10.7670671Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-27e2dd5fd3a036de.json (deflated 81%) 2025-12-04T16:24:10.7693232Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-655beaec67eb1ac4.json (deflated 93%) 2025-12-04T16:24:10.7701179Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-3eb5ad48105c1cd2.json (deflated 97%) 2025-12-04T16:24:10.7708649Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-50486f79f4e32e59.json (deflated 97%) 2025-12-04T16:24:10.7717084Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-305489af275a491d.json (deflated 94%) 2025-12-04T16:24:10.7741841Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-3d2f5f1cdf9aac8a.json (deflated 95%) 2025-12-04T16:24:10.7743800Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-53adf85a57cb1562.json (deflated 86%) 2025-12-04T16:24:10.7745773Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-ae6d18145ba692e1.json (deflated 86%) 2025-12-04T16:24:10.7752822Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-7f9a6ec5b091c39f.json (deflated 93%) 2025-12-04T16:24:10.7757625Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_opinfo/inductor.test_torchinductor_opinfo-b99623f068a7d8ad.json (deflated 95%) 2025-12-04T16:24:10.7763415Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_opinfo/inductor.test_torchinductor_opinfo-5a150b3a059b9a30.json (deflated 96%) 2025-12-04T16:24:10.7769204Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_opinfo/inductor.test_torchinductor_opinfo-a7e0d08ca6140a5e.json (deflated 95%) 2025-12-04T16:24:10.7775366Z adding: test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-102e4c3a904c277d.json (deflated 93%) 2025-12-04T16:24:10.7777093Z adding: test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-ea0dea10ce3f91a8.json (deflated 88%) 2025-12-04T16:24:10.7778847Z adding: test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-70230936f8120c6b.json (deflated 88%) 2025-12-04T16:24:10.7780515Z adding: test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-327d67e6c79eec23.json (deflated 92%) 2025-12-04T16:24:10.7782124Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-b8b19f7b3ae0b98c.json (deflated 86%) 2025-12-04T16:24:10.7783733Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-cdcb3ea635f515ef.json (deflated 85%) 2025-12-04T16:24:10.7785492Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-2c53b8c4a22bc4ac.json (deflated 85%) 2025-12-04T16:24:10.7787136Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-c3b4fd201ecf8b1e.json (deflated 86%) 2025-12-04T16:24:10.7789542Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7b498694d0586007.json (deflated 85%) 2025-12-04T16:24:10.7791138Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-f04dfce55afddfa2.json (deflated 85%) 2025-12-04T16:24:10.7792845Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a20deb48b5165834.json (deflated 86%) 2025-12-04T16:24:10.7794443Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-688c2e6fbd08d334.json (deflated 85%) 2025-12-04T16:24:10.7811216Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a49a4c623eea732f.json (deflated 85%) 2025-12-04T16:24:10.7812783Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-9d29992f2be81c73.json (deflated 86%) 2025-12-04T16:24:10.7814252Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7a570f380a97ffdf.json (deflated 85%) 2025-12-04T16:24:10.7815729Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-b091dc2605282b32.json (deflated 85%) 2025-12-04T16:24:10.7817221Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7a84aa230dcc2c72.json (deflated 86%) 2025-12-04T16:24:10.7818776Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-622e1501e18bdb77.json (deflated 85%) 2025-12-04T16:24:10.7820244Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-af0abf9f8a1870a5.json (deflated 85%) 2025-12-04T16:24:10.7821828Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a29cc44636754e7c.json (deflated 86%) 2025-12-04T16:24:10.7823294Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ad8948ec33d2b35b.json (deflated 85%) 2025-12-04T16:24:10.7824762Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-33a982dae5077c46.json (deflated 85%) 2025-12-04T16:24:10.7826231Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-bd8b5496f00b6cd3.json (deflated 86%) 2025-12-04T16:24:10.7827708Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ef5bae0bde57e96b.json (deflated 85%) 2025-12-04T16:24:10.7829180Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-c80e75c43f69568f.json (deflated 85%) 2025-12-04T16:24:10.7830651Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-528c2efc29bf6df4.json (deflated 86%) 2025-12-04T16:24:10.7832122Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ed81db7834398b8b.json (deflated 85%) 2025-12-04T16:24:10.7833581Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-261b1bf878c6e8ce.json (deflated 85%) 2025-12-04T16:24:10.7835054Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-753aefcc31d0e7df.json (deflated 86%) 2025-12-04T16:24:10.7836621Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-adc6d5470587661c.json (deflated 85%) 2025-12-04T16:24:10.7838093Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-bbb1474dafa92442.json (deflated 85%) 2025-12-04T16:24:10.7839565Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-e5cba0d160bf2991.json (deflated 86%) 2025-12-04T16:24:10.7841075Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-df18bdebce6af33a.json (deflated 85%) 2025-12-04T16:24:10.7842551Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-d98beca05e867fac.json (deflated 85%) 2025-12-04T16:24:10.7844033Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-3111b2456c98ee88.json (deflated 86%) 2025-12-04T16:24:10.7845499Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-d129ff52aa3d5acd.json (deflated 85%) 2025-12-04T16:24:10.7846961Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-89ebab0d3f4369e8.json (deflated 85%) 2025-12-04T16:24:10.7848411Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-0dc61241fef4b8bc.json (stored 0%) 2025-12-04T16:24:10.7849847Z adding: test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-5bc7c9553ad37db8.json (deflated 93%) 2025-12-04T16:24:10.7851283Z adding: test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-c06a9a1a186bb263.json (deflated 91%) 2025-12-04T16:24:10.7852723Z adding: test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-3aa7e69ca6b255e3.json (deflated 91%) 2025-12-04T16:24:10.7860968Z adding: test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-696ac340ce65f280.json (deflated 96%) 2025-12-04T16:24:10.7862422Z adding: test/test-reports/python-pytest/inductor.test_deterministic/inductor.test_deterministic-b80250c7092ee9c7.json (deflated 83%) 2025-12-04T16:24:10.7863777Z adding: test/test-reports/python-pytest/inductor.test_deterministic/inductor.test_deterministic-4c3a96ba74540dee.json (deflated 77%) 2025-12-04T16:24:10.7865088Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-826c2562e296e7cd.json (deflated 83%) 2025-12-04T16:24:10.7866370Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-738ee4f0dcb0d1ea.json (deflated 83%) 2025-12-04T16:24:10.7867664Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a05156439a0d1496.json (deflated 83%) 2025-12-04T16:24:10.7868955Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a884406c7304f17c.json (deflated 83%) 2025-12-04T16:24:10.7870247Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c03b63e1d9e14442.json (deflated 83%) 2025-12-04T16:24:10.7871538Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a7e12d722ef68d61.json (deflated 83%) 2025-12-04T16:24:10.7872833Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-758c1dbc5b36fbbe.json (deflated 83%) 2025-12-04T16:24:10.7874136Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1356c8f9e8cb48ab.json (deflated 83%) 2025-12-04T16:24:10.7875424Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3d33fda2cceb2cbd.json (deflated 83%) 2025-12-04T16:24:10.7876791Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0982db42fabefc11.json (deflated 83%) 2025-12-04T16:24:10.7878075Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e365e000dec26b6e.json (deflated 83%) 2025-12-04T16:24:10.7879365Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c05ba049f6d7f244.json (deflated 83%) 2025-12-04T16:24:10.7880638Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e64a3e20d6733c67.json (deflated 83%) 2025-12-04T16:24:10.7881973Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-60656dccead73746.json (deflated 83%) 2025-12-04T16:24:10.7883255Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-74b07a56cd02ebb4.json (deflated 83%) 2025-12-04T16:24:10.7884546Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-97c1aae6502fe9a4.json (deflated 86%) 2025-12-04T16:24:10.7885828Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9520e13942c1e127.json (deflated 83%) 2025-12-04T16:24:10.7887107Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5b20aba83da29ea2.json (deflated 83%) 2025-12-04T16:24:10.7888617Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c4f43dfc4bd691e8.json (deflated 88%) 2025-12-04T16:24:10.7889910Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1ceca9168882f805.json (deflated 83%) 2025-12-04T16:24:10.7891197Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c6f480cabc56b496.json (deflated 83%) 2025-12-04T16:24:10.7892493Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9497acc525242c8a.json (deflated 85%) 2025-12-04T16:24:10.7893776Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-158a50fa606ffe95.json (deflated 83%) 2025-12-04T16:24:10.7895125Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-27359545a6656d51.json (deflated 83%) 2025-12-04T16:24:10.7896421Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1f2cd5255ddbd20d.json (deflated 83%) 2025-12-04T16:24:10.7897701Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-6e021b62864c2856.json (deflated 83%) 2025-12-04T16:24:10.7899060Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-49627e719c8e890b.json (deflated 83%) 2025-12-04T16:24:10.7900341Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-66537c67c1172e95.json (deflated 83%) 2025-12-04T16:24:10.7901611Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b567c9f6400d034b.json (deflated 83%) 2025-12-04T16:24:10.7902900Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2dff14642abe4d78.json (deflated 83%) 2025-12-04T16:24:10.7904182Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-65b5f001c721490a.json (deflated 83%) 2025-12-04T16:24:10.7905471Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-51dd6a289fe92b8c.json (deflated 83%) 2025-12-04T16:24:10.7906760Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-fcdeca5855991d94.json (deflated 83%) 2025-12-04T16:24:10.7908050Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-97fc9d1afafb45ba.json (deflated 83%) 2025-12-04T16:24:10.7909432Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-58940be16b8df1f1.json (deflated 83%) 2025-12-04T16:24:10.7910723Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9349febb39593fe3.json (deflated 83%) 2025-12-04T16:24:10.7912000Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-610ad64c0d2affb1.json (deflated 83%) 2025-12-04T16:24:10.7913280Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-535142d7b2475ad0.json (deflated 83%) 2025-12-04T16:24:10.7914616Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-44b8ebfaa38db266.json (deflated 83%) 2025-12-04T16:24:10.7915892Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-bba5c35945b28810.json (deflated 83%) 2025-12-04T16:24:10.7917173Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a3c90c249d149d10.json (deflated 83%) 2025-12-04T16:24:10.7918456Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c896d4e441aec272.json (deflated 83%) 2025-12-04T16:24:10.7919749Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0f626bbdc83148be.json (deflated 83%) 2025-12-04T16:24:10.7921024Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e68881511844a645.json (deflated 83%) 2025-12-04T16:24:10.7922299Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a161840b55d48987.json (deflated 83%) 2025-12-04T16:24:10.7923565Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-84da9e83d0a63a49.json (deflated 83%) 2025-12-04T16:24:10.7924852Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5ba1ce04884dd264.json (deflated 83%) 2025-12-04T16:24:10.7926137Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7aa6851dee05bf26.json (deflated 83%) 2025-12-04T16:24:10.7927453Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cff0107c6e55c745.json (deflated 85%) 2025-12-04T16:24:10.7928733Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-65ccdb61caed4adb.json (deflated 83%) 2025-12-04T16:24:10.7930027Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-332efb6825e8ee00.json (deflated 83%) 2025-12-04T16:24:10.7931313Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5cf5b975b5ed0b41.json (deflated 83%) 2025-12-04T16:24:10.7932687Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-66c6826e1f29c7ae.json (deflated 83%) 2025-12-04T16:24:10.7934060Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2a970c00eea5bec9.json (deflated 83%) 2025-12-04T16:24:10.7935483Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-284ef1fed3dcf877.json (deflated 83%) 2025-12-04T16:24:10.7937030Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5553e88673e5c7d3.json (deflated 83%) 2025-12-04T16:24:10.7938630Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-93bd8f6af98a9b4c.json (deflated 83%) 2025-12-04T16:24:10.7940042Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-58f36c545585b24b.json (deflated 85%) 2025-12-04T16:24:10.7941492Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-6ef09e48fb8b226a.json (deflated 83%) 2025-12-04T16:24:10.7942990Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e71119cdd1d43b1e.json (deflated 83%) 2025-12-04T16:24:10.7944403Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c8dc22bc4fac83a2.json (deflated 83%) 2025-12-04T16:24:10.7945878Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-8170ae01b738ee47.json (deflated 83%) 2025-12-04T16:24:10.7947283Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0595c5125e4115a2.json (deflated 83%) 2025-12-04T16:24:10.7948672Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ea830dd4e98e3dce.json (deflated 85%) 2025-12-04T16:24:10.7950148Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0ddafb1da2805e03.json (deflated 83%) 2025-12-04T16:24:10.7951579Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3b676b001d5055f4.json (deflated 83%) 2025-12-04T16:24:10.7953000Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-15a19448251eca38.json (deflated 83%) 2025-12-04T16:24:10.7954429Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7dd506857cedd350.json (deflated 83%) 2025-12-04T16:24:10.7955858Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a068bf6354320ba5.json (deflated 83%) 2025-12-04T16:24:10.7957276Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0bf202cdb375dae7.json (deflated 83%) 2025-12-04T16:24:10.7958688Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-796bdc853aa8ac88.json (deflated 83%) 2025-12-04T16:24:10.7960096Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-abef13434279995d.json (deflated 83%) 2025-12-04T16:24:10.7961533Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-47824ae1e1d0f6cb.json (deflated 83%) 2025-12-04T16:24:10.7963010Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4038060382adbf9a.json (deflated 83%) 2025-12-04T16:24:10.7964396Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d0c6827a4c24a96e.json (deflated 83%) 2025-12-04T16:24:10.7965871Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-eb0f3705043be69a.json (deflated 83%) 2025-12-04T16:24:10.7967251Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b4b4a4cac36fd9fb.json (deflated 83%) 2025-12-04T16:24:10.7968645Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5f059c7d0f50bda4.json (deflated 83%) 2025-12-04T16:24:10.7970177Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c9ff80f1a6820ed2.json (deflated 83%) 2025-12-04T16:24:10.7971610Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c2ef3587eca8dd3f.json (deflated 83%) 2025-12-04T16:24:10.7973049Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-f520a42aa1f82bba.json (deflated 83%) 2025-12-04T16:24:10.7974433Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-26d6067018821038.json (deflated 85%) 2025-12-04T16:24:10.7975840Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-323433d4d00bc615.json (deflated 83%) 2025-12-04T16:24:10.7977329Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-171ced1c6f8950ca.json (deflated 83%) 2025-12-04T16:24:10.7978890Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-25dfec8b4de20cd4.json (deflated 83%) 2025-12-04T16:24:10.7980259Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-301e5b291449627e.json (deflated 83%) 2025-12-04T16:24:10.7981728Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0b50a11bd83abca6.json (deflated 83%) 2025-12-04T16:24:10.7983151Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-28422965aaa270fa.json (deflated 83%) 2025-12-04T16:24:10.7984588Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cc1d7ca9ed71a647.json (deflated 83%) 2025-12-04T16:24:10.7986133Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-28fdda0d819ffd5b.json (deflated 83%) 2025-12-04T16:24:10.7987515Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2efd0ac46f0ffe8c.json (deflated 83%) 2025-12-04T16:24:10.7989133Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a06616e9f3ca63a0.json (deflated 83%) 2025-12-04T16:24:10.7990635Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ac835e731ca0e136.json (deflated 83%) 2025-12-04T16:24:10.7992049Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b97506f71fc95dd3.json (deflated 83%) 2025-12-04T16:24:10.7993395Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e0e0cb20cc598f84.json (deflated 83%) 2025-12-04T16:24:10.7994880Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7ccf4f5fc7381dbf.json (deflated 83%) 2025-12-04T16:24:10.7996295Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9c9073ed00c82380.json (deflated 83%) 2025-12-04T16:24:10.7997758Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-24dd415c882be612.json (deflated 83%) 2025-12-04T16:24:10.7999274Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ebb637005a79032b.json (deflated 83%) 2025-12-04T16:24:10.8000627Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-64323540e9247afe.json (deflated 83%) 2025-12-04T16:24:10.8002069Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a52cc93387944e77.json (deflated 83%) 2025-12-04T16:24:10.8003550Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c3ebbd7913c9335f.json (deflated 83%) 2025-12-04T16:24:10.8004987Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4acaad8a10b8235e.json (deflated 83%) 2025-12-04T16:24:10.8006430Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7532210fc6e9ae4a.json (deflated 83%) 2025-12-04T16:24:10.8007806Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d5d3164b3ad67fad.json (deflated 83%) 2025-12-04T16:24:10.8009206Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4e5ad1e60a526460.json (deflated 83%) 2025-12-04T16:24:10.8010669Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4c3c91d6f895ec60.json (deflated 83%) 2025-12-04T16:24:10.8012077Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-95a17199eb5946d1.json (deflated 83%) 2025-12-04T16:24:10.8013516Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-81b9e43a20c07e3d.json (deflated 83%) 2025-12-04T16:24:10.8014988Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1de8202a5dd1888a.json (deflated 83%) 2025-12-04T16:24:10.8016417Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c241afcfaa8e2579.json (deflated 83%) 2025-12-04T16:24:10.8017865Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-dbee4fdc7ea3ecd0.json (deflated 83%) 2025-12-04T16:24:10.8019456Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-163cc6583cdf0c02.json (deflated 83%) 2025-12-04T16:24:10.8020885Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a5ca9a61ebc16b36.json (deflated 83%) 2025-12-04T16:24:10.8022328Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c4c3d903e4063d32.json (deflated 83%) 2025-12-04T16:24:10.8023770Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-967683d4f4b3f6ea.json (deflated 83%) 2025-12-04T16:24:10.8025183Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-dd59b77eb3bfc158.json (deflated 83%) 2025-12-04T16:24:10.8026573Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-16ca540d5e1f852f.json (deflated 83%) 2025-12-04T16:24:10.8028013Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3fca4d8c1020266e.json (deflated 83%) 2025-12-04T16:24:10.8029433Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d72e902ee6d4abee.json (deflated 83%) 2025-12-04T16:24:10.8030881Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ec9f5a8da88c6473.json (deflated 83%) 2025-12-04T16:24:10.8032266Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cd768c9b8fb615f9.json (deflated 83%) 2025-12-04T16:24:10.8033688Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-eed072ccebef54d5.json (deflated 83%) 2025-12-04T16:24:10.8035284Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e17832bbca358cbf.json (deflated 83%) 2025-12-04T16:24:10.8036698Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2596f02ad0864233.json (deflated 83%) 2025-12-04T16:24:10.8038086Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-35234e0a942debc1.json (deflated 83%) 2025-12-04T16:24:10.8039511Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-11b937d6d736747a.json (deflated 83%) 2025-12-04T16:24:10.8040937Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5cef76ce1b71d0a4.json (deflated 83%) 2025-12-04T16:24:10.8042333Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-f5efe4ba459d5b44.json (deflated 83%) 2025-12-04T16:24:10.8043808Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-92f9442ed796f73f.json (deflated 83%) 2025-12-04T16:24:10.8045171Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-bcaaee0a01e62e43.json (deflated 83%) 2025-12-04T16:24:10.8046567Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-76ac61a88acbe22b.json (deflated 83%) 2025-12-04T16:24:10.8048048Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-8647c42278291c82.json (deflated 97%) 2025-12-04T16:24:10.8049583Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a0ddea104bebcf1c.json (deflated 82%) 2025-12-04T16:24:10.8051026Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c80d90eb9dbd0365.json (deflated 82%) 2025-12-04T16:24:10.8052403Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-98804e2c83f300d6.json (stored 0%) 2025-12-04T16:24:10.8053745Z adding: test/test-reports/python-pytest/dynamo.test_callback/dynamo.test_callback-042e319b1bc49e9e.json (deflated 61%) 2025-12-04T16:24:10.8055151Z adding: test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-9d75f5d2563f16c8.json (deflated 88%) 2025-12-04T16:24:10.8056677Z adding: test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-704b849fb73fff94.json (deflated 88%) 2025-12-04T16:24:10.8058115Z adding: test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-64c9d19a2b1bbb58.json (deflated 88%) 2025-12-04T16:24:10.8059699Z adding: test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-c370485b718ab3d2.json (deflated 75%) 2025-12-04T16:24:10.8061414Z adding: test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-cfe1d81d1ed43800.json (deflated 93%) 2025-12-04T16:24:10.8063339Z adding: test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-1783f83983666f0c.json (deflated 57%) 2025-12-04T16:24:10.8065270Z adding: test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-04ae990c731b58f8.json (deflated 92%) 2025-12-04T16:24:10.8066989Z adding: test/test-reports/python-pytest/inductor.test_collective_autotuning/inductor.test_collective_autotuning-0f27caa637154718.json (deflated 68%) 2025-12-04T16:24:10.8068611Z adding: test/test-reports/python-pytest/higher_order_ops.test_local_map/higher_order_ops.test_local_map-9df8631d0f17132c.json (deflated 85%) 2025-12-04T16:24:10.8069978Z adding: test/test-reports/python-pytest/test_dataloader/test_dataloader-94259eed0293b6e4.json (deflated 90%) 2025-12-04T16:24:10.8071180Z adding: test/test-reports/python-pytest/test_decomp/test_decomp-f51ad4b3b0738d41.json (deflated 95%) 2025-12-04T16:24:10.8072215Z adding: test/test-reports/python-pytest/test_decomp/test_decomp-6b78cf4133d2553c.json (deflated 95%) 2025-12-04T16:24:10.8078914Z adding: test/test-reports/python-pytest/test_decomp/test_decomp-91f7fdd24463f867.json (deflated 95%) 2025-12-04T16:24:10.8121891Z adding: test/test-reports/python-pytest/test_ops_fwd_gradients/test_ops_fwd_gradients-dba31c3696d52244.json (deflated 97%) 2025-12-04T16:24:10.8294878Z adding: test/test-reports/python-pytest/test_meta/test_meta-a3131b0e7fce7ef2.json (deflated 97%) 2025-12-04T16:24:10.8368710Z adding: test/test-reports/python-pytest/test_ops_gradients/test_ops_gradients-8f336b5d99061bf3.json (deflated 97%) 2025-12-04T16:24:10.8383930Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-77c9be2be0e8d8f0.json (deflated 97%) 2025-12-04T16:24:10.8391270Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-87fea4734ba7fa90.json (deflated 97%) 2025-12-04T16:24:10.8398221Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-cdf15fa2abf8389b.json (deflated 97%) 2025-12-04T16:24:10.8399193Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-301657435283457f.json (deflated 85%) 2025-12-04T16:24:10.8406077Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-274ad688fc221da2.json (deflated 97%) 2025-12-04T16:24:10.8413082Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-c003f5ff32bae5e6.json (deflated 97%) 2025-12-04T16:24:10.8416749Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-e9e5772b33a862c1.json (deflated 96%) 2025-12-04T16:24:10.8427324Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-67ba282cb3193875.json (deflated 97%) 2025-12-04T16:24:10.8437590Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-f88fa3b886672d5c.json (deflated 97%) 2025-12-04T16:24:10.8452600Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-83ecadeff448aeea.json (deflated 96%) 2025-12-04T16:24:10.8533870Z adding: test/test-reports/python-pytest/test_ops/test_ops-2fce3d3e4cb230c2.json (deflated 96%) 2025-12-04T16:24:10.8608487Z adding: test/test-reports/python-pytest/test_ops/test_ops-b0df9953dc6aab8d.json (deflated 96%) 2025-12-04T16:24:10.8657562Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-bc7545635763bbdc.json (deflated 97%) 2025-12-04T16:24:10.8658852Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-fad3091b7781ac2f.json (deflated 88%) 2025-12-04T16:24:10.8660133Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-74bf247f22e5b0d6.json (deflated 88%) 2025-12-04T16:24:10.8661582Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-4c8f61ef5d858418.json (deflated 90%) 2025-12-04T16:24:10.8662757Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-c19baf682148544c.json (deflated 88%) 2025-12-04T16:24:10.8663997Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-a8d85ae14c598d51.json (deflated 88%) 2025-12-04T16:24:10.8665238Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-8c50d88405c7b0b3.json (deflated 88%) 2025-12-04T16:24:10.8666449Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-dad902e1dd428e6d.json (deflated 88%) 2025-12-04T16:24:10.8667606Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-d765bceda0ac4437.json (deflated 88%) 2025-12-04T16:24:10.8669441Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-42fbcb95d0a0c581.json (deflated 91%) 2025-12-04T16:24:10.8670647Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-da54dbb08fbaa628.json (deflated 88%) 2025-12-04T16:24:10.8671892Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-90277c9f3d74bea5.json (deflated 88%) 2025-12-04T16:24:10.8673084Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-559456aeb9be5546.json (deflated 89%) 2025-12-04T16:24:10.8718587Z adding: test/test-reports/python-pytest/functorch.test_ops/functorch.test_ops-c51c09e8b0651326.json (deflated 95%) 2025-12-04T16:24:10.8719874Z adding: test/test-reports/python-pytest/inductor.test_group_batch_fusion/inductor.test_group_batch_fusion-c943193060216670.json (deflated 84%) 2025-12-04T16:24:10.8738253Z adding: test/test-reports/python-pytest/inductor.test_cpu_repro/inductor.test_cpu_repro-2a01ea2f1377f4e8.json (deflated 97%) 2025-12-04T16:24:10.8746034Z adding: test/test-reports/python-pytest/inductor.test_fused_attention/inductor.test_fused_attention-93034b4b3ea7a646.json (deflated 96%) 2025-12-04T16:24:10.8749419Z adding: test/test-reports/python-pytest/inductor.test_inductor_freezing/inductor.test_inductor_freezing-a1df3eccdba273d4.json (deflated 94%) 2025-12-04T16:24:10.8767564Z adding: test/test-reports/python-pytest/dynamo.test_higher_order_ops/dynamo.test_higher_order_ops-53ad11142cf6dd1d.json (deflated 93%) 2025-12-04T16:24:10.8777103Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor_arrayref/inductor.test_aot_inductor_arrayref-b86518472a5506a3.json (deflated 95%) 2025-12-04T16:24:10.8778565Z adding: test/test-reports/python-pytest/inductor.test_autoheuristic/inductor.test_autoheuristic-96f2d55568fe5d93.json (stored 0%) 2025-12-04T16:24:10.8779883Z adding: test/test-reports/python-pytest/inductor.test_flex_flash/inductor.test_flex_flash-19911cd6feadbffe.json (deflated 97%) 2025-12-04T16:24:10.8781185Z adding: test/test-reports/python-pytest/inductor.test_segmented_tree/inductor.test_segmented_tree-1f8a73d877a0e0af.json (deflated 89%) 2025-12-04T16:24:10.8782704Z adding: test/test-reports/python-pytest/inductor.test_kernel_optimization/inductor.test_kernel_optimization-8ba4093a849c99fb.json (deflated 39%) 2025-12-04T16:24:10.8784053Z adding: test/test-reports/python-pytest/inductor.test_metrics/inductor.test_metrics-ba304e492604cb49.json (deflated 78%) 2025-12-04T16:24:10.8785377Z adding: test/test-reports/python-pytest/export.test_unflatten_training_ir/export.test_unflatten_training_ir-dcf5c0e2417d22fe.json (deflated 94%) 2025-12-04T16:24:10.8786704Z adding: test/test-reports/python-pytest/inductor.test_fx_fusion/inductor.test_fx_fusion-740800c94204dd1f.json (deflated 79%) 2025-12-04T16:24:10.8797476Z adding: test/test-reports/python-pytest/dynamo.test_inline_and_install/dynamo.test_inline_and_install-ee70bed9693bb3f7.json (deflated 92%) 2025-12-04T16:24:10.8798890Z adding: test/test-reports/python-pytest/inductor.test_inductor_scheduler/inductor.test_inductor_scheduler-f799e81908d876d8.json (deflated 85%) 2025-12-04T16:24:10.8840129Z adding: test/test-reports/python-pytest/export.test_export_strict/export.test_export_strict-16708e34d1c82f51.json (deflated 96%) 2025-12-04T16:24:10.8856142Z adding: test/test-reports/python-pytest/inductor.test_compiled_optimizers/inductor.test_compiled_optimizers-8408ff78148caebd.json (deflated 96%) 2025-12-04T16:24:10.8857590Z adding: test/test-reports/python-pytest/dynamo.test_graph_region_tracker/dynamo.test_graph_region_tracker-e7e3f06a1f8d4793.json (deflated 90%) 2025-12-04T16:24:10.8858978Z adding: test/test-reports/python-pytest/dynamo.test_unittest/dynamo.test_unittest-1a49598617666801.json (deflated 37%) 2025-12-04T16:24:10.8860176Z adding: test/test-reports/python-pytest/inductor.test_compile/inductor.test_compile-f393f964f8c8ee63.json (deflated 87%) 2025-12-04T16:24:10.8882070Z adding: test/test-reports/python-pytest/dynamo.test_functions/dynamo.test_functions-e9d213b32d56adfd.json (deflated 93%) 2025-12-04T16:24:10.8889436Z adding: test/test-reports/python-pytest/inductor.test_ordered_set/inductor.test_ordered_set-b674b38a5211e06f.json (deflated 97%) 2025-12-04T16:24:10.8891030Z adding: test/test-reports/python-pytest/dynamo.test_install_free_tensors/dynamo.test_install_free_tensors-4c842a4ed6e31f3f.json (deflated 87%) 2025-12-04T16:24:10.8892788Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_codegen_config_overrides/inductor.test_torchinductor_codegen_config_overrides-15f0302c88c6ef46.json (deflated 82%) 2025-12-04T16:24:10.8894298Z adding: test/test-reports/python-pytest/export.test_passes/export.test_passes-d27457a3d9237a22.json (deflated 89%) 2025-12-04T16:24:10.8898032Z adding: test/test-reports/python-pytest/dynamo.test_autograd_function/dynamo.test_autograd_function-bbde79f54df96d93.json (deflated 89%) 2025-12-04T16:24:10.8913680Z adding: test/test-reports/python-pytest/inductor.test_codecache/inductor.test_codecache-d532a0bd02778de2.json (deflated 95%) 2025-12-04T16:24:10.8929724Z adding: test/test-reports/python-pytest/inductor.test_auto_functionalize/inductor.test_auto_functionalize-1caf9ad6930ca9ac.json (deflated 95%) 2025-12-04T16:24:10.8931176Z adding: test/test-reports/python-pytest/inductor.test_efficient_conv_bn_eval/inductor.test_efficient_conv_bn_eval-93e786f7489a3e47.json (deflated 71%) 2025-12-04T16:24:10.8932545Z adding: test/test-reports/python-pytest/inductor.test_torchbind/inductor.test_torchbind-9cdc6632b9077562.json (deflated 86%) 2025-12-04T16:24:10.8933774Z adding: test/test-reports/python-pytest/dynamo.test_flat_apply/dynamo.test_flat_apply-3a6f00a2fe27363c.json (deflated 74%) 2025-12-04T16:24:10.8935076Z adding: test/test-reports/python-pytest/dynamo.test_input_attr_tracking/dynamo.test_input_attr_tracking-e306635654774769.json (deflated 83%) 2025-12-04T16:24:10.8936474Z adding: test/test-reports/python-pytest/dynamo.test_graph_deduplication/dynamo.test_graph_deduplication-72be7e1eac6e503d.json (deflated 90%) 2025-12-04T16:24:10.8938025Z adding: test/test-reports/python-pytest/inductor.test_distributed_patterns/inductor.test_distributed_patterns-7db9a3d44802273a.json (deflated 90%) 2025-12-04T16:24:10.8939492Z adding: test/test-reports/python-pytest/dynamo.test_bytecode_utils/dynamo.test_bytecode_utils-4132ad7230faf1a5.json (deflated 87%) 2025-12-04T16:24:10.8940744Z adding: test/test-reports/python-pytest/export.test_nativert/export.test_nativert-bbcdef66a0abda26.json (deflated 88%) 2025-12-04T16:24:10.8941874Z adding: test/test-reports/python-pytest/export.test_hop/export.test_hop-06f913442140df32.json (deflated 94%) 2025-12-04T16:24:10.8978559Z adding: test/test-reports/python-pytest/export.test_export/export.test_export-f3f94f66dfdfed6b.json (deflated 96%) 2025-12-04T16:24:10.8979810Z adding: test/test-reports/python-pytest/test_utils_config_module/test_utils_config_module-5af1ad5b3b73495f.json (deflated 90%) 2025-12-04T16:24:10.8980937Z adding: test/test-reports/python-pytest/test_hop_infra/test_hop_infra-7733a804aa6d4c14.json (deflated 72%) 2025-12-04T16:24:10.8982138Z adding: test/test-reports/python-pytest/test_appending_byte_serializer/test_appending_byte_serializer-65501f8364394f57.json (deflated 76%) 2025-12-04T16:24:10.8983319Z adding: test/test-reports/python-pytest/test_license/test_license-56f2b4cb04cabe36.json (deflated 59%) 2025-12-04T16:24:10.8984350Z adding: test/test-reports/python-pytest/test_ao_sparsity/test_ao_sparsity-c7bfc0a40ffe86a3.json (deflated 91%) 2025-12-04T16:24:10.8987629Z adding: test/test-reports/python-pytest/test_proxy_tensor/test_proxy_tensor-25f8d712045ddeef.json (deflated 95%) 2025-12-04T16:24:10.8988953Z adding: test/test-reports/python-pytest/torch_np.test_dtype/torch_np.test_dtype-a1152bdff0bb5e89.json (deflated 96%) 2025-12-04T16:24:10.8990048Z adding: test/test-reports/python-pytest/test_file_check/test_file_check-21e904b3669672b6.json (deflated 62%) 2025-12-04T16:24:10.8991107Z adding: test/test-reports/python-pytest/test_flop_counter/test_flop_counter-83e0741caa94eff2.json (deflated 89%) 2025-12-04T16:24:10.8992298Z adding: test/test-reports/python-pytest/backends.xeon.test_launch/backends.xeon.test_launch-35338ec25f37484c.json (deflated 62%) 2025-12-04T16:24:10.8993472Z adding: test/test-reports/python-pytest/test_show_pickle/test_show_pickle-865976a650f0e2c8.json (deflated 35%) 2025-12-04T16:24:10.8994560Z adding: test/test-reports/python-pytest/test_openmp/test_openmp-ab09b4ea4a98cceb.json (deflated 61%) 2025-12-04T16:24:10.8996764Z adding: test/test-reports/python-pytest/test_expanded_weights/test_expanded_weights-e8def21a8b982d80.json (deflated 96%) 2025-12-04T16:24:10.8998008Z adding: test/test-reports/python-pytest/functorch.test_ac_knapsack/functorch.test_ac_knapsack-e93133af03c79b6b.json (deflated 87%) 2025-12-04T16:24:10.9033493Z adding: test/test-reports/python-pytest/torch_np.test_nep50_examples/torch_np.test_nep50_examples-4b9983ced205ee32.json (deflated 99%) 2025-12-04T16:24:10.9055156Z adding: test/test-reports/python-pytest/test_torch/test_torch-f1fe0f79113cbdd9.json (deflated 95%) 2025-12-04T16:24:10.9056671Z adding: test/test-reports/python-pytest/higher_order_ops.test_with_effects/higher_order_ops.test_with_effects-e6b12b25e8a894fd.json (deflated 88%) 2025-12-04T16:24:10.9057843Z adding: test/test-reports/python-pytest/test_weak/test_weak-d0e879aa5e6c216f.json (deflated 95%) 2025-12-04T16:24:10.9058867Z adding: test/test-reports/python-pytest/test_complex/test_complex-85789b75771d41fd.json (deflated 88%) 2025-12-04T16:24:10.9079054Z adding: test/test-reports/python-pytest/test_optim/test_optim-e2224a41ca12c5b8.json (deflated 96%) 2025-12-04T16:24:10.9080021Z adding: test/test-reports/python-pytest/xpu.test_conv/xpu.test_conv-725b55c77ef2b365.json (stored 0%) 2025-12-04T16:24:10.9081119Z adding: test/test-reports/python-pytest/nn.test_packed_sequence/nn.test_packed_sequence-cbb1d818037c9c38.json (deflated 91%) 2025-12-04T16:24:10.9082261Z adding: test/test-reports/python-pytest/test_numa_binding/test_numa_binding-1b1d6199d4053b8e.json (deflated 86%) 2025-12-04T16:24:10.9083435Z adding: test/test-reports/python-pytest/test_pruning_op/test_pruning_op-4970668223f4717e.json (deflated 64%) 2025-12-04T16:24:10.9255739Z adding: test/test-reports/python-pytest/test_jit_fuser_te/test_jit_fuser_te-c194bc30f4bbb26c.json (deflated 99%) 2025-12-04T16:24:10.9259932Z adding: test/test-reports/python-pytest/test_sympy_utils/test_sympy_utils-922738ab31cc19f0.json (deflated 96%) 2025-12-04T16:24:10.9261018Z adding: test/test-reports/python-pytest/test_mkldnn_fusion/test_mkldnn_fusion-51238045ef241025.json (deflated 81%) 2025-12-04T16:24:10.9372780Z adding: test/test-reports/python-pytest/test_schema_check/test_schema_check-489dd02163956a84.json (deflated 97%) 2025-12-04T16:24:10.9373870Z adding: test/test-reports/python-pytest/test_mkldnn/test_mkldnn-2c6972170cf2deca.json (stored 0%) 2025-12-04T16:24:10.9771649Z adding: test/test-reports/python-pytest/test_unary_ufuncs/test_unary_ufuncs-52555684d34478cf.json (deflated 98%) 2025-12-04T16:24:10.9772854Z adding: test/test-reports/python-unittest/test_autoload/TEST-TestDeviceBackendAutoload-20251204162351.json (deflated 38%) 2025-12-04T16:24:10.9802967Z ##[group]Run # Remove any previous test reports if they exist 2025-12-04T16:24:10.9803530Z # Remove any previous test reports if they exist 2025-12-04T16:24:10.9803976Z rm -f test-reports-*.zip 2025-12-04T16:24:10.9804535Z zip -r "test-reports-${FILE_SUFFIX}.zip" test/test-reports -i '*.xml' -i '*.csv' 2025-12-04T16:24:10.9811408Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T16:24:10.9811832Z env: 2025-12-04T16:24:10.9812079Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:10.9812391Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:10.9812747Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:10.9813391Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:10.9814199Z FILE_SUFFIX: test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259 2025-12-04T16:24:10.9814771Z ##[endgroup] 2025-12-04T16:24:10.9948866Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-7ff7771e3c72f123.xml (deflated 89%) 2025-12-04T16:24:10.9959802Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b312d84b484956a1.xml (deflated 93%) 2025-12-04T16:24:10.9971968Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-9ee61a34966ac1ed.xml (deflated 93%) 2025-12-04T16:24:10.9978086Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-31f94497420761eb.xml (deflated 91%) 2025-12-04T16:24:10.9981288Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b252d51c09f06fd1.xml (deflated 90%) 2025-12-04T16:24:10.9984508Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-446d02039a2c0737.xml (deflated 90%) 2025-12-04T16:24:10.9989439Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-b3aa6989e65545bb.xml (deflated 91%) 2025-12-04T16:24:10.9992051Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-dc71db8b83a58b10.xml (deflated 90%) 2025-12-04T16:24:10.9994297Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-d1d71be98185ea4a.xml (deflated 90%) 2025-12-04T16:24:10.9996805Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-ec873e76fad9978d.xml (deflated 91%) 2025-12-04T16:24:11.0023031Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-bf110f953f5c5355.xml (deflated 94%) 2025-12-04T16:24:11.0024381Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-268cc96fdd12a701.xml (deflated 86%) 2025-12-04T16:24:11.0025733Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-9d3344bbd0e95b72.xml (deflated 86%) 2025-12-04T16:24:11.0028515Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-c49d705795b8a118.xml (deflated 87%) 2025-12-04T16:24:11.0030178Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-03cc5e31909b0a52.xml (deflated 89%) 2025-12-04T16:24:11.0032033Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-e6be60d41d66658c.xml (deflated 89%) 2025-12-04T16:24:11.0033372Z adding: test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-27e2dd5fd3a036de.xml (deflated 76%) 2025-12-04T16:24:11.0053350Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-655beaec67eb1ac4.xml (deflated 92%) 2025-12-04T16:24:11.0063354Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-3eb5ad48105c1cd2.xml (deflated 94%) 2025-12-04T16:24:11.0073347Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-50486f79f4e32e59.xml (deflated 94%) 2025-12-04T16:24:11.0081024Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-305489af275a491d.xml (deflated 92%) 2025-12-04T16:24:11.0103179Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-3d2f5f1cdf9aac8a.xml (deflated 93%) 2025-12-04T16:24:11.0105006Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-53adf85a57cb1562.xml (deflated 85%) 2025-12-04T16:24:11.0106824Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-ae6d18145ba692e1.xml (deflated 85%) 2025-12-04T16:24:11.0113079Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_codegen_dynamic_shapes/inductor.test_torchinductor_codegen_dynamic_shapes-7f9a6ec5b091c39f.xml (deflated 92%) 2025-12-04T16:24:11.0116694Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_opinfo/inductor.test_torchinductor_opinfo-b99623f068a7d8ad.xml (deflated 92%) 2025-12-04T16:24:11.0121228Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_opinfo/inductor.test_torchinductor_opinfo-5a150b3a059b9a30.xml (deflated 93%) 2025-12-04T16:24:11.0125788Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_opinfo/inductor.test_torchinductor_opinfo-a7e0d08ca6140a5e.xml (deflated 92%) 2025-12-04T16:24:11.0131518Z adding: test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-102e4c3a904c277d.xml (deflated 92%) 2025-12-04T16:24:11.0133060Z adding: test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-ea0dea10ce3f91a8.xml (deflated 87%) 2025-12-04T16:24:11.0134865Z adding: test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-70230936f8120c6b.xml (deflated 87%) 2025-12-04T16:24:11.0136752Z adding: test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-327d67e6c79eec23.xml (deflated 90%) 2025-12-04T16:24:11.0138316Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-b8b19f7b3ae0b98c.xml (deflated 85%) 2025-12-04T16:24:11.0139792Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-cdcb3ea635f515ef.xml (deflated 84%) 2025-12-04T16:24:11.0141257Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-2c53b8c4a22bc4ac.xml (deflated 84%) 2025-12-04T16:24:11.0142852Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-c3b4fd201ecf8b1e.xml (deflated 85%) 2025-12-04T16:24:11.0144330Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7b498694d0586007.xml (deflated 84%) 2025-12-04T16:24:11.0145788Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-f04dfce55afddfa2.xml (deflated 84%) 2025-12-04T16:24:11.0147260Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a20deb48b5165834.xml (deflated 85%) 2025-12-04T16:24:11.0148774Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-688c2e6fbd08d334.xml (deflated 84%) 2025-12-04T16:24:11.0150250Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a49a4c623eea732f.xml (deflated 84%) 2025-12-04T16:24:11.0151727Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-9d29992f2be81c73.xml (deflated 85%) 2025-12-04T16:24:11.0153181Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7a570f380a97ffdf.xml (deflated 84%) 2025-12-04T16:24:11.0154658Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-b091dc2605282b32.xml (deflated 84%) 2025-12-04T16:24:11.0156125Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-7a84aa230dcc2c72.xml (deflated 85%) 2025-12-04T16:24:11.0157594Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-622e1501e18bdb77.xml (deflated 84%) 2025-12-04T16:24:11.0159074Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-af0abf9f8a1870a5.xml (deflated 84%) 2025-12-04T16:24:11.0160528Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-a29cc44636754e7c.xml (deflated 85%) 2025-12-04T16:24:11.0162029Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ad8948ec33d2b35b.xml (deflated 84%) 2025-12-04T16:24:11.0163497Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-33a982dae5077c46.xml (deflated 84%) 2025-12-04T16:24:11.0164977Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-bd8b5496f00b6cd3.xml (deflated 85%) 2025-12-04T16:24:11.0166440Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ef5bae0bde57e96b.xml (deflated 84%) 2025-12-04T16:24:11.0167917Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-c80e75c43f69568f.xml (deflated 84%) 2025-12-04T16:24:11.0169390Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-528c2efc29bf6df4.xml (deflated 85%) 2025-12-04T16:24:11.0170858Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-ed81db7834398b8b.xml (deflated 84%) 2025-12-04T16:24:11.0172334Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-261b1bf878c6e8ce.xml (deflated 84%) 2025-12-04T16:24:11.0173789Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-753aefcc31d0e7df.xml (deflated 85%) 2025-12-04T16:24:11.0175257Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-adc6d5470587661c.xml (deflated 84%) 2025-12-04T16:24:11.0176798Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-bbb1474dafa92442.xml (deflated 84%) 2025-12-04T16:24:11.0178358Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-e5cba0d160bf2991.xml (deflated 85%) 2025-12-04T16:24:11.0179823Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-df18bdebce6af33a.xml (deflated 84%) 2025-12-04T16:24:11.0181357Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-d98beca05e867fac.xml (deflated 84%) 2025-12-04T16:24:11.0182830Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-3111b2456c98ee88.xml (deflated 85%) 2025-12-04T16:24:11.0184303Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-d129ff52aa3d5acd.xml (deflated 84%) 2025-12-04T16:24:11.0185774Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-89ebab0d3f4369e8.xml (deflated 84%) 2025-12-04T16:24:11.0187228Z adding: test/test-reports/python-pytest/inductor.test_cuda_select_algorithm/inductor.test_cuda_select_algorithm-0dc61241fef4b8bc.xml (deflated 28%) 2025-12-04T16:24:11.0189017Z adding: test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-5bc7c9553ad37db8.xml (deflated 92%) 2025-12-04T16:24:11.0190470Z adding: test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-c06a9a1a186bb263.xml (deflated 90%) 2025-12-04T16:24:11.0191911Z adding: test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-3aa7e69ca6b255e3.xml (deflated 90%) 2025-12-04T16:24:11.0214695Z adding: test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-696ac340ce65f280.xml (deflated 96%) 2025-12-04T16:24:11.0216081Z adding: test/test-reports/python-pytest/inductor.test_deterministic/inductor.test_deterministic-b80250c7092ee9c7.xml (deflated 79%) 2025-12-04T16:24:11.0217527Z adding: test/test-reports/python-pytest/inductor.test_deterministic/inductor.test_deterministic-4c3a96ba74540dee.xml (deflated 71%) 2025-12-04T16:24:11.0218996Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-826c2562e296e7cd.xml (deflated 82%) 2025-12-04T16:24:11.0220306Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-738ee4f0dcb0d1ea.xml (deflated 82%) 2025-12-04T16:24:11.0221584Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a05156439a0d1496.xml (deflated 82%) 2025-12-04T16:24:11.0222874Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a884406c7304f17c.xml (deflated 82%) 2025-12-04T16:24:11.0224175Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c03b63e1d9e14442.xml (deflated 82%) 2025-12-04T16:24:11.0225466Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a7e12d722ef68d61.xml (deflated 82%) 2025-12-04T16:24:11.0226757Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-758c1dbc5b36fbbe.xml (deflated 82%) 2025-12-04T16:24:11.0228037Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1356c8f9e8cb48ab.xml (deflated 82%) 2025-12-04T16:24:11.0229339Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3d33fda2cceb2cbd.xml (deflated 82%) 2025-12-04T16:24:11.0230631Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0982db42fabefc11.xml (deflated 82%) 2025-12-04T16:24:11.0232059Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e365e000dec26b6e.xml (deflated 82%) 2025-12-04T16:24:11.0233343Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c05ba049f6d7f244.xml (deflated 82%) 2025-12-04T16:24:11.0234640Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e64a3e20d6733c67.xml (deflated 82%) 2025-12-04T16:24:11.0235940Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-60656dccead73746.xml (deflated 82%) 2025-12-04T16:24:11.0237276Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-74b07a56cd02ebb4.xml (deflated 82%) 2025-12-04T16:24:11.0238559Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-97c1aae6502fe9a4.xml (deflated 85%) 2025-12-04T16:24:11.0239856Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9520e13942c1e127.xml (deflated 82%) 2025-12-04T16:24:11.0241150Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5b20aba83da29ea2.xml (deflated 82%) 2025-12-04T16:24:11.0242450Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c4f43dfc4bd691e8.xml (deflated 86%) 2025-12-04T16:24:11.0243749Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1ceca9168882f805.xml (deflated 82%) 2025-12-04T16:24:11.0245044Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c6f480cabc56b496.xml (deflated 82%) 2025-12-04T16:24:11.0246336Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9497acc525242c8a.xml (deflated 84%) 2025-12-04T16:24:11.0247632Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-158a50fa606ffe95.xml (deflated 82%) 2025-12-04T16:24:11.0248919Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-27359545a6656d51.xml (deflated 82%) 2025-12-04T16:24:11.0250232Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1f2cd5255ddbd20d.xml (deflated 82%) 2025-12-04T16:24:11.0251522Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-6e021b62864c2856.xml (deflated 82%) 2025-12-04T16:24:11.0252812Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-49627e719c8e890b.xml (deflated 82%) 2025-12-04T16:24:11.0254096Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-66537c67c1172e95.xml (deflated 82%) 2025-12-04T16:24:11.0255375Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b567c9f6400d034b.xml (deflated 82%) 2025-12-04T16:24:11.0256686Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2dff14642abe4d78.xml (deflated 82%) 2025-12-04T16:24:11.0257990Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-65b5f001c721490a.xml (deflated 82%) 2025-12-04T16:24:11.0259353Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-51dd6a289fe92b8c.xml (deflated 82%) 2025-12-04T16:24:11.0260647Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-fcdeca5855991d94.xml (deflated 82%) 2025-12-04T16:24:11.0261934Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-97fc9d1afafb45ba.xml (deflated 82%) 2025-12-04T16:24:11.0263236Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-58940be16b8df1f1.xml (deflated 82%) 2025-12-04T16:24:11.0264600Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9349febb39593fe3.xml (deflated 82%) 2025-12-04T16:24:11.0265897Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-610ad64c0d2affb1.xml (deflated 82%) 2025-12-04T16:24:11.0267173Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-535142d7b2475ad0.xml (deflated 82%) 2025-12-04T16:24:11.0268472Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-44b8ebfaa38db266.xml (deflated 82%) 2025-12-04T16:24:11.0269807Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-bba5c35945b28810.xml (deflated 82%) 2025-12-04T16:24:11.0271095Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a3c90c249d149d10.xml (deflated 82%) 2025-12-04T16:24:11.0272377Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c896d4e441aec272.xml (deflated 82%) 2025-12-04T16:24:11.0273688Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0f626bbdc83148be.xml (deflated 82%) 2025-12-04T16:24:11.0274984Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e68881511844a645.xml (deflated 82%) 2025-12-04T16:24:11.0276276Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a161840b55d48987.xml (deflated 82%) 2025-12-04T16:24:11.0277563Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-84da9e83d0a63a49.xml (deflated 82%) 2025-12-04T16:24:11.0278849Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5ba1ce04884dd264.xml (deflated 82%) 2025-12-04T16:24:11.0280154Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7aa6851dee05bf26.xml (deflated 82%) 2025-12-04T16:24:11.0281931Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cff0107c6e55c745.xml (deflated 84%) 2025-12-04T16:24:11.0284227Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-65ccdb61caed4adb.xml (deflated 82%) 2025-12-04T16:24:11.0286263Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-332efb6825e8ee00.xml (deflated 82%) 2025-12-04T16:24:11.0288265Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5cf5b975b5ed0b41.xml (deflated 82%) 2025-12-04T16:24:11.0289588Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-66c6826e1f29c7ae.xml (deflated 82%) 2025-12-04T16:24:11.0290886Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2a970c00eea5bec9.xml (deflated 82%) 2025-12-04T16:24:11.0292180Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-284ef1fed3dcf877.xml (deflated 82%) 2025-12-04T16:24:11.0293474Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5553e88673e5c7d3.xml (deflated 82%) 2025-12-04T16:24:11.0294768Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-93bd8f6af98a9b4c.xml (deflated 82%) 2025-12-04T16:24:11.0296063Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-58f36c545585b24b.xml (deflated 84%) 2025-12-04T16:24:11.0297341Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-6ef09e48fb8b226a.xml (deflated 82%) 2025-12-04T16:24:11.0298732Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e71119cdd1d43b1e.xml (deflated 82%) 2025-12-04T16:24:11.0300526Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c8dc22bc4fac83a2.xml (deflated 82%) 2025-12-04T16:24:11.0301844Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-8170ae01b738ee47.xml (deflated 82%) 2025-12-04T16:24:11.0303143Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0595c5125e4115a2.xml (deflated 82%) 2025-12-04T16:24:11.0304423Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ea830dd4e98e3dce.xml (deflated 84%) 2025-12-04T16:24:11.0305783Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0ddafb1da2805e03.xml (deflated 82%) 2025-12-04T16:24:11.0307079Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3b676b001d5055f4.xml (deflated 82%) 2025-12-04T16:24:11.0308375Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-15a19448251eca38.xml (deflated 82%) 2025-12-04T16:24:11.0309652Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7dd506857cedd350.xml (deflated 82%) 2025-12-04T16:24:11.0310947Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a068bf6354320ba5.xml (deflated 82%) 2025-12-04T16:24:11.0312237Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0bf202cdb375dae7.xml (deflated 82%) 2025-12-04T16:24:11.0313533Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-796bdc853aa8ac88.xml (deflated 82%) 2025-12-04T16:24:11.0314817Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-abef13434279995d.xml (deflated 82%) 2025-12-04T16:24:11.0316116Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-47824ae1e1d0f6cb.xml (deflated 82%) 2025-12-04T16:24:11.0317413Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4038060382adbf9a.xml (deflated 82%) 2025-12-04T16:24:11.0318751Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d0c6827a4c24a96e.xml (deflated 82%) 2025-12-04T16:24:11.0320035Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-eb0f3705043be69a.xml (deflated 82%) 2025-12-04T16:24:11.0321338Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b4b4a4cac36fd9fb.xml (deflated 82%) 2025-12-04T16:24:11.0322647Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5f059c7d0f50bda4.xml (deflated 82%) 2025-12-04T16:24:11.0323953Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c9ff80f1a6820ed2.xml (deflated 82%) 2025-12-04T16:24:11.0325259Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c2ef3587eca8dd3f.xml (deflated 82%) 2025-12-04T16:24:11.0326552Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-f520a42aa1f82bba.xml (deflated 82%) 2025-12-04T16:24:11.0327847Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-26d6067018821038.xml (deflated 84%) 2025-12-04T16:24:11.0329131Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-323433d4d00bc615.xml (deflated 82%) 2025-12-04T16:24:11.0330423Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-171ced1c6f8950ca.xml (deflated 82%) 2025-12-04T16:24:11.0331710Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-25dfec8b4de20cd4.xml (deflated 82%) 2025-12-04T16:24:11.0333056Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-301e5b291449627e.xml (deflated 82%) 2025-12-04T16:24:11.0334382Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-0b50a11bd83abca6.xml (deflated 82%) 2025-12-04T16:24:11.0335678Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-28422965aaa270fa.xml (deflated 82%) 2025-12-04T16:24:11.0336980Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cc1d7ca9ed71a647.xml (deflated 82%) 2025-12-04T16:24:11.0338389Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-28fdda0d819ffd5b.xml (deflated 82%) 2025-12-04T16:24:11.0339699Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2efd0ac46f0ffe8c.xml (deflated 82%) 2025-12-04T16:24:11.0341009Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a06616e9f3ca63a0.xml (deflated 82%) 2025-12-04T16:24:11.0342304Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ac835e731ca0e136.xml (deflated 82%) 2025-12-04T16:24:11.0343593Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-b97506f71fc95dd3.xml (deflated 82%) 2025-12-04T16:24:11.0344891Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e0e0cb20cc598f84.xml (deflated 82%) 2025-12-04T16:24:11.0346187Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7ccf4f5fc7381dbf.xml (deflated 82%) 2025-12-04T16:24:11.0347474Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-9c9073ed00c82380.xml (deflated 82%) 2025-12-04T16:24:11.0348747Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-24dd415c882be612.xml (deflated 82%) 2025-12-04T16:24:11.0350037Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ebb637005a79032b.xml (deflated 82%) 2025-12-04T16:24:11.0351324Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-64323540e9247afe.xml (deflated 82%) 2025-12-04T16:24:11.0352655Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a52cc93387944e77.xml (deflated 82%) 2025-12-04T16:24:11.0353931Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c3ebbd7913c9335f.xml (deflated 82%) 2025-12-04T16:24:11.0355224Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4acaad8a10b8235e.xml (deflated 82%) 2025-12-04T16:24:11.0356528Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-7532210fc6e9ae4a.xml (deflated 82%) 2025-12-04T16:24:11.0357828Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d5d3164b3ad67fad.xml (deflated 82%) 2025-12-04T16:24:11.0359110Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4e5ad1e60a526460.xml (deflated 82%) 2025-12-04T16:24:11.0360407Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-4c3c91d6f895ec60.xml (deflated 82%) 2025-12-04T16:24:11.0361700Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-95a17199eb5946d1.xml (deflated 82%) 2025-12-04T16:24:11.0362989Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-81b9e43a20c07e3d.xml (deflated 82%) 2025-12-04T16:24:11.0364284Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-1de8202a5dd1888a.xml (deflated 82%) 2025-12-04T16:24:11.0365635Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c241afcfaa8e2579.xml (deflated 82%) 2025-12-04T16:24:11.0366950Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-dbee4fdc7ea3ecd0.xml (deflated 82%) 2025-12-04T16:24:11.0368246Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-163cc6583cdf0c02.xml (deflated 82%) 2025-12-04T16:24:11.0369545Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a5ca9a61ebc16b36.xml (deflated 82%) 2025-12-04T16:24:11.0370854Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c4c3d903e4063d32.xml (deflated 82%) 2025-12-04T16:24:11.0372145Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-967683d4f4b3f6ea.xml (deflated 82%) 2025-12-04T16:24:11.0373446Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-dd59b77eb3bfc158.xml (deflated 82%) 2025-12-04T16:24:11.0374743Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-16ca540d5e1f852f.xml (deflated 82%) 2025-12-04T16:24:11.0376025Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-3fca4d8c1020266e.xml (deflated 82%) 2025-12-04T16:24:11.0377324Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-d72e902ee6d4abee.xml (deflated 82%) 2025-12-04T16:24:11.0378723Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-ec9f5a8da88c6473.xml (deflated 82%) 2025-12-04T16:24:11.0380030Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-cd768c9b8fb615f9.xml (deflated 82%) 2025-12-04T16:24:11.0381315Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-eed072ccebef54d5.xml (deflated 82%) 2025-12-04T16:24:11.0382624Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-e17832bbca358cbf.xml (deflated 82%) 2025-12-04T16:24:11.0383946Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-2596f02ad0864233.xml (deflated 82%) 2025-12-04T16:24:11.0385280Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-35234e0a942debc1.xml (deflated 82%) 2025-12-04T16:24:11.0386577Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-11b937d6d736747a.xml (deflated 82%) 2025-12-04T16:24:11.0388052Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-5cef76ce1b71d0a4.xml (deflated 82%) 2025-12-04T16:24:11.0389361Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-f5efe4ba459d5b44.xml (deflated 82%) 2025-12-04T16:24:11.0390672Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-92f9442ed796f73f.xml (deflated 82%) 2025-12-04T16:24:11.0391979Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-bcaaee0a01e62e43.xml (deflated 82%) 2025-12-04T16:24:11.0393272Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-76ac61a88acbe22b.xml (deflated 82%) 2025-12-04T16:24:11.0394572Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-8647c42278291c82.xml (deflated 97%) 2025-12-04T16:24:11.0395880Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-a0ddea104bebcf1c.xml (deflated 81%) 2025-12-04T16:24:11.0397193Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-c80d90eb9dbd0365.xml (deflated 81%) 2025-12-04T16:24:11.0398592Z adding: test/test-reports/python-pytest/inductor.test_op_dtype_prop/inductor.test_op_dtype_prop-98804e2c83f300d6.xml (deflated 28%) 2025-12-04T16:24:11.0399831Z adding: test/test-reports/python-pytest/dynamo.test_callback/dynamo.test_callback-042e319b1bc49e9e.xml (deflated 55%) 2025-12-04T16:24:11.0401102Z adding: test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-9d75f5d2563f16c8.xml (deflated 87%) 2025-12-04T16:24:11.0402455Z adding: test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-704b849fb73fff94.xml (deflated 88%) 2025-12-04T16:24:11.0403795Z adding: test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-64c9d19a2b1bbb58.xml (deflated 88%) 2025-12-04T16:24:11.0405190Z adding: test/test-reports/python-pytest/dynamo.test_compiler_bisector/dynamo.test_compiler_bisector-c370485b718ab3d2.xml (deflated 72%) 2025-12-04T16:24:11.0406762Z adding: test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-cfe1d81d1ed43800.xml (deflated 92%) 2025-12-04T16:24:11.0408540Z adding: test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-1783f83983666f0c.xml (deflated 55%) 2025-12-04T16:24:11.0410315Z adding: test/test-reports/python-pytest/inductor.test_cudagraph_trees_expandable_segments/inductor.test_cudagraph_trees_expandable_segments-04ae990c731b58f8.xml (deflated 90%) 2025-12-04T16:24:11.0411938Z adding: test/test-reports/python-pytest/inductor.test_collective_autotuning/inductor.test_collective_autotuning-0f27caa637154718.xml (deflated 62%) 2025-12-04T16:24:11.0413371Z adding: test/test-reports/python-pytest/higher_order_ops.test_local_map/higher_order_ops.test_local_map-9df8631d0f17132c.xml (deflated 80%) 2025-12-04T16:24:11.0414579Z adding: test/test-reports/python-pytest/test_dataloader/test_dataloader-94259eed0293b6e4.xml (deflated 84%) 2025-12-04T16:24:11.0415612Z adding: test/test-reports/python-pytest/test_decomp/test_decomp-f51ad4b3b0738d41.xml (deflated 91%) 2025-12-04T16:24:11.0416570Z adding: test/test-reports/python-pytest/test_decomp/test_decomp-6b78cf4133d2553c.xml (deflated 91%) 2025-12-04T16:24:11.0417580Z adding: test/test-reports/python-pytest/test_decomp/test_decomp-91f7fdd24463f867.xml (deflated 91%) 2025-12-04T16:24:11.0446930Z adding: test/test-reports/python-pytest/test_ops_fwd_gradients/test_ops_fwd_gradients-dba31c3696d52244.xml (deflated 95%) 2025-12-04T16:24:11.0597918Z adding: test/test-reports/python-pytest/test_meta/test_meta-a3131b0e7fce7ef2.xml (deflated 96%) 2025-12-04T16:24:11.0661728Z adding: test/test-reports/python-pytest/test_ops_gradients/test_ops_gradients-8f336b5d99061bf3.xml (deflated 96%) 2025-12-04T16:24:11.0674254Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-77c9be2be0e8d8f0.xml (deflated 96%) 2025-12-04T16:24:11.0682146Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-87fea4734ba7fa90.xml (deflated 96%) 2025-12-04T16:24:11.0690274Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-cdf15fa2abf8389b.xml (deflated 96%) 2025-12-04T16:24:11.0691248Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-301657435283457f.xml (deflated 84%) 2025-12-04T16:24:11.0699319Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-274ad688fc221da2.xml (deflated 96%) 2025-12-04T16:24:11.0706970Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-c003f5ff32bae5e6.xml (deflated 96%) 2025-12-04T16:24:11.0710039Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-e9e5772b33a862c1.xml (deflated 94%) 2025-12-04T16:24:11.0721971Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-67ba282cb3193875.xml (deflated 96%) 2025-12-04T16:24:11.0733980Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-f88fa3b886672d5c.xml (deflated 96%) 2025-12-04T16:24:11.0745456Z adding: test/test-reports/python-pytest/test_linalg/test_linalg-83ecadeff448aeea.xml (deflated 94%) 2025-12-04T16:24:11.0813315Z adding: test/test-reports/python-pytest/test_ops/test_ops-2fce3d3e4cb230c2.xml (deflated 94%) 2025-12-04T16:24:11.0873943Z adding: test/test-reports/python-pytest/test_ops/test_ops-b0df9953dc6aab8d.xml (deflated 94%) 2025-12-04T16:24:11.0913021Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-bc7545635763bbdc.xml (deflated 95%) 2025-12-04T16:24:11.0914174Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-fad3091b7781ac2f.xml (deflated 88%) 2025-12-04T16:24:11.0915407Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-74bf247f22e5b0d6.xml (deflated 88%) 2025-12-04T16:24:11.0916735Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-4c8f61ef5d858418.xml (deflated 88%) 2025-12-04T16:24:11.0917915Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-c19baf682148544c.xml (deflated 87%) 2025-12-04T16:24:11.0919067Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-a8d85ae14c598d51.xml (deflated 87%) 2025-12-04T16:24:11.0920229Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-8c50d88405c7b0b3.xml (deflated 87%) 2025-12-04T16:24:11.0921397Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-dad902e1dd428e6d.xml (deflated 87%) 2025-12-04T16:24:11.0922545Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-d765bceda0ac4437.xml (deflated 87%) 2025-12-04T16:24:11.0924111Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-42fbcb95d0a0c581.xml (deflated 90%) 2025-12-04T16:24:11.0925320Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-da54dbb08fbaa628.xml (deflated 87%) 2025-12-04T16:24:11.0926526Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-90277c9f3d74bea5.xml (deflated 87%) 2025-12-04T16:24:11.0927574Z adding: test/test-reports/python-pytest/test_sparse_csr/test_sparse_csr-559456aeb9be5546.xml (deflated 87%) 2025-12-04T16:24:11.0965040Z adding: test/test-reports/python-pytest/functorch.test_ops/functorch.test_ops-c51c09e8b0651326.xml (deflated 93%) 2025-12-04T16:24:11.0966306Z adding: test/test-reports/python-pytest/inductor.test_group_batch_fusion/inductor.test_group_batch_fusion-c943193060216670.xml (deflated 75%) 2025-12-04T16:24:11.0983058Z adding: test/test-reports/python-pytest/inductor.test_cpu_repro/inductor.test_cpu_repro-2a01ea2f1377f4e8.xml (deflated 97%) 2025-12-04T16:24:11.0990135Z adding: test/test-reports/python-pytest/inductor.test_fused_attention/inductor.test_fused_attention-93034b4b3ea7a646.xml (deflated 96%) 2025-12-04T16:24:11.0993144Z adding: test/test-reports/python-pytest/inductor.test_inductor_freezing/inductor.test_inductor_freezing-a1df3eccdba273d4.xml (deflated 92%) 2025-12-04T16:24:11.1009199Z adding: test/test-reports/python-pytest/dynamo.test_higher_order_ops/dynamo.test_higher_order_ops-53ad11142cf6dd1d.xml (deflated 91%) 2025-12-04T16:24:11.1017441Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor_arrayref/inductor.test_aot_inductor_arrayref-b86518472a5506a3.xml (deflated 93%) 2025-12-04T16:24:11.1018956Z adding: test/test-reports/python-pytest/inductor.test_autoheuristic/inductor.test_autoheuristic-96f2d55568fe5d93.xml (deflated 28%) 2025-12-04T16:24:11.1020262Z adding: test/test-reports/python-pytest/inductor.test_flex_flash/inductor.test_flex_flash-19911cd6feadbffe.xml (deflated 96%) 2025-12-04T16:24:11.1021563Z adding: test/test-reports/python-pytest/inductor.test_segmented_tree/inductor.test_segmented_tree-1f8a73d877a0e0af.xml (deflated 79%) 2025-12-04T16:24:11.1022959Z adding: test/test-reports/python-pytest/inductor.test_kernel_optimization/inductor.test_kernel_optimization-8ba4093a849c99fb.xml (deflated 37%) 2025-12-04T16:24:11.1024281Z adding: test/test-reports/python-pytest/inductor.test_metrics/inductor.test_metrics-ba304e492604cb49.xml (deflated 69%) 2025-12-04T16:24:11.1025733Z adding: test/test-reports/python-pytest/export.test_unflatten_training_ir/export.test_unflatten_training_ir-dcf5c0e2417d22fe.xml (deflated 92%) 2025-12-04T16:24:11.1027063Z adding: test/test-reports/python-pytest/inductor.test_fx_fusion/inductor.test_fx_fusion-740800c94204dd1f.xml (deflated 63%) 2025-12-04T16:24:11.1033186Z adding: test/test-reports/python-pytest/dynamo.test_inline_and_install/dynamo.test_inline_and_install-ee70bed9693bb3f7.xml (deflated 89%) 2025-12-04T16:24:11.1034574Z adding: test/test-reports/python-pytest/inductor.test_inductor_scheduler/inductor.test_inductor_scheduler-f799e81908d876d8.xml (deflated 79%) 2025-12-04T16:24:11.1069787Z adding: test/test-reports/python-pytest/export.test_export_strict/export.test_export_strict-16708e34d1c82f51.xml (deflated 95%) 2025-12-04T16:24:11.1084271Z adding: test/test-reports/python-pytest/inductor.test_compiled_optimizers/inductor.test_compiled_optimizers-8408ff78148caebd.xml (deflated 95%) 2025-12-04T16:24:11.1085710Z adding: test/test-reports/python-pytest/dynamo.test_graph_region_tracker/dynamo.test_graph_region_tracker-e7e3f06a1f8d4793.xml (deflated 84%) 2025-12-04T16:24:11.1086989Z adding: test/test-reports/python-pytest/dynamo.test_unittest/dynamo.test_unittest-1a49598617666801.xml (deflated 38%) 2025-12-04T16:24:11.1088330Z adding: test/test-reports/python-pytest/inductor.test_compile/inductor.test_compile-f393f964f8c8ee63.xml (deflated 83%) 2025-12-04T16:24:11.1106460Z adding: test/test-reports/python-pytest/dynamo.test_functions/dynamo.test_functions-e9d213b32d56adfd.xml (deflated 90%) 2025-12-04T16:24:11.1111314Z adding: test/test-reports/python-pytest/inductor.test_ordered_set/inductor.test_ordered_set-b674b38a5211e06f.xml (deflated 94%) 2025-12-04T16:24:11.1112668Z adding: test/test-reports/python-pytest/dynamo.test_install_free_tensors/dynamo.test_install_free_tensors-4c842a4ed6e31f3f.xml (deflated 82%) 2025-12-04T16:24:11.1114306Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_codegen_config_overrides/inductor.test_torchinductor_codegen_config_overrides-15f0302c88c6ef46.xml (deflated 78%) 2025-12-04T16:24:11.1115788Z adding: test/test-reports/python-pytest/export.test_passes/export.test_passes-d27457a3d9237a22.xml (deflated 83%) 2025-12-04T16:24:11.1119215Z adding: test/test-reports/python-pytest/dynamo.test_autograd_function/dynamo.test_autograd_function-bbde79f54df96d93.xml (deflated 87%) 2025-12-04T16:24:11.1133083Z adding: test/test-reports/python-pytest/inductor.test_codecache/inductor.test_codecache-d532a0bd02778de2.xml (deflated 94%) 2025-12-04T16:24:11.1148806Z adding: test/test-reports/python-pytest/inductor.test_auto_functionalize/inductor.test_auto_functionalize-1caf9ad6930ca9ac.xml (deflated 95%) 2025-12-04T16:24:11.1150277Z adding: test/test-reports/python-pytest/inductor.test_efficient_conv_bn_eval/inductor.test_efficient_conv_bn_eval-93e786f7489a3e47.xml (deflated 65%) 2025-12-04T16:24:11.1151633Z adding: test/test-reports/python-pytest/inductor.test_torchbind/inductor.test_torchbind-9cdc6632b9077562.xml (deflated 81%) 2025-12-04T16:24:11.1152860Z adding: test/test-reports/python-pytest/dynamo.test_flat_apply/dynamo.test_flat_apply-3a6f00a2fe27363c.xml (deflated 63%) 2025-12-04T16:24:11.1154140Z adding: test/test-reports/python-pytest/dynamo.test_input_attr_tracking/dynamo.test_input_attr_tracking-e306635654774769.xml (deflated 79%) 2025-12-04T16:24:11.1155531Z adding: test/test-reports/python-pytest/dynamo.test_graph_deduplication/dynamo.test_graph_deduplication-72be7e1eac6e503d.xml (deflated 87%) 2025-12-04T16:24:11.1156961Z adding: test/test-reports/python-pytest/inductor.test_distributed_patterns/inductor.test_distributed_patterns-7db9a3d44802273a.xml (deflated 87%) 2025-12-04T16:24:11.1158347Z adding: test/test-reports/python-pytest/dynamo.test_bytecode_utils/dynamo.test_bytecode_utils-4132ad7230faf1a5.xml (deflated 81%) 2025-12-04T16:24:11.1159583Z adding: test/test-reports/python-pytest/export.test_nativert/export.test_nativert-bbcdef66a0abda26.xml (deflated 83%) 2025-12-04T16:24:11.1160803Z adding: test/test-reports/python-pytest/export.test_hop/export.test_hop-06f913442140df32.xml (deflated 93%) 2025-12-04T16:24:11.1193936Z adding: test/test-reports/python-pytest/export.test_export/export.test_export-f3f94f66dfdfed6b.xml (deflated 95%) 2025-12-04T16:24:11.1195128Z adding: test/test-reports/python-pytest/test_utils_config_module/test_utils_config_module-5af1ad5b3b73495f.xml (deflated 82%) 2025-12-04T16:24:11.1196251Z adding: test/test-reports/python-pytest/test_hop_infra/test_hop_infra-7733a804aa6d4c14.xml (deflated 56%) 2025-12-04T16:24:11.1197518Z adding: test/test-reports/python-pytest/test_appending_byte_serializer/test_appending_byte_serializer-65501f8364394f57.xml (deflated 61%) 2025-12-04T16:24:11.1198710Z adding: test/test-reports/python-pytest/test_license/test_license-56f2b4cb04cabe36.xml (deflated 50%) 2025-12-04T16:24:11.1199737Z adding: test/test-reports/python-pytest/test_ao_sparsity/test_ao_sparsity-c7bfc0a40ffe86a3.xml (deflated 85%) 2025-12-04T16:24:11.1201443Z adding: test/test-reports/python-pytest/test_proxy_tensor/test_proxy_tensor-25f8d712045ddeef.xml (deflated 90%) 2025-12-04T16:24:11.1202559Z adding: test/test-reports/python-pytest/torch_np.test_dtype/torch_np.test_dtype-a1152bdff0bb5e89.xml (deflated 94%) 2025-12-04T16:24:11.1203651Z adding: test/test-reports/python-pytest/test_file_check/test_file_check-21e904b3669672b6.xml (deflated 48%) 2025-12-04T16:24:11.1204691Z adding: test/test-reports/python-pytest/test_flop_counter/test_flop_counter-83e0741caa94eff2.xml (deflated 81%) 2025-12-04T16:24:11.1205878Z adding: test/test-reports/python-pytest/backends.xeon.test_launch/backends.xeon.test_launch-35338ec25f37484c.xml (deflated 48%) 2025-12-04T16:24:11.1207040Z adding: test/test-reports/python-pytest/test_show_pickle/test_show_pickle-865976a650f0e2c8.xml (deflated 37%) 2025-12-04T16:24:11.1208053Z adding: test/test-reports/python-pytest/test_openmp/test_openmp-ab09b4ea4a98cceb.xml (deflated 48%) 2025-12-04T16:24:11.1209143Z adding: test/test-reports/python-pytest/test_expanded_weights/test_expanded_weights-e8def21a8b982d80.xml (deflated 94%) 2025-12-04T16:24:11.1210371Z adding: test/test-reports/python-pytest/functorch.test_ac_knapsack/functorch.test_ac_knapsack-e93133af03c79b6b.xml (deflated 79%) 2025-12-04T16:24:11.1240793Z adding: test/test-reports/python-pytest/torch_np.test_nep50_examples/torch_np.test_nep50_examples-4b9983ced205ee32.xml (deflated 99%) 2025-12-04T16:24:11.1258302Z adding: test/test-reports/python-pytest/test_torch/test_torch-f1fe0f79113cbdd9.xml (deflated 92%) 2025-12-04T16:24:11.1259625Z adding: test/test-reports/python-pytest/higher_order_ops.test_with_effects/higher_order_ops.test_with_effects-e6b12b25e8a894fd.xml (deflated 83%) 2025-12-04T16:24:11.1260787Z adding: test/test-reports/python-pytest/test_weak/test_weak-d0e879aa5e6c216f.xml (deflated 89%) 2025-12-04T16:24:11.1261744Z adding: test/test-reports/python-pytest/test_complex/test_complex-85789b75771d41fd.xml (deflated 81%) 2025-12-04T16:24:11.1277267Z adding: test/test-reports/python-pytest/test_optim/test_optim-e2224a41ca12c5b8.xml (deflated 94%) 2025-12-04T16:24:11.1278247Z adding: test/test-reports/python-pytest/xpu.test_conv/xpu.test_conv-725b55c77ef2b365.xml (deflated 28%) 2025-12-04T16:24:11.1279358Z adding: test/test-reports/python-pytest/nn.test_packed_sequence/nn.test_packed_sequence-cbb1d818037c9c38.xml (deflated 81%) 2025-12-04T16:24:11.1280487Z adding: test/test-reports/python-pytest/test_numa_binding/test_numa_binding-1b1d6199d4053b8e.xml (deflated 75%) 2025-12-04T16:24:11.1281533Z adding: test/test-reports/python-pytest/test_pruning_op/test_pruning_op-4970668223f4717e.xml (deflated 51%) 2025-12-04T16:24:11.1440064Z adding: test/test-reports/python-pytest/test_jit_fuser_te/test_jit_fuser_te-c194bc30f4bbb26c.xml (deflated 99%) 2025-12-04T16:24:11.1443053Z adding: test/test-reports/python-pytest/test_sympy_utils/test_sympy_utils-922738ab31cc19f0.xml (deflated 93%) 2025-12-04T16:24:11.1444255Z adding: test/test-reports/python-pytest/test_mkldnn_fusion/test_mkldnn_fusion-51238045ef241025.xml (deflated 72%) 2025-12-04T16:24:11.1524530Z adding: test/test-reports/python-pytest/test_schema_check/test_schema_check-489dd02163956a84.xml (deflated 96%) 2025-12-04T16:24:11.1525566Z adding: test/test-reports/python-pytest/test_mkldnn/test_mkldnn-2c6972170cf2deca.xml (deflated 28%) 2025-12-04T16:24:11.1797778Z adding: test/test-reports/python-pytest/test_unary_ufuncs/test_unary_ufuncs-52555684d34478cf.xml (deflated 97%) 2025-12-04T16:24:11.1798954Z adding: test/test-reports/python-unittest/test_autoload/TEST-TestDeviceBackendAutoload-20251204162351.xml (deflated 42%) 2025-12-04T16:24:11.1826710Z ##[group]Run # Remove any previous usage logs if they exist 2025-12-04T16:24:11.1827257Z # Remove any previous usage logs if they exist 2025-12-04T16:24:11.1827689Z rm -f logs-*.zip 2025-12-04T16:24:11.1828102Z zip "logs-${FILE_SUFFIX}.zip" 'usage_log.txt' || true 2025-12-04T16:24:11.1828703Z zip -r "logs-${FILE_SUFFIX}.zip" test/test-reports -i '*.log' || true 2025-12-04T16:24:11.1835404Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T16:24:11.1835846Z env: 2025-12-04T16:24:11.1836091Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:11.1836401Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:11.1836770Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:11.1837420Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:11.1838233Z FILE_SUFFIX: test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259 2025-12-04T16:24:11.1838811Z ##[endgroup] 2025-12-04T16:24:11.1902768Z adding: usage_log.txt (deflated 58%) 2025-12-04T16:24:11.1988035Z adding: test/test-reports/inductor.test_aot_inductor_2.6_195f61f6b7f24fa5_.log (deflated 95%) 2025-12-04T16:24:11.2017066Z adding: test/test-reports/inductor.test_torchinductor_1.2_9aedcc93d70c8453_.log (deflated 92%) 2025-12-04T16:24:11.2045598Z adding: test/test-reports/inductor.test_torchinductor_dynamic_shapes_3.5_c72f035bd2cef633_.log (deflated 93%) 2025-12-04T16:24:11.2069892Z adding: test/test-reports/inductor.test_torchinductor_codegen_dynamic_shapes_3.4_8476a077570d9691_.log (deflated 93%) 2025-12-04T16:24:11.2076638Z adding: test/test-reports/inductor.test_torchinductor_opinfo_4.17_3a9725e48b5f31f6_.log (deflated 91%) 2025-12-04T16:24:11.2085065Z adding: test/test-reports/inductor.test_torchinductor_opinfo_9.17_99683e4d5a5c32ee_.log (deflated 91%) 2025-12-04T16:24:11.2092777Z adding: test/test-reports/inductor.test_torchinductor_opinfo_14.17_f7d5878415016f45_.log (deflated 90%) 2025-12-04T16:24:11.2098475Z adding: test/test-reports/dynamo.test_activation_checkpointing_1.1_2178259ee06de350_.log (deflated 92%) 2025-12-04T16:24:11.2099433Z adding: test/test-reports/inductor.test_flex_attention_1.1_20245f741d40e72b_.log (deflated 29%) 2025-12-04T16:24:11.2146050Z adding: test/test-reports/inductor.test_cuda_select_algorithm_1.5_fb546dd2d7cc6229_.log (deflated 96%) 2025-12-04T16:24:11.2189044Z adding: test/test-reports/inductor.test_compile_subprocess_1.3_2d3d7d60d67c3c74_.log (deflated 95%) 2025-12-04T16:24:11.2190270Z adding: test/test-reports/inductor.test_deterministic_3.8_b560ed3aac269ff0_.log (deflated 70%) 2025-12-04T16:24:11.2191238Z adding: test/test-reports/inductor.test_deterministic_8.8_75991ffd80c13502_.log (deflated 67%) 2025-12-04T16:24:11.2264718Z adding: test/test-reports/inductor.test_op_dtype_prop_1.2_88d9121bb179f5e4_.log (deflated 98%) 2025-12-04T16:24:11.2265577Z adding: test/test-reports/dynamo.test_callback_1.1_3d6aff0243b6db4a_.log (deflated 58%) 2025-12-04T16:24:11.2269772Z adding: test/test-reports/dynamo.test_compiler_bisector_1.1_331143819e86e237_.log (deflated 94%) 2025-12-04T16:24:11.2280060Z adding: test/test-reports/inductor.test_cudagraph_trees_expandable_segments_1.1_4e86984065370f2d_.log (deflated 89%) 2025-12-04T16:24:11.2281128Z adding: test/test-reports/inductor.test_collective_autotuning_1.1_e338f0430869aa69_.log (deflated 66%) 2025-12-04T16:24:11.2282270Z adding: test/test-reports/higher_order_ops.test_local_map_1.1_356472a63c965b37_.log (deflated 73%) 2025-12-04T16:24:11.2285679Z adding: test/test-reports/test_dataloader_1.2_13cbd3bba7ddab60_.log (deflated 86%) 2025-12-04T16:24:11.2301626Z adding: test/test-reports/test_decomp_4.17_574ee9916c2803b5_.log (deflated 89%) 2025-12-04T16:24:11.2317440Z adding: test/test-reports/test_decomp_9.17_c2558e85bd6204cf_.log (deflated 89%) 2025-12-04T16:24:11.2333407Z adding: test/test-reports/test_decomp_14.17_c45af0ac343f4cce_.log (deflated 89%) 2025-12-04T16:24:11.2378124Z adding: test/test-reports/test_ops_fwd_gradients_2.2_d78504ac4b58b9c4_.log (deflated 94%) 2025-12-04T16:24:11.2587063Z adding: test/test-reports/test_meta_3.5_0884fc4739a02c0f_.log (deflated 93%) 2025-12-04T16:24:11.2658724Z adding: test/test-reports/test_ops_gradients_1.2_48c762d3544cab24_.log (deflated 94%) 2025-12-04T16:24:11.2729069Z adding: test/test-reports/test_linalg_1.1_2a5b6f269d53327b_.log (deflated 96%) 2025-12-04T16:24:11.2815478Z adding: test/test-reports/test_ops_4.11_cc5a88e4afdf55b7_.log (deflated 91%) 2025-12-04T16:24:11.2902238Z adding: test/test-reports/test_ops_9.11_a89556c3277781cc_.log (deflated 91%) 2025-12-04T16:24:11.2998504Z adding: test/test-reports/test_sparse_csr_2.2_9a8f6f98dd0804ce_.log (deflated 95%) 2025-12-04T16:24:11.3039948Z adding: test/test-reports/functorch.test_ops_5.7_4f81116d2780ad42_.log (deflated 92%) 2025-12-04T16:24:11.3041175Z adding: test/test-reports/inductor.test_group_batch_fusion_1.1_8a4a2e0db31add28_.log (deflated 79%) 2025-12-04T16:24:11.3053690Z adding: test/test-reports/inductor.test_cpu_repro_2.3_d08b33d3f2a080e9_.log (deflated 93%) 2025-12-04T16:24:11.3054494Z adding: test/test-reports/inductor.test_smoke_1.1_b380630b8c684182_.log (stored 0%) 2025-12-04T16:24:11.3057452Z adding: test/test-reports/inductor.test_fused_attention_1.1_0d21780331237b8e_.log (deflated 91%) 2025-12-04T16:24:11.3059475Z adding: test/test-reports/inductor.test_inductor_freezing_1.1_a351f78acb34ecfb_.log (deflated 85%) 2025-12-04T16:24:11.3066844Z adding: test/test-reports/dynamo.test_higher_order_ops_1.1_d050168149600c3d_.log (deflated 90%) 2025-12-04T16:24:11.3076103Z adding: test/test-reports/inductor.test_aot_inductor_arrayref_1.2_d25289a5cf5642e3_.log (deflated 90%) 2025-12-04T16:24:11.3077057Z adding: test/test-reports/inductor.test_triton_cpu_backend_1.1_6ce1ac8654c4e721_.log (deflated 8%) 2025-12-04T16:24:11.3077971Z adding: test/test-reports/inductor.test_autoheuristic_1.1_b0f24ef6b05122fc_.log (deflated 50%) 2025-12-04T16:24:11.3078872Z adding: test/test-reports/inductor.test_flex_flash_1.1_f4efdeec9f55a014_.log (deflated 92%) 2025-12-04T16:24:11.3079754Z adding: test/test-reports/inductor.test_segmented_tree_1.1_902748e42a4b3c21_.log (deflated 74%) 2025-12-04T16:24:11.3080685Z adding: test/test-reports/inductor.test_kernel_optimization_1.1_b508de360be55d82_.log (deflated 54%) 2025-12-04T16:24:11.3081591Z adding: test/test-reports/inductor.test_metrics_1.1_0adb01e2e59a8204_.log (deflated 64%) 2025-12-04T16:24:11.3082992Z adding: test/test-reports/export.test_unflatten_training_ir_1.1_abb2865a8ff8868a_.log (deflated 85%) 2025-12-04T16:24:11.3083894Z adding: test/test-reports/inductor.test_fx_fusion_1.1_98b847c3047c6d7b_.log (deflated 62%) 2025-12-04T16:24:11.3098398Z adding: test/test-reports/dynamo.test_inline_and_install_1.1_9d45f7d839a55bb8_.log (deflated 92%) 2025-12-04T16:24:11.3099335Z adding: test/test-reports/inductor.test_inductor_scheduler_1.1_5f88a9d31f2f290d_.log (deflated 73%) 2025-12-04T16:24:11.3139820Z adding: test/test-reports/export.test_export_strict_1.1_4ea191e22f87eaf9_.log (deflated 91%) 2025-12-04T16:24:11.3140882Z adding: test/test-reports/inductor.test_compiled_optimizers_3.3_f162c7ad1675ca04_.log (deflated 92%) 2025-12-04T16:24:11.3141844Z adding: test/test-reports/dynamo.test_graph_region_tracker_1.1_f5ae5cf5f828f9b1_.log (deflated 77%) 2025-12-04T16:24:11.3142883Z adding: test/test-reports/dynamo.test_unittest_1.1_d68e34e45e740e8c_.log (deflated 51%) 2025-12-04T16:24:11.3143718Z adding: test/test-reports/inductor.test_compile_1.1_b13c9a106be3e4f3_.log (deflated 73%) 2025-12-04T16:24:11.3148743Z adding: test/test-reports/dynamo.test_functions_1.1_6ca2c48a93208d7e_.log (deflated 88%) 2025-12-04T16:24:11.3157694Z adding: test/test-reports/inductor.test_ordered_set_1.1_77140872da5a7126_.log (deflated 91%) 2025-12-04T16:24:11.3158538Z adding: test/test-reports/inductor.test_pallas_1.1_cb3d6eceb8711a40_.log (deflated 8%) 2025-12-04T16:24:11.3159536Z adding: test/test-reports/dynamo.test_install_free_tensors_1.1_dc3bcfdcc8893d8d_.log (deflated 82%) 2025-12-04T16:24:11.3160584Z adding: test/test-reports/inductor.test_torchinductor_codegen_config_overrides_1.1_1597e2f7db8ab1bf_.log (deflated 68%) 2025-12-04T16:24:11.3161981Z adding: test/test-reports/export.test_passes_1.1_5fdccc59815cb06b_.log (deflated 89%) 2025-12-04T16:24:11.3163738Z adding: test/test-reports/dynamo.test_autograd_function_1.1_12e04931a7d6b73d_.log (deflated 83%) 2025-12-04T16:24:11.3174191Z adding: test/test-reports/inductor.test_codecache_1.1_750e07d8e3fe4035_.log (deflated 92%) 2025-12-04T16:24:11.3175658Z adding: test/test-reports/inductor.test_auto_functionalize_1.1_de84d14995b669d1_.log (deflated 85%) 2025-12-04T16:24:11.3176615Z adding: test/test-reports/inductor.test_efficient_conv_bn_eval_1.1_efda8428304aed40_.log (deflated 56%) 2025-12-04T16:24:11.3179344Z adding: test/test-reports/inductor.test_torchbind_1.1_fc2e846084c707a2_.log (deflated 92%) 2025-12-04T16:24:11.3180195Z adding: test/test-reports/dynamo.test_flat_apply_1.1_ce4f5252908fcc19_.log (deflated 61%) 2025-12-04T16:24:11.3181076Z adding: test/test-reports/dynamo.test_input_attr_tracking_1.1_c0c1747b0b64a184_.log (deflated 77%) 2025-12-04T16:24:11.3181994Z adding: test/test-reports/dynamo.test_graph_deduplication_1.1_1391af197500fccc_.log (deflated 80%) 2025-12-04T16:24:11.3183065Z adding: test/test-reports/inductor.test_distributed_patterns_1.1_a3ab6d55458abecf_.log (deflated 81%) 2025-12-04T16:24:11.3184007Z adding: test/test-reports/dynamo.test_bytecode_utils_1.1_b447a5fdf5287190_.log (deflated 79%) 2025-12-04T16:24:11.3184916Z adding: test/test-reports/export.test_nativert_1.1_85e109ef4797d863_.log (deflated 67%) 2025-12-04T16:24:11.3194307Z adding: test/test-reports/export.test_hop_1.1_4e0a4ce0cd3157fe_.log (deflated 95%) 2025-12-04T16:24:11.3211308Z adding: test/test-reports/export.test_export_1.1_0d008746ac2eaaa3_.log (deflated 86%) 2025-12-04T16:24:11.3212261Z adding: test/test-reports/test_utils_config_module_1.1_ab32cc157b19bec7_.log (deflated 80%) 2025-12-04T16:24:11.3213055Z adding: test/test-reports/test_hop_infra_1.1_6308f2f4df0add62_.log (deflated 57%) 2025-12-04T16:24:11.3213887Z adding: test/test-reports/test_appending_byte_serializer_1.1_b51fec74da87aec6_.log (deflated 62%) 2025-12-04T16:24:11.3214707Z adding: test/test-reports/test_license_1.1_af7f8f91f8ff69a5_.log (deflated 52%) 2025-12-04T16:24:11.3217353Z adding: test/test-reports/test_ao_sparsity_1.1_435c6c94cb7afb29_.log (deflated 87%) 2025-12-04T16:24:11.3222501Z adding: test/test-reports/test_proxy_tensor_1.1_6f0b68fe764e3723_.log (deflated 88%) 2025-12-04T16:24:11.3223790Z adding: test/test-reports/torch_np.test_dtype_1.1_d9bfb103178d1533_.log (deflated 88%) 2025-12-04T16:24:11.3224550Z adding: test/test-reports/test_file_check_1.1_1900e06915def041_.log (deflated 53%) 2025-12-04T16:24:11.3225510Z adding: test/test-reports/test_flop_counter_1.1_4d9092c128608eaa_.log (deflated 75%) 2025-12-04T16:24:11.3226340Z adding: test/test-reports/backends.xeon.test_launch_1.1_fba14d6e358f5987_.log (deflated 52%) 2025-12-04T16:24:11.3227146Z adding: test/test-reports/test_show_pickle_1.1_979efe7958d3bd6b_.log (deflated 50%) 2025-12-04T16:24:11.3227897Z adding: test/test-reports/test_openmp_1.1_db907b0eb0691969_.log (deflated 53%) 2025-12-04T16:24:11.3234053Z adding: test/test-reports/test_expanded_weights_1.1_db5bdda645780846_.log (deflated 93%) 2025-12-04T16:24:11.3235224Z adding: test/test-reports/functorch.test_ac_knapsack_1.1_69e4982e1e9d819e_.log (deflated 78%) 2025-12-04T16:24:11.3265566Z adding: test/test-reports/torch_np.test_nep50_examples_1.1_7e6125dc6a887b5d_.log (deflated 96%) 2025-12-04T16:24:11.3291134Z adding: test/test-reports/test_torch_1.1_3a3aff20d8da01a9_.log (deflated 91%) 2025-12-04T16:24:11.3292412Z adding: test/test-reports/higher_order_ops.test_with_effects_1.1_2e0c561acdbd87a0_.log (deflated 79%) 2025-12-04T16:24:11.3293632Z adding: test/test-reports/test_weak_1.1_9028b921eb7afade_.log (deflated 82%) 2025-12-04T16:24:11.3294538Z adding: test/test-reports/test_complex_1.1_f2afd153be3495f4_.log (deflated 76%) 2025-12-04T16:24:11.3320237Z adding: test/test-reports/test_optim_1.1_ef0be9bf78c331a7_.log (deflated 93%) 2025-12-04T16:24:11.3320966Z adding: test/test-reports/xpu.test_conv_1.1_02560067eb16cf7d_.log (deflated 48%) 2025-12-04T16:24:11.3321920Z adding: test/test-reports/nn.test_packed_sequence_1.1_69614db4f9bab953_.log (deflated 75%) 2025-12-04T16:24:11.3325046Z adding: test/test-reports/test_numa_binding_1.1_69471f0d5f25dce0_.log (deflated 91%) 2025-12-04T16:24:11.3325801Z adding: test/test-reports/test_pruning_op_1.1_67dd68b7168ba8bb_.log (deflated 55%) 2025-12-04T16:24:11.3470350Z adding: test/test-reports/test_jit_fuser_te_1.1_986011f95e3b5bd3_.log (deflated 96%) 2025-12-04T16:24:11.3475598Z adding: test/test-reports/test_sympy_utils_1.1_8a60d70855d2b894_.log (deflated 90%) 2025-12-04T16:24:11.3476442Z adding: test/test-reports/test_mkldnn_fusion_1.1_e7401bbd89a7874f_.log (deflated 65%) 2025-12-04T16:24:11.3618898Z adding: test/test-reports/test_schema_check_1.1_a822f87ec4dc8cdd_.log (deflated 96%) 2025-12-04T16:24:11.3619649Z adding: test/test-reports/test_mkldnn_1.1_15c2989d3d5eef99_.log (deflated 48%) 2025-12-04T16:24:11.4155295Z adding: test/test-reports/test_unary_ufuncs_1.1_47fb7c62f96091ec_.log (deflated 96%) 2025-12-04T16:24:11.4183503Z ##[group]Run # Remove any previous debugging artifacts if they exist 2025-12-04T16:24:11.4184133Z # Remove any previous debugging artifacts if they exist 2025-12-04T16:24:11.4184596Z rm -f debug-*.zip 2025-12-04T16:24:11.4185002Z if [ -d 'test/debug' ]; then 2025-12-04T16:24:11.4185422Z  zip -r "debug-${FILE_SUFFIX}.zip" test/debug 2025-12-04T16:24:11.4185811Z fi 2025-12-04T16:24:11.4192639Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T16:24:11.4193080Z env: 2025-12-04T16:24:11.4193314Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:11.4193630Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:11.4193994Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:11.4194647Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:11.4195441Z FILE_SUFFIX: test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259 2025-12-04T16:24:11.4196020Z ##[endgroup] 2025-12-04T16:24:11.4281496Z ##[group]Run seemethere/upload-artifact-s3@v5 2025-12-04T16:24:11.4281883Z with: 2025-12-04T16:24:11.4282135Z s3-bucket: gha-artifacts 2025-12-04T16:24:11.4282500Z s3-prefix: pytorch/pytorch/19922826259/1/artifact 2025-12-04T16:24:11.4282909Z retention-days: 14 2025-12-04T16:24:11.4283188Z if-no-files-found: warn 2025-12-04T16:24:11.4283496Z path: test-jsons-*.zip 2025-12-04T16:24:11.4283785Z name: artifact 2025-12-04T16:24:11.4284030Z region: us-east-1 2025-12-04T16:24:11.4284282Z env: 2025-12-04T16:24:11.4284520Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:11.4284811Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:11.4285170Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:11.4285826Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:11.4286396Z ##[endgroup] 2025-12-04T16:24:11.8152503Z NOTE: s3-prefix specified, ignoring name parameter 2025-12-04T16:24:11.8153045Z With the provided path, there will be 1 file uploaded 2025-12-04T16:24:11.8153880Z Uploading to s3 prefix: pytorch/pytorch/19922826259/1/artifact 2025-12-04T16:24:11.8207945Z Starting upload of test-jsons-test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259.zip 2025-12-04T16:24:11.9725773Z Finished upload of test-jsons-test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259.zip 2025-12-04T16:24:11.9930162Z ##[group]Run seemethere/upload-artifact-s3@v5 2025-12-04T16:24:11.9930546Z with: 2025-12-04T16:24:11.9930801Z s3-bucket: gha-artifacts 2025-12-04T16:24:11.9931176Z s3-prefix: pytorch/pytorch/19922826259/1/artifact 2025-12-04T16:24:11.9931694Z retention-days: 14 2025-12-04T16:24:11.9931975Z if-no-files-found: error 2025-12-04T16:24:11.9932295Z path: test-reports-*.zip 2025-12-04T16:24:11.9932594Z name: artifact 2025-12-04T16:24:11.9932841Z region: us-east-1 2025-12-04T16:24:11.9933097Z env: 2025-12-04T16:24:11.9933333Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:11.9933624Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:11.9934001Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:11.9934657Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:11.9935226Z ##[endgroup] 2025-12-04T16:24:12.3880542Z NOTE: s3-prefix specified, ignoring name parameter 2025-12-04T16:24:12.3881074Z With the provided path, there will be 1 file uploaded 2025-12-04T16:24:12.3881593Z Uploading to s3 prefix: pytorch/pytorch/19922826259/1/artifact 2025-12-04T16:24:12.3935657Z Starting upload of test-reports-test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259.zip 2025-12-04T16:24:12.6310613Z Finished upload of test-reports-test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259.zip 2025-12-04T16:24:12.6518528Z ##[group]Run seemethere/upload-artifact-s3@v5 2025-12-04T16:24:12.6518923Z with: 2025-12-04T16:24:12.6519183Z s3-bucket: gha-artifacts 2025-12-04T16:24:12.6519548Z s3-prefix: pytorch/pytorch/19922826259/1/artifact 2025-12-04T16:24:12.6519963Z retention-days: 14 2025-12-04T16:24:12.6520278Z if-no-files-found: ignore 2025-12-04T16:24:12.6520580Z path: logs-*.zip 2025-12-04T16:24:12.6520846Z name: artifact 2025-12-04T16:24:12.6521108Z region: us-east-1 2025-12-04T16:24:12.6521479Z env: 2025-12-04T16:24:12.6521711Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:12.6522023Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:12.6522395Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:12.6523044Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:12.6523640Z ##[endgroup] 2025-12-04T16:24:13.0181244Z NOTE: s3-prefix specified, ignoring name parameter 2025-12-04T16:24:13.0181801Z With the provided path, there will be 1 file uploaded 2025-12-04T16:24:13.0182336Z Uploading to s3 prefix: pytorch/pytorch/19922826259/1/artifact 2025-12-04T16:24:13.0236841Z Starting upload of logs-test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259.zip 2025-12-04T16:24:13.2107224Z Finished upload of logs-test-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu_57119749259.zip 2025-12-04T16:24:13.2313569Z ##[group]Run seemethere/upload-artifact-s3@v5 2025-12-04T16:24:13.2313949Z with: 2025-12-04T16:24:13.2314216Z s3-bucket: gha-artifacts 2025-12-04T16:24:13.2314585Z s3-prefix: pytorch/pytorch/19922826259/1/artifact 2025-12-04T16:24:13.2314994Z retention-days: 14 2025-12-04T16:24:13.2315277Z if-no-files-found: ignore 2025-12-04T16:24:13.2315587Z path: debug-*.zip 2025-12-04T16:24:13.2315854Z name: artifact 2025-12-04T16:24:13.2316111Z region: us-east-1 2025-12-04T16:24:13.2316385Z env: 2025-12-04T16:24:13.2316627Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:13.2316922Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:13.2317293Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:13.2317953Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:13.2318542Z ##[endgroup] 2025-12-04T16:24:13.5913663Z No files were found with the provided path: debug-*.zip. No artifacts will be uploaded. 2025-12-04T16:24:13.6113752Z ##[group]Run # shellcheck disable=SC2156 2025-12-04T16:24:13.6114202Z # shellcheck disable=SC2156 2025-12-04T16:24:13.6114908Z find . -iname "core.[1-9]*" -exec docker exec "${DOCKER_CONTAINER_ID}" sh -c "gdb python {} -ex 'bt' -ex 'q'" \; 2025-12-04T16:24:13.6121930Z shell: /usr/bin/bash -e {0} 2025-12-04T16:24:13.6122245Z env: 2025-12-04T16:24:13.6122496Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:13.6122795Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:13.6123265Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:13.6123927Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:13.6124519Z ##[endgroup] 2025-12-04T16:24:13.9834121Z ##[group]Run seemethere/upload-artifact-s3@baba72d0712b404f646cebe0730933554ebce96a 2025-12-04T16:24:13.9834727Z with: 2025-12-04T16:24:13.9835143Z name: coredumps-legacy_nvidia_driver-2-5-linux.g4dn.4xlarge.nvidia.gpu 2025-12-04T16:24:13.9835657Z retention-days: 14 2025-12-04T16:24:13.9835957Z if-no-files-found: ignore 2025-12-04T16:24:13.9836282Z path: ./**/core.[1-9]* 2025-12-04T16:24:13.9836585Z s3-bucket: gha-artifacts 2025-12-04T16:24:13.9836893Z region: us-east-1 2025-12-04T16:24:13.9837160Z env: 2025-12-04T16:24:13.9837392Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:13.9837701Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:13.9838069Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:13.9838715Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:13.9839312Z ##[endgroup] 2025-12-04T16:24:23.0447747Z No files were found with the provided path: ./**/core.[1-9]*. No artifacts will be uploaded. 2025-12-04T16:24:23.0780944Z Prepare all required actions 2025-12-04T16:24:23.0781411Z Getting action download info 2025-12-04T16:24:23.3113067Z Download action repository 'actions/setup-python@v6' (SHA:83679a892e2d95755f2dac6acb0bfd1e9ac5d548) 2025-12-04T16:24:23.6188737Z ##[group]Run ./.github/actions/upload-utilization-stats 2025-12-04T16:24:23.6189169Z with: 2025-12-04T16:24:23.6189401Z job_id: 57119749259 2025-12-04T16:24:23.6190219Z job_name: linux-jammy-cuda12.4-py3.10-gcc11 / test (legacy_nvidia_driver, 2, 5, linux.g4dn.4xlarge.nvidia.gpu, mem_leak_check, unstable) 2025-12-04T16:24:23.6191019Z workflow_name: periodic 2025-12-04T16:24:23.6191319Z workflow_run_id: 19922826259 2025-12-04T16:24:23.6191639Z workflow_attempt: 1 2025-12-04T16:24:23.6191910Z env: 2025-12-04T16:24:23.6192149Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:23.6192460Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:23.6192829Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:23.6193530Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:23.6194116Z ##[endgroup] 2025-12-04T16:24:23.6237142Z ##[group]Run actions/setup-python@v6 2025-12-04T16:24:23.6237502Z with: 2025-12-04T16:24:23.6237763Z python-version: 3.10 2025-12-04T16:24:23.6238046Z check-latest: false 2025-12-04T16:24:23.6238437Z token: *** 2025-12-04T16:24:23.6238705Z update-environment: true 2025-12-04T16:24:23.6239043Z allow-prereleases: false 2025-12-04T16:24:23.6239340Z freethreaded: false 2025-12-04T16:24:23.6239614Z env: 2025-12-04T16:24:23.6239855Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:23.6240142Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:23.6240504Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:23.6241159Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:23.6241736Z ##[endgroup] 2025-12-04T16:24:24.1172175Z ##[group]Installed versions 2025-12-04T16:24:24.1182291Z Version 3.10 was not found in the local cache 2025-12-04T16:24:24.1483917Z (node:309939) [DEP0040] DeprecationWarning: The `punycode` module is deprecated. Please use a userland alternative instead. 2025-12-04T16:24:24.1484996Z (Use `node --trace-deprecation ...` to show where the warning was created) 2025-12-04T16:24:24.5520658Z ##[error]The version '3.10' with architecture 'x64' was not found for this operating system. The list of all available versions can be found here: https://raw.githubusercontent.com/actions/python-versions/main/versions-manifest.json 2025-12-04T16:24:24.5706102Z ##[group]Run pytorch/test-infra/.github/actions/teardown-linux@main 2025-12-04T16:24:24.5706615Z with: 2025-12-04T16:24:24.5706848Z env: 2025-12-04T16:24:24.5707082Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:24.5707507Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:24.5707877Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:24.5708527Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:24.5709444Z ##[endgroup] 2025-12-04T16:24:24.5727669Z ##[group]Run set -eou pipefail 2025-12-04T16:24:24.5728035Z set -eou pipefail 2025-12-04T16:24:24.5728341Z  2025-12-04T16:24:24.5728771Z echo "Holding runner for 2 hours until all ssh sessions have logged out" 2025-12-04T16:24:24.5729320Z for _ in $(seq 1440); do 2025-12-04T16:24:24.5729693Z  # Break if no ssh session exists anymore 2025-12-04T16:24:24.5730107Z  if [ "$(who)" = "" ]; then 2025-12-04T16:24:24.5730491Z  break 2025-12-04T16:24:24.5730765Z  fi 2025-12-04T16:24:24.5731028Z  echo "." 2025-12-04T16:24:24.5731306Z  sleep 5 2025-12-04T16:24:24.5731564Z done 2025-12-04T16:24:24.5738380Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T16:24:24.5738842Z env: 2025-12-04T16:24:24.5739093Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:24.5739392Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:24.5739760Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:24.5740419Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:24.5741010Z ##[endgroup] 2025-12-04T16:24:24.5767773Z Holding runner for 2 hours until all ssh sessions have logged out 2025-12-04T16:24:24.5862379Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2025-12-04T16:24:24.5863128Z # ignore expansion of "docker ps -q" since it could be empty 2025-12-04T16:24:24.5863641Z # shellcheck disable=SC2046 2025-12-04T16:24:24.5864033Z docker stop $(docker ps -q) || true 2025-12-04T16:24:24.5864437Z # Prune all of the docker images 2025-12-04T16:24:24.5864804Z docker system prune -af 2025-12-04T16:24:24.5871099Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T16:24:24.5871536Z env: 2025-12-04T16:24:24.5871767Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:24.5872074Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:24.5872437Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:24.5873075Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:24.5873660Z ##[endgroup] 2025-12-04T16:24:35.9204716Z fffd24d16954 2025-12-04T16:24:39.5543356Z Deleted Containers: 2025-12-04T16:24:39.5543880Z fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:39.5544301Z 2025-12-04T16:24:46.4194077Z Deleted Images: 2025-12-04T16:24:46.4199203Z untagged: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-cuda12.4-cudnn9-py3-gcc11-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T16:24:46.4200764Z untagged: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image@sha256:ae30f11a5b50741bd652aa0c94ad89ef791c4e50157eff642748620825cf7940 2025-12-04T16:24:46.4201864Z deleted: sha256:5465aa79632b68f6240c23f0d0b021df4d0fd595333b61a40d36a0cf73656024 2025-12-04T16:24:46.4202620Z deleted: sha256:f57a578c46f36a858c2be92210a89558688ee36b619af78c698952c0e3ef05ad 2025-12-04T16:24:46.4203511Z deleted: sha256:ce0698bd1efc811ccead0ecdad944b4839bf17bff387495b58e64cf8db0e210c 2025-12-04T16:24:46.4204294Z deleted: sha256:f0ee66f328fa98c40f336c64fee9a4b42e51a793cceea7f81932068bdc7bd315 2025-12-04T16:24:46.4205065Z deleted: sha256:ea24b30a25c161bd4bd564bfd90c36d88674a1aa59ef3e65647e926c76685be0 2025-12-04T16:24:46.4205838Z deleted: sha256:15bc0847ce5e60cc1a9b36d25283dc5648fb45e04aa9a8dec984af3c193e2f0b 2025-12-04T16:24:46.4206942Z deleted: sha256:3639aa26691090ef45641c75bffcb2e3f427f5e282abc93d607de4433bf90488 2025-12-04T16:24:46.4207710Z deleted: sha256:86258272ba477934c917d08b21e0da6000c268b60f5a9ae907038e7bf3236532 2025-12-04T16:24:46.4208471Z deleted: sha256:ba8e0040c98ddbf87acbc3ae6575b2933c09421ac7094a96e027d1fc9356fbb6 2025-12-04T16:24:46.4209461Z deleted: sha256:ca0176fc0de6cc059c4dbfc313434b5dea2c90dc24f2dc3a1061b941c7b3e6ca 2025-12-04T16:24:46.4210229Z deleted: sha256:cc6a480ab9e6091c6c206bc9b340611b3863258975e835769bd8f2a38b5d8c13 2025-12-04T16:24:46.4210987Z deleted: sha256:8465c24f0b284d8589ea191edeb80d1da07e4a59dfcfdcfa153bdf3d5d678d3e 2025-12-04T16:24:46.4211749Z deleted: sha256:b93bfbd3b55899c606fb98c5edbd21fd63114862a4f5a5b67c7aa63fc9ada9a3 2025-12-04T16:24:46.4212519Z deleted: sha256:6b7582e3ce445d82e9d2ae7769502119c39c1edbf5fe11c195615db8da846931 2025-12-04T16:24:46.4213281Z deleted: sha256:9d79615a9d9ae67110cc9da697933492b385b1e4708d30c2211625bea5d42f27 2025-12-04T16:24:46.4214143Z deleted: sha256:7132c6db5e7d5692786167dfb22dea62d8203dc7837b2d1de435c6e5c85e906e 2025-12-04T16:24:46.4215072Z deleted: sha256:d61bc13a0957d633ff633186c6cbdf48da1c551991d814281262e58709e225a8 2025-12-04T16:24:46.4215834Z deleted: sha256:0c348bbc3988acd329b3e42de4d2c73d5dc4942618716ca312d389d4f704f4bb 2025-12-04T16:24:46.4216602Z deleted: sha256:28d30dd15686ab6819c2f03388c9999bbdaef35e8756817297d795e00dd623fc 2025-12-04T16:24:46.4217346Z deleted: sha256:0a57608df6cffb31a0b24f2537b4dfe7a55bbe6ea02216703cc3172062ab9d75 2025-12-04T16:24:46.4218118Z deleted: sha256:43d23f49f4d70a54b4aff6f4f10d5c5a3d75b100abbbf281ad510177cc80cd99 2025-12-04T16:24:46.4218986Z deleted: sha256:f9e33c2e4c7b8e7179fba052da4d7c4acdc8287f253c95328ae04055755f88a4 2025-12-04T16:24:46.4219756Z deleted: sha256:cfce0930cf33c7136fc92511b9bcad570958363b55e9e0c82e9b8ebc29301356 2025-12-04T16:24:46.4220508Z deleted: sha256:9a709ae20528f500f51271ad2ce6a3d7196fe814a28ae73881901ecef9748c2a 2025-12-04T16:24:46.4221382Z deleted: sha256:68a1d16e9392be6fe939a58c5f941a0919408b5852e52cb04027b0b8777e2b0e 2025-12-04T16:24:46.4222142Z deleted: sha256:042a0022b3eea78f54015f4cf2888bcfa3b91deb0b08830a33c2814b93285dd9 2025-12-04T16:24:46.4222907Z deleted: sha256:a7ba703ff0aa305a608f3b4afd89c2ecd0d1244b127629145a2e691490abb271 2025-12-04T16:24:46.4223679Z deleted: sha256:be44f5fbae55066faba60eebf7065a082abf517ab8f2ebf8ece69e74d45def07 2025-12-04T16:24:46.4224556Z deleted: sha256:a01f1b0d88a8936d648f78787f56579bdb6617edf4620d0410ab6b118351bbb2 2025-12-04T16:24:46.4225317Z deleted: sha256:dc93f45553adafb5c6e7473711c833996f6884dab2da708ffc76b5cf65b8db9d 2025-12-04T16:24:46.4226072Z deleted: sha256:ffdba9ecb5890a9cb23368d781ff5484270b7f13c6d5629feca3512b58b9a0ac 2025-12-04T16:24:46.4226822Z deleted: sha256:268a91c420865628895871795b524436f5cc4403aa53d71f457db21bf42dd530 2025-12-04T16:24:46.4227568Z deleted: sha256:72450bfd97986ccc53d8fa76252130b464fdb3c5fd8e688546e8c3ce0b9d4394 2025-12-04T16:24:46.4228335Z deleted: sha256:63954235d3be0420af6ad2dae2b24849e3eee1edb10cf86d29137c3e19621f47 2025-12-04T16:24:46.4229089Z deleted: sha256:1c4e2d3e68e8a166d1965962077fe194ea00cad2ee636399c0c17ba5a94bdb9c 2025-12-04T16:24:46.4229865Z deleted: sha256:361cacbab7154a0cb62486f57d75b112feedbcc751a7d8f7bb02ec7a61b1fe0d 2025-12-04T16:24:46.4230636Z deleted: sha256:e653f6af92265f4300717bd617aab954cfbf049d4be32e890e57c2e8135be7f9 2025-12-04T16:24:46.4231383Z deleted: sha256:bfffeb2974ffc58c0669724812f701df860257ac3d047a7315a100beb0ea0507 2025-12-04T16:24:46.4232153Z deleted: sha256:6ae48d8efc75420f721058928fe8b1ccf48aa1bdc92de539b1f0db9248a41fcf 2025-12-04T16:24:46.4233053Z deleted: sha256:535c7026785a690366fc69ecbc9a81f1b58a46f63c782620591c1297406a2731 2025-12-04T16:24:46.4234167Z deleted: sha256:8462076c3cc8db6030f38e1137bfbef1aad85404ed4231285c1e06cd414d3e57 2025-12-04T16:24:46.4234925Z deleted: sha256:fe340d63ccb66e5b395b7900c1002a513e4afd7f610e9df5e7262c4f71e93bef 2025-12-04T16:24:46.4235866Z deleted: sha256:b61085386114396fe42144a4aa739b2a0b45f0c30a083462a2ea7b9b675c02aa 2025-12-04T16:24:46.4236782Z deleted: sha256:7772f25c05bcd5ede631d287b826aa108db67c773e377db98ffa73b0917f3629 2025-12-04T16:24:46.4237543Z deleted: sha256:3ea8a43d8193d05ecd6aa473b523a3569e11ae691eed9e6ffd693f23b0106035 2025-12-04T16:24:46.4238309Z deleted: sha256:34647b4087d29cf48a18668bb935a95fc8b2dac3522c2581397f0f27227047fd 2025-12-04T16:24:46.4239119Z deleted: sha256:b6a169f1ab01281c16562ad43b462a1a47a33be8d3cfae0a117ffa5c47d0b532 2025-12-04T16:24:46.4239887Z deleted: sha256:664173a33cd21248a2d73d2eba7887602e36fbc96002d991eb0bd0a2d574ac88 2025-12-04T16:24:46.4240641Z deleted: sha256:d67fdfe94c9a0228f17991cd3e958e36da96d4d597b46773cb7eed98c489f947 2025-12-04T16:24:46.4241407Z deleted: sha256:f2be0722250908742f067756b56ed3fa169daa2f1c8201a7ed4335b2fed2cae5 2025-12-04T16:24:46.4242171Z deleted: sha256:8614db257d8dc9e0f0ee8398a4a4d3c061b2797d6017daaf0696dd7f87633b3e 2025-12-04T16:24:46.4242960Z deleted: sha256:23ee0908a1bf254f1d4dd0591cc0c6801571b4d93950b6fd4fee57ca7e361da0 2025-12-04T16:24:46.4243732Z deleted: sha256:f627a99df4c0f370bd7fc8ea6be7695d8027f988aed52b65233cbcf78b01989b 2025-12-04T16:24:46.4244498Z deleted: sha256:d5e92389b59d4134cdb96113af964186602e98c392e76a8f26d4ea6e54056ccc 2025-12-04T16:24:46.4245269Z deleted: sha256:cbfccf44b9dc670c109634fbf19c2bfff2a3d5243bfa351c851d9fad3f1acfc2 2025-12-04T16:24:46.4246026Z deleted: sha256:1242535e81ad4bd713910a6c5e1b38375b12ed1bcd1b48419813a5ef28a5c84c 2025-12-04T16:24:46.4246781Z deleted: sha256:10b1394079cfe756a1ad9aa9aa3a2995bd5e46ef1e18029eb9eae0398f6d4e88 2025-12-04T16:24:46.4247535Z deleted: sha256:1d32da9a5f10e10c4a97a839151a1943d4db18494e8080bea91a6c9784fde067 2025-12-04T16:24:46.4248293Z deleted: sha256:af2fd59653ebd685a032ef800f8227c0d7b9b0e5ef397b30d4301e001c943e8b 2025-12-04T16:24:46.4249047Z deleted: sha256:c48d351980e3bd24d533ae55d1acc6a27911dffcbb03b2ae552d7ccc3e4cd74f 2025-12-04T16:24:46.4249810Z deleted: sha256:e663afac609b1b6c812ab45265c27d870b92c9fc6849939f0b8635da83cbfb53 2025-12-04T16:24:46.4250619Z deleted: sha256:f79dc17668331d4214ef24000d5c54a0bb2ba70f152d8523f571e2b76a303f4f 2025-12-04T16:24:46.4251419Z deleted: sha256:00de9606a6cd2a2dfb4ceffcb076474d027a1f6273894677090aee7478035865 2025-12-04T16:24:46.4252298Z deleted: sha256:cf35fe1d0317253b75ee17c12783c2561faebf9bf2c59c07ad4712c053246586 2025-12-04T16:24:46.4253060Z deleted: sha256:06622801490739d9db884c23c05a31a1ee86c41e888b34c3ccef23d37f2bdbb5 2025-12-04T16:24:46.4253820Z deleted: sha256:df5dafcaee865ddfb66e22075c63769836e01a627d6fe46658b6f4b4a25318d3 2025-12-04T16:24:46.4254589Z deleted: sha256:7949ae5c4df921feb0e2cd7bac1e402e1ab9135e758fa41cd567880b354b40bc 2025-12-04T16:24:46.4255348Z deleted: sha256:9f19148d820adb1d6e86d0ce68e21fbcedafa7c7ec6c45c9004fa3a607096923 2025-12-04T16:24:46.4256121Z deleted: sha256:1d37d963e85ce22ffaab56a1cf35b3411f34f9432dc5e49ebbdf6f30816cdfa8 2025-12-04T16:24:46.4256893Z deleted: sha256:bac6d91e3830e51e96879deaa3e6d0d39da076fa802ebda68f81bdf7ef8342d5 2025-12-04T16:24:46.4257652Z deleted: sha256:ffd496b07151c90e7ddd68a81a36471f51a544187982db5e34621358e1b29681 2025-12-04T16:24:46.4258507Z deleted: sha256:890b2042bdb9e22a614cea1be88366cd3ae15159bf78ac510b9daa6f802493a6 2025-12-04T16:24:46.4259279Z deleted: sha256:ddd9a57b20a8b45ae0e8e350ec266d50a1b9e9a7ff4921470eb38f004d50eb20 2025-12-04T16:24:46.4260046Z deleted: sha256:2f4f91684b8221bc5cbc3f14c7e00bb693854027a1a6de5ad6bdcd000bb579f2 2025-12-04T16:24:46.4260800Z deleted: sha256:9c01ec5e73233284a0f9bb42de59696a1fa61caacacdf63d04df5ebd73895d77 2025-12-04T16:24:46.4261561Z deleted: sha256:f6153a90f0f5316b03f1464826325a1578231b89b3c1f1c83cc7cebdd41cee2a 2025-12-04T16:24:46.4262305Z deleted: sha256:4e89cd2181813af7fd2219923bae493e33111d8b4ebd76f257b7fb26744fda28 2025-12-04T16:24:46.4263065Z deleted: sha256:a0b77eb4054db8f2ea2ec957b3941b4aeee14b59e94a99a1521f90d6e41faf0e 2025-12-04T16:24:46.4263804Z deleted: sha256:1a1b2848f15aa5114f5a67e3705439512880bf1a7a6436cc67760c59b5f10c46 2025-12-04T16:24:46.4264537Z deleted: sha256:004fc01362840c164664c18580e479546fa0b7f9599487558f80190aec30e2b5 2025-12-04T16:24:46.4265402Z deleted: sha256:35f36e20799f0a0dead81bc3701732e43489264e6bee9fcb789b376a99e17e78 2025-12-04T16:24:46.4266142Z deleted: sha256:1207fd2ede86015c3f105620cb491e8199d2060a4a87490de358286d0ae52e4e 2025-12-04T16:24:46.4266902Z deleted: sha256:02dccb85ee744d1fbb819c6da618b2c52a3e4affc89e407f79b875e7b3bbb7df 2025-12-04T16:24:46.4267715Z deleted: sha256:d22e6ff9c3ac9dabbcc6052e1459f8dc4ebd19bd057bd0688615d6cc3ebb5cf0 2025-12-04T16:24:46.4268476Z deleted: sha256:73974f74b436f39a2fdb6461b1e3f7c3e41c73325776fa71d16b942a5b4a365b 2025-12-04T16:24:46.4269091Z untagged: public.ecr.aws/docker/library/python:3.13 2025-12-04T16:24:46.4269929Z untagged: public.ecr.aws/docker/library/python@sha256:3f986299a7b8b44b0d8cf9bda2b22361ce5c3058ef5d7cb17fb7452506680ab0 2025-12-04T16:24:46.4270922Z deleted: sha256:44438aecfedf7b6086fce506dae0db5ba7fc0027f9b743f1a75a6b5cbc7de70a 2025-12-04T16:24:46.4271680Z deleted: sha256:6f09a1f5d8a107c2532fbd116e75116cb75fa77b1a7d72d3bdf1ac12de152acd 2025-12-04T16:24:46.4272457Z deleted: sha256:fe5f3ac0be086125eb1e3cd10cc33e8e426f4e079381f7ce5a987b626e99fa67 2025-12-04T16:24:46.4273232Z deleted: sha256:79dd2061a22cf919cfc4f1f02704bfda09afadb017265e670ee54441d296c06c 2025-12-04T16:24:46.4274003Z deleted: sha256:9447ad402aafdbee17e999b0ec84ad89c2646dbebf054d469d4f8bee77f66212 2025-12-04T16:24:46.4274751Z deleted: sha256:7a4909f3c1975be52292f53107495ee1b41c17494918767ccedf1cf1688ae318 2025-12-04T16:24:46.4275494Z deleted: sha256:3474923d97f1f498237650a7d51bd4aea37d5e6b9d8a778777920584af5dd560 2025-12-04T16:24:46.4276244Z deleted: sha256:683afd1773444401a9cbd24842ee5d9154a11abb4fab63ddea5c03df788597ee 2025-12-04T16:24:46.4276693Z 2025-12-04T16:24:46.4276848Z Total reclaimed space: 35.48GB 2025-12-04T16:24:46.4312632Z ##[group]Run set +e 2025-12-04T16:24:46.4313021Z set +e 2025-12-04T16:24:46.4313290Z set -x 2025-12-04T16:24:46.4313533Z  2025-12-04T16:24:46.4313780Z nvidia-smi 2025-12-04T16:24:46.4314411Z # NB: Surprisingly, nvidia-smi command returns successfully with return code 0 even in 2025-12-04T16:24:46.4315233Z # the case where the driver has already crashed as it still can get the driver version 2025-12-04T16:24:46.4316038Z # and some basic information like the bus ID. However, the rest of the information 2025-12-04T16:24:46.4316657Z # would be missing (ERR!), for example: 2025-12-04T16:24:46.4317043Z # 2025-12-04T16:24:46.4317387Z # +-----------------------------------------------------------------------------+ 2025-12-04T16:24:46.4318018Z # | NVIDIA-SMI 525.89.02 Driver Version: 525.89.02 CUDA Version: 12.0 | 2025-12-04T16:24:46.4318688Z # |-------------------------------+----------------------+----------------------+ 2025-12-04T16:24:46.4319301Z # | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | 2025-12-04T16:24:46.4319991Z # | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | 2025-12-04T16:24:46.4320559Z # | | | MIG M. | 2025-12-04T16:24:46.4320991Z # |===============================+======================+======================| 2025-12-04T16:24:46.4321466Z # | 0 ERR! Off | 00000000:00:1E.0 Off | ERR! | 2025-12-04T16:24:46.4322038Z # |ERR! ERR! ERR! ERR! / ERR! | 4184MiB / 23028MiB | ERR! Default | 2025-12-04T16:24:46.4322552Z # | | | ERR! | 2025-12-04T16:24:46.4323055Z # +-------------------------------+----------------------+----------------------+ 2025-12-04T16:24:46.4323494Z # 2025-12-04T16:24:46.4323844Z # +-----------------------------------------------------------------------------+ 2025-12-04T16:24:46.4324386Z # | Processes: | 2025-12-04T16:24:46.4324952Z # | GPU GI CI PID Type Process name GPU Memory | 2025-12-04T16:24:46.4325466Z # | ID ID Usage | 2025-12-04T16:24:46.4325906Z # |=============================================================================| 2025-12-04T16:24:46.4326471Z # +-----------------------------------------------------------------------------+ 2025-12-04T16:24:46.4326918Z # 2025-12-04T16:24:46.4327354Z # This should be reported as a failure instead as it will guarantee to fail when 2025-12-04T16:24:46.4327956Z # Docker tries to run with --gpus all 2025-12-04T16:24:46.4328335Z # 2025-12-04T16:24:46.4328749Z # So, the correct check here is to query one of the missing piece of info like 2025-12-04T16:24:46.4329377Z # GPU name, so that the command can fail accordingly 2025-12-04T16:24:46.4329953Z nvidia-smi --query-gpu=gpu_name --format=csv,noheader --id=0 2025-12-04T16:24:46.4330464Z NVIDIA_SMI_STATUS=$? 2025-12-04T16:24:46.4330762Z  2025-12-04T16:24:46.4331280Z # These are acceptable return code from nvidia-smi as copied from setup-nvidia GitHub action 2025-12-04T16:24:46.4332059Z if [ "$NVIDIA_SMI_STATUS" -ne 0 ] && [ "$NVIDIA_SMI_STATUS" -ne 14 ]; then 2025-12-04T16:24:46.4332744Z  echo "NVIDIA driver installation has failed, shutting down the runner..." 2025-12-04T16:24:46.4333347Z  .github/scripts/stop_runner_service.sh 2025-12-04T16:24:46.4333735Z fi 2025-12-04T16:24:46.4333975Z  2025-12-04T16:24:46.4334545Z # For runner with multiple GPUs, we also want to confirm that the number of GPUs are the 2025-12-04T16:24:46.4335290Z # power of 2, i.e. 1, 2, 4, or 8. This is to avoid flaky test issue when one GPU fails 2025-12-04T16:24:46.4335962Z # https://github.com/pytorch/test-infra/issues/4000 2025-12-04T16:24:46.4336474Z GPU_COUNT=$(nvidia-smi --list-gpus | wc -l) 2025-12-04T16:24:46.4336877Z NVIDIA_SMI_STATUS=$? 2025-12-04T16:24:46.4337190Z  2025-12-04T16:24:46.4337708Z # These are acceptable return code from nvidia-smi as copied from setup-nvidia GitHub action 2025-12-04T16:24:46.4338575Z if [ "$NVIDIA_SMI_STATUS" -ne 0 ] && [ "$NVIDIA_SMI_STATUS" -ne 14 ]; then 2025-12-04T16:24:46.4339282Z  echo "NVIDIA driver installation has failed, shutting down the runner..." 2025-12-04T16:24:46.4339892Z  .github/scripts/stop_runner_service.sh 2025-12-04T16:24:46.4340282Z fi 2025-12-04T16:24:46.4340517Z  2025-12-04T16:24:46.4340803Z # Check the GPU count to be a power of 2 2025-12-04T16:24:46.4341465Z if [ "$GPU_COUNT" -le 8 ] && [ "$GPU_COUNT" -ne 1 ] && [ "$GPU_COUNT" -ne 2 ] && [ "$GPU_COUNT" -ne 4 ] && [ "$GPU_COUNT" -ne 8 ]; then 2025-12-04T16:24:46.4342355Z  echo "NVIDIA driver detects $GPU_COUNT GPUs. The runner has a broken GPU, shutting it down..." 2025-12-04T16:24:46.4343030Z  .github/scripts/stop_runner_service.sh 2025-12-04T16:24:46.4343409Z fi 2025-12-04T16:24:46.4353500Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T16:24:46.4353943Z env: 2025-12-04T16:24:46.4354195Z GIT_DEFAULT_BRANCH: main 2025-12-04T16:24:46.4354509Z HAS_NVIDIA_GPU: true 2025-12-04T16:24:46.4354858Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2025-12-04T16:24:46.4355513Z DOCKER_CONTAINER_ID: fffd24d1695495ee6e5261d13f6ea512c367e4b5932b9e51d95e58b019b8db58 2025-12-04T16:24:46.4356101Z ##[endgroup] 2025-12-04T16:24:46.4385445Z + nvidia-smi 2025-12-04T16:24:46.4587087Z Thu Dec 4 16:24:46 2025 2025-12-04T16:24:46.4587525Z +-----------------------------------------------------------------------------+ 2025-12-04T16:24:46.4588385Z | NVIDIA-SMI 525.105.17 Driver Version: 525.105.17 CUDA Version: 12.0 | 2025-12-04T16:24:46.4588995Z |-------------------------------+----------------------+----------------------+ 2025-12-04T16:24:46.4589579Z | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | 2025-12-04T16:24:46.4590232Z | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | 2025-12-04T16:24:46.4590886Z | | | MIG M. | 2025-12-04T16:24:46.4591292Z |===============================+======================+======================| 2025-12-04T16:24:46.4750294Z | 0 Tesla T4 On | 00000000:00:1E.0 Off | 0 | 2025-12-04T16:24:46.4750857Z | N/A 22C P8 9W / 70W | 2MiB / 15360MiB | 0% Default | 2025-12-04T16:24:46.4751322Z | | | N/A | 2025-12-04T16:24:46.4751786Z +-------------------------------+----------------------+----------------------+ 2025-12-04T16:24:46.4752244Z 2025-12-04T16:24:46.4752700Z +-----------------------------------------------------------------------------+ 2025-12-04T16:24:46.4753211Z | Processes: | 2025-12-04T16:24:46.4753724Z | GPU GI CI PID Type Process name GPU Memory | 2025-12-04T16:24:46.4754220Z | ID ID Usage | 2025-12-04T16:24:46.4754635Z |=============================================================================| 2025-12-04T16:24:46.4755619Z | No running processes found | 2025-12-04T16:24:46.4756178Z +-----------------------------------------------------------------------------+ 2025-12-04T16:24:46.5577011Z + nvidia-smi --query-gpu=gpu_name --format=csv,noheader --id=0 2025-12-04T16:24:46.5751506Z Tesla T4 2025-12-04T16:24:46.5785851Z + NVIDIA_SMI_STATUS=0 2025-12-04T16:24:46.5786162Z + '[' 0 -ne 0 ']' 2025-12-04T16:24:46.5791844Z ++ nvidia-smi --list-gpus 2025-12-04T16:24:46.5793189Z ++ wc -l 2025-12-04T16:24:46.5994226Z + GPU_COUNT=1 2025-12-04T16:24:46.5994519Z + NVIDIA_SMI_STATUS=0 2025-12-04T16:24:46.5994792Z + '[' 0 -ne 0 ']' 2025-12-04T16:24:46.5995053Z + '[' 1 -le 8 ']' 2025-12-04T16:24:46.5995322Z + '[' 1 -ne 1 ']' 2025-12-04T16:24:46.6073068Z Post job cleanup. 2025-12-04T16:24:46.6162193Z Post job cleanup. 2025-12-04T16:24:46.6213091Z Post job cleanup. 2025-12-04T16:24:46.7366410Z [command]/usr/bin/git version 2025-12-04T16:24:46.7427917Z git version 2.50.1 2025-12-04T16:24:46.7467887Z Copying '/home/ec2-user/.gitconfig' to '/home/ec2-user/actions-runner/_work/_temp/047827e5-76b9-4459-a843-459633717425/.gitconfig' 2025-12-04T16:24:46.7478001Z Temporarily overriding HOME='/home/ec2-user/actions-runner/_work/_temp/047827e5-76b9-4459-a843-459633717425' before making global git config changes 2025-12-04T16:24:46.7479534Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T16:24:46.7483694Z [command]/usr/bin/git config --global --add safe.directory /home/ec2-user/actions-runner/_work/pytorch/pytorch 2025-12-04T16:24:46.7525666Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T16:24:46.7568348Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T16:24:46.7915477Z Entering 'android/libs/fbjni' 2025-12-04T16:24:46.7981848Z Entering 'third_party/FP16' 2025-12-04T16:24:46.8044952Z Entering 'third_party/FXdiv' 2025-12-04T16:24:46.8108263Z Entering 'third_party/NNPACK' 2025-12-04T16:24:46.8172167Z Entering 'third_party/NVTX' 2025-12-04T16:24:46.8235653Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T16:24:46.8298893Z Entering 'third_party/XNNPACK' 2025-12-04T16:24:46.8378755Z Entering 'third_party/aiter' 2025-12-04T16:24:46.8442118Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T16:24:46.8513998Z Entering 'third_party/benchmark' 2025-12-04T16:24:46.8577004Z Entering 'third_party/composable_kernel' 2025-12-04T16:24:46.8651463Z Entering 'third_party/cpp-httplib' 2025-12-04T16:24:46.8714661Z Entering 'third_party/cpuinfo' 2025-12-04T16:24:46.8779879Z Entering 'third_party/cudnn_frontend' 2025-12-04T16:24:46.8845105Z Entering 'third_party/cutlass' 2025-12-04T16:24:46.8920278Z Entering 'third_party/fbgemm' 2025-12-04T16:24:46.8988280Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T16:24:46.9050156Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T16:24:46.9125445Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T16:24:46.9188360Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T16:24:46.9259647Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T16:24:46.9323367Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T16:24:46.9384457Z Entering 'third_party/fbgemm/external/json' 2025-12-04T16:24:46.9449114Z Entering 'third_party/flash-attention' 2025-12-04T16:24:46.9513960Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T16:24:46.9583126Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T16:24:46.9654625Z Entering 'third_party/flatbuffers' 2025-12-04T16:24:46.9719738Z Entering 'third_party/fmt' 2025-12-04T16:24:46.9782271Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T16:24:46.9847806Z Entering 'third_party/gloo' 2025-12-04T16:24:46.9911009Z Entering 'third_party/googletest' 2025-12-04T16:24:46.9972659Z Entering 'third_party/ideep' 2025-12-04T16:24:47.0037196Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T16:24:47.0107709Z Entering 'third_party/ittapi' 2025-12-04T16:24:47.0169659Z Entering 'third_party/kineto' 2025-12-04T16:24:47.0232100Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T16:24:47.0297917Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T16:24:47.0361413Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T16:24:47.0423235Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T16:24:47.0485268Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T16:24:47.0546154Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T16:24:47.0612077Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T16:24:47.0673974Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T16:24:47.0735755Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T16:24:47.0800176Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T16:24:47.0862643Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T16:24:47.0924840Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T16:24:47.0992552Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T16:24:47.1059280Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T16:24:47.1120621Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T16:24:47.1184166Z Entering 'third_party/kleidiai' 2025-12-04T16:24:47.1247565Z Entering 'third_party/mimalloc' 2025-12-04T16:24:47.1314727Z Entering 'third_party/nlohmann' 2025-12-04T16:24:47.1379141Z Entering 'third_party/onnx' 2025-12-04T16:24:47.1460761Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T16:24:47.1526294Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T16:24:47.1591267Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T16:24:47.1651230Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T16:24:47.1712405Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T16:24:47.1772385Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T16:24:47.1838521Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T16:24:47.1900108Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T16:24:47.1959888Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T16:24:47.2020407Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T16:24:47.2085331Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T16:24:47.2149358Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T16:24:47.2234331Z Entering 'third_party/pocketfft' 2025-12-04T16:24:47.2297292Z Entering 'third_party/protobuf' 2025-12-04T16:24:47.2363770Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T16:24:47.2426276Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T16:24:47.2491942Z Entering 'third_party/psimd' 2025-12-04T16:24:47.2554498Z Entering 'third_party/pthreadpool' 2025-12-04T16:24:47.2618838Z Entering 'third_party/pybind11' 2025-12-04T16:24:47.2681064Z Entering 'third_party/python-peachpy' 2025-12-04T16:24:47.2743156Z Entering 'third_party/sleef' 2025-12-04T16:24:47.2807374Z Entering 'third_party/tensorpipe' 2025-12-04T16:24:47.2871428Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T16:24:47.2934249Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T16:24:47.2995609Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T16:24:47.3057754Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T16:24:47.3120199Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T16:24:47.3204537Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T16:24:47.3228720Z http.https://github.com/.extraheader 2025-12-04T16:24:47.3238820Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-12-04T16:24:47.3271540Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T16:24:47.3605789Z Entering 'android/libs/fbjni' 2025-12-04T16:24:47.3648690Z http.https://github.com/.extraheader 2025-12-04T16:24:47.3686201Z Entering 'third_party/FP16' 2025-12-04T16:24:47.3728488Z http.https://github.com/.extraheader 2025-12-04T16:24:47.3766371Z Entering 'third_party/FXdiv' 2025-12-04T16:24:47.3811270Z http.https://github.com/.extraheader 2025-12-04T16:24:47.3848931Z Entering 'third_party/NNPACK' 2025-12-04T16:24:47.3892030Z http.https://github.com/.extraheader 2025-12-04T16:24:47.3930544Z Entering 'third_party/NVTX' 2025-12-04T16:24:47.3972802Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4012557Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T16:24:47.4054905Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4096330Z Entering 'third_party/XNNPACK' 2025-12-04T16:24:47.4139337Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4194705Z Entering 'third_party/aiter' 2025-12-04T16:24:47.4237213Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4276157Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T16:24:47.4318765Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4366433Z Entering 'third_party/benchmark' 2025-12-04T16:24:47.4409466Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4447686Z Entering 'third_party/composable_kernel' 2025-12-04T16:24:47.4490352Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4537283Z Entering 'third_party/cpp-httplib' 2025-12-04T16:24:47.4579729Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4617664Z Entering 'third_party/cpuinfo' 2025-12-04T16:24:47.4660208Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4699079Z Entering 'third_party/cudnn_frontend' 2025-12-04T16:24:47.4741259Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4778817Z Entering 'third_party/cutlass' 2025-12-04T16:24:47.4822881Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4871489Z Entering 'third_party/fbgemm' 2025-12-04T16:24:47.4914532Z http.https://github.com/.extraheader 2025-12-04T16:24:47.4954228Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T16:24:47.4996022Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5033417Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T16:24:47.5076745Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5124330Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T16:24:47.5166140Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5203775Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T16:24:47.5246143Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5294179Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T16:24:47.5337096Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5373891Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T16:24:47.5416067Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5452048Z Entering 'third_party/fbgemm/external/json' 2025-12-04T16:24:47.5494127Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5533662Z Entering 'third_party/flash-attention' 2025-12-04T16:24:47.5576179Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5613396Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T16:24:47.5654442Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5700087Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T16:24:47.5741557Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5789722Z Entering 'third_party/flatbuffers' 2025-12-04T16:24:47.5834186Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5875353Z Entering 'third_party/fmt' 2025-12-04T16:24:47.5919611Z http.https://github.com/.extraheader 2025-12-04T16:24:47.5957657Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T16:24:47.6001132Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6038658Z Entering 'third_party/gloo' 2025-12-04T16:24:47.6081775Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6119671Z Entering 'third_party/googletest' 2025-12-04T16:24:47.6162561Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6202514Z Entering 'third_party/ideep' 2025-12-04T16:24:47.6244264Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6280466Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T16:24:47.6323539Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6371052Z Entering 'third_party/ittapi' 2025-12-04T16:24:47.6413379Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6450867Z Entering 'third_party/kineto' 2025-12-04T16:24:47.6493400Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6529940Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T16:24:47.6572809Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6611398Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T16:24:47.6653766Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6694622Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T16:24:47.6736735Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6775393Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T16:24:47.6820437Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6859936Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T16:24:47.6903081Z http.https://github.com/.extraheader 2025-12-04T16:24:47.6939302Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T16:24:47.6981997Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7023433Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T16:24:47.7065467Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7104917Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T16:24:47.7147209Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7185853Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T16:24:47.7228142Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7267602Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T16:24:47.7311694Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7349642Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T16:24:47.7393903Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7430445Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T16:24:47.7473437Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7514686Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T16:24:47.7557835Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7601108Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T16:24:47.7643150Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7680068Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T16:24:47.7722612Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7762067Z Entering 'third_party/kleidiai' 2025-12-04T16:24:47.7809786Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7848225Z Entering 'third_party/mimalloc' 2025-12-04T16:24:47.7892127Z http.https://github.com/.extraheader 2025-12-04T16:24:47.7929465Z Entering 'third_party/nlohmann' 2025-12-04T16:24:47.7971891Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8011755Z Entering 'third_party/onnx' 2025-12-04T16:24:47.8055199Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8115915Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T16:24:47.8158886Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8200332Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T16:24:47.8242782Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8282277Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T16:24:47.8325530Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8380136Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T16:24:47.8409054Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8445929Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T16:24:47.8488136Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8524770Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T16:24:47.8566232Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8606106Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T16:24:47.8648217Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8685667Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T16:24:47.8726997Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8763592Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T16:24:47.8807338Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8843810Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T16:24:47.8885755Z http.https://github.com/.extraheader 2025-12-04T16:24:47.8924843Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T16:24:47.8967106Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9007829Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T16:24:47.9049288Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9114716Z Entering 'third_party/pocketfft' 2025-12-04T16:24:47.9157511Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9194721Z Entering 'third_party/protobuf' 2025-12-04T16:24:47.9236865Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9276848Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T16:24:47.9319301Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9355686Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T16:24:47.9398083Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9438284Z Entering 'third_party/psimd' 2025-12-04T16:24:47.9481315Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9519407Z Entering 'third_party/pthreadpool' 2025-12-04T16:24:47.9562101Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9600297Z Entering 'third_party/pybind11' 2025-12-04T16:24:47.9642943Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9680557Z Entering 'third_party/python-peachpy' 2025-12-04T16:24:47.9723856Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9761221Z Entering 'third_party/sleef' 2025-12-04T16:24:47.9804195Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9841299Z Entering 'third_party/tensorpipe' 2025-12-04T16:24:47.9884147Z http.https://github.com/.extraheader 2025-12-04T16:24:47.9921851Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T16:24:47.9963728Z http.https://github.com/.extraheader 2025-12-04T16:24:48.0001884Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T16:24:48.0043018Z http.https://github.com/.extraheader 2025-12-04T16:24:48.0081196Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T16:24:48.0123381Z http.https://github.com/.extraheader 2025-12-04T16:24:48.0160478Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T16:24:48.0203304Z http.https://github.com/.extraheader 2025-12-04T16:24:48.0239093Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T16:24:48.0281568Z http.https://github.com/.extraheader 2025-12-04T16:24:48.0348854Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.0381675Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T16:24:48.0728294Z Entering 'android/libs/fbjni' 2025-12-04T16:24:48.0756843Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T16:24:48.0775181Z Entering 'third_party/FP16' 2025-12-04T16:24:48.0805746Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T16:24:48.0823417Z Entering 'third_party/FXdiv' 2025-12-04T16:24:48.0853154Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T16:24:48.0871900Z Entering 'third_party/NNPACK' 2025-12-04T16:24:48.0901611Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T16:24:48.0919737Z Entering 'third_party/NVTX' 2025-12-04T16:24:48.0949049Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T16:24:48.0968397Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T16:24:48.1000787Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T16:24:48.1019193Z Entering 'third_party/XNNPACK' 2025-12-04T16:24:48.1048085Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T16:24:48.1083872Z Entering 'third_party/aiter' 2025-12-04T16:24:48.1114383Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T16:24:48.1133428Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T16:24:48.1160944Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T16:24:48.1190254Z Entering 'third_party/benchmark' 2025-12-04T16:24:48.1219095Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T16:24:48.1241877Z Entering 'third_party/composable_kernel' 2025-12-04T16:24:48.1270791Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T16:24:48.1299605Z Entering 'third_party/cpp-httplib' 2025-12-04T16:24:48.1328397Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T16:24:48.1347674Z Entering 'third_party/cpuinfo' 2025-12-04T16:24:48.1376861Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T16:24:48.1396728Z Entering 'third_party/cudnn_frontend' 2025-12-04T16:24:48.1426074Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T16:24:48.1444631Z Entering 'third_party/cutlass' 2025-12-04T16:24:48.1474052Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T16:24:48.1504407Z Entering 'third_party/fbgemm' 2025-12-04T16:24:48.1534075Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T16:24:48.1555593Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T16:24:48.1583873Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T16:24:48.1601778Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T16:24:48.1630429Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T16:24:48.1657656Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T16:24:48.1686645Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T16:24:48.1704977Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T16:24:48.1732359Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T16:24:48.1759388Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T16:24:48.1787906Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T16:24:48.1805724Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T16:24:48.1833984Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T16:24:48.1851553Z Entering 'third_party/fbgemm/external/json' 2025-12-04T16:24:48.1879206Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T16:24:48.1900940Z Entering 'third_party/flash-attention' 2025-12-04T16:24:48.1929546Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T16:24:48.1948224Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T16:24:48.1975717Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T16:24:48.2001943Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T16:24:48.2029239Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T16:24:48.2059461Z Entering 'third_party/flatbuffers' 2025-12-04T16:24:48.2088370Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T16:24:48.2110455Z Entering 'third_party/fmt' 2025-12-04T16:24:48.2139478Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T16:24:48.2157756Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T16:24:48.2187697Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T16:24:48.2206252Z Entering 'third_party/gloo' 2025-12-04T16:24:48.2235741Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T16:24:48.2254460Z Entering 'third_party/googletest' 2025-12-04T16:24:48.2283186Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T16:24:48.2303078Z Entering 'third_party/ideep' 2025-12-04T16:24:48.2331954Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T16:24:48.2349293Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T16:24:48.2377057Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T16:24:48.2404675Z Entering 'third_party/ittapi' 2025-12-04T16:24:48.2433645Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T16:24:48.2451939Z Entering 'third_party/kineto' 2025-12-04T16:24:48.2481280Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T16:24:48.2499794Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T16:24:48.2527486Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T16:24:48.2545251Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T16:24:48.2573566Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T16:24:48.2593108Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T16:24:48.2621438Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T16:24:48.2639373Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T16:24:48.2667213Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T16:24:48.2685954Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T16:24:48.2716042Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T16:24:48.2732781Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T16:24:48.2761060Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T16:24:48.2781146Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T16:24:48.2810758Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T16:24:48.2828477Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T16:24:48.2856585Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T16:24:48.2875595Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T16:24:48.2908582Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T16:24:48.2927716Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T16:24:48.2956293Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T16:24:48.2974394Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T16:24:48.3004629Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T16:24:48.3022617Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T16:24:48.3051299Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T16:24:48.3072031Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T16:24:48.3102015Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T16:24:48.3124128Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T16:24:48.3152254Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T16:24:48.3169507Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T16:24:48.3198719Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T16:24:48.3219054Z Entering 'third_party/kleidiai' 2025-12-04T16:24:48.3249629Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T16:24:48.3269669Z Entering 'third_party/mimalloc' 2025-12-04T16:24:48.3299450Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T16:24:48.3318427Z Entering 'third_party/nlohmann' 2025-12-04T16:24:48.3347651Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T16:24:48.3367281Z Entering 'third_party/onnx' 2025-12-04T16:24:48.3396231Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T16:24:48.3434618Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T16:24:48.3463161Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T16:24:48.3483833Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T16:24:48.3513439Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T16:24:48.3533965Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T16:24:48.3561512Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T16:24:48.3579051Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T16:24:48.3607274Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T16:24:48.3624971Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T16:24:48.3652366Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T16:24:48.3669991Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T16:24:48.3697944Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T16:24:48.3717065Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T16:24:48.3744767Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T16:24:48.3762060Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T16:24:48.3790146Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T16:24:48.3808744Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T16:24:48.3836624Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T16:24:48.3853119Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T16:24:48.3880710Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T16:24:48.3902807Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T16:24:48.3930328Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T16:24:48.3950187Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T16:24:48.3977638Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T16:24:48.4017239Z Entering 'third_party/pocketfft' 2025-12-04T16:24:48.4048193Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T16:24:48.4065300Z Entering 'third_party/protobuf' 2025-12-04T16:24:48.4094125Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T16:24:48.4116194Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T16:24:48.4144049Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T16:24:48.4161352Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T16:24:48.4188834Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T16:24:48.4208894Z Entering 'third_party/psimd' 2025-12-04T16:24:48.4238106Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T16:24:48.4256367Z Entering 'third_party/pthreadpool' 2025-12-04T16:24:48.4285171Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T16:24:48.4305797Z Entering 'third_party/pybind11' 2025-12-04T16:24:48.4335048Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T16:24:48.4354018Z Entering 'third_party/python-peachpy' 2025-12-04T16:24:48.4383146Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T16:24:48.4402910Z Entering 'third_party/sleef' 2025-12-04T16:24:48.4432442Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T16:24:48.4450922Z Entering 'third_party/tensorpipe' 2025-12-04T16:24:48.4480522Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T16:24:48.4499514Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T16:24:48.4527010Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T16:24:48.4544622Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T16:24:48.4573219Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T16:24:48.4591957Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T16:24:48.4619247Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T16:24:48.4636986Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T16:24:48.4665535Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T16:24:48.4682598Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T16:24:48.4710676Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T16:24:48.4751238Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.4781022Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.4808876Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.4835971Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.4862771Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.4889762Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.4918868Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.4945448Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.4972733Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.4999775Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5026937Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5054018Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5080272Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5106970Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5132731Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5159392Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5186617Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5213626Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5239442Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5265720Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5292651Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5320173Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5351026Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5376830Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5402519Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5429248Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5455331Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5481540Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5512136Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5537918Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5563730Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5590106Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5616248Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5642968Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5669823Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5698236Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5724437Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5751775Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5777553Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5805141Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5833939Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5860297Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5895374Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5921763Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5947601Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.5974287Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6001513Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6030053Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6057793Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6084160Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6110873Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6136885Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6162401Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6191154Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6216894Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6242751Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6268789Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6295966Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6321346Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6348807Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6374730Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6402339Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6428577Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6454864Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6481355Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6518128Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6544169Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6570416Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6601727Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6628406Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6655093Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6681035Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6707913Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6734124Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6759953Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6786030Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6812622Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6839551Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6865781Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6892767Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.6918569Z [command]/usr/bin/git config --file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T16:24:48.7028813Z A job completed hook has been configured by the self-hosted runner administrator 2025-12-04T16:24:48.7044157Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-12-04T16:24:48.7050129Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T16:24:48.7050586Z ##[endgroup] 2025-12-04T16:24:57.0286867Z Cleaning up orphan processes